Skip to main content

A modern machine learning library for high-energy physics data analysis

Project description

ColliderML

Tests Coverage Python 3.10+ License: MIT

A modern machine learning library for high-energy physics data analysis.

Features

  • Efficient parallel data downloading with resume capability
  • Support for common HEP data formats
  • Machine learning utilities for particle physics
  • Visualization tools for physics data

Installation

For Users

# Create and activate environment
conda create -n collider-env python=3.11  # 3.10 or 3.11 recommended
conda activate collider-env

# Install package
pip install colliderml

For Developers

# Create and activate environment
conda create -n collider-dev python=3.11  # 3.10 or 3.11 recommended
conda activate collider-dev

# Clone repository
git clone https://github.com/murnanedaniel/colliderml.git
cd colliderml

# Install in development mode with extra dependencies
pip install -e ".[dev]"

Quick Start

CLI

# Download 100 events from the taster campaign into ./data
colliderml get -c taster -e 100 -O data
from colliderml.core.data.manifest import ManifestClient
from colliderml.core.io import DataDownloader

manifest = ManifestClient()
files = manifest.select_files(campaign=None, datasets=["ttbar"], objects=["tracks"], max_events=1000)

downloader = DataDownloader()
results = downloader.download_files([f.path for f in files], local_dir="data", max_workers=4, resume=True)

for path, result in results.items():
    print(path, result.success, result.error)

Features

  • Manifest-driven: Always selects files from the latest portal manifest
  • Parallel Downloads: Download multiple files concurrently
  • Resume Capability: Optionally resume interrupted downloads
  • Progress Tracking: Real-time progress bars
  • Clear Errors: Helpful failure messages and HEAD checks

Development

  1. Activate your environment:

    conda activate collider-dev
    
  2. Run tests:

    # Run unit tests only
    pytest -v -m "not integration"
    
    # Run all tests including integration tests
    pytest -v
    
    # Run with coverage report
    pytest --cov=colliderml
    
  3. Build documentation:

    mkdocs build
    mkdocs serve  # View at http://127.0.0.1:8000
    

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colliderml-0.1.2.tar.gz (482.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

colliderml-0.1.2-py3-none-any.whl (14.3 kB view details)

Uploaded Python 3

File details

Details for the file colliderml-0.1.2.tar.gz.

File metadata

  • Download URL: colliderml-0.1.2.tar.gz
  • Upload date:
  • Size: 482.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/8.5.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.5

File hashes

Hashes for colliderml-0.1.2.tar.gz
Algorithm Hash digest
SHA256 a9876e70d7a63f0ee2ef6a93b9b473aab03a67511e0cff56fdbc9fc24ebde328
MD5 7d82db00d41d07945208aaaa024b9a34
BLAKE2b-256 43cfbb6bdf86491c6bcddf56cf7f3d3ba2be4e476623b4633610b55f51eee4b1

See more details on using hashes here.

File details

Details for the file colliderml-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: colliderml-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 14.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.7 tqdm/4.62.3 importlib-metadata/8.5.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.5

File hashes

Hashes for colliderml-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 21a947d860a8b2175b679f7fd5dec638a91a5878863d49a27287596b72b764dc
MD5 55a0d1de25750f26d2dfd7cfed475cf7
BLAKE2b-256 19beb92ffa0eca16b8da41e16ce5492dcdf1560bf34ac94a5492fbd2f49a9387

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page