Skip to main content

A modern machine learning library for high-energy physics data analysis

Project description

ColliderML

Tests Coverage Python 3.10+ License: MIT

A modern machine learning library for high-energy physics data analysis.

Features

  • Efficient parallel data downloading with resume capability
  • Support for common HEP data formats
  • Machine learning utilities for particle physics
  • Visualization tools for physics data

Installation

For Users

# Create and activate environment
conda create -n collider-env python=3.11  # 3.10 or 3.11 recommended
conda activate collider-env

# Install package
pip install colliderml

For Developers

# Create and activate environment
conda create -n collider-dev python=3.11  # 3.10 or 3.11 recommended
conda activate collider-dev

# Clone repository
git clone https://github.com/OpenDataDetector/ColliderML.git
cd ColliderML

# Install in development mode with extra dependencies
pip install -e ".[dev]"

Quick Start

CLI

# Download 100 events from the taster campaign into ./data
colliderml get -c taster -e 100 -O data
from colliderml.core.data.manifest import ManifestClient
from colliderml.core.io import DataDownloader

manifest = ManifestClient()
files = manifest.select_files(campaign=None, datasets=["ttbar"], objects=["tracks"], max_events=1000)

downloader = DataDownloader()
results = downloader.download_files([f.path for f in files], local_dir="data", max_workers=4, resume=True)

for path, result in results.items():
    print(path, result.success, result.error)

Features

  • Manifest-driven: Always selects files from the latest portal manifest
  • Parallel Downloads: Download multiple files concurrently
  • Resume Capability: Optionally resume interrupted downloads
  • Progress Tracking: Real-time progress bars
  • Clear Errors: Helpful failure messages and HEAD checks

Development

  1. Activate your environment:

    conda activate collider-dev
    
  2. Run tests:

    # Run unit tests only
    pytest -v -m "not integration"
    
    # Run all tests including integration tests
    pytest -v
    
    # Run with coverage report
    pytest --cov=colliderml
    
  3. Build documentation:

    mkdocs build
    mkdocs serve  # View at http://127.0.0.1:8000
    

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

colliderml-0.3.0.tar.gz (505.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

colliderml-0.3.0-py3-none-any.whl (29.1 kB view details)

Uploaded Python 3

File details

Details for the file colliderml-0.3.0.tar.gz.

File metadata

  • Download URL: colliderml-0.3.0.tar.gz
  • Upload date:
  • Size: 505.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for colliderml-0.3.0.tar.gz
Algorithm Hash digest
SHA256 1c50f42ff308c20b13ae147ae4f625561297a524e44eb766d19588ed63d8665e
MD5 754b06f875ef186fbf2730b15911ae0c
BLAKE2b-256 02a3b2e5bab341c9aa38aa294e4f1c7e9267794d60fc349b766289d6a4cb29c5

See more details on using hashes here.

Provenance

The following attestation bundles were made for colliderml-0.3.0.tar.gz:

Publisher: publish-pypi.yml on OpenDataDetector/ColliderML

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file colliderml-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: colliderml-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 29.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for colliderml-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0444c72f6a477051c35ca84128dbe35248b6d62c3ff0141f73a585af992bf96f
MD5 a37e92d35bbd76e666dc3bf4706cc068
BLAKE2b-256 dfc9da76a93761c019083142c741117d3f20f17b7658c9501036affdda34a0b9

See more details on using hashes here.

Provenance

The following attestation bundles were made for colliderml-0.3.0-py3-none-any.whl:

Publisher: publish-pypi.yml on OpenDataDetector/ColliderML

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page