Skip to main content

Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP analysis.

Project description

coffea - Columnar Object Framework For Effective Analysis

https://zenodo.org/badge/159673139.svg https://github.com/CoffeaTeam/coffea/actions/workflows/ci.yml/badge.svg https://codecov.io/gh/CoffeaTeam/coffea/branch/master/graph/badge.svg?event=schedule https://badge.fury.io/py/coffea.svg https://img.shields.io/pypi/dm/coffea.svg https://img.shields.io/conda/vn/conda-forge/coffea.svg https://badges.gitter.im/CoffeaTeam/coffea.svg https://mybinder.org/badge_logo.svg

Basic tools and wrappers for enabling not-too-alien syntax when running columnar Collider HEP analysis.

coffea is a prototype package for pulling together all the typical needs of a high-energy collider physics (HEP) experiment analysis using the scientific python ecosystem. It makes use of uproot and awkward-array to provide an array-based syntax for manipulating HEP event data in an efficient and numpythonic way. There are sub-packages that implement histogramming, plotting, and look-up table functionalities that are needed to convey scientific insight, apply transformations to data, and correct for discrepancies in Monte Carlo simulations compared to data.

coffea also supplies facilities for horizontally scaling an analysis in order to reduce time-to-insight in a way that is largely independent of the resource the analysis is being executed on. By making use of modern big-data technologies like Apache Spark, parsl, Dask , and Work Queue, it is possible with coffea to scale a HEP analysis from a testing on a laptop to: a large multi-core server, computing clusters, and super-computers without the need to alter or otherwise adapt the analysis code itself.

coffea is a HEP community project collaborating with iris-hep and is currently a prototype. We welcome input to improve its quality as we progress towards a sensible refactorization into the scientific python ecosystem and a first release. Please feel free to contribute at our github repo!

Installation

Install coffea like any other Python package:

pip install coffea

or similar (use sudo, --user, virtualenv, or pip-in-conda if you wish). For more details, see the Installing coffea section of the documentation.

Strict dependencies

The following are installed automatically when you install coffea with pip:

  • numpy (1.22+);

  • uproot for interacting with ROOT files and handling their data transparently;

  • awkward-array to manipulate complex-structured columnar data, such as jagged arrays;

  • numba just-in-time compilation of python functions;

  • scipy for many statistical functions;

  • matplotlib as a plotting backend;

  • and other utility packages, as enumerated in pyproject.toml.

Documentation

All documentation is hosted at https://coffeateam.github.io/coffea/

Citation

If you would like to cite this code in your work, you can use the zenodo DOI indicated in CITATION.cff, or the latest DOI. You may also cite the proceedings:

  • “N. Smith et al 2020 EPJ Web Conf. 245 06012”

  • “L. Gray et al 2023 J. Phys.: Conf. Ser. 2438 012033”

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coffea-2024.4.0.tar.gz (18.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coffea-2024.4.0-py3-none-any.whl (172.9 kB view details)

Uploaded Python 3

File details

Details for the file coffea-2024.4.0.tar.gz.

File metadata

  • Download URL: coffea-2024.4.0.tar.gz
  • Upload date:
  • Size: 18.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for coffea-2024.4.0.tar.gz
Algorithm Hash digest
SHA256 d1bca8bc4ca361b86ce25c526a7355f477961f291b57eb78e6b3211425b35b7a
MD5 e4ac9ef7b98d194fd76b206acd4f20c2
BLAKE2b-256 224298ffee359d4c8525c9f80c35a492f2f1eacc714ff42e439d180aaa58d30c

See more details on using hashes here.

File details

Details for the file coffea-2024.4.0-py3-none-any.whl.

File metadata

  • Download URL: coffea-2024.4.0-py3-none-any.whl
  • Upload date:
  • Size: 172.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for coffea-2024.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2aa030c42d448128b6486fb639a52fedf1c17ac4e6f45d71dc28213c1cfb59aa
MD5 5d391be4628b126c97a63baa9f1939f4
BLAKE2b-256 81428931b8a4cc647e2a3d94c963bb51e9e974dfcb47307fcebe814efc39d2f4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page