A Python package to compare cutflow histograms from ROOT files.
Project description
cutflow_compare
Overview
cutflow_compare is a Python package designed to simplify the comparison of cutflow histograms from ROOT files. It provides an intuitive command-line interface for analyzing and visualizing differences in cutflow data across multiple regions and files. Whether you're working with high-energy physics datasets or other ROOT-based analyses, cutflow_compare helps you streamline your workflow by automating comparisons and generating detailed reports.
Features
- Compare cutflow histograms from multiple ROOT files.
- Compare cutflow histograms with countflow histograms within the same file.
- Custom labels for each file using the
--labelsargument. - Generate separate CSV reports for each region.
- Calculate relative errors and standard deviations across all files for each selection.
- Calculate cumulative error between cutflow and countflow histograms.
- Easy to use with command-line arguments for file input and region selection.
Installation
You can install the package using pip:
pip install cutflow_compare
Alternatively, you can clone the repository and install it manually:
git clone https://github.com/ibeuler/cutflow_compare.git
cd cutflow_compare
Or, if running from source:
python cutflow_compare.py --files histoOut-compared.root histoOut-reference.root -r region1 region2 region3 --labels Compared Reference
Usage
After installation, you can use the command-line tool directly:
cutflow_compare --files histoOut-compared.root histoOut-reference.root -r region1 region2 region3 --labels Compared Reference
Or, if running from source:
python cutflow_compare.py --files histoOut-compared.root histoOut-reference.root -r region1 region2 region3 --labels Compared Reference
Note:
Make sure the same regions are present in all files with the same name.
Arguments
--files: List of input ROOT files to compare. Required.--regions: List of regions to compare within the cutflow histograms. Required.--labels: Custom labels for each file, used in the output CSV and terminal display. Optional.--separate-selections: Keep selections separate instead of merging them. Optional.--relative-error: Include relative error calculations in the output. Optional.--save: Save the results to CSV files. Optionally, specify a custom filename prefix. Optional.--colored: Display table with colored columns for better contrast in the terminal. Optional.--counts: Compare cutflow output with specified countflow histograms (names provided). Optional.--comulative-error: Calculate cumulative error between cutflow and countflow (only valid with--counts). Optional.--version: Check the current version of cutflow_compare. Optional.
Output
The tool generates separate CSV files for each region when the --save option is used. Each CSV file contains:
- Columns for each file's event counts after cuts.
- Calculated relative errors and standard deviations for each selection (if multiple files are compared).
- Calculated cumulative error between cutflow and countflow (if
--countsand--comulative-errorare used).
Example Output
For a region WZ and files histoOut-compared.root, histoOut-reference.root, and histoOut-third.root, the output CSV might look like this:
| Selection | Compared_Event_After_Cut | Reference_Event_After_Cut | Third_Event_After_Cut | RelativeError_AllFiles |
|---|---|---|---|---|
| Selection 1 | 100 ± 5 | 105 ± 6 | 102 ± 4 | 0.03 |
| Selection 2 | 200 ± 10 | 195 ± 9 | 198 ± 8 | 0.02 |
When comparing with countflow, the output might look like this:
| Selection | Cutflow_Event_After_Cut | countflow1_countflow | countflow2_countflow |
|---|---|---|---|
| Selection 1 | 100 ± 5 | 98 ± 3 | 101 ± 4 |
| Selection 2 | 200 ± 10 | 195 ± 8 | 202 ± 9 |
Examples
Basic Comparison
cutflow_compare --files histoOut-compared.root histoOut-reference.root -r region1 region2
This compares region1 and region2 in the two ROOT files and prints the results to the terminal.
Save Results to CSV
cutflow_compare --files histoOut-compared.root histoOut-reference.root -r region1 region2 --save
This saves the results for each region to separate CSV files, e.g., cutflow_comparison_region1.csv and cutflow_comparison_region2.csv.
Custom Filename for Saved Results
cutflow_compare --files histoOut-compared.root histoOut-reference.root -r region1 region2 --save my_results
This saves the results to my_results_region1.csv and my_results_region2.csv.
Colored Output
cutflow_compare --files histoOut-compared.root histoOut-reference.root -r region1 region2 --colored
This displays the results in the terminal with contrasting colors for each file's data.
Compare with Countflow
cutflow_compare --files histoOut-compared.root -r region1 region2 --counts countflow
This compares the cutflow histogram with countflow in histoOut-compared.root for region1 and region2.
Calculate Cumulative Error
cutflow_compare --files histoOut-compared.root -r region1 --counts countflow --comulative-error
This compares the cutflow histogram with countflow in histoOut-compared.root for region1 and calculates the cumulative error.
Requirements
- Python 3.6+
- ROOT (must be installed separately, e.g., via conda:
conda install -c conda-forge root) - pandas (automatically installed with the package)
- uncertainties (automatically installed with the package)
- prettytable (automatically installed with the package)
License
This project is licensed under the MIT License. See the LICENSE file for more details.
Contributing
Contributions are welcome! If you have ideas for new features, bug fixes, or improvements, feel free to submit a pull request or open an issue on the GitHub repository.
Acknowledgments
This package leverages the ROOT framework for data analysis and visualization. Special thanks to the open-source community for providing the tools and libraries that make this project possible.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cutflow_compare-2.1.0.tar.gz.
File metadata
- Download URL: cutflow_compare-2.1.0.tar.gz
- Upload date:
- Size: 10.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2ed9eeeab9382689836ab521694d53ac42154dbd534875627eb74221f32ad350
|
|
| MD5 |
37b856aaa90a621e2233bb5218423aa9
|
|
| BLAKE2b-256 |
f4636880e8ba395c6916a617cba2040d2786c5a78fe63cb76f0148890af8e333
|
File details
Details for the file cutflow_compare-2.1.0-py3-none-any.whl.
File metadata
- Download URL: cutflow_compare-2.1.0-py3-none-any.whl
- Upload date:
- Size: 9.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f1df1f7cd09e204cdb59841aed638774cc7717db27450b0443ed404e9061d85e
|
|
| MD5 |
0a3979d4abd43b6d5fef4086f4720157
|
|
| BLAKE2b-256 |
4ec60fb6713819c1f6474494a6b39bb4bf2259d6c560ae6e29c4593435a75c69
|