Monte Carlo reweighting toolkit with multiple backends (hep_ml, XGBoost, neural networks, folding, and binning).

Project description

mcreweight

mcreweight is a Python package for Monte Carlo event reweighting to match data distributions in multiplicity and kinematic variables. It provides multiple reweighting backends, including hep_ml-based gradient boosting, XGBoost, neural-network approaches, folding variants, and bin-based reweighting, with optional Optuna hyperparameter tuning and integrated plotting/validation utilities.

[!WARNING] Bins reweighting is a useful low-dimensional baseline, but it becomes unstable quickly as the number of training variables grows. Prefer it for one or two dimensions, use extra care around three or four, and avoid relying on it as the main method in higher-dimensional problems.

Documentation

Project documentation is currently intended to be available at:

CERN Pages: http://mcreweight.docs.cern.ch
Read the Docs: https://mcreweight.readthedocs.io/ once the project is imported and built there

Setup

Run in a lb-conda environment, as

lb-conda mcreweight

Installation

If you don't run in a lb-conda environment, consider installing the python package from PyPI or cloning it from GitLab.

From PyPI

pip install mcreweight

From GitLab

Requires pixi.

git clone https://gitlab.cern.ch/lhcb-dpa/tools/mcreweight.git
cd mcreweight

To run the CLI tools you can prefix them with pixi run, i.e.

pixi run run-reweight --help
pixi run apply-weights --help

To run the verification checks used in CI:

pixi run -e lint quality
pixi run test
pixi run -e docs build-docs

To auto-format the code before rerunning the checks:

pixi run -e lint black .
pixi run -e lint quality

This repository currently uses these Pixi environments:

default: package runtime and CLI usage
lint: black and ruff
test: pytest
docs: Sphinx documentation build

Useful Pixi commands:

pixi run -e lint ruff check .
pixi run -e lint black --check .
pixi run -e docs sphinx-build -b html -n -W --keep-going docs docs/_build/html
pixi run -e test pytest tests/test_cli.py::test_run_reweight -v
pixi run -e test pytest tests/test_cli.py::test_apply_weights -v

The pixi.lock file pins all dependencies for reproducibility. To update them, run pixi update and commit the updated lock file.

Usage

Both CLIs support two usage modes:

Config-driven mode (recommended): pass --config <file.yaml>
Direct CLI mode: pass all options on the command line

Use --dry-run to validate the merged configuration without running.

Both CLIs treat weightsdir and plotdir as root directories:

run-reweight writes to <weightsdir>/<sample>/ and <plotdir>/<sample>/
apply-weights reads trained artifacts from <weightsdir>/<training-sample>/
apply-weights writes normalized weights to <weightsdir>/<application-sample>/ and plots to <plotdir>/<application-sample>/

To run reweighting do it via configuration file:

run-reweight --config <path_to_config.yaml>

or passing all options to the command line:

run-reweight --path-data <path_to_data.root> \
             --path-mc <path_to_mc.root> \
             --training-vars <variable_list> \
             --monitoring-vars <monitoring_variable_list> \
             --sample <sample> \
             --n_trials <optuna_tests> \
             --test_size <test_sample_size> \
             --weightsdir <weights_directory>

To apply saved weights to an MC sample with configuration file:

apply-weights --config <path_to_config.yaml>

or passing all options to the command line:

apply-weights --path-mc <path_to_mc.root> \
              --vars <variable_list> \
              --training-sample <training_sample> \
              --application-sample <application_sample> \
              --method <method_for_reweighter> \
              --monitoring-vars <monitoring_variable_list> \
              --output-path <output_file.root> \
              --weightsdir <weights_directory>

Options

For the reweighting (`run-reweight`):

General:

--config: YAML configuration file
--dry-run: Validate config and print effective settings only
--verbosity: Logging level (1, 2, 3, 4), default 1

Inputs:

--path-mc: MC ROOT file path(s)
--tree-mc: MC tree name (default: DecayTree)
--mcweights-name: MC input weight branch
--mc-label: MC sample label (default: MC)
--path-data: Data ROOT file path(s)
--tree-data: Data tree name (default: DecayTree)
--sweights-name: Data sWeights branch (default: sweight_sig)
--data-label: Data sample label (default: Data)
--path-xlabels: YAML mapping for axis labels (default: utils/default_xlabels.yaml)

Variables:

--training-vars: Reweighting training variables
--monitoring-vars: Variables used for monitoring plots

Reweighting:

--sample: Sample tag (default: bd_jpsikst_ee)
--methods: Methods list (defaults from YAML or internal defaults)
--transform: Feature transform (quantile, yeo-johnson, signed-log, scaler)
--n_trials: Optuna trials (default: 10)
--test_size: Test split fraction (default: 0.3)
--n_folds: Folding count (default: 10)
--n_bins: Bin count for Bins method (default: 10)
--n_neighs: Neighbors for Bins method (default: 3)
--reweight-validation-fraction: Validation split used by iterative ONNX reweighters
--reweight-early-stopping-rounds: Early-stopping patience for iterative ONNX reweighters
--reweight-metric-every: Evaluate ONNX validation metric every N stages
--folding-aggregation: Folding aggregation (weighted_geometric, geometric, median)
--shap: Enable SHAP computation

Output:

--weightsdir: Root directory for training artifacts; <sample>/ is created automatically. If omitted, MCREWEIGHTS_DATA_ROOT is used
--plotdir: Root directory for plots; <sample>/ is created automatically (default: plots)

Additional options can be found by running:

run-reweight --help

For the application of the weights (`apply-weights`):

General:

--config: YAML configuration file
--dry-run: Validate config and print effective settings only
--verbosity: Logging level (1, 2, 3, 4), default 1

Inputs:

--path-mc: MC file path(s)
--tree-mc: MC tree name (default: DecayTree)
--mcweights-name: Existing MC input weight branch
--path-data: Optional data file path(s), for comparison plots
--tree-data: Data tree name (default: DecayTree)
--sweights-name: Data sWeights branch (default: sweight_sig)
--path-xlabels: YAML mapping for axis labels

Variables:

--vars: Application variables
--training-vars: Variables used when training the saved model
--monitoring-vars: Variables used for monitoring plots

Reweighting:

--method: Method (GB, XGB, NN, Folding, XGBFolding, NNFolding, Bins), default XGB
--training-sample: Training sample tag (default: bd_jpsikst_ee)
--application-sample: Application sample tag (default: bd_jpsikst_ee)
--weightsdir: Root directory for trained artifacts; <training-sample>/ is read and <application-sample>/ is written automatically. If omitted, MCREWEIGHTS_DATA_ROOT is used
--plotdir: Root directory for plots; <application-sample>/ is created automatically (default: plots)

Output:

--output-path: Output ROOT file path
--output-ntuple: Output ntuple type (default: TTree)
--output-tree: Output tree name (default: DecayTree)
--weights-name: Name of produced weights branch (default: weights)

Additional options can be found by running:

apply-weights --help

Example

Reweighting:

pixi run run-reweight \
  --config tests_run/run_reweighting_config.yaml \
  --verbosity 2

Application of the weights:

pixi run apply-weights \
  --config tests_run/apply_weights_config.yaml \
  --method XGB \
  --output-path test_applied_weights.root

Documentation build:

pixi run -e docs build-docs

The generated HTML documentation is written to docs/_build/html/.

Contact

For questions, please contact the repository maintainer.

Project details

Release history Release notifications | RSS feed

1.0.5

May 18, 2026

1.0.4

May 14, 2026

1.0.3

Apr 10, 2026

This version

1.0.1.dev0 pre-release

Apr 10, 2026

0.3.4

Mar 20, 2026

0.3.2

Mar 18, 2026

0.3.1

Nov 24, 2025

0.3.0

Nov 24, 2025

0.2.9

Nov 18, 2025

0.2.8

Nov 18, 2025

0.2.7

Nov 18, 2025

0.2.6

Nov 18, 2025

0.2.5

Oct 2, 2025

0.2.4

Oct 1, 2025

0.2.3

Sep 3, 2025

0.2.1

Aug 4, 2025

0.2.0

Aug 1, 2025

0.1.6

Jul 30, 2025

0.1.3

Jul 22, 2025

0.1.1

Jul 21, 2025

0.1.0

Jul 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcreweight-1.0.1.dev0.tar.gz (37.6 MB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcreweight-1.0.1.dev0-py3-none-any.whl (81.7 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file mcreweight-1.0.1.dev0.tar.gz.

File metadata

Download URL: mcreweight-1.0.1.dev0.tar.gz
Upload date: Apr 10, 2026
Size: 37.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for mcreweight-1.0.1.dev0.tar.gz
Algorithm	Hash digest
SHA256	`267843fe78ca1a01670b0445ff5305716d73ebe589d92db2fa9e42e8e08835cd`
MD5	`d526baca00de77595472eef3aef7fc1a`
BLAKE2b-256	`1f6c5de14a8baaf8d024aa13c4e764f92775686f32732f0d226300435711e1b3`

See more details on using hashes here.

File details

Details for the file mcreweight-1.0.1.dev0-py3-none-any.whl.

File metadata

Download URL: mcreweight-1.0.1.dev0-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 81.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for mcreweight-1.0.1.dev0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e1f2d4ee37e45e706fc0c8b4e3778c6b2c573965e2943e5d251e00c88b4fd562`
MD5	`5af2b3a7169614e5dc0bc956f220d0e6`
BLAKE2b-256	`5963b9e7a8a15276083da44b730b0702e1e58a237720f9ee6b0027ae98fc5e97`

See more details on using hashes here.

mcreweight 1.0.1.dev0

Navigation

Verified details

Owner

Maintainers

Unverified details

Meta

Project description

mcreweight

Documentation

Setup

Installation

From PyPI

From GitLab

Usage

Options

For the reweighting (`run-reweight`):

For the application of the weights (`apply-weights`):

Example

Contact

Project details

Verified details

Owner

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

mcreweight 1.0.1.dev0

Navigation

Verified details

Owner

Maintainers

Unverified details

Meta

Project description

mcreweight

Documentation

Setup

Installation

From PyPI

From GitLab

Usage

Options

For the reweighting (run-reweight):

For the application of the weights (apply-weights):

Example

Contact

Project details

Verified details

Owner

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

For the reweighting (`run-reweight`):

For the application of the weights (`apply-weights`):