zfit·PyPI

scalable pythonic model fitting for high energy physics

These details have been verified by PyPI

Project links

Homepage

Owner

zfit

GitHub Statistics

These details have not been verified by PyPI

Project description

zfit: scalable pythonic fitting

https://scikit-hep.org/assets/images/Scikit--HEP-Affiliated-blue.svg

https://github.com/zfit/zfit/workflows/build/badge.svg

zfit is a highly scalable and customizable model manipulation and likelihood fitting library. It uses the same computational backend as TensorFlow and is optimised for simple and direct manipulation of probability density functions. The project is affiliated with and well integrated into Scikit-HEP, the HEP Python ecosystem.

Tutorials: Interactive introduction and tutorials
Quick start: Example scripts
Documentation: See stable documentation or latest documentation
Questions: see the FAQ, ask on StackOverflow (with the zfit tag) or contact us directly.
Physics, HEP: zfit-physics is the place to contribute and find more HEP related content
Statistical inference: hepstats for limits, CI, sWeights and more

If you use zfit in research, please consider citing.

N.B.: zfit is currently in beta stage, so while most core parts are established, some may still be missing and bugs may be encountered. It is, however, mostly ready for production, and is being used in analyses projects. If you want to use it for your project and you are not sure if all the needed functionality is there, feel free to contact us.

Installation

zfit is available on pip and conda-forge. To install it (recommended: use a virtual/conda env!) with all the dependencies (minimizers, uproot, …), use

pip install -U zfit[all]

(the -U just indicates to upgrade zfit, in case you have it already installed) or for minimal dependencies

pip install zfit

For conda/mamba, use

conda install -c conda-forge zfit

How to use

While the zfit library provides a model fitting and sampling framework for a broad list of applications, we will illustrate its main features with a simple example by fitting a Gaussian distribution with an unbinned likelihood fit and a parameter uncertainty estimation.

Example in short

obs = zfit.Space('x', -10, 10)

# create the model
mu    = zfit.Parameter("mu"   , 2.4, -1, 5)
sigma = zfit.Parameter("sigma", 1.3,  0, 5)
gauss = zfit.pdf.Gauss(obs=obs, mu=mu, sigma=sigma)

# load the data
data_np = np.random.normal(size=10_000)
data = zfit.Data(obs=obs, data=data_np)
# or sample from model
data = gauss.sample(10_000)

# build the loss
nll = zfit.loss.UnbinnedNLL(model=gauss, data=data)

# minimize (20+ interchangeable minimizers available!)
minimizer = zfit.minimize.Minuit()
result = minimizer.minimize(nll).update_params()

# calculate errors
sym_errors = result.hesse()
asym_errors = result.errors()

This follows the zfit workflow

Full explanation

The default space (e.g. normalization range) of a PDF is defined by an observable space, which is created using the zfit.Space class:

obs = zfit.Space('x', -10, 10)

To create a simple Gaussian PDF, we define its parameters and their limits using the zfit.Parameter class.

# syntax: zfit.Parameter("any_name", value, lower, upper)
  mu    = zfit.Parameter("mu"   , 2.4, -1, 5)
  sigma = zfit.Parameter("sigma", 1.3,  0, 5)
  gauss = zfit.pdf.Gauss(obs=obs, mu=mu, sigma=sigma)

For simplicity, we create the dataset to be fitted starting from a numpy array, but zfit allows for the use of other sources such as ROOT files:

mu_true = 0
sigma_true = 1
data_np = np.random.normal(mu_true, sigma_true, size=10000)
data = zfit.Data(obs=obs, data=data_np)

Fits are performed in three steps:

Creation of a loss function, in our case a negative log-likelihood.
Instantiation of our minimiser of choice, in the example the Minuit.
Minimisation of the loss function.

# Stage 1: create an unbinned likelihood with the given PDF and dataset
nll = zfit.loss.UnbinnedNLL(model=gauss, data=data)

# Stage 2: instantiate a minimiser (in this case a basic minuit)
minimizer = zfit.minimize.Minuit()

# Stage 3: minimise the given negative log-likelihood
result = minimizer.minimize(nll).update_params()

The .update_params() changes the default values of the parameters (this is currently happen by default but won’t anymore in the future)

Symmetric errors are calculated with a further function call to avoid running potentially expensive operations if not needed. Asymmetric errors using a profiling method can also be obtained:

sym_errors = result.hesse()
asym_errors = result.errors()

Once we’ve performed the fit and obtained the corresponding uncertainties, we can examine the fit results by printing it or looking at individual parts

print(result)  # nice representation of a whole result

print("Function minimum:", result.fmin)
print("Converged:", result.converged)

# Information on all the parameters in the fit
params = result.params
print(params)

# Printing information on specific parameters, e.g. mu
print("mu={}".format(params[mu]['value']))

And that’s it! For more details and information of what you can do with zfit, checkout the latest documentation.

Why?

The basic idea behind zfit is to offer a Python oriented alternative to the very successful RooFit library from the ROOT data analysis package that can integrate with the other packages that are part if the scientific Python ecosystem. Contrary to the monolithic approach of ROOT/RooFit, the aim of zfit is to be light and flexible enough t o integrate with any state-of-art tools and to allow scalability going to larger datasets.

These core ideas are supported by two basic pillars:

The skeleton and extension of the code is minimalist, simple and finite: the zfit library is exclusively designed for the purpose of model fitting and sampling with no attempt to extend its functionalities to features such as statistical methods or plotting.
zfit is designed for optimal parallelisation and scalability by making use of TensorFlow as its backend. The use of TensorFlow provides crucial features in the context of model fitting like taking care of the parallelisation and analytic derivatives.

Prerequisites

zfit works with Python versions 3.9 and above. The main dependency is tensorflow: zfit follows a close version compatibility with TensorFlow.

Contributing

Any idea of how to improve the library? Or interested to write some code? Contributions are always welcome, please have a look at the Contributing guide.

Contact

You can contact us directly:

via e-mail: zfit@physik.uzh.ch
join our Gitter channel

Original Authors

Jonas Eschle <jonas.eschle@cern.ch>
Albert Puig <albert.puig@cern.ch>
Rafael Silva Coutinho <rsilvaco@cern.ch>

See here for all authors and contributors

Acknowledgements

zfit has been developed with support from the University of Zurich and the Swiss National Science Foundation (SNSF) under contracts 168169 and 174182.

The idea of zfit is inspired by the TensorFlowAnalysis framework developed by Anton Poluektov and TensorProb by Chris Burr and Igor Babuschkin using the TensorFlow open source library and more libraries.

Project details

These details have been verified by PyPI

Project links

Homepage

Owner

zfit

GitHub Statistics

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.26.0

Jun 2, 2025

0.25.0

Mar 14, 2025

0.24.3

Jan 6, 2025

0.24.2

Dec 11, 2024

0.24.1

Dec 9, 2024

0.24.0

Dec 10, 2024

0.23.0

Nov 7, 2024

0.22.0

Aug 22, 2024

0.21.1

Jul 13, 2024

0.21.0

Jul 2, 2024

0.20.3

Apr 20, 2024

0.20.2

Apr 16, 2024

0.20.1

Apr 15, 2024

0.20.0

Apr 12, 2024

0.18.2

Mar 13, 2024

0.18.1

Feb 22, 2024

0.18.0

Jan 30, 2024

0.17.0

Jan 29, 2024

0.16.0

Oct 9, 2023

0.15.5

Jul 27, 2023

0.15.2

Jul 20, 2023

0.15.1

Jul 20, 2023

0.15.0

Jul 13, 2023

0.14.1

Jul 1, 2023

0.14.0

Jun 22, 2023

0.13.2

Jun 15, 2023

0.13.1

Apr 20, 2023

0.13.0

Apr 19, 2023

0.12.1

Apr 1, 2023

0.12.0

Mar 14, 2023

0.11.1

Nov 29, 2022

0.10.1

Aug 31, 2022

0.10.0

Aug 22, 2022

0.9.0a3 pre-release

May 17, 2022

0.9.0a2 pre-release

Mar 2, 2022

0.9.0a1 pre-release

Dec 1, 2021

0.9.0a0 pre-release

Jan 11, 2022

0.8.3

Apr 5, 2022

0.8.2

Sep 20, 2021

0.8.1

Sep 14, 2021

0.7.2

Jul 7, 2021

0.7.1

Jul 6, 2021

0.7.0

Jun 3, 2021

0.6.6

May 12, 2021

0.6.5

May 4, 2021

0.6.4

Apr 16, 2021

0.6.3

Apr 15, 2021

0.6.2

Apr 9, 2021

0.6.1

Mar 31, 2021

0.6.0

Mar 30, 2021

0.5.6 yanked

Jan 26, 2021

Reason this release was yanked: