pyhf · PyPI

pure-Python HistFactory implementation with tensors and autodiff

These details have been verified by PyPI

Maintainers

kratsg lukasheinrich matthewfeickert scikit-hep

These details have not been verified by PyPI

Project links

Project description

pure-python fitting/limit-setting/interval estimation HistFactory-style

The HistFactory p.d.f. template [CERN-OPEN-2012-016] is per-se independent of its implementation in ROOT and sometimes, it’s useful to be able to run statistical analysis outside of ROOT, RooFit, RooStats framework.

This repo is a pure-python implementation of that statistical model for multi-bin histogram-based analysis and its interval estimation is based on the asymptotic formulas of “Asymptotic formulae for likelihood-based tests of new physics” [arXiv:1007.1727]. The aim is also to support modern computational graph libraries such as PyTorch and TensorFlow in order to make use of features such as autodifferentiation and GPU acceleration.

Hello World

This is how you use the pyhf Python API to build a statistical model and run basic inference:

>>> import pyhf
>>> model = pyhf.simplemodels.hepdata_like(signal_data=[12.0, 11.0], bkg_data=[50.0, 52.0], bkg_uncerts=[3.0, 7.0])
>>> data = [51, 48] + model.config.auxdata
>>> test_mu = 1.0
>>> CLs_obs, CLs_exp = pyhf.infer.hypotest(test_mu, data, model, qtilde=True, return_expected=True)
>>> print(f"Observed: {CLs_obs}, Expected: {CLs_exp}")
Observed: 0.05251497423736956, Expected: 0.06445320535890459

Alternatively the statistical model and observational data can be read from its serialized JSON representation (see next section).

>>> import pyhf
>>> import requests
>>> wspace = pyhf.Workspace(requests.get('https://git.io/JJYDE').json())
>>> model = wspace.model()
>>> data = wspace.data(model)
>>> test_mu = 1.0
>>> CLs_obs, CLs_exp = pyhf.infer.hypotest(test_mu, data, model, qtilde=True, return_expected=True)
>>> print(f"Observed: {CLs_obs}, Expected: {CLs_exp}")
Observed: 0.3599840922126626, Expected: 0.3599840922126626

Finally, you can also use the command line interface that pyhf provides which should produce the following JSON output:

$ cat << EOF  | tee likelihood.json | pyhf cls
{
    "channels": [
        { "name": "singlechannel",
          "samples": [
            { "name": "signal",
              "data": [12.0, 11.0],
              "modifiers": [ { "name": "mu", "type": "normfactor", "data": null} ]
            },
            { "name": "background",
              "data": [50.0, 52.0],
              "modifiers": [ {"name": "uncorr_bkguncrt", "type": "shapesys", "data": [3.0, 7.0]} ]
            }
          ]
        }
    ],
    "observations": [
        { "name": "singlechannel", "data": [51.0, 48.0] }
    ],
    "measurements": [
        { "name": "Measurement", "config": {"poi": "mu", "parameters": []} }
    ],
    "version": "1.0.0"
}
EOF
{
   "CLs_exp": [
      0.0026062609501074576,
      0.01382005356161206,
      0.06445320535890459,
      0.23525643861460702,
      0.573036205919389
   ],
   "CLs_obs": 0.05251497423736956
}

What does it support

Implemented variations:

☑ HistoSys
☑ OverallSys
☑ ShapeSys
☑ NormFactor
☑ Multiple Channels
☑ Import from XML + ROOT via uproot
☑ ShapeFactor
☑ StatError
☑ Lumi Uncertainty

Computational Backends:

☑ NumPy
☑ PyTorch
☑ TensorFlow
☑ JAX

Optimizers:

☑ SciPy (scipy.optimize)
☑ MINUIT (iminuit)

All backends can be used in combination with all optimizers. Custom user backends and optimizers can be used as well.

Todo

☐ StatConfig
☐ Non-asymptotic calculators

results obtained from this package are validated against output computed from HistFactory workspaces

A one bin example

import pyhf
import numpy as np
import matplotlib.pyplot as plt
import pyhf.contrib.viz.brazil

pyhf.set_backend("numpy")
model = pyhf.simplemodels.hepdata_like(
    signal_data=[10.0], bkg_data=[50.0], bkg_uncerts=[7.0]
)
data = [55.0] + model.config.auxdata

poi_vals = np.linspace(0, 5, 41)
results = [
    pyhf.infer.hypotest(test_poi, data, model, qtilde=True, return_expected_set=True)
    for test_poi in poi_vals
]

fig, ax = plt.subplots()
fig.set_size_inches(7, 5)
ax.set_xlabel(r"$\mu$ (POI)")
ax.set_ylabel(r"$\mathrm{CL}_{s}$")
pyhf.contrib.viz.brazil.plot_results(ax, poi_vals, results)

pyhf

ROOT

A two bin example

import pyhf
import numpy as np
import matplotlib.pyplot as plt
import pyhf.contrib.viz.brazil

pyhf.set_backend("numpy")
model = pyhf.simplemodels.hepdata_like(
    signal_data=[30.0, 45.0], bkg_data=[100.0, 150.0], bkg_uncerts=[15.0, 20.0]
)
data = [100.0, 145.0] + model.config.auxdata

poi_vals = np.linspace(0, 5, 41)
results = [
    pyhf.infer.hypotest(test_poi, data, model, qtilde=True, return_expected_set=True)
    for test_poi in poi_vals
]

fig, ax = plt.subplots()
fig.set_size_inches(7, 5)
ax.set_xlabel(r"$\mu$ (POI)")
ax.set_ylabel(r"$\mathrm{CL}_{s}$")
pyhf.contrib.viz.brazil.plot_results(ax, poi_vals, results)

pyhf

ROOT

Installation

To install pyhf from PyPI with the NumPy backend run

python -m pip install pyhf

and to install pyhf with all additional backends run

python -m pip install pyhf[backends]

or a subset of the options.

To uninstall run

python -m pip uninstall pyhf

Questions

If you have a question about the use of pyhf not covered in the documentation, please ask a question on Stack Overflow with the [pyhf] tag, which the pyhf dev team watches.

If you believe you have found a bug in pyhf, please report it in the GitHub Issues. If you’re interested in getting updates from the pyhf dev team and release announcements you can join the pyhf-announcements mailing list.

Citation

As noted in Use and Citations, the preferred BibTeX entry for citation of pyhf is

@software{pyhf,
  author = "{Heinrich, Lukas and Feickert, Matthew and Stark, Giordon}",
  title = "{pyhf: v0.5.4}",
  version = {0.5.4},
  doi = {10.5281/zenodo.1169739},
  url = {https://github.com/scikit-hep/pyhf},
}

Authors

pyhf is openly developed by Lukas Heinrich, Matthew Feickert, and Giordon Stark.

Please check the contribution statistics for a list of contributors.

Milestones

2020-07-28: 1000 GitHub issues and pull requests. (See PR #1000)

Acknowledgements

Matthew Feickert has received support to work on pyhf provided by NSF cooperative agreement OAC-1836650 (IRIS-HEP) and grant OAC-1450377 (DIANA/HEP).

Project details

These details have been verified by PyPI

Maintainers

kratsg lukasheinrich matthewfeickert scikit-hep

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.7.6

Jan 5, 2024

0.7.5

Oct 26, 2023

0.7.4

Sep 7, 2023

0.7.3

Aug 17, 2023

0.7.2

May 18, 2023

0.7.1

Apr 7, 2023

0.7.0

Sep 24, 2022

0.7.0rc4 pre-release

Sep 10, 2022

0.7.0rc3 pre-release

Sep 3, 2022

0.7.0rc2 pre-release

Aug 29, 2022

0.7.0rc1 pre-release

Jun 29, 2022

0.6.3

Sep 3, 2021

0.6.2

Jun 21, 2021

0.6.1

Mar 9, 2021

0.6.0

Feb 16, 2021

This version

0.5.4

Dec 12, 2020

0.5.3

Oct 20, 2020

0.5.2

Sep 7, 2020

0.5.1

Aug 2, 2020

0.5.0

Jul 25, 2020

0.4.4

Jul 2, 2020

0.4.3

May 31, 2020

0.4.2

May 28, 2020

0.4.1

Mar 4, 2020

0.4.0

Jan 13, 2020

0.3.4

Jan 13, 2020

0.3.3

Jan 12, 2020

0.3.2

Jan 3, 2020

0.3.1

Dec 16, 2019

0.3.0

Dec 9, 2019

0.2.2

Nov 21, 2019

0.2.1

Nov 15, 2019

0.2.0

Nov 14, 2019

0.1.2

Jul 12, 2019

0.1.1

Jun 27, 2019

0.1.0

Apr 13, 2019

0.0.17

Apr 11, 2019

0.0.16

Jan 31, 2019

0.0.15

Sep 2, 2018

0.0.14

Sep 1, 2018

0.0.13

Sep 1, 2018

0.0.12

Aug 23, 2018

0.0.8

Feb 14, 2018

0.0.5

Feb 13, 2018

0.0.4

Feb 9, 2018

0.0.3

Feb 5, 2018

0.0.1

Jan 24, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyhf-0.5.4.tar.gz (98.5 kB view hashes)

Uploaded Dec 12, 2020 Source

Built Distribution

pyhf-0.5.4-py2.py3-none-any.whl (125.6 kB view hashes)

Uploaded Dec 12, 2020 Python 2 Python 3

Hashes for pyhf-0.5.4.tar.gz

Hashes for pyhf-0.5.4.tar.gz
Algorithm	Hash digest
SHA256	`e5b0d8226bb1a79007cb9ead03f9d6b2ef18b4613f6fda8dda209809ad49164e`
MD5	`176ecdfc6ce40a046d73693fbfc6242d`
BLAKE2b-256	`32b3f761f3bce1cded897c496554c426b68ef8c5b49f26d0bc36f9cb850949da`

Hashes for pyhf-0.5.4-py2.py3-none-any.whl

Hashes for pyhf-0.5.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c740340735aab943995f7e5f71b7ad6e68162a4be2fd7fdf576bc9a2b363dd1`
MD5	`bfd8e13a868659f9e5a531017a78c36f`
BLAKE2b-256	`66afbfb29c9e2de653b97466d59f7e65e86a333a49fa267248b3ee23848a337d`