OmniFold, a library to perform unbinned and high-dimensional unfolding for HEP.
Project description
OmniFold: A Method to Simultaneously Unfold All Observables
This repository contains the implementation and examples of the OmniFold algorithm originally described in Phys. Rev. Lett. 124 (2020) 182001, 1911.09107 [hep-ph]. The code for the original paper can be found at this repository, which includes a binder demo. This repository was created to maintain the pip installable version of OmniFold with additional functionality compared to the original package:
Installation
pip install omnifold
Getting Started
Examples for tabular data and for Point Cloud-like inputs are provided in the notebooks OmniFold_example.ipynb and OmniFold_example_pc.ipynb, respectively. To unfold your own data you can follow these steps:
Creating a DataLoader to hold your data
from omnifold import DataLoader
# Load your own dataset, possibly with weights.
mock_dataset = np.zeros((100,3,4))
mock_dataloader = DataLoader(
reco = mock_dataset,
gen = mock_dataset,
normalize=True)
The DataLoader class will automatically normalize the weights if normalize
is true. To estimate the statistical uncertainty using the Bootstrap method, you can use the optional flag bootstrap = True
.
Creating your own Keras model to be used for Unfolding
In the MultiFold class, we provide simple neural network models that you can use. For a Multilayer Perceptron you can load
from omnifold import MLP
ndim = 3 #The number of features present in your dataset
reco_model = MLP(ndim)
gen_model = MLP(ndim)
to create the models to be used at both reconstruction and generator level trainings of OmniFold. In case your data is better described by a point cloud, we also provide the implementation of the Point-Edge Transformer (PET) model that can be used similarly to the MLP implementation:
from omnifold import PET
ndim = 3 #The number of features present in your dataset
npart = 5 #Maximum number of particles present in the dataset
reco_model = PET(ndim,num_part = npart)
gen_model = PET(ndim,num_part = npart)
You can also provide your own custom keras.Model to be used by OmniFold
Creating the MultiFold Object
Now that we have the dataset and models, we can create the MultiFold object that performs the unfolding and reweighting of new datasets
omnifold = MultiFold(
"Name_of_experiment",
reco_model,
gen_model,
data, # a dataloader instance containing the measured data
mc , # a dataloader instance containing the simulation
)
The last step is to finally run the unfolding!
omnifold.Unfold()
Evaluating the Unfolded Results:
We can evaluate the reweighting function learned by OmniFold by using the reweight function
unfolded_weights = omnifold.reweight(validation_data,omnifold.model2,batch_size=1000)
These weights can be applied directly to the simulation used during the unfolding to produce the unfolded results.
Plotting the Results of the Unfolding
The omnifold package also provides a histogram functionality that you can use to plot histograms. You can use the plotting code as:
from omnifold import SetStyle, HistRoutine
SetStyle()
#Create a dictionary containing the data to plot
data_dict = {
'Distribution A': data_a,
'Distribution B': data_b,
}
HistRoutine(data_dict,'Name of the feature to plot', reference_name = 'Name of the dataset to calculate the ratio plot')
The function will create the histograms for the datasets used as part of the inputs. Specifc binnings for the histograms can be passed as numpy arrays in the binning argument, or calculated directly by the routine.
Weights can be added to the histograms by passing an additional dictionary with the same key entries and the weights to be used for each distribution. For example:
weight_dict = {
'Distribution A': weight_a,
'Distribution B': weight_b,
}
HistRoutine(data_dict,'Name of the feature to plot', reference_name = 'Name of the dataset to calculate the ratio plot', weights = weight_dict)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file omnifold-0.1.33.tar.gz
.
File metadata
- Download URL: omnifold-0.1.33.tar.gz
- Upload date:
- Size: 16.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 91e6b7670239cd47de15bf181066b0d3c163510059e88729150f8a645d0e9a0c |
|
MD5 | 60cb6db351b7ee361d31cf562afd93fa |
|
BLAKE2b-256 | 68093c311cc0e871cb4076eae52bcae77981acc57ae5b55e998f855090e4f154 |
File details
Details for the file omnifold-0.1.33-py3-none-any.whl
.
File metadata
- Download URL: omnifold-0.1.33-py3-none-any.whl
- Upload date:
- Size: 17.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | db776a0606c9853f4403b2b21fe45ebfb1e9c20359bd319ee71029e10a8d0a93 |
|
MD5 | dde7e333ba9453f1031a8f7f488d8213 |
|
BLAKE2b-256 | bb8ee5c12fca60cd37b14a9038fd4168c2b21c82e26b14f145e8343492e94ec0 |