Sample Nuclear Data Files

Project description

NuclearDataSampler

Welcome to NuclearDataSampler, a Python-based code aiming to randomly sample evaluated nuclear data files (ENDF) under well-defined, minimal assumptions. This project is part of an effort to explore and compare different approaches to uncertainty quantification (UQ) in nuclear data, including classic sensitivity-based methods and the more direct “Total Monte Carlo” (TMC) approach. This project was made possible by the efficient handling of nuclear data files using ENDFtk and its C++/Python bindings.

:one: Dependencies and Installation

This project relies on ENDFtk for reading and writing ENDF files:

ENDFtk GitHub Repository

Before installing NuclearDataSampler, you should ensure that ENDFtk is installed and available in your environment. Please see the ENDFtk repository for instructions on building or installing ENDFtk.

Installation

You can install NuclearDataSampler via pip:

pip install NuclearDataSampler

If you want to actively develop or contribute to the project, clone this repository and install in “editable” mode:

git clone https://github.com/Pierresole/NuclearDataSampler.git
cd NuclearDataSampler
pip install -e .

This will let you edit the code locally and directly test your changes without reinstalling.

:two: Progress Overview

Perturbed Parameters	Status	Comment
Thermal Parameters	:white_check_mark:	LEAPR inputs
Resonance Parameters
- URR	:white_check_mark:
- MLBW
- RM	:white_check_mark:
- RML	:white_check_mark:
Cross Sections (groupwise)	:x:	Interpolation(1)

(1) To perturb a MF3 based on its MF33, it is necessary to create an easily interpolable XS. This has been done. What is left is to think of the code organization and the way of perturbing composed cross sections.

:three: The Core Idea of NuclearDataSampler

Input: You provide an ENDF file that contains both the nominal (mean) parameter values and the associated covariance matrix.
Sampling: We draw random samples from the multivariate Gaussian distribution parameterized by that mean vector and covariance matrix—no additional re-interpretation or modeling assumptions are introduced.
Output: Each random draw updates the relevant sections of the ENDF file, producing a new, consistent ENDF file that reflects one realization of the underlying uncertainties.

By construction, this approach works at the evaluated nuclear data level, avoiding additional data-format conversions or embedded nuclear reaction model assumptions. This makes it simple to compare with or feed into other downstream codes.

Note: A mean vector and a covariance matrix uniquely define a (multivariate) Gaussian distribution. By specifying only these two ingredients, we are implicitly stating that uncertainties follow a normal distribution in parameter space. Any more complicated shape would require higher-order moments or parametric expansions, and at least something to verify our hypothesis or some guidance from evaluated distributions.

Many researchers in the field of UQ have acknowledged the effectiveness of using of LHS sampling, which we have applied in NDSampler and LEAPRSampler.

:four: Context

UQ in Particle Transport Physics

Uncertainty Quantification (UQ) in particle transport typically involves answering the question: “How do the uncertainties in nuclear cross sections, resonance parameters, and other fundamental data propagate to engineering or physics parameters of interest (e.g., reaction rates, keff in reactor calculations, neutron and neutrinos flux spectrum)?”

Two common strategies to address this question are:

Sensitivity-based approaches: Perturb the inputs systematically using partial derivatives or adjoint solutions to infer the output uncertainties. These methods often rely on linear approximations and can have difficulties with strongly non-linear or resonance-dominated processes.
Total Monte Carlo (TMC): Re-sample the nuclear data themselves many times (often from an assumed probability distribution, typically a multivariate Gaussian) and compute the transport problem to statistically evaluate the distribution of the output. This direct approach is computationally more intensive, but it does not rely on linearity assumptions or pre-computed sensitivities.

NuclearDataSampler focuses on the TMC idea: given an ENDF file and its uncertainty information (covariances), produce randomized ENDF files ready for downstream usage—without imposing any additional assumptions or re-interpretations.

State of the Art and Motivation

Several tools exist for generating or processing perturbed nuclear data files, each with its own set of assumptions and workflows. For example:

TALYS (1) A comprehensive nuclear reaction model code that can generate cross sections, angular distributions, and other observables for many projectiles, targets, and reaction channels. TALYS includes various nuclear structure models (optical models, level densities, gamma strength functions, etc.). One typical usage is to sample model parameters (e.g., nuclear level density parameters, optical model potential parameters) multiple times, generate numerous “realizations” of cross sections, and compare these with experimental data at certain steps. This effectively yields an ensemble of cross-section evaluations.
SANDY (2) A Python package focused on sampling and analyzing nuclear data uncertainties. It can produce perturbed nuclear data in PENDF formats using the processing code NJOY.
FRENDY (3) A data processing system designed to read, process, and produce ACE-formatted data from evaluated nuclear data libraries. FRENDY also provides modules to handle uncertainties.

Despite these codes’ capabilities, many times a user just needs a straightforward way to:

Take an existing ENDF file that comes with mean values and a covariance matrix.
Sample new sets of evaluated data from a well-defined multivariate Gaussian distribution (mean vector + covariance matrix).
Output new ENDF files with minimal additional assumptions or format transformations.

That is exactly what NuclearDataSampler aims to do.

What motivated this code is a simple but faithful treatment of resonance parameters (RP) uncertainty that are lumped into group cross sections (XS), especially in SANDY and FRENDY. A very foundational code to study the effect of lumping RP uncertainties into group XS uncertainties is ENDSAM (4) developed at JSI.

ENDSAM is able to generate random files but was primarily developed to check whether the relative uncertainty of certain parameters is too high, and if so, verify if their covariance matrix is mathematically correct (log-normal transformation).

The conclusions drawn from ENDSAM results and the discussions it triggered in the community (5) led to NuclearDataSampler to avoid backend interpretation. Namely if positive parameters are sampled negatively, any backend patch will not be verifiably correct without access to the statistical evaluated distribution. However, if the patch is "acceptable," it may be applied. Similarly, if the covariance matrix is not positive definite and the problematic eigenvalues are significantly negative, one should not clip them to a positive value.

Keep cool and call an evaluator

Diagnosing and communicating such issues is essential for the evaluation of codes. Conversely, "screwdriver-ing" or "quick fixes" to make things work can obscure more serious underlying problems.

References :link: :

(1) Koning, A.J., Hilaire, S., and Goriely, S., NRG CEA ULB and IAEA TALYS Repository.

(2) Fiorito, L., SCK-CEN, SANDY Repository.

(3) Tada, K., JAEA, FRENDY Repository.

(4) Plevnik, L. and Žerovnik, G., JSI, "Computer code ENDSAM for random sampling and validation of the resonance parameters covariance matrices of some major nuclear data libraries", Annals of Nuclear Energy, 2016.
DOI: 10.1016/j.anucene.2016.04.026

(5) Taavitsainen, A. and Vanhanen, R., "On the maximum entropy distributions of inherently positive nuclear data", 2017, Aalto University School of Science, Finland. DOI: 10.1016/j.nima.2016.11.061

The (Multivariate) Gaussian Assumption

When we talk about a mean vector $\mu$ and a covariance matrix $\Sigma$, we are specifying a multivariate Gaussian (normal) distribution:

$p(\mathbf{x}) = \frac{1}{\sqrt{(2\pi)^n \det(\Sigma)}} \exp\left(-\frac{1}{2} (\mathbf{x} - \mu)^\top \Sigma^{-1} (\mathbf{x} - \mu)\right).$

$\mu \in \mathbb{R}^n$ is the vector of means for the $n$ parameters.
$\Sigma \in \mathbb{R}^{n \times n}$ is the covariance matrix describing pairwise correlations between parameters.

By definition, only specifying $\mu$ and $\Sigma$ means that the distribution is exactly Gaussian. Any higher-order "shape" information (e.g., skewness, kurtosis, etc.) is zero for a perfect Gaussian.

If the physical reality demands more complex distributions, we must add more parameters or move beyond the Gaussian assumption. NuclearDataSampler is developed to be flexible enough to be used with advanced relationship models like Copulas. Sampling from more complicated dependencies and laws is straightforward in Python, but the first move should come from evaluation codes, which should be able to communicate the parameters' distributions.

:five: Contributing :construction_worker:

Contributions are welcome—whether it’s adding new features, fixing bugs, or improving documentation.

There is still work to be done to address the six types of uncertainties present in nuclear data files: neutron multiplicities, resonance parameters, multigroup cross sections, angular distributions, energy distributions, and fission spectra.

Feel free to explore and adapt the TMC approach with this simple sampling tool. We hope it simplifies your workflows and fosters further comparisons with other uncertainty quantification methods!

Project details

Release history Release notifications | RSS feed

This version

0.0.1

Feb 3, 2025

0.0.0

Jan 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nucleardatasampler-0.0.1.tar.gz (6.8 kB view details)

Uploaded Feb 3, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

NuclearDataSampler-0.0.1-py3-none-any.whl (6.7 kB view details)

Uploaded Feb 3, 2025 Python 3

File details

Details for the file nucleardatasampler-0.0.1.tar.gz.

File metadata

Download URL: nucleardatasampler-0.0.1.tar.gz
Upload date: Feb 3, 2025
Size: 6.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for nucleardatasampler-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`79150f94c0118a545160cf4176b39a6da8baa0ef3f578359c3ac57b2dac17dbd`
MD5	`40c5855f945e4072c270625c570b2968`
BLAKE2b-256	`6011659ce18efcbc23b2ca9b970509c308fd29bda7b4ce461614c2de8bc33223`

See more details on using hashes here.

File details

Details for the file NuclearDataSampler-0.0.1-py3-none-any.whl.

File metadata

Download URL: NuclearDataSampler-0.0.1-py3-none-any.whl
Upload date: Feb 3, 2025
Size: 6.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for NuclearDataSampler-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8f9f08b4d05c863a0b192174d6fe573a60b8a1022d0a466de4dc1dfd38e23516`
MD5	`47aab92eb9cf393757b1642a13a4fea5`
BLAKE2b-256	`3975f95e53a13fff4fd9a208b11e6e732bb2fc010f1581989b28a68d7dbc98d2`

See more details on using hashes here.

NuclearDataSampler 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

NuclearDataSampler

:one: Dependencies and Installation

Installation

:two: Progress Overview

:three: The Core Idea of NuclearDataSampler

:four: Context

UQ in Particle Transport Physics

State of the Art and Motivation

The (Multivariate) Gaussian Assumption

:five: Contributing :construction_worker:

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes