vivarium-profiling

Profiling and benchmarking tools for Vivarium simulations.

These details have not been verified by PyPI

Project description

Profiling and benchmarking tools for Vivarium simulations.

Installation

vivarium-profiling is published on PyPI as part of the vivarium-suite monorepo:

pip install vivarium-profiling

For local development against the monorepo source, see the monorepo README at https://github.com/ihmeuw/vivarium-suite.

Supported Python versions: 3.10, 3.11

HDF5 backing storage. Vivarium uses the Hierarchical Data Format (HDF) for its data artifacts, and the libraries pip needs to read these files may not be present on your system. If you encounter HDF5-related errors, install the system tooling into your conda env:

conda install hdf5

git-lfs and data artifacts. When cloning the monorepo, large data artifacts are stored via git-lfs. A clone that completes very quickly likely fetched only the checksum files rather than the artifacts themselves, and your simulations will fail. If you suspect this happened, pull the data explicitly:

git-lfs pull

Source layout

The package lives at src/vivarium/profiling/ and provides these subpackages:

components - custom Vivarium components used by the profiling models
constants - project-level constants and metadata
data - artifact builder helpers
model_specifications - model spec YAMLs (e.g. model_spec_scaling.yaml)
plugins - the MultiComponentParser plugin
templates - Jupyter notebook templates for analysis
tools - the CLI entry points (profile_sim, run_benchmark, summarize, make_artifacts)

Profiling and Benchmarking

This repository provides tools for profiling and benchmarking Vivarium simulations to analyze their performance characteristics. See the tutorials at https://vivarium.readthedocs.io/en/latest/tutorials/running_a_simulation/index.html and https://vivarium.readthedocs.io/en/latest/tutorials/exploration.html for general instructions on running simulations with Vivarium.

Configuring Scaling Simulations

This repository includes a custom MultiComponentParser plugin that allows you to easily create scaling simulations by defining multiple instances of diseases and risks using a simplified YAML syntax.

To use the parser, add it to your model specification:

plugins:
    required:
        component_configuration_parser:
            controller: "vivarium.profiling.plugins.parser.MultiComponentParser"

Then use the causes and risks multi-config blocks:

Causes Configuration

Define multiple disease instances with automatic numbering:

components:
    causes:
        lower_respiratory_infections:
            number: 4          # Creates 4 disease instances
            duration: 28       # Disease duration in days
            observers: True    # Auto-create DiseaseObserver components

This creates components named lower_respiratory_infections_1, lower_respiratory_infections_2, etc., each with its own observer if enabled.

Risks Configuration

Define multiple risk instances and their effects on causes:

components:
    risks:
        high_systolic_blood_pressure:
            number: 2
            observers: False    # Set False for continuous risks
            affected_causes:
                lower_respiratory_infections:
                    effect_type: nonloglinear
                    measure: incidence_rate
                    number: 2   # Affects first 2 LRI instances

        unsafe_water_source:
            number: 2
            observers: True     # Set True for categorical risks
            affected_causes:
                lower_respiratory_infections:
                    effect_type: loglinear
                    number: 2

See model_specifications/model_spec_scaling.yaml for a complete working example of a scaling simulation configuration.

Running Benchmark Simulations

The profile_sim command profiles runtime and memory usage for a single simulation of a vivarium model given a model specification file. The underlying simulation model can be any vivarium-based model, including the aforementioned scaling simulations as well as models in a separate repository. This will generate, in addition to the standard simulation outputs, profiling data depending on the profiler backend provided. By default, runtime profiling is performed with cProfile, but you can also use scalene for more detailed call stack analysis.

The run_benchmark command runs multiple iterations of one or more model specification, in order to compare the results. It requires at least one baseline model for comparison, and any other number of ‘experiment’ models to benchmark against the baseline, which can be passed via glob patterns. You can separately configure the sample size of runs for the baseline and experiment models. The command aggregates the profiling results and generates summary statistics and visualizations for a default set of important function calls to help identify performance bottlenecks.

The command creates a timestamped directory containing:

benchmark_results.csv: Raw profiling data for each run
summary.csv: Aggregated statistics (automatically generated)
performance_analysis.png: Performance charts (automatically generated)
Additional analysis plots for runtime phases and bottlenecks

Analyzing Benchmark Results

The summarize command processes benchmark results and creates visualizations. This runs automatically after run_benchmark, but can also be run manually for custom analysis after the fact.

By default, this creates the following files in the specified output directory:

summary.csv: Aggregated statistics with mean, median, std, min, max for all metrics, plus percent differences from baseline
performance_analysis.png: Runtime and memory usage comparison charts
runtime_analysis_*.png: Individual phase runtime charts (setup, run, etc.)
bottleneck_fraction_*.png: Bottleneck fraction scaling analysis

You can also generate an interactive Jupyter notebook including the same default plots and summary dataframe with a --nb flag, in which case the command also creates an analysis.ipynb file in the output directory.

Customizing Result Extraction

By default, the benchmarking tools extract standard profiling metrics:

Simulation phases: setup, initialize_simulants, run, finalize, report
Common bottlenecks: gather_results, pipeline calls, population views
Memory usage and total runtime

You can customize which metrics to extract by creating an extraction config YAML file. See extraction_config_example.yaml for a complete annotated example.

Basic Pattern Structure:

patterns:
  - name: my_function          # Logical name for the metric
    filename: my_module.py     # Source file containing the function
    function_name: my_function # Function name to match
    extract_cumtime: true      # Extract cumulative time (default: true)
    extract_percall: false     # Extract time per call (default: false)
    extract_ncalls: false      # Extract number of calls (default: false)

In turn, this yaml can be passed to the run_benchmark and summarize commands using the --extraction_config flag. summarize will automatically create runtime analysis plots for the specified functions.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.4

May 13, 2026

0.4.3

May 13, 2026

This version

0.4.2

May 12, 2026

0.4.1

May 12, 2026

0.4.0

May 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vivarium_profiling-0.4.2.tar.gz (59.4 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vivarium_profiling-0.4.2-py3-none-any.whl (46.7 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file vivarium_profiling-0.4.2.tar.gz.

File metadata

Download URL: vivarium_profiling-0.4.2.tar.gz
Upload date: May 12, 2026
Size: 59.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vivarium_profiling-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`9963d390465d6a9b4f1c272a7872aed23179df3794ec07c47ef80605941cb268`
MD5	`5c73ffef8866dad3146ffd4e6a51e4bc`
BLAKE2b-256	`8ec9407f604ad968eeb3165b1f19eea4d3a855f7f13438c44b2a9858bcfad4fa`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vivarium_profiling-0.4.2.tar.gz:

Publisher: release.yml on ihmeuw/vivarium-suite

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vivarium_profiling-0.4.2.tar.gz
- Subject digest: 9963d390465d6a9b4f1c272a7872aed23179df3794ec07c47ef80605941cb268
- Sigstore transparency entry: 1520815145
- Sigstore integration time: May 12, 2026
Source repository:
- Permalink: ihmeuw/vivarium-suite@6d38f400f75568de6902ff73f2772e02207d77e4
- Branch / Tag: refs/heads/main
- Owner: https://github.com/ihmeuw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6d38f400f75568de6902ff73f2772e02207d77e4
- Trigger Event: push

File details

Details for the file vivarium_profiling-0.4.2-py3-none-any.whl.

File metadata

Download URL: vivarium_profiling-0.4.2-py3-none-any.whl
Upload date: May 12, 2026
Size: 46.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vivarium_profiling-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9eac76033e65939af84ea79fc123621a089dd573fdb588370a7f359749b47156`
MD5	`91736407b5b55799c546743cb57cbb86`
BLAKE2b-256	`03eeae37de570cb5fe8097677c38feed39988c1903aeedabc3008a01887cea26`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vivarium_profiling-0.4.2-py3-none-any.whl:

Publisher: release.yml on ihmeuw/vivarium-suite

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vivarium_profiling-0.4.2-py3-none-any.whl
- Subject digest: 9eac76033e65939af84ea79fc123621a089dd573fdb588370a7f359749b47156
- Sigstore transparency entry: 1520815227
- Sigstore integration time: May 12, 2026
Source repository:
- Permalink: ihmeuw/vivarium-suite@6d38f400f75568de6902ff73f2772e02207d77e4
- Branch / Tag: refs/heads/main
- Owner: https://github.com/ihmeuw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6d38f400f75568de6902ff73f2772e02207d77e4
- Trigger Event: push

vivarium-profiling 0.4.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Installation

Source layout

Profiling and Benchmarking

Configuring Scaling Simulations

Running Benchmark Simulations

Analyzing Benchmark Results

Customizing Result Extraction

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance