Skip to main content

An experiment workflow and organization tool

Project description

Curifactory

Code style: black PyPI version Conda Version status tests Python versions License

Curifactory is a library and CLI tool designed to help organize and manage research experiments in python.

screenshot flow

Experiment management must fulfill several tasks, including experiment orchestration, parameterization, caching, reproducibility, reporting, and parallelization. Existing projects such as MLFlow, MetaFlow, Luigi, and Pachyderm support these tasks in several different ways and to various degrees. Curifactory provides a different opinion, with a heavier focus on supporting general research experiment workflows for individuals or small teams working primarily in python.

You can read more about these design principles in our paper in the SciPy 2022 proceedings.

Features

  • Adds a CLI layer on top of your codebase, a single entrypoint for running experiments
  • Automatic caching of intermediate data and lazy loading of stored objects
  • Jupyter notebook output for further exploration of an experiment run
  • Docker container output with copy of codebase, conda environment, full experiment run cache, and jupyter run notebook
  • HTML report output from each run with graphviz-rendered diagram of experiment
  • Easily report plots and values to HTML report
  • Configuration files are python scripts, allowing programmatic definition, parameter composition, and parameter inheritance
  • Output logs from every run
  • Run experiments directly from CLI or other python code, notebooks, etc.

Installation

pip install curifactory

Graphviz is required for certain features and can be installed through conda via:

conda install python-graphviz

Tab completion

For tab-completion in bash/zsh, install the argcomplete package (if using curifactory inside a conda environment, you'll need to install this in your system python.)

pip install argcomplete

To enable, you can either use argcomplete's global hook activate-global-python-argcomplete, which will enable tab complete on all argcomplete-enabled python packages (e.g. pytest), or you can add eval "$(register-python-argcomplete experiment)" to your shell's rc file. Curifactory can add this line for you automatically with:

curifactory completion [--bash|--zsh]  # use the shell flag appropriate

Once enabled, the experiment command will provide tab complete for experiment names, parameter names, and flags.

Requirements

OS: We primarily develop and test Curifactory on Linux, but it runs on Windows and MacOS as well.

Python: 3.9-3.11

Optional:

  • Conda/Mamba
  • Graphviz
  • Docker
  • Jupyter notebook/lab

Documentation

The documentation for the latest version of Curifactory can be found at: https://ornl.github.io/curifactory/stable/index.html.

Examples

Several small example can be found in the examples folder. examples/notebooks includes walkthroughs demonstrating usage of curifactory solely in Jupyter.

Citation

Please use the following BibTeX if citing this project:

@article{Martindale2023,
  doi = {10.21105/joss.05793},
  url = {https://doi.org/10.21105/joss.05793},
  year = {2023},
  publisher = {The Open Journal},
  volume = {8},
  number = {90},
  pages = {5793},
  author = {Nathan Martindale and Scott L. Stewart and Jason Hite and Mark B. Adams},
  title = {Curifactory: A research experiment manager},
  journal = {Journal of Open Source Software}
}

Similar Projects

Curifactory is one tool and one opinion among many, other projects that have similar goals and/or approaches:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

curifactory-0.20.1.tar.gz (125.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

curifactory-0.20.1-py3-none-any.whl (100.1 kB view details)

Uploaded Python 3

File details

Details for the file curifactory-0.20.1.tar.gz.

File metadata

  • Download URL: curifactory-0.20.1.tar.gz
  • Upload date:
  • Size: 125.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for curifactory-0.20.1.tar.gz
Algorithm Hash digest
SHA256 6137e5178b98846ebf6ed7f512c7d264cec5768dc80f5950ead9d318cb15942e
MD5 62f1ef1b197a255331f6ae84db5fd31d
BLAKE2b-256 70ad116e8040789271a356048e5139b86708030e05d5b5c135b71719e1e78418

See more details on using hashes here.

File details

Details for the file curifactory-0.20.1-py3-none-any.whl.

File metadata

  • Download URL: curifactory-0.20.1-py3-none-any.whl
  • Upload date:
  • Size: 100.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for curifactory-0.20.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bc308e375dcb3a803a8702ad08c3396267a4860fcb24bc80f5bda4c5774dd823
MD5 e13b17c9067a03c098364f797d061ad2
BLAKE2b-256 4c3385cec96acd10936808b45d08bd5db1234146e9f08e6c57a558acf9a58ed5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page