Skip to main content

C++ IO and Preprocessing package for sparse neutrino data, with H5 for IO and python bindings.

Project description

Build Status license

LArCV (Version 3)

Software framework for image(2D)/volumetric(3D) data processing with APIs to interface deep neural network open-source softwares, written in C++ with extensive Python supports. Originally developed for analyzing data from time-projection-chamber (TPC). It is now converted to be a generic tool to handle 2D-projected images and 3D-voxelized data. LArCV is particularly suitable for sparse data processing.


You can install larcv through pypi: pip install larcv and it should work. You can also build from source:

git clone
cd larcv3
git submodule update --init  # Pulls pybind11 subpackage
python build [-j 12] # Optional parallel build for faster compilation
python install [--user | -prefix ${INSTALLATION_DIR} ] 

To verify your larcv installation, after install has completed:

cd larcv3/tests
py.test .


  • Python
  • Numpy
  • HDF5 (for IO)
  • cmake (for building)
  • scikit-build (for installation)
  • pytest (for continuous integration)

HDF5 and cmake can all be installed by package managers. Conda will also work.

For compilation, a gcc > 4.8 is required. GCC versions 5 to 8 are all known to work, as is clang on MacOS.

To install requirements on ubuntu, you can do: sudo apt-get install cmake libhdf5-serial-dev python-dev pip install numpy scikit-build pytest

To install requirements on mac, you can do: sudo port install cmake hdf5 pip install numpy scikit-build pytest

To install in a generic system, you can try conda or a virtual environment. It has been shown to work on many linux distributions.


larcv3 works on mac and many flavors of linux. It has never been tested on windows as far as I know. If you try to install and need help, please open an Issue.

Use Cases

Larcv is predominantly used as an IO framework and data preprocessing tool for machine learning and deep learning. It has run on many systems and in many scenarios. Larcv has a suite of test cases available that test the serialization, read back, threaded IO tools, and distributed IO tools.

Larcv has run on some of the biggest systems in the world, including Summit (ORNL) and Theta (ANL). It has been used for distributed io of sparse, non-uniform data up to hundreds of CPUs/GPUs, and had good performance.

If you would like to use larcv for your application and want to benchmark the performance, you are welcome to use the larcv3 open dataset (more info on and if you would like help, open an issue or contact the authors directly.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for larcv, version 3.3.4
Filename, size File type Python version Upload date Hashes
Filename, size larcv-3.3.4.tar.gz (857.6 kB) File type Source Python version None Upload date Hashes View
Filename, size larcv-3.3.4-cp36-cp36m-macosx_10_9_x86_64.whl (1.2 MB) File type Wheel Python version cp36 Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page