Skip to main content

F.A.S.T. package for describing datasets

Project description

pypi package pipeline status coverage report

fast-curator

Create, read and write dictionary descriptions of input datasets to process. Currently all datasets are expected to be built from sets of ROOT Trees.

Requirements

Installing

pip install --user fast-curator

Usage

# Local files:
fast_curator -o output_file_list.txt -t tree_name -d dataset_name --mc input/files/*root

# Single XROOTD files:
fast_curator -o output_file_list.txt --mc root://my.domain.with.files://input/files/one_file.root

# XROOTD files with several globs
fast_curator -o output_file_list.txt --mc root://my.domain.with.files://inp*/files/*.root

Notes:

  1. If the command is called multiple times with the same output file (using the -o option), the additional files specified will be appended to the output file.
  2. Arbitrary meta-data (such as cross-section, data quality, generator precision, etc) can be added to each dataset with the -m option.

For more guidance try the built-in help:

fast_curator --help

Reading dataset files back

import fast_curator
datasets = fast_curator.read.from_yaml("my_dataset_file.yml")

Will return a list of datasets with the default section applied to each dataset.

Further Documentation

Is on its way...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast-curator-0.1.6.tar.gz (6.4 kB view hashes)

Uploaded Source

Built Distribution

fast_curator-0.1.6-py2.py3-none-any.whl (8.1 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page