Skip to main content

Determine how to group together input files into batches for subsequent concatenation

Project description

Batchee logo

Project Status: Active – The project has reached a stable, usable state and is being actively developed Mypy checked Python Versions Package version Code coverage

Overview

Batchee is a Python package that intelligently groups filenames together, enabling efficient batch operations like concatenation.

What does it do?

Batchee analyzes filename patterns and groups related files together. For example (note that these are pseudo-real, not actual, TEMPO file names):

batchee TEMPO_NO2_L2_S006G01.nc TEMPO_NO2_L2_S006G02.nc TEMPO_NO2_L2_S007G08.nc TEMPO_NO2_L2_S007G09.nc

Output:

  • TEMPO_NO2_L2_S006G01.nc, TEMPO_NO2_L2_S006G02.nc → Group 1 (scan 6)
  • TEMPO_NO2_L2_S007G08.nc, TEMPO_NO2_L2_S007G09.nc → Group 2 (scan 7)

This enables batch processing operations on each group separately.

Key Features

  • Automatic filename grouping based on configurable patterns
  • Command-line interface and Python API for integration with NASA Harmony service orchestrator
  • Verbose logging for debugging

Installation

From PyPI (Recommended)

pip install batchee

From Source (Development)

For local development or the latest features:

git clone <Repository URL>
cd batchee

(Option A) using poetry (Recommended for development):

# Install poetry: https://python-poetry.org/docs/
poetry install

(Option B) using pip:

pip install .

Usage

Basic Usage

batchee [file_names ...]

With Poetry (if installed via poetry)

poetry run batchee [file_names ...]

Options

  • -h, --help - Show help message and exit
  • -v, --verbose - Enable verbose output to stdout; useful for debugging

Contributing

Issues and pull requests welcome on GitHub.

License & Attribution

Batchee is released under the Apache License 2.0.

This package is NASA Software Release Authorization (SRA) # LAR-20440-1

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

batchee-1.5.2.tar.gz (16.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

batchee-1.5.2-py3-none-any.whl (19.5 kB view details)

Uploaded Python 3

File details

Details for the file batchee-1.5.2.tar.gz.

File metadata

  • Download URL: batchee-1.5.2.tar.gz
  • Upload date:
  • Size: 16.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1018-azure

File hashes

Hashes for batchee-1.5.2.tar.gz
Algorithm Hash digest
SHA256 558f47aed960459e9c853baeeb77b6e2abd5656aae4a636defd3bac6fc2da31a
MD5 d00638467d4fe83f838ef0fa9b9857d2
BLAKE2b-256 3614b55ca95ee479b71c6ce09a861ba9428864df24998843b2e0628ca254d206

See more details on using hashes here.

File details

Details for the file batchee-1.5.2-py3-none-any.whl.

File metadata

  • Download URL: batchee-1.5.2-py3-none-any.whl
  • Upload date:
  • Size: 19.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1018-azure

File hashes

Hashes for batchee-1.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 43a6060b3bbbde05e75b57c6e41992a4063f9ded5c49fd7b65fc2ac290ae8ad5
MD5 75933afc7f392442aed5e3a09d39eee5
BLAKE2b-256 fbf71344df02715888879e8b0ee542bcfb0bcda9c71de1de90e1a7064f02d6e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page