Skip to main content

A Python package for processing omol-25 data using MPI.

Project description

lavello_mlips

Tests Pre-commit Docs Zenodo Publish PyPI version Python versions License Zenodo Code style: ruff Coverage

A Python package for processing omol-25 data using MPI.

Installation

You can install this package locally:

uv pip install -e .

Usage

This package provides three primary command-line interfaces:

1. Processing Data

Extract, process, and combine molecular data from an S3 bucket (or local directory):

lavello_mlips --help
  • MPI Support: Add --mpi and run via mpirun to distribute tasks across multiple workers natively via hybrid RMA.
  • Smart Restart: Add --restart to automatically sweep the output directory, recover orphaned Parquet/XYZ pairs, and pick up right where you left off.
  • Logging: Specify --log-file my_log.log to write text streams to disk (existing logs are automatically appended to, not overwritten).
  • Batch Flushing: Use --batch-size N to control disk writes. If not specified, workers dynamically flush at 1% increments (with a strict minimum of 100 output structures).

2. Downloading Raw Data

Download original raw orca.out datasets from S3 without running processing logic natively on them:

download_omol25 --help

3. Verification Utility

Cross-reference a generated Parquet dataset with its respective ExtXYZ file to guarantee absolutely zero data corruption or structural mismatching:

verify_processed_omol25 --parquet props_group.parquet --extxyz structs_group.xyz
  • This rigorously structurally aligns both tables via geom_sha1 and flags any mathematically misassigned properties.
  • Embedded timing metadata such as process_time_s are strictly and unconditionally excluded to prevent false-positive errors.

License

This project is licensed under the BSD 3-Clause License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lavello_mlips-0.2.0.tar.gz (28.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lavello_mlips-0.2.0-py3-none-any.whl (33.6 kB view details)

Uploaded Python 3

File details

Details for the file lavello_mlips-0.2.0.tar.gz.

File metadata

  • Download URL: lavello_mlips-0.2.0.tar.gz
  • Upload date:
  • Size: 28.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for lavello_mlips-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5d747f8f9e820ce616bd028542de17867ab8b2a522a99b20fecf9e38f610baba
MD5 85e074299291a3152da3d4db8fa2b6d8
BLAKE2b-256 88792948791fec7c4ae4555014ee5d1a157d2f484f29cd9572c322ec545fec74

See more details on using hashes here.

File details

Details for the file lavello_mlips-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: lavello_mlips-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 33.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for lavello_mlips-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 079bbf1eca57596f60091edaf5bb39b14e34846edba695b20df2036074a3fb35
MD5 0aa7e76209a679cac41d976fd7972270
BLAKE2b-256 724e7088005ae3e39bfffd24da92f84c415afe58b60d2cf8f3e8214094ac5645

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page