Skip to main content

A package for reading and writing OMF data with pandas

Project description

omfpandas

PyPI Run Tests Publish Docs

A pandas (and parquet) interface for the Open Mining Format package (omf).

When working with OMF files, it is often useful to convert the data to a pandas DataFrame. This package provides a simple interface to do so.

The parquet format is a nice, compact, efficient format to persist pandas DataFrames. This package also provides a simple interface to convert an omf element to a parquet file. When datasets do not fit into memory, parquet files can be read in chunks or by column.

Note: This package only supports omf 2.0, which is currently only a pre-release.

Installation

pip install omfpandas

If you intend to use the parquet functionality, you will need to install the optional dependencies.

pip install omfpandas[io]

Roadmap

  • 0.2.0 - Add support for reading a VolumeElement (Block Model) from an OMF file as a pandas DataFrame. Export a VolumeElement as a parquet file.
  • 0.3.0 - Add support for writing a DataFrame to an OMF BlockModel. Version 2.0 of the OMF spec is supported.
  • 0.4.0 - Convert to omf 2.0 support.
  • 0.5.0 - Block model profiling, with reports persisted in the omf file.
  • 0.6.0 - Optional block model validation using pandera json schemas.
  • 0.7.0 - Add support for low-memory/out-of-core writing an omf element to parquet
  • ...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omfpandas-0.6.11.tar.gz (14.9 kB view details)

Uploaded Source

Built Distribution

omfpandas-0.6.11-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file omfpandas-0.6.11.tar.gz.

File metadata

  • Download URL: omfpandas-0.6.11.tar.gz
  • Upload date:
  • Size: 14.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for omfpandas-0.6.11.tar.gz
Algorithm Hash digest
SHA256 17e45d0a074a36b0e1c8a4b01f30ad49bf93106090104d31438191a9e243033d
MD5 ead98f24edbe39d1e42b013b98bfa533
BLAKE2b-256 e57cc7f6818b113f5c43ba4a559876cd770311fb134e837140896680f8ffd3e1

See more details on using hashes here.

File details

Details for the file omfpandas-0.6.11-py3-none-any.whl.

File metadata

  • Download URL: omfpandas-0.6.11-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for omfpandas-0.6.11-py3-none-any.whl
Algorithm Hash digest
SHA256 9f698d21bde7e724e14baf5c2d4604fd387f6d85232715e5323fbb5bba5fd175
MD5 f00c838a6dd81ebbac5eb38cce490836
BLAKE2b-256 45aa0522057e9b9e022c95d028446649ee5fd9cfc5876d75ac6cf76d798608a1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page