Skip to main content

A package for reading and writing OMF data with pandas

Project description

omfpandas

PyPI Run Tests Publish Docs

A pandas (and parquet) interface for the Open Mining Format package (omf).

When working with OMF files, it is often useful to convert the data to a pandas DataFrame. This package provides a simple interface to do so.

The parquet format is a nice, compact, efficient format to persist pandas DataFrames. This package also provides a simple interface to convert an omf element to a parquet file. When datasets do not fit into memory, parquet files can be read in chunks or by column.

Note: This package only supports omf 2.0, which is currently only a pre-release.

Installation

pip install omfpandas

If you intend to use the parquet functionality, you will need to install the optional dependencies.

pip install omfpandas[io]

Roadmap

  • 0.2.0 - Add support for reading a VolumeElement (Block Model) from an OMF file as a pandas DataFrame. Export a VolumeElement as a parquet file.
  • 0.3.0 - Add support for writing a DataFrame to an OMF BlockModel. Version 2.0 of the OMF spec is supported.
  • 0.4.0 - Convert to omf 2.0 support.
  • 0.5.0 - Block model profiling, with reports persisted in the omf file.
  • 0.6.0 - Optional block model validation using pandera json schemas.
  • ...
  • 0.9.0 - Add support for low-memory/out-of-core writing an omf element to parquet
  • ...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omfpandas-0.9.0.tar.gz (25.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omfpandas-0.9.0-py3-none-any.whl (30.8 kB view details)

Uploaded Python 3

File details

Details for the file omfpandas-0.9.0.tar.gz.

File metadata

  • Download URL: omfpandas-0.9.0.tar.gz
  • Upload date:
  • Size: 25.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for omfpandas-0.9.0.tar.gz
Algorithm Hash digest
SHA256 6f212e16114078b58007362763044c2fe79f07ef70e7be2749bef295316de0f5
MD5 6a95e9617fea77f8200f5d9973f00d06
BLAKE2b-256 485764a8cd930a0bfe5bf044692c73650346cf66146de44ed3f36fbf56cc98fa

See more details on using hashes here.

File details

Details for the file omfpandas-0.9.0-py3-none-any.whl.

File metadata

  • Download URL: omfpandas-0.9.0-py3-none-any.whl
  • Upload date:
  • Size: 30.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for omfpandas-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7392b622281fc46e7fcb139bb7ac3975921a0efcfbf2ef15457282ce90fa4f66
MD5 c304d6c4c0568c0604457e5dcc1ff4be
BLAKE2b-256 c0fca7cfe528954f290cd8fd01419aeaa4651778fbcd676dacdb37041cf90dae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page