No project description provided
Project description
omfpandas
A pandas (and parquet) interface for the Open Mining Format package (omf).
When working with OMF files, it is often useful to convert the data to a pandas DataFrame. This package provides a simple interface to do so.
The parquet format is a nice, compact, efficient format to persist pandas DataFrames. This package also provides a simple interface to convert an omf element to a parquet file. When datasets do not fit into memory, parquet files can be read in chunks or by column.
Note: This package only supports omf 2.0, which is currently only a pre-release.
Installation
pip install omfpandas
If you intend to use the parquet functionality, you will need to install the optional dependencies.
pip install omfpandas[io]
Roadmap
- 0.2.0 - Add support for reading a VolumeElement (Block Model) from an OMF file as a pandas DataFrame. Export a VolumeElement as a parquet file.
- 0.3.0 - Add support for writing a DataFrame to an OMF BlockModel. Version 2.0 of the OMF spec is supported.
- 0.4.0 - Convert to omf 2.0 support.
- 0.5.0 - Block model profiling, with reports persisted in the omf file.
- 0.6.0 - Optional block model validation using pandera json schemas.
- 0.7.0 - Add support for low-memory/out-of-core writing an omf element to parquet
- ...
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for omfpandas-0.6.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8c1d652b69b369da803de5d5eec498f1f092a9a7532a06a4daf08c551c847a5b |
|
MD5 | e986db711d4cfc2217567731344464d4 |
|
BLAKE2b-256 | b30ff1c27a7d7745772bf69b941127eee9c28ed447ba81ff26e5fbfeebd450c1 |