Small library of common functionalities used in various projects in the ratschlab
Project description
Small library of common code used in various projects in the ratschlab.
Free software: MIT license
Features
Writing parquet and HDF5 files with sensible defaults.
Support for working with ‘chunkfiles’, i.e. splitting up a large dataset in smaller chunks which can be processed independently:
Repartition records (i.e. increase or decrease number of chunkfiles) while keeping data belonging together in the same file (e.g. data with the same patient id associated)
simple indexing for looking up in which chunk to find data belonging e.g. to a patient
bigmatrix: support for creating and reading large matrices stored in HDF5 having additional metadata on the axes in form of data frames.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ratschlab_common-0.2.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5096b842720b8b2a81c4c0e960308f79455d72671c8b5fbd214a5944ed24e4f6 |
|
MD5 | 84b815f8e79a1b5e97297c0258a2e1e6 |
|
BLAKE2b-256 | f119a41a895b6e85250bd304e15550944d716acc9f329848ccdf52733f46e1e2 |