Skip to main content

Small library of common functionalities used in various projects in the ratschlab

Project description

https://img.shields.io/pypi/v/ratschlab_common.svg https://img.shields.io/travis/ratschlab/ratschlab_common.svg Documentation Status

Small library of common code used in various projects in the ratschlab.

  • Free software: MIT license

Features

  • Writing parquet and HDF5 files with sensible defaults.

  • Support for working with ‘chunkfiles’, i.e. splitting up a large dataset in smaller chunks which can be processed independently:

    • Repartition records (i.e. increase or decrease number of chunkfiles) while keeping data belonging together in the same file (e.g. data with the same patient id associated)

    • simple indexing for looking up in which chunk to find data belonging e.g. to a patient

  • bigmatrix: support for creating and reading large matrices stored in HDF5 having additional metadata on the axes in form of data frames.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ratschlab_common-0.1.0.tar.gz (16.2 kB view hashes)

Uploaded Source

Built Distribution

ratschlab_common-0.1.0-py2.py3-none-any.whl (2.3 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page