Skip to main content

tools for accessing large amounts of genomic data

Project description

by Michael Hoffman <michael.hoffman at utoronto dot ca>

Latest Version


Genomedata is both a format for efficient storage of multiple tracks of numeric data anchored to a genome and a python interface to genomic datasets. The file format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint. We have also developed utilities to load data into this format.

Specifically, the genomedata package provides access to genome-scale data, either using an HDF5 container or a bigWig file.

Please see the following URL (and linked documentation) for information, installation, and support:


Live documentation based on this repository can be found on Read the Docs.


Genomedata is free software: you can redistribute it and/or modify it under the terms of version 2 of the GNU General Public License as published by the Free Software Foundation.

Genomedata is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genomedata-1.7.2.tar.gz (1.9 MB view hashes)

Uploaded Source

Built Distribution

genomedata-1.7.2-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.2 MB view hashes)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page