Skip to main content

Efficient Storage and Querying of Linkage Disequilibrium Matrices

Project description

Efficient Storage and Querying of Linkage Disequilibrium Matrices

Setup

Install the package with:

pip install ldmat

Getting Started

The package includes some example files, which can be found in your venv directory under ldmat/examples.

The unprocessed LD matrices are too large to include here, but there is a sample processed matrix, chr21_partial.h5, which includes LD values for chromosome 21 from positions 13000001 to 22000001 (although there is no data between positions 13000001 and 14000001). This file contains all LD values greater than 0.1, rounded to 2 decimals.

For a sample query, we can extract the square submatrix of positions 14300001 through 14400001 with the following command:

ldmat submatrix YOUR_VENV_DIRECTORY/ldmat/examples/chr21_partial.h5 \
--i-start 14300001 --i-end 14400001

This will simply print the results as a Pandas DataFrame, so you'll probably want to save the results by adding an output file, like: ldmat submatrix ... -o YOUR_OUTPUT.csv

To see all the commands available, run ldmat.

For any specific command, you can get more information with the --help flag, like: ldmat submatrix --help.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ldmat-0.1.4.tar.gz (28.7 MB view hashes)

Uploaded Source

Built Distribution

ldmat-0.1.4-py3-none-any.whl (28.7 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page