RFMix-reader is a Python package designed to efficiently read and process output files generated by RFMix, a popular tool for estimating local ancestry in admixed populations. The package employs a lazy loading approach, which minimizes memory consumption by reading only the loci that are accessed by the user, rather than loading the entire dataset into memory at once.

These details have not been verified by PyPI

Project links

GitHub Statistics

Project description

rfmix-reader

rfmix-reader is a Python package designed to efficiently read and process output files generated by RFMix, a popular tool for estimating local ancestry in admixed populations. The package employs a lazy loading approach, which minimizes memory consumption by reading only the loci that are accessed by the user, rather than loading the entire dataset into memory at once. Additionally, we leverage GPU acceleration to improve computational speed.

Install

rfmix-reader can be installed using pip:

pip install rfmix-reader

GPU Acceleration: rfmix-reader leverages GPU acceleration for improved performance. To use this functionality, you will need to install the following libraries for your specific CUDA version:

RAPIDS: Refer to official installation guide here
PyTorch: Installation instructions can be found here

Additoinal Notes:

We have not tested installation with Docker or Conda environemnts. Compatibility may vary.
If you do not have GPU, you can still use the basic functionality of rfmix-reader.

Key Features

Lazy Loading

Reads data on-the-fly as requested, reducing memory footprint.
Ideal for working with large RFMix output files that may not fit entirely in memory.

Efficient Data Access

Provides convenient access to specific loci or regions of interest.
Allows for selective loading of data, enabling faster processing times.

Seamless Integration

Designed to work seamlessly with existing Python data analysis workflows.
Facilitates downstream analysis and manipulation of RFMix output data.

Whether you're working with large-scale genomic datasets or have limited computational resources, RFMix-reader offers an efficient and memory-conscious solution for reading and processing RFMix output files. Its lazy loading approach ensures optimal resource utilization, making it a valuable tool for researchers and bioinformaticians working with admixed population data.

Usage

This works similarly to pandas-plink:

Two population admixture example

from rfmix_reader import read_rfmix

file_path = "examples/two_popuations/out/"
loci, rf_q, admix = read_rfmix(file_path)

Three population admixture example

Authors

Kynon JM Benjamin

Citation

Please cite: XXXX.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

Release history Release notifications | RSS feed

0.1.15

Jul 2, 2024

0.1.14

Jun 28, 2024

0.1.13

Jun 27, 2024

This version

0.1.12

Jun 22, 2024

0.1.12a0 pre-release

Jun 20, 2024

0.1.11

Jun 20, 2024

0.1.11a0 pre-release

Jun 19, 2024

0.1.10

Jun 19, 2024

0.1.10a0 pre-release

Jun 19, 2024

0.1.9

Jun 18, 2024

0.1.9a0 pre-release

Jun 18, 2024

0.1.8

Jun 18, 2024

0.1.7

Jun 12, 2024

0.1.6

Jun 11, 2024

0.1.5

Jun 11, 2024

0.1.4

Jun 3, 2024

0.1.3

Jun 2, 2024

0.1.2

Jun 2, 2024

0.1.0

May 31, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rfmix_reader-0.1.12.tar.gz (21.7 kB view hashes)

Uploaded Jun 22, 2024 Source

Built Distributions

rfmix_reader-0.1.12-py3-none-any.whl (22.9 kB view hashes)

Uploaded Jun 22, 2024 Python 3

rfmix_reader-0.1.12-cp39-cp39-manylinux_2_34_x86_64.whl (33.9 kB view hashes)

Uploaded Jun 22, 2024 CPython 3.9 manylinux: glibc 2.34+ x86-64

Hashes for rfmix_reader-0.1.12.tar.gz

Hashes for rfmix_reader-0.1.12.tar.gz
Algorithm	Hash digest
SHA256	`cf34543c17582919144edb87927e92656fede89b67c510c7001a5c5e6a62665b`
MD5	`56e4e4259cc331d1a3b02749def47dd5`
BLAKE2b-256	`22351d2d0d343facfd7ea383f06a53a26ef59168204e0a8f69eebb24c868d491`

Hashes for rfmix_reader-0.1.12-py3-none-any.whl

Hashes for rfmix_reader-0.1.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`30609d952e12fa8f461d9fe15229a45846435de986326e08098e850c81d5a828`
MD5	`87eb8e631997de79535d04e70bf4dede`
BLAKE2b-256	`54e5fca638a748566a0e491a5833bbdd2d7b5b9693abf1d912224fd5da3018b0`

Hashes for rfmix_reader-0.1.12-cp39-cp39-manylinux_2_34_x86_64.whl

Hashes for rfmix_reader-0.1.12-cp39-cp39-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`898439a46699b513ba9d5c70c638a78516991f09f88d35f5a1999bee46a3bf6c`
MD5	`4d212527bbc007cf07c63e76ef9307f0`
BLAKE2b-256	`8d8e4bc433c21afd6459e74b49c2b97aa3a5c128aa13b733cf03b63cd511393d`