Convenience package for parsing RNA-seq quantification results
Project description
## readquant
This package contains helper functions for parsing both expression values
and technical features from common RNA-seq quantification tools.
The goal is to simplify going from a collection of quantification results to
expression table, as well as sample meta data.
A minimal example could look something like this:
In [1]: from readquant import read_quants, read_qcs
In [2]: tpm = read_quants('salmon/*E4_salmon_out', version='0.4.0')
In [3]: tpm.head()
Out[3]:
salmon/1771-026-195-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-026-190-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-026-196-E4_salmon_out \
Name
ERCC-00158 0.000
ERCC-00154 906.236
ERCC-00150 0.000
ERCC-00143 0.000
ERCC-00142 0.000
salmon/1771-026-198-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-023-118-E4_salmon_out \
Name
ERCC-00158 0.00000
ERCC-00154 9.35988
ERCC-00150 0.00000
ERCC-00143 724.44800
ERCC-00142 0.00000
salmon/1771-026-193-E4_salmon_out
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
In [4]: sample_info = read_qcs('salmon/*_salmon_out', version='0.4.0', flen_lim=(10, 100))
In [5]: sample_info.head()
Out[5]:
percent_mapped num_processed \
salmon/1771-026-197-G4_salmon_out 71.8270 2665988.0
salmon/1771-026-198-D2_salmon_out 67.7853 3841894.0
salmon/1771-026-195-H7_salmon_out 73.4492 3875822.0
salmon/1771-026-194-E9_salmon_out 47.5425 979446.0
salmon/1771-026-195-E4_salmon_out 56.3955 5051277.0
global_fl_mode robust_fl_mode
salmon/1771-026-197-G4_salmon_out 103.0 103.0
salmon/1771-026-198-D2_salmon_out 122.0 122.0
salmon/1771-026-195-H7_salmon_out 110.0 110.0
salmon/1771-026-194-E9_salmon_out 111.0 111.0
salmon/1771-026-195-E4_salmon_out 111.0 111.0
This package contains helper functions for parsing both expression values
and technical features from common RNA-seq quantification tools.
The goal is to simplify going from a collection of quantification results to
expression table, as well as sample meta data.
A minimal example could look something like this:
In [1]: from readquant import read_quants, read_qcs
In [2]: tpm = read_quants('salmon/*E4_salmon_out', version='0.4.0')
In [3]: tpm.head()
Out[3]:
salmon/1771-026-195-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-026-190-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-026-196-E4_salmon_out \
Name
ERCC-00158 0.000
ERCC-00154 906.236
ERCC-00150 0.000
ERCC-00143 0.000
ERCC-00142 0.000
salmon/1771-026-198-E4_salmon_out \
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
salmon/1771-023-118-E4_salmon_out \
Name
ERCC-00158 0.00000
ERCC-00154 9.35988
ERCC-00150 0.00000
ERCC-00143 724.44800
ERCC-00142 0.00000
salmon/1771-026-193-E4_salmon_out
Name
ERCC-00158 0.0
ERCC-00154 0.0
ERCC-00150 0.0
ERCC-00143 0.0
ERCC-00142 0.0
In [4]: sample_info = read_qcs('salmon/*_salmon_out', version='0.4.0', flen_lim=(10, 100))
In [5]: sample_info.head()
Out[5]:
percent_mapped num_processed \
salmon/1771-026-197-G4_salmon_out 71.8270 2665988.0
salmon/1771-026-198-D2_salmon_out 67.7853 3841894.0
salmon/1771-026-195-H7_salmon_out 73.4492 3875822.0
salmon/1771-026-194-E9_salmon_out 47.5425 979446.0
salmon/1771-026-195-E4_salmon_out 56.3955 5051277.0
global_fl_mode robust_fl_mode
salmon/1771-026-197-G4_salmon_out 103.0 103.0
salmon/1771-026-198-D2_salmon_out 122.0 122.0
salmon/1771-026-195-H7_salmon_out 110.0 110.0
salmon/1771-026-194-E9_salmon_out 111.0 111.0
salmon/1771-026-195-E4_salmon_out 111.0 111.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
readquant-1.0.0.tar.gz
(3.7 kB
view details)
Built Distribution
File details
Details for the file readquant-1.0.0.tar.gz
.
File metadata
- Download URL: readquant-1.0.0.tar.gz
- Upload date:
- Size: 3.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f2caba12b6543e98357aecb441e3cccf1ef63640b02d2177f14d09059f6aae1 |
|
MD5 | d89060957059af353a4ce0b81b9ef4bf |
|
BLAKE2b-256 | d77cf777a3f23ea7784645da5962258e072a826943c56044cdb174556d3b9f6c |
File details
Details for the file readquant-1.0.0-py2.py3-none-any.whl
.
File metadata
- Download URL: readquant-1.0.0-py2.py3-none-any.whl
- Upload date:
- Size: 5.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d1e42f773363b3e35dc465623eaee5549e5f2b461f3f505ba255a33248b1de77 |
|
MD5 | 548513b0bb98952bcebb51133431c38d |
|
BLAKE2b-256 | acd11321172fb3e530777cef66f18993cc2151dbb56c4b4abb11465540c4fb42 |