HLAQuant - Get HLA allele specific expression
Project description
HLAQuant
Author: Austin Crinklaw
What is it?
HLAQuant is a pipeline that produces fast and accurate allele specific expression for HLA genes. This is done by quantifying on the peptide binding groove domain using Salmon with personalized sequences.
Requirements:
- Linux OS
- NCBI Blast+
- Salmon -- please ensure Salmon is on your PATH!
- Python 3+
- Python packages: Pandas, BioPython
How to use:
Installation:
HLAQuant can be downloaded through PyPI using the following pip command.
pip install hlaquant
Input
HLAQuant takes two input files currently.
- File one (-hla) consists of a sample_id and then a list of alleles corresponding to that sample's typing (tab separated)
- File two (-fastq) consists of a sample_id and then a list of FASTQ files corresponding to that sample (paired-end or single-end) Examples of these inputs can be found under the 'test_data/' directory
Usage
- A list of parameters and their descriptions can be found with the -h flag
python -m HLAQuant -h
Output
The output will match that of Salmons. It consists of a tab separated file containing the transcript ID (in this case, a specific HLA allele), as well as the number of reads. TPMs can be ignored as they will be inaccurate since we are only quantifying over a few sequences.
How does it work?
- We first take the list of alleles and fetch the corresponding sequences from IMGT
- Next we extract the sequences corresponding to their groove domains from these sequences
- We build an index for quantification using these G-domain sequences
- We then perform quantification using this index
The paper outlining this method in detail can be found [....somewhere when it is published]
References:
This pipeline would be unable to work without Salmon
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A., & Kingsford, C. (2017). Salmon provides fast and bias-aware quantification of transcript expression. Nature Methods.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file hlaquant-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: hlaquant-0.0.2-py3-none-any.whl
- Upload date:
- Size: 1.2 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee4094cc4bda75b5248a1692b77c87903e66a859f5b83e2a17d69a0959b833e2 |
|
MD5 | d6427f4b17d532d48e3c59894842da8c |
|
BLAKE2b-256 | e3b4bc03eec2e66bcb98ec1520319eb41d654e5d2afff70f7d7b8bafa52615ab |