Skip to main content

HLAQuant - Get HLA allele specific expression

Project description

HLAQuant

Author: Austin Crinklaw

What is it?

HLAQuant is a pipeline that produces fast and accurate allele specific expression for HLA genes. This is done by quantifying on the peptide binding groove domain using Salmon with personalized sequences.

Requirements:

  • Linux OS
  • NCBI Blast+
  • Salmon -- please ensure Salmon is on your PATH!
  • Python 3+
    • Python packages: Pandas, BioPython

How to use:

Installation:

HLAQuant can be downloaded through PyPI using the following pip command.

pip install hlaquant

Input

HLAQuant takes two input files currently.

  • File one (-hla) consists of a sample_id and then a list of alleles corresponding to that sample's typing (tab separated)
  • File two (-fastq) consists of a sample_id and then a list of FASTQ files corresponding to that sample (paired-end or single-end) Examples of these inputs can be found under the 'test_data/' directory

Usage

  • A list of parameters and their descriptions can be found with the -h flag
python -m HLAQuant -h

Output

The output will match that of Salmons. It consists of a tab separated file containing the transcript ID (in this case, a specific HLA allele), as well as the number of reads. TPMs can be ignored as they will be inaccurate since we are only quantifying over a few sequences.

How does it work?

  • We first take the list of alleles and fetch the corresponding sequences from IMGT
  • Next we extract the sequences corresponding to their groove domains from these sequences
  • We build an index for quantification using these G-domain sequences
  • We then perform quantification using this index

The paper outlining this method in detail can be found [....somewhere when it is published]

References:

This pipeline would be unable to work without Salmon

Patro, R., Duggal, G., Love, M. I., Irizarry, R. A., & Kingsford, C. (2017). Salmon provides fast and bias-aware quantification of transcript expression. Nature Methods.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

hlaquant-0.0.2-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file hlaquant-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: hlaquant-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8

File hashes

Hashes for hlaquant-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ee4094cc4bda75b5248a1692b77c87903e66a859f5b83e2a17d69a0959b833e2
MD5 d6427f4b17d532d48e3c59894842da8c
BLAKE2b-256 e3b4bc03eec2e66bcb98ec1520319eb41d654e5d2afff70f7d7b8bafa52615ab

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page