PAnno is a Pharmacogenomics Annotation tool for clinical genomic testing.
Project description
PAnno: A Pharmacogenomics Annotation Tool for Clinical Genomic Testing
PAnno reports drug responses and prescribing recommendations by parsing the germline variant call format (VCF) file from NGS and the population to which the individual belongs.
Installation
Prerequisite: To ensure smooth installation and usage, Python >= 3.7 (#1 and #3 below), or Miniconda/Anaconda (#2 below) are required.
- You can install PAnno from PyPI using pip as follows:
pip install panno==0.3.0
- Alternatively, you can create a environment using Conda.
conda create -n PAnno panno=0.3.0 -c lyaqing -c conda-forge -c bioconda
conda activate PAnno
- If you would like the development version instead, the command is:
pip install --upgrade --force-reinstall git+https://github.com/PreMedKB/PAnno.git
# Or download first and install later
git clone https://github.com/PreMedKB/PAnno.git; pip install PAnno
Usage
Once installed, you can use PAnno by navigating to your VCF file and entering the corresponding three-letter abbreviation of the population:
panno -s sample_id -i germline_vcf -p population -o outdir
- Required arguments
-s, --sample_id TEXT Sample ID that will be displayed in the PAnno report.
-i, --germline_vcf TEXT Unannotated VCF file, preferably germline variant.
-p, --population [AAC|AME|EAS|EUR|LAT|NEA|OCE|SAS|SSA]
The three-letter abbreviation for biogeographic groups:
AAC (African American/Afro-Caribbean), AME (American),
EAS (East Asian), EUR (European), LAT (Latino),
NEA (Near Eastern), OCE (Oceanian),
SAS (Central/South Asian), SSA (Sub-Saharan African).
-o, --outdir TEXT Create report in the specified output path.
Input data
1. Germline VCF file
PAnno directly uses the NGS-derived germline VCF file as input and assumes it has undergone quality control. Therefore, if the VCF file is of poor quality, inaccurate genotypes and inappropriate clinical recommendations may be reported.
PAnno requires the VCF file aligned to the GRCh38 reference genome given the increasing generality and the built-in diplotype definition dependency version.
2. Population
There are nine biogeographic groups supported by PAnno. Please use the three-letter abbreviation as input. This is to prevent errors caused by special symbols such as spaces.
AAC (African American/Afro-Caribbean), AME (American), EAS (East Asian), EUR (European), LAT (Latino), NEA (Near Eastern), OCE (Oceanian), SAS (Central/South Asian), SSA (Sub-Saharan African).
More information is available at https://www.pharmgkb.org/page/biogeographicalGroups.
Output data
The report is created in ${sample_id}.html
at the outdir
by default.
For more detailed instructions, run panno -h
.
Examples
We analyzed the germline variants of 88 samples from the GeT-RM PGx study using PAnno.
- The VCF files are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/vcf.
- The PAnno reports are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/report.
Here is a snapshot from the PAnno report:
Core Components
PAnno provides an end-to-end clinical pharmacogenomics decision support solution by resolving, annotating, and reporting germline variants in individuals.
PAnno ranking model for diplotype inference
Genotype resolution aims to extract the alleles of small variants (SNVs and Indels) and the diplotypes related to PGx from the user-submitted VCF file. PAnno processes the “GT” information to obtain all relevant single-locus genotypes. Afterwards, the genotypes of small variants will be passed to clinical annotation directly, while the genotypes related to diplotype definitions will be passed to the following PAnno ranking model.
PAnno annotation method for predicting drug response at individual level
PAnno annotation method translates the literal PGx knowledge about genotypes into quantitative scores. The association between multiple genotypes and a single drug is then further translated into an individual-level association with this drug. Then the individual responses to specific drugs are reported in terms of the drug response level (decreased, moderate, and increased) and the level of clinical evidence (A and B).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.