panno

PAnno is a Pharmacogenomics Annotation tool for clinical genomic testing.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 2 - Pre-Alpha
Environment
- Console
- Web Environment
Intended Audience
- Science/Research
License
- OSI Approved :: Mozilla Public License 2.0 (MPL 2.0)
Natural Language
- English
Operating System
Programming Language

Project description

PAnno: A Pharmacogenomics Annotation Tool for Clinical Genomic Testing

PyPI Conda AppVeyor

PAnno reports drug responses and prescribing recommendations by parsing the germline variant call format (VCF) file from NGS and the population to which the individual belongs. PAnno provides an end-to-end clinical pharmacogenomics decision support solution by resolving, annotating, and reporting germline variants in individuals.

A ranking model dedicated to inferring diplotype developed based on allele definitions and population allele frequencies was introduced in PAnno. The predictive performance for diplotype was validated in comparison with four similar tools using the consensus diplotype data of the Genetic Testing Reference Materials Coordination Program (GeT-RM) as ground truth.

An annotation method was further proposed to summarize the drug response level (decreased, moderate, and increased) and the level of clinical evidence (A and B) for the resolved genotypes.

Status

PAnno is still under active development. In the current release, you should only use it to evaluate whether PAnno will compile and run properly on your system. All information in the PAnno report is interpreted directly from the uploaded VCF file. Users recognize that they are using PAnno at their own risk.

Installation

Prerequisite: PAnno requires Python >= 3.7 to be loaded in your environment for full functionality to work.

You can install PAnno from PyPI using pip as follows:

pip install panno

Alternatively, you can create a environment using Conda:

conda create -n PAnno panno -c lyaqing -c conda-forge -c bioconda

If you would like the development version instead, the command is:

pip install --upgrade --force-reinstall git+https://github.com/PreMedKB/PAnno.git
# Or download first and install later
git clone https://github.com/PreMedKB/PAnno.git; pip install PAnno

Usage

Once installed, you can use PAnno by navigating to your VCF file and entering the corresponding three-letter abbreviation of the population:

panno -s sample_id -i germline_vcf -p population -o outdir

Required arguments

-s, --sample_id TEXT            Sample ID that will be displayed in the PAnno report.

-i, --germline_vcf TEXT         Unannotated VCF file, preferably germline variant.

-p, --population [AAC|AME|EAS|EUR|LAT|NEA|OCE|SAS|SSA]
                                The three-letter abbreviation for biogeographic groups:
                                AAC (African American/Afro-Caribbean), AME (American),
                                EAS (East Asian), EUR (European), LAT (Latino),
                                NEA (Near Eastern), OCE (Oceanian),
                                SAS (Central/South Asian), SSA (Sub-Saharan African).

-o, --outdir TEXT               Create report in the specified output path.

Input data

1. Germline VCF file

PAnno directly uses the NGS-derived germline VCF file as input and assumes it has undergone quality control. Therefore, if the VCF file is of poor quality, inaccurate genotypes and inappropriate clinical recommendations may be reported.

PAnno requires the VCF file aligned to the GRCh38 reference genome given the increasing generality and the built-in diplotype definition dependency version.

2. Population

There are nine biogeographic groups supported by PAnno:

AAC (African American/Afro-Caribbean), AME (American), EAS (East Asian), EUR (European), LAT (Latino), NEA (Near Eastern), OCE (Oceanian), SAS (Central/South Asian), SSA (Sub-Saharan African).

More information is available at https://www.pharmgkb.org/page/biogeographicalGroups.

Please use the three-letter abbreviation as input. This is to prevent errors caused by special symbols such as spaces.

Output data

The report is created in ${sample_id}.html at the outdir by default.

For more detailed instructions, run panno -h.

Examples

We analyzed the germline variants of 88 samples from the GeT-RM PGx study using PAnno. The generated PGx report is available at https://github.com/PreMedKB/PAnno-analysis/report.

Here is a snapshot from the PAnno report:

Core Components

PAnno ranking model for diplotype inference

Genotype resolution aims to extract the alleles of small variants (SNVs and Indels) and the diplotypes related to PGx from the user-submitted VCF file. PAnno processes the “GT” information to obtain all relevant single-locus genotypes. Afterwards, the genotypes of small variants will be passed to clinical annotation directly, while the genotypes related to diplotype definitions will be passed to the PAnno ranking model. The output diplotypes with the highest ranking will then be annotated.

PAnno annotation method for predicting drug response at individual level

This component aims to discover the “drug-genotype-response-evidence” relationship. PAnno annotation method translates the literal PGx knowledge about genotypes into quantitative scores. The association between multiple genotypes and a single drug is then further translated into an individual-level association with this drug. Then the individual responses to specific drugs are reported in terms of the strength of the response and the reliability of the evidence.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 2 - Pre-Alpha
Environment
- Console
- Web Environment
Intended Audience
- Science/Research
License
- OSI Approved :: Mozilla Public License 2.0 (MPL 2.0)
Natural Language
- English
Operating System
Programming Language

Release history Release notifications | RSS feed

0.3.1

Dec 30, 2022

0.3.0

Dec 14, 2022

This version

0.2.3

Jul 30, 2022

0.2.2

Jul 30, 2022

0.2.2.dev1 pre-release

Jul 30, 2022

0.2.1

Jul 29, 2022

0.2.0

Jul 29, 2022

0.1.1

Jul 28, 2022

0.1.0

Jul 28, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

panno-0.2.3.tar.gz (8.6 MB view hashes)

Uploaded Jul 30, 2022 Source

Hashes for panno-0.2.3.tar.gz

Hashes for panno-0.2.3.tar.gz
Algorithm	Hash digest
SHA256	`a5a23d26aade36232d950638c9c017fc145936fcb5b02e2cdafaa050c5e9aeda`
MD5	`b748e7f87c1bd7cf50b994bccd7a1204`
BLAKE2b-256	`ed12e8bb14430efbab773f6bc6061a5eb81eea57089f2de35eb268ba0f045b67`