Skip to main content

Pointed Interpretation of Clinical Variant Significance

Project description

# Picus Pointed Interpretation of Clinical Variant Significance

## Quick Install * Linux&Mac

> sudo pip3 install picus

  • Windows

> pip install picus

## Example Uses

  • Picus examples

> picus -i input.csv -o output.json

## Evidence Collection Process

### PVS1 * PVS1 null variant (nonsense, frameshift, canonical ±1 or 2 splice sites, initiation codon, single or multiexon deletion) in a gene where LOF is a known mechanism of disease.

#### Status * Implemented

#### Resources * LoF genes list from intervar. https://raw.githubusercontent.com/barslmn/InterVar/master/intervardb/PVS1.LOF.genes.hg19 * Null variants defined as HIGH IMPACT by https://www.ensembl.org/info/genome/variation/prediction/predicted_data.html

#### Conditions * “gene_symbol” is in LoF gene list. * “transcript_consequence_terms” is high impact.

#### Shortcomings * LoF gene list is only predictive and may be missing some actual LoF genes. * No checks for multiexon deletion.

### PS1 * Same amino acid change as a previously established pathogenic variant regardless of nucleotide change.

#### Status * Implemented

#### Resources * Clinvar xml (ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/)

#### Annotation Steps 1. Clinvar data is parsed using https://github.com/barslmn/clinvar. 2. Sample data and clinvar data is merged based on columns “CHR” and “POS”. 3. Clinvar feature columns “ALT”, “hgvsp”, and “clinical_significance” added to original annotation.

#### Conditions 1. “clinical_significance” is pathogenic. 2. Sample “hgvsp” and later added clinvar “hgvsp” changes are the same. 3. Sample “ALT” and clinvar “ALT” are different.

#### Shortcomings

### PS2 * De novo (both maternity and paternity confirmed) in a patient with the disease and no family history.

#### Status * Not Checked

#### Resources

#### Conditions

#### Shortcomings

### PS3 * Well-established in vitro or in vivo functional studies supportive of a damaging effect on the gene or gene product

#### Status * Not Checked

#### Resources

#### Conditions

#### Shortcomings

### PS4 * The prevalence of the variant in affected individuals is significantly increased compared with the prevalence in controls

#### Status * Implemented

#### Resources * Intervar

#### Conditions 1. “id” is in id list.

#### Shortcomings 1. No idea how the source is made.

### PM1 * Located in a mutational hot spot and/or critical and well-established functional domain (e.g., active site of an enzyme) without benign variation

#### Status * Planned.

#### Resources

#### Conditions

#### Shortcomings

### PM2 * Absent from controls (or at extremely low frequency if recessive) (Table 6) in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium

#### Status * Implemented

#### Resources * VEP

#### Conditions * “gnomad” less than 0.001.

#### Shortcomings

### PM3 * For recessive disorders, detected in trans with a pathogenic variant

#### Status * Planned for trio

#### Resources

#### Conditions

#### Shortcomings

### PM4 * Protein length changes as a result of in-frame deletions/insertions in a nonrepeat region or stop-loss variants

#### Status * Implemented

#### Resources * VEP

#### Conditions * “transcript_consequence_terms” is “inframe_insertion”, “inframe_deletion”, or “stop_lost”.

#### Shortcomings * No checks for repeat regions.

### PM5 * Novel missense change at an amino acid residue where a different missense change determined to be pathogenic has been seen before

#### Status * Broken. (╯°□°)╯︵ ┻━┻)

#### Resources * Clinvar xml (ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/)

#### Annotation Steps 1. Clinvar data is parsed using https://github.com/barslmn/clinvar. 2. Sample data and clinvar data hgvsp columns parsed till position. 3. Synonym changes removed from clinvar data. 4. Clinvar feature columns “hgvsc”, and “clinical_significance” added to original annotation based on protein change position.

#### Conditions 1. “gnomad” less then 0.001. 2. “clinical_significance” is pathogenic. 3. “transcript_consequence_terms” is missense variant. 4. “hgvsc” of the variant and clinvar entry dont match.

#### Shortcomings

### PM6 * Assumed de novo, but without confirmation of paternity and maternity

#### Status * Planned for trio.

#### Resources

#### Conditions

#### Shortcomings

### PP1 * Cosegregation with disease in multiple affected family members in a gene definitively known to cause the disease

#### Status * Planned after Vesta.

#### Resources

#### Conditions

#### Shortcomings

### PP2 * Missense variant in a gene that has a low rate of benign missense variation and in which missense variants are a common mechanism of disease

#### Status * Implemented

#### Resources * Intervar

#### Conditions * “transcript_consequence_terms” is a missense variant. * “gene_symbol” is in PP2 gene list.

#### Shortcomings

### PP3 * Multiple lines of computational evidence support a deleterious effect on the gene or gene product (conservation, evolutionary, splicing impact, etc.)

#### Status * Implemented

#### Resources * Vep

#### Conditions * “sift_score” less than 0.05 * “polyphen_score” greater than 0.908

#### Shortcomings

### PP4 * Patient’s phenotype or family history is highly specific for a disease with a single genetic etiology

#### Status * Not Checked.

#### Resources

#### Conditions

#### Shortcomings

### PP5 * Reputable source recently reports variant as pathogenic, but the evidence is not available to the laboratory to perform an independent evaluation

#### Status * Implemented.

#### Resources * Clinvar

#### Conditions * “clinical_significance” is Pathogenic.

#### Shortcomings

### Benign

### BA1 * Allele frequency is >5% in Exome Sequencing Project, 1000 Genomes Project, or Exome Aggregation Consortium

#### Status * Implemented.

#### Resources * Vep

#### Conditions * “minor_allele_freq” is greater than 0.05

OR

  • “gnomad” is greater than 0.05.

#### Shortcomings

### BS1 * Allele frequency is greater than expected for disorder

#### Status * Planned for later.

#### Resources

#### Conditions

#### Shortcomings

### BS2 * Observed in a healthy adult individual for a recessive (homozygous), dominant (heterozygous), or X-linked (hemizygous) disorder, with full penetrance expected at an early age

#### Status * Planned

#### Resources * Intervar

#### Conditions

#### Shortcomings

### BS3 * Well-established in vitro or in vivo functional studies show no damaging effect on protein function or splicing

#### Status * Not Checked.

#### Resources

#### Conditions

#### Shortcomings

### BS4 * Lack of segregation in affected members of a family

#### Status * Not Checked.

#### Resources

#### Conditions

#### Shortcomings

### BP1 * Missense variant in a gene for which primarily truncating variants are known to cause disease

#### Status * Implemented.

#### Resources * Intervar

#### Conditions * “transcript_consequence_terms” is a missense variant. * “gene_symbol” is in BP1 gene list.

#### Shortcomings

### BP2 * Observed in trans with a pathogenic variant for a fully penetrant dominant gene/disorder or observed in cis with a pathogenic variant in any inheritance pattern

#### Status * Planned for trio.

#### Resources

#### Conditions

#### Shortcomings

### BP3 * In-frame deletions/insertions in a repetitive region without a known function

#### Status * Not Checked.

#### Resources

#### Conditions

#### Shortcomings

### BP4 * Multiple lines of computational evidence suggest no impact on gene or gene product (conservation, evolutionary, splicing impact, etc.)

#### Status * Implemented

#### Resources * VEP

#### Conditions * “sift_score” greater than or equals to 0.05 * “polyphen_score” less than or equals to 0.446

#### Shortcomings

### BP5 * Variant found in a case with an alternate molecular basis for disease

#### Status * Not Checked.

#### Resources

#### Conditions

#### Shortcomings

### BP6 * Reputable source recently reports variant as benign, but the evidence is not available to the laboratory to perform an independent evaluation

#### Status * Implemented

#### Resources * Clinvar

#### Conditions * “clinical_significance” is benign

#### Shortcomings

### BP7 * A synonymous (silent) variant for which splicing prediction algorithms predict no impact to the splice consensus sequence nor the creation of a new splice site AND the nucleotide is not highly conserved

#### Status * Planned

#### Resources

#### Conditions

#### Shortcomings

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

picus-0.0.5-py3-none-any.whl (7.3 MB view details)

Uploaded Python 3

File details

Details for the file picus-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: picus-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 7.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.3.1 requests-toolbelt/0.8.0 tqdm/4.48.2 CPython/3.8.6

File hashes

Hashes for picus-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 94c2e8f2bb25dcb8cfa67351cad94708d425ecf8071ab3f97796b71f388fe672
MD5 c65299d2ba1e33bf69865d75591ab11c
BLAKE2b-256 13e97fe13b64c0a02b618a316e587ec9369378efc788e615f722437a4223b765

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page