Skip to main content

Human mitochondrial variants annotation using HmtVar.

Project description

HmtNote

https://img.shields.io/pypi/v/hmtnote.svg https://travis-ci.com/robertopreste/HmtNote.svg?token=zzk3yyGKDnWjk4pFXFuz&branch=master https://circleci.com/gh/robertopreste/HmtNote.svg?style=svg&circle-token=b910c3491e8df21fee34293ace05a35a116759c7 https://codecov.io/gh/robertopreste/HmtNote/branch/master/graph/badge.svg Documentation Status Updates

Human mitochondrial variants annotation using HmtVar.

Features

HmtNote is a bioinformatics tool that can be used to annotate human mitochondrial variants from a VCF, using data available on HmtVar.

Annotations are grouped into basic, cross-reference, variability and predictions, depending on the type of information they provide.

Basic

Basic information about the variant; they include:

  • Locus: Locus to which the variant belongs

  • AaChange: Aminoacidic change determined

  • Pathogenicity: Pathogenicity predicted by HmtVar

  • DiseaseScore: Disease score calculated by HmtVar

  • HmtVar: HmtVar ID of the variant (can be used to view the related VariantCard on https://www.hmtvar.uniba.it/varCard/<HmtVarID>)

Cross-reference

Cross-reference information about the variant; they include:

  • Clinvar: Clinvar ID of the variant

  • dbSNP: dbSNP ID of the variant

  • OMIM: OMIM ID of the variant

  • MitomapAssociatedDiseases: Diseases associated to the variant according to Mitomap

  • MitomapSomaticMutations: Diseases associated to the variant according to Mitomap Somatic Mutations

Variability

Variability and allele frequency data about the variant; they include:

  • NtVarH: Nucleotide variability of the position in healthy individuals

  • NtVarP: Nucleotide variability of the position in patient individuals

  • AaVarH: Aminoacid variability of the position in healthy individuals

  • AaVarP: Aminoacid variability of the position in patient individuals

  • AlleleFreqH: Allele frequency of the variant in healthy individuals overall

  • AlleleFreqP: Allele frequency of the variant in patient individuals overall

  • AlleleFreqH_AF: Allele frequency of the variant in healthy individuals from Africa

  • AlleleFreqP_AF: Allele frequency of the variant in patient individuals from Africa

  • AlleleFreqH_AM: Allele frequency of the variant in healthy individuals from America

  • AlleleFreqP_AM: Allele frequency of the variant in patient individuals from America

  • AlleleFreqH_AS: Allele frequency of the variant in healthy individuals from Asia

  • AlleleFreqP_AS: Allele frequency of the variant in patient individuals from Asia

  • AlleleFreqH_EU: Allele frequency of the variant in healthy individuals from Europe

  • AlleleFreqP_EU: Allele frequency of the variant in patient individuals from Europe

  • AlleleFreqH_OC: Allele frequency of the variant in healthy individuals from Oceania

  • AlleleFreqP_OC: Allele frequency of the variant in patient individuals from Oceania

Predictions

Pathogenicity prediction information of the variant from external resources; they include:

  • MutPred_Prediction: Pathogenicity prediction offered by MutPred

  • MutPred_Probability: Confidence of the pathogenicity prediction offered by MutPred

  • Panther_Prediction: Pathogenicity prediction offered by Panther

  • Panther_Probability: Confidence of the pathogenicity prediction offered by Panther

  • PhDSNP_Prediction: Pathogenicity prediction offered by PhD SNP

  • PhDSNP_Probability: Confidence of the pathogenicity prediction offered by PhD SNP

  • SNPsGO_Prediction: Pathogenicity prediction offered by SNPs & GO

  • SNPsGO_Probability: Confidence of the pathogenicity prediction offered by SNPs & GO

  • Polyphen2HumDiv_Prediction: Pathogenicity prediction offered by Polyphen2 HumDiv

  • Polyphen2HumDiv_Probability: Confidence of the pathogenicity prediction offered by Polyphen2 HumDiv

  • Polyphen2HumVar_Prediction: Pathogenicity prediction offered by Polyphen2 HumVar

  • Polyphen2HumVar_Probability: Confidence of the pathogenicity prediction offered by Polyphen2 HumVar

Usage

Command Line Interface

HmtNote can be used as a command line tool, by simply providing the original VCF and the filename where the annotated VCF will be saved:

hmtnote input_vcf.vcf annotated_vcf.vcf

By default, HmtNote will annotate the VCF using all four groups of annotations (basic, cross-reference, variability and predictions). If desired, you can specify which kind of annotation you want, using respectively --basic, --crossref, --variab and --predict (or -b, -c, -v, -p):

hmtnote input_vcf.vcf annotated_basic_vcf.vcf --basic
hmtnote input_vcf.vcf annotated_crossreferences_vcf.vcf --crossref
hmtnote input_vcf.vcf annotated_variability_vcf.vcf --variability
hmtnote input_vcf.vcf annotated_predictions_vcf.vcf --predict

Python Module

HmtNote can also be imported in a Python script and its function annotate_vcf() can be used to annotated a given VCF:

from hmtnote import annotate_vcf
annotate_vcf("input_vcf.vcf", "annotated_vcf.vcf")

By default, annotate_vcf() will annotate the VCF using all four groups of annotations (basic, cross-reference, variability and predictions). If desired, you can specify which kind of annotation you want, using respectively the basic=True, crossref=True, variab=True, predict=True arguments:

annotate_vcf("input_vcf.vcf", "annotated_basic_vcf.vcf", basic=True)
annotate_vcf("input_vcf.vcf", "annotated_crossreferences_vcf.vcf", crossref=True)
annotate_vcf("input_vcf.vcf", "annotated_variability_vcf.vcf", variab=True)
annotate_vcf("input_vcf.vcf", "annotated_predictions_vcf.vcf", predict=True)

Installation

PLEASE NOTE: HmtNote only supports Python 3!

The preferred installation method for HmtNote is using pip in a conda environment:

$ conda install requests
$ conda install -c bioconda cyvcf2
$ pip install hmtnote

If you have issues, please refer to the Installation section of the Documentation.

Credits

This package was created with Cookiecutter and the cc-pypackage project template.

History

0.1.0 (2019-03-03)

  • First release on PyPI.

0.1.1 (2019-03-04)

  • Clean installation requirements for conda;

  • Update documentation.

0.1.2 (2019-03-15)

  • Classes and methods are protected where needed;

  • Code style is clean.

0.1.3 (2019-03-17)

  • Fix issue with –predict annotation, which didn’t retrieve the correct field from HmtVar.

0.1.4 (2019-03-19)

  • Fix issue that prevented importing annotate_vcf() into Python scripts.

0.1.5 (2019-03-20)

  • Add HmtVar ID of the variant in basic and full annotation;

  • Change Disease Score annotation to DiseaseScore.

0.2.0 (WIP)

  • Add warnings to hmtnote command to be compliant with future versions;

  • Check internet connection before trying to annotate variants.

X.X.X (WIP)

  • Add options to download the required databases locally;

  • Use local databases to annotate variants (instead of calling HmtVar’s API);

  • Fallback to using local databases when web connection is not available?

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hmtnote-0.2.0.tar.gz (1.4 MB view details)

Uploaded Source

Built Distribution

hmtnote-0.2.0-py2.py3-none-any.whl (11.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hmtnote-0.2.0.tar.gz.

File metadata

  • Download URL: hmtnote-0.2.0.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for hmtnote-0.2.0.tar.gz
Algorithm Hash digest
SHA256 39c879b9fcc093a68f0133b11f1e8c88071347faf3bbdf877bfe88c69ea39ab6
MD5 1b04a7eac53d7bcd169802f47b7edc11
BLAKE2b-256 1d837364178e9885a55d7672ab32c5ff394d7d696197b463d471db88a7e2fa7b

See more details on using hashes here.

File details

Details for the file hmtnote-0.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: hmtnote-0.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 11.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for hmtnote-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 820665f0f944c7ab19d907015514fc3fb848595976bc080efe083c3890b978ab
MD5 428a7ded7dac0a882dcd519ebadd6ed0
BLAKE2b-256 332152dfea3ecc42e479314e8f6549a511aecd7ad4e9e10847230c479c4f325b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page