Skip to main content

Regular expression-based validation of HGVS-style variant strings for Multiplexed Assays of Variant Effect.

Project description

Build Status Coverage Status Code style: black

mavehgvs

mavehgvs is the Python reference implementation of the MAVE-HGVS variant representation standard, a strict subset of HGVS, used primarily for clinical genomics.

The MAVE-HGVS Standard

MAVE-HGVS is a strict subset of the HGVS Sequence Variant Nomenclature, version 20.05. HGVS nomenclature is comprehensive and very expressive and consequently includes a lot of syntax that is not needed to represent variants from Multiplexed Assay of Variant Effect (MAVE) data and makes the variant strings more challenging to parse.

While packages exist for parsing HGVS (most notably the biocommons hgvs package, they are intended for use in human genetics and rely on sequence databases and reference sequence (called "target sequence" for MAVE-HGVS), which are not always available for or relevant for multiplexed assays.

MAVE-HGVS is an attempt to define an easy-to-parse subset of the HGVS nomenclature that captures those variants that occur in MAVE datasets, while excluding many variant types that are unlikely to be found. Importantly, the mavehgvs implementation does not rely on external sequence databases or identifiers.

Supported Variants

MAVE-HGVS supports DNA, RNA, and protein variants. MAVE-HGVS supports a subset of HGVS variants including:

  • substitutions
  • deletions
  • duplications
  • insertions
  • frame shifts

Many HGVS variants are unsupported including:

  • inversions
  • conversions
  • extensions
  • changes in methylation state
  • RNA fusion transcripts
  • mosaicism
  • chimerism
  • variants with uncertain consequence
  • variants in trans or unknown phase
  • complex variants (e.g. translocations)

For further details, including example variants, see the specification in the package documentation.

Installation

Install mavehgvs from pip using:

pip3 install mavehgvs

To set up the package for development purposes, include the optional dependencies and install pre-commit:

pip3 install mavehgvs[dev]
pre-commit install

Feedback

To report a problem or request a new feature with either the mavehgvs package or the MAVE-HGVS standard, please use the GitHub issue tracker.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mavehgvs-0.6.1.tar.gz (25.3 kB view details)

Uploaded Source

Built Distribution

mavehgvs-0.6.1-py3-none-any.whl (19.5 kB view details)

Uploaded Python 3

File details

Details for the file mavehgvs-0.6.1.tar.gz.

File metadata

  • Download URL: mavehgvs-0.6.1.tar.gz
  • Upload date:
  • Size: 25.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for mavehgvs-0.6.1.tar.gz
Algorithm Hash digest
SHA256 b69a88fa3c4aa1fe9c1b93b7c66b51df378bb356784e88c18a74c58179c8ab12
MD5 d2a19ac14e5c754e1b042f3b5848a693
BLAKE2b-256 5c81ef990e2539d710e8f7d152e2135cdde3f9947427e55673487bf56aaf1594

See more details on using hashes here.

File details

Details for the file mavehgvs-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: mavehgvs-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 19.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for mavehgvs-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5bf91a8c8ddf7a1f65d6c3d7fdf33c6303392cd5e7f561ba349a19d987d2637a
MD5 0468777afe27b1503c0af947f9c24b58
BLAKE2b-256 3440c57fa6a188eccb5fddb4abf858b0bba98ac77694001675416ca07851fd05

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page