VICC normalization routine for variations

These details have not been verified by PyPI

Project description

Variation Normalization

The Variation Normalizer parses and translates free-text descriptions of genomic variations into computable objects conforming to the Variation Representation Specification (VRS), enabling consistent and accurate variant harmonization across a diversity of genomic knowledge resources.

Live OpenAPI endpoint

Installation

Install from PyPI:

python3 -m pip install variation-normalizer

variation-normalization branch	variation-normalizer version	gene-normalizer version	VRS version
main	>=0.14.Z	>=0.9.Z	2.0

About

Variation Normalization works by using four main steps: tokenization, classification, validation, and translation. During tokenization, we split strings on whitespace and parse to determine the type of token. During classification, we specify the order of tokens a classification can have. We then do validation checks such as ensuring references for a nucleotide or amino acid matches the expected value and validating a position exists on the given transcript. During translation, we return a VRS Allele object.

Variation Normalization is limited to the following types of variants:

HGVS expressions and text representations (ex: BRAF V600E):
- protein (p.): substitution, deletion, insertion, deletion-insertion
- coding DNA (c.): substitution, deletion, insertion, deletion-insertion
- genomic (g.): substitution, deletion, ambiguous deletion, insertion, deletion-insertion, duplication
gnomAD-style VCF (chr-pos-ref-alt, ex: 7-140753336-A-T)
- genomic (g.): substitution, deletion, insertion

Variation Normalizer accepts input from GRCh37 or GRCh8 assemblies.

We are working towards adding more types of variations, coordinates, and representations.

VRS Versioning

The variation-normalization repo depends on VRS models, and therefore each variation-normalizer package on PyPI uses a particular version of VRS. The correspondences between packages may be summarized as:

variation-normalization branch	variation-normalizer version	gene-normalizer version	VRS version
main	>=0.14.Z	>=0.9.Z	2.0

Previous VRS Versioning

The correspondences between the packages that are no longer maintained may be summarized as:

variation-normalization branch	variation-normalizer version	gene-normalizer version	VRS version
vrs-1.3	0.6.Z	0.1.Z	1.3

Available Endpoints

`/to_vrs`

Returns a list of validated VRS Variations.

`/normalize`

Returns a VRS Variation aligned to the prioritized transcript. The Variation Normalizer relies on Common Operations On Lots-of Sequences Tool (cool-seq-tool) for retrieving the prioritized transcript data. More information on the transcript selection algorithm can be found here.

If a genomic variation query is given a gene (E.g. BRAF g.140753336A>T), the associated cDNA representation will be returned. This is because the gene provides additional strand context. If a genomic variation query is not given a gene, the GRCh38 representation will be returned.

Development

Clone the repo:

git clone https://github.com/cancervariants/variation-normalization.git
cd variation-normalization

For a development install, we recommend using Pipenv. See the pipenv docs for direction on installing pipenv in your compute environment.

Once installed, from the project root dir, just run:

pipenv shell
pipenv update && pipenv install --dev

Required resources

Variation Normalization relies on some local data caches which you will need to set up. We provide instructions on how to setup your development environment using Docker.

SeqRepo: You must setup SeqRepo locally following these steps.
Gene Normalizer: The Variation Normalizer uses Gene Normalizer to get normalized gene concept information.
Universal Transcript Archive (UTA): The Variation Normalizer uses Common Operations On Lots-of Sequences Tool (cool-seq-tool) which uses UTA as the underlying PostgreSQL database.

SeqRepo

Variation Normalization relies on seqrepo, which you must download yourself.

Variation Normalizer uses seqrepo to retrieve sequences at given positions on a transcript.

From the root directory:

pip install seqrepo
sudo mkdir /usr/local/share/seqrepo
sudo chown $USER /usr/local/share/seqrepo
seqrepo pull -i 2024-12-20/  # Replace with latest version using `seqrepo list-remote-instances` if outdated

If you get an error similar to the one below:

PermissionError: [Error 13] Permission denied: '/usr/local/share/seqrepo/2024-12-20/._fkuefgd' -> '/usr/local/share/seqrepo/2024-12-20/'

You will want to do the following:
(Might not be ._fkuefgd, so replace with your error message path)

sudo mv /usr/local/share/seqrepo/2024-12-20._fkuefgd /usr/local/share/seqrepo/2024-12-20
exit

Use the SEQREPO_ROOT_DIR environment variable to set the path of an already existing SeqRepo directory. The default is /usr/local/share/seqrepo/latest.

Docker Installation (Preferred)

We recommend installing the Variation Normalizer using Docker.

Requirements

Docker

Build, (re)create, and start containers

docker volume create --name=uta_vol
docker compose up

[!IMPORTANT] This assumes you have a local SeqRepo installed at /usr/local/share/seqrepo/2024-12-20. If you have it installed elsewhere, please update the SEQREPO_ROOT_DIR environment variable in compose.yaml.
If you're using Docker Desktop, you'll want to go to Settings -> Resources -> File sharing and add /usr/local/share/seqrepo under the Virtual file shares section. Otherwise, you will get the following error: OSError: Unable to open SeqRepo directory /usr/local/share/seqrepo/2024-12-20.

[!TIP] If you want a clean slate, run docker compose down -v to remove containers and volumes, then docker compose up --build to rebuild and start fresh containers.

Point your browser to http://localhost:8001/variation/.

Code QC

Code style is managed by Ruff and checked prior to commit.

To perform formatting and check style:

python3 -m ruff format . && python3 -m ruff check --fix .

We use pre-commit to run conformance tests.

This ensures:

Style correctness
No large files
AWS credentials are present
Private key is present

Pre-commit must be installed before your first commit. Use the following command:

pre-commit install

Testing

From the root directory of the repository:

pytest tests/

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.15.2

Jan 2, 2026

0.15.1

Jul 31, 2025

0.15.0

Jul 18, 2025

0.14.2

Jul 17, 2025

0.14.1

Apr 22, 2025

0.14.0

Apr 3, 2025

0.13.0

Feb 14, 2025

0.12.2

Feb 13, 2025

0.12.1

Jan 30, 2025

0.12.0

Jan 7, 2025

0.11.0

Jan 3, 2025

0.10.0

Jul 22, 2024

0.9.1

Jul 16, 2024

0.9.0 yanked

Jul 15, 2024

Reason this release was yanked:

Does not work

0.8.2

Mar 21, 2024

0.8.1

Feb 23, 2024

0.8.1.dev0 pre-release

Feb 15, 2024

0.8.0.dev1 pre-release

Feb 5, 2024

0.8.0.dev0 pre-release

Nov 10, 2023

0.7.0.dev7 pre-release

May 2, 2023

0.7.0.dev6 pre-release

Apr 19, 2023

0.7.0.dev5 pre-release

Apr 11, 2023

0.7.0.dev4 pre-release

Apr 6, 2023

0.7.dev0 pre-release

Oct 3, 2022

0.6.3 yanked

Sep 23, 2022

Reason this release was yanked:

This is now on the 0.7.x release

0.6.0 yanked

Aug 25, 2022

Reason this release was yanked:

This is now on the 0.7.x release

0.6.0.dev1 pre-release

Nov 15, 2023

0.6.0.dev0 pre-release

Sep 22, 2023

0.5.5

May 9, 2023

0.5.4

May 7, 2023

0.5.3

Apr 6, 2023

0.5.2

Jan 10, 2023

0.5.1

Nov 8, 2022

0.4.0a7 pre-release

Jun 13, 2022

0.4.0a6 pre-release

Jun 3, 2022

0.4.0a5 pre-release

May 24, 2022

0.4.0a4 pre-release

May 23, 2022

0.4.0a3 pre-release

May 3, 2022

0.4.0a2 pre-release

Apr 20, 2022

0.4.0a1 pre-release

Apr 12, 2022

0.3.0

Apr 4, 2022

0.2.22

Mar 30, 2022

0.2.21

Mar 7, 2022

0.2.20

Feb 21, 2022

0.2.19

Feb 3, 2022

0.2.18

Feb 2, 2022

0.2.17

Jan 27, 2022

0.2.15

Dec 24, 2021

0.2.14

Dec 7, 2021

0.2.13

Nov 23, 2021

0.2.12

Nov 7, 2021

0.2.11

Sep 1, 2021

0.2.10

Aug 27, 2021

0.2.9

Aug 24, 2021

0.2.8

Aug 13, 2021

0.2.7

Aug 12, 2021

0.2.5

Aug 4, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

variation_normalizer-0.15.2.tar.gz (97.4 kB view details)

Uploaded Jan 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

variation_normalizer-0.15.2-py3-none-any.whl (153.8 kB view details)

Uploaded Jan 2, 2026 Python 3

File details

Details for the file variation_normalizer-0.15.2.tar.gz.

File metadata

Download URL: variation_normalizer-0.15.2.tar.gz
Upload date: Jan 2, 2026
Size: 97.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for variation_normalizer-0.15.2.tar.gz
Algorithm	Hash digest
SHA256	`4d8731e4ac7d9134db50525abb328c4b012505111fb76649e2f97ca4beb9a197`
MD5	`66d92c15bc701f28e3c266a552bc05f0`
BLAKE2b-256	`bb0d74b50eb0406652c3d8bd1a797e2da7fd4b99a7f7631189fbb1c2f874a4ac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for variation_normalizer-0.15.2.tar.gz:

Publisher: release.yml on cancervariants/variation-normalization

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: variation_normalizer-0.15.2.tar.gz
- Subject digest: 4d8731e4ac7d9134db50525abb328c4b012505111fb76649e2f97ca4beb9a197
- Sigstore transparency entry: 788332483
- Sigstore integration time: Jan 2, 2026
Source repository:
- Permalink: cancervariants/variation-normalization@902b988e43875dddec066a713502c639b3c27480
- Branch / Tag: refs/tags/0.15.2
- Owner: https://github.com/cancervariants
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@902b988e43875dddec066a713502c639b3c27480
- Trigger Event: release

File details

Details for the file variation_normalizer-0.15.2-py3-none-any.whl.

File metadata

Download URL: variation_normalizer-0.15.2-py3-none-any.whl
Upload date: Jan 2, 2026
Size: 153.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for variation_normalizer-0.15.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`acf58c1772e723590ea7884b0ae43c3501d19e0c289aced8705e9fa77d47fb21`
MD5	`6aa13283efa8a98952c98c62ed7dac52`
BLAKE2b-256	`d5f6e0b51e648ecb63965c82284fd58e2d3c44d9748587e7555d5f3ccbad49a9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for variation_normalizer-0.15.2-py3-none-any.whl:

Publisher: release.yml on cancervariants/variation-normalization

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: variation_normalizer-0.15.2-py3-none-any.whl
- Subject digest: acf58c1772e723590ea7884b0ae43c3501d19e0c289aced8705e9fa77d47fb21
- Sigstore transparency entry: 788332484
- Sigstore integration time: Jan 2, 2026
Source repository:
- Permalink: cancervariants/variation-normalization@902b988e43875dddec066a713502c639b3c27480
- Branch / Tag: refs/tags/0.15.2
- Owner: https://github.com/cancervariants
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@902b988e43875dddec066a713502c639b3c27480
- Trigger Event: release

variation-normalizer 0.15.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Variation Normalization

Installation

About

VRS Versioning

Previous VRS Versioning

Available Endpoints

/to_vrs

/normalize

Development

Required resources

SeqRepo

Docker Installation (Preferred)

Requirements

Build, (re)create, and start containers

Code QC

Testing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`/to_vrs`

`/normalize`