Skip to main content

Map genetic variants to rsIDs

Project description

variant-mapper

version: 0.1.1b0

The variant-mapper is a package to map genetic variants map genetic variants to the genome, in order to validate them and assign an rs ID.

This will:

  1. Localise the genetic variant based on chromosome position, using either a file join approach or tabix.
  2. Determine if the ref/alt alleles match to a known variant site, it assumes that ref/alt can be flipped.
  3. If a site can be identified then it will annotate the variant with function information.
  4. If no site can be identified
  5. If an INDEL, normalise the alleles and attempt mapping again.
  6. Finally, is still can't be mapped validate one of the alleles against the reference genome assembly
  7. This can also handle cases where only a single allele is known, assuming the site is bi-alleilic and the ref allele can be localised.

The mapper works by having a common mapper file and a full mapper file. The common mapper file contains common variants usually used in GWAS studies and the full mapper file has all known variant from dbSNP and from other projects as well.

You can either map by localising the genetic variants using tabix or by a table scan (file join) approach. The file join is most efficient if you have millions of variants, or rather if your input fie is ~10-20M variants. In this case the common file is used for the join and where something can't be mapped then a tabix query is tried again the full file. In many cases the common file is good enough but it might miss some variants. In any case, please contact me for a download link. There is nothing super secret about the mapping file, UCL does not offer any file distribution and I have no other official way of distributing it, so it is on my personal pCloud at the moment.

Installation

This can be installed using pypi or conda

To install using pypi:

pip install variant-mapper

To install using conda:

conda install -c cfin -c conda-forge variant-mapper

Documentation

There is online documentation for variant mapper.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

variant_mapper-0.1.1b0.tar.gz (162.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

variant_mapper-0.1.1b0-py3-none-any.whl (182.8 kB view details)

Uploaded Python 3

File details

Details for the file variant_mapper-0.1.1b0.tar.gz.

File metadata

  • Download URL: variant_mapper-0.1.1b0.tar.gz
  • Upload date:
  • Size: 162.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.0

File hashes

Hashes for variant_mapper-0.1.1b0.tar.gz
Algorithm Hash digest
SHA256 62bee43f0d2e25b8ecef3bf16d00749953eefca91372df78334fae57f3e560a3
MD5 2ae708dbde959531218ca4d64e01b8a4
BLAKE2b-256 da4ed5f314a6ea55d6551f60cf9fc607355ac9d964c48f81392a37c68d3e2f66

See more details on using hashes here.

File details

Details for the file variant_mapper-0.1.1b0-py3-none-any.whl.

File metadata

File hashes

Hashes for variant_mapper-0.1.1b0-py3-none-any.whl
Algorithm Hash digest
SHA256 363b0f0697468fb55388491300fccf89914ee363e357c2e2e528ae5668c8e0ad
MD5 55712c334c3843d87ddd673d1cb0464d
BLAKE2b-256 a3f698f4369759568d528f344b8ebd1f67501d11fc7afa70d699a415aa748dbf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page