Skip to main content

Project vcf level coordinates from chromosomes to transcript

Project description

VCF projection from chromosome to transcriptome

The program uses ncls, a fast interval tree search program that considerably speeds up the ''stabbing'' problem of finding out the variant that overlaps the transcriptome interval.


GTF_FILE = 'Mus_musculus.GRCm38.91.gtf'
VCF_FILE = 'mgp_v4_indels_NOD_PWK.vcf'


from vcfproj import vcfproj
vcf_gtf = vcfproj.projection(GTF_FILE, VCF_FILE)

vcf_gtf is a dataframe, that would contain the following columns

[

'chrom_x', - chromosome of variation 
'start', - start of transcript
'end', - end of transcript
'gene', - gene name
'txome', - transcript name
'vcf_index', - ignore
'relative_pos', - relative position of variation on transcript
'transcript_length', - length of transcript
'chrom_y', - same as chrom_x
'pos', - position of variation
'id', - ID
'ref', - REF
'alt', - ALT
'qual', - QUAL
'filter', - FILTER
'info', - INFO
'format', - FORMAT
'samples' - SAMPLES 

]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vcfproj-0.0.3.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vcfproj-0.0.3-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file vcfproj-0.0.3.tar.gz.

File metadata

  • Download URL: vcfproj-0.0.3.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.8

File hashes

Hashes for vcfproj-0.0.3.tar.gz
Algorithm Hash digest
SHA256 93c48afc10a18fe2ab68a9c19df735c651550ae132ab7f62de64b5998a49abe5
MD5 411717aaa7846ae685ec838b99ec14a6
BLAKE2b-256 95a3343438659440424937bb0058b18daa32f48e5aa579d14c0cc2674e658b99

See more details on using hashes here.

File details

Details for the file vcfproj-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: vcfproj-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.8

File hashes

Hashes for vcfproj-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 96f0695f43eae7ff640da80c1a3a3f8639ebf8a26390b242a65424465a96bc48
MD5 195b20492e795ddb88809ef1bbeeaeaf
BLAKE2b-256 c29ae620bbf4232237adbdfdcd4ea02454f806fc69d01d480f8abbe939dd6f0e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page