Project vcf level coordinates from chromosomes to transcript
Project description
VCF projection from chromosome to transcriptome
The program uses ncls, a fast interval tree search program that considerably speeds up the ''stabbing'' problem of finding out the variant that overlaps the transcriptome interval.
GTF_FILE = 'Mus_musculus.GRCm38.91.gtf'
VCF_FILE = 'mgp_v4_indels_NOD_PWK.vcf'
from vcfproj import vcfproj
vcf_gtf = vcfproj.projection(GTF_FILE, VCF_FILE)
vcf_gtf is a dataframe, that would contain the following columns
[
'chrom_x', - chromosome of variation
'start', - start of transcript
'end', - end of transcript
'gene', - gene name
'txome', - transcript name
'vcf_index', - ignore
'relative_pos', - relative position of variation on transcript
'transcript_length', - length of transcript
'chrom_y', - same as chrom_x
'pos', - position of variation
'id', - ID
'ref', - REF
'alt', - ALT
'qual', - QUAL
'filter', - FILTER
'info', - INFO
'format', - FORMAT
'samples' - SAMPLES
]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vcfproj-0.0.3.tar.gz.
File metadata
- Download URL: vcfproj-0.0.3.tar.gz
- Upload date:
- Size: 4.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
93c48afc10a18fe2ab68a9c19df735c651550ae132ab7f62de64b5998a49abe5
|
|
| MD5 |
411717aaa7846ae685ec838b99ec14a6
|
|
| BLAKE2b-256 |
95a3343438659440424937bb0058b18daa32f48e5aa579d14c0cc2674e658b99
|
File details
Details for the file vcfproj-0.0.3-py3-none-any.whl.
File metadata
- Download URL: vcfproj-0.0.3-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
96f0695f43eae7ff640da80c1a3a3f8639ebf8a26390b242a65424465a96bc48
|
|
| MD5 |
195b20492e795ddb88809ef1bbeeaeaf
|
|
| BLAKE2b-256 |
c29ae620bbf4232237adbdfdcd4ea02454f806fc69d01d480f8abbe939dd6f0e
|