No project description provided
Project description
Python pakcage for genomic variant analysis
How to use?
pip install variant
- run
variant-effect
in the command line - more functions will be supported in the future
variant-effect
command can infer the effect of a mutation
Usage: variant-effect [OPTIONS]
Variant (genomic variant analysis in python)
╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --input -i TEXT Input position file. │
│ --output -o TEXT Output annotation file │
│ --reference -r TEXT reference species │
│ --reference-gtf TEXT Customized reference gtf file. │
│ --reference-transcript TEXT Customized reference transcript fasta file. │
│ --reference-protein TEXT Customized reference protein fasta file. │
│ --release -e INTEGER ensembl release │
│ --type -t [DNA|RNA] (deprecated) │
│ --strandness -s Use strand infomation or not? │
│ --pU-mode -u Make rRNA, tRNA, snoRNA into top priority. │
│ --npad -n INTEGER Number of padding base to call motif. │
│ --all-effects -a Output all effects. │
│ --with-header -H With header line in input file. │
│ --columns -c TEXT Sets columns for site info. (Chrom,Pos,Strand,Ref,Alt) [default: 1,2,3,4,5] │
│ --help -h Show this message and exit. │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
demo:
Store the following table in file (sites.tsv
).
Chrom | Position | Strand | Ref | Alt |
---|---|---|---|---|
chr1 | 230703034 | - | C | T |
chr12 | 69353439 | + | A | T |
chr14 | 23645352 | + | G | T |
chr2 | 215361150 | - | A | T |
chr2 | 84906537 | + | C | T |
chr22 | 39319077 | - | T | A |
chr22 | 39319095 | - | T | A |
chr22 | 39319098 | - | T | A |
chr3 | 10301112 | - | G | T |
chr3 | 20301112 | - | G | T |
chr5 | 181241858 | - | T | A |
chr6 | 85677602 | - | T | A |
chr7 | 45893389 | + | G | T |
chr7 | 94669540 | + | G | N |
chr8 | 66922519 | - | T | A |
chr8 | 66922525 | - | T | A |
chrX | 136879224 | - | T | A |
chrX | 136879226 | - | T | A |
Run command:
variant-effect -i sites.tsv -H -r human -e 108 -t RNA -H -c 1,2,3
-i
specify the input file-H
means the file is with header line, and the first row will be skipped;-r
use the specific genome, default is human-e
specify the Ensembl release version-c
means only use some of the columns in the input file. default will use the first 5 columns.
You will have this output
#chrom | pos | strand | ref | alt | mut_type | gene_name | gene_pos | transcript_name | transcript_pos | transcript_motif | coding_pos | codon_ref | aa_pos | aa_ref | distance2splice |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
chr1 | 230703034 | - | G | A | ThreePrimeUTR | ENSG00000135744(AGT) | 42543 | ENST00000680041(AGT-208) | 1753 | TGTGTCACCCCCAGTCTCCCA | None | None | None | None | 295 |
chr2 | 215361150 | - | T | A | ThreePrimeUTR | ENSG00000115414(FN1) | 74924 | ENST00000323926(FN1-201) | 8012 | GGCCCGCAATACTGTAGGAAC | None | None | None | None | 476 |
chr2 | 84906537 | + | C | T | ThreePrimeUTR | ENSG00000034510(TMSB10) | 882 | ENST00000233143(TMSB10-201) | 327 | CCTGGGCACTCCGCGCCGATG | None | None | None | None | 148 |
chr3 | 10301112 | - | C | A | Silent | ENSG00000157020(SEC13) | 20001 | ENST00000397117(SEC13-209) | 1441 | TTGATCATCTGCCTTAACGTG | 849 | CTG | 283 | L | 35 |
chr3 | 20301112 | - | C | A | Intergenic | None | None | None | None | None | None | None | None | None | None |
chr7 | 45893389 | + | G | T | ThreePrimeUTR | ENSG00000146678(IGFBP1) | 5030 | ENST00000275525(IGFBP1-201) | 1243 | CAAAGCTCCTGCGTCTGTTTT | None | None | None | None | 429 |
chr7 | 94669540 | + | G | N | ThreePrimeUTR | ENSG00000242265(PEG10) | 13216 | ENST00000612941(PEG10-206) | 6240 | TTTTACCCCTGTCAGTAGCCC | None | None | None | None | 5030 |
chr12 | 69353439 | + | A | T | ThreePrimeUTR | ENSG00000090382(LYZ) | 5059 | ENST00000261267(LYZ-201) | 695 | TAGAACTAATACTGGTGAAAA | None | None | None | None | 286 |
chr14 | 23645352 | + | G | T | ThreePrimeUTR | ENSG00000100867(DHRS2) | 15238 | ENST00000344777(DHRS2-202) | 1391 | CTGCCATTCTGCCAGACTAGC | None | None | None | None | 210 |
TODO:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
variant-0.0.47.tar.gz
(10.8 kB
view details)
Built Distribution
File details
Details for the file variant-0.0.47.tar.gz
.
File metadata
- Download URL: variant-0.0.47.tar.gz
- Upload date:
- Size: 10.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.10.9 Darwin/22.1.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e57c96d2fbe320dc27694478f1935c6cc9b0f69c7441739fb6c88c2ab3db1702 |
|
MD5 | 0b332f8033f74696de732559d3158cb9 |
|
BLAKE2b-256 | b6cc5712f611e99522db5bd229e8ab4e51d171d125929d33c7282c2d955d83d4 |
File details
Details for the file variant-0.0.47-py3-none-any.whl
.
File metadata
- Download URL: variant-0.0.47-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.10.9 Darwin/22.1.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19d1d8306f740706c29accb3d0863ee20cf2eaa2a6656cefa613d09b6ba9f399 |
|
MD5 | 327b671da619495cefbe0f37c733333d |
|
BLAKE2b-256 | 74d794198ba13dd700de613e8027daefd512983bb2b4854012b6ee90038ce34b |