Skip to main content

No project description provided

Project description

Python pakcage for genomic variant analysis

Pypi Releases Downloads

How to use?

pip install variant
  • run variant-effect in the command line
  • more functions will be supported in the future

variant-effect command can infer the effect of a mutation

 Usage: variant-effect [OPTIONS]

 Variant (genomic variant analysis in python)

╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --input                 -i  TEXT       Input position file.                                                          │
│ --output                -o  TEXT       Output annotation file                                                        │
│ --reference             -r  TEXT       reference species                                                             │
│ --reference-gtf             TEXT       Customized reference gtf file.                                                │
│ --reference-transcript      TEXT       Customized reference transcript fasta file.                                   │
│ --reference-protein         TEXT       Customized reference protein fasta file.                                      │
│ --release               -e  INTEGER    ensembl release                                                               │
│ --type                  -t  [DNA|RNA]  (deprecated)                                                                  │
│ --strandness            -s             Use strand infomation or not?                                                 │
│ --pU-mode               -u             Make rRNA, tRNA, snoRNA into top priority.                                    │
│ --npad                  -n  INTEGER    Number of padding base to call motif.                                         │
│ --all-effects           -a             Output all effects.                                                           │
│ --with-header           -H             With header line in input file.                                               │
│ --columns               -c  TEXT       Sets columns for site info. (Chrom,Pos,Strand,Ref,Alt) [default: 1,2,3,4,5]   │
│ --help                  -h             Show this message and exit.                                                   │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯

demo:

Store the following table in file (sites.tsv).

Chrom Position Strand Ref Alt
chr1 230703034 - C T
chr12 69353439 + A T
chr14 23645352 + G T
chr2 215361150 - A T
chr2 84906537 + C T
chr22 39319077 - T A
chr22 39319095 - T A
chr22 39319098 - T A

Run command:

variant-effect -i sites.tsv -H -r human -e 108 -t RNA -H -c 1,2,3
  • -i specify the input file
  • -H means the file is with header line, and the first row will be skipped;
  • -r use the specific genome, default is human
  • -e specify the Ensembl release version
  • -c means only use some of the columns in the input file. default will use the first 5 columns.

You will have this output

Chrom Position Strand Ref Alt mut_type gene_type gene_name gene_pos transcript_name transcript_pos transcript_motif coding_pos codon_ref aa_pos aa_ref distance2splice
chr1 230703034 - C T ThreePrimeUTR protein_coding ENSG00000135744(AGT) 42543 ENST00000680041(AGT-208) 1753 TGTGTCACCCCCAGTCTCCCA None None None None 295
chr12 69353439 + A T ThreePrimeUTR protein_coding ENSG00000090382(LYZ) 5059 ENST00000261267(LYZ-201) 695 TAGAACTAATACTGGTGAAAA None None None None 286
chr14 23645352 + G T ThreePrimeUTR protein_coding ENSG00000100867(DHRS2) 15238 ENST00000344777(DHRS2-202) 1391 CTGCCATTCTGCCAGACTAGC None None None None 210
chr2 215361150 - A T ThreePrimeUTR protein_coding ENSG00000115414(FN1) 74924 ENST00000323926(FN1-201) 8012 GGCCCGCAATACTGTAGGAAC None None None None 476
chr2 84906537 + C T ThreePrimeUTR protein_coding ENSG00000034510(TMSB10) 882 ENST00000233143(TMSB10-201) 327 CCTGGGCACTCCGCGCCGATG None None None None 148
chr22 39319077 - T A Intronic protein_coding ENSG00000100316(RPL3) 1313 ENST00000216146(RPL3-201) None None None None None None None
chr22 39319095 - T A Intronic protein_coding ENSG00000100316(RPL3) 1295 ENST00000216146(RPL3-201) None None None None None None None
chr22 39319098 - T A Intronic protein_coding ENSG00000100316(RPL3) 1292 ENST00000216146(RPL3-201) None None None None None None None

TODO:

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

variant-0.0.71.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

variant-0.0.71-py3-none-any.whl (11.6 kB view details)

Uploaded Python 3

File details

Details for the file variant-0.0.71.tar.gz.

File metadata

  • Download URL: variant-0.0.71.tar.gz
  • Upload date:
  • Size: 11.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.0 CPython/3.11.5 Linux/5.15.133.1-microsoft-standard-WSL2

File hashes

Hashes for variant-0.0.71.tar.gz
Algorithm Hash digest
SHA256 100d43f7093b82bddbd2d15919efbddd333c2c3c14a38204ba293503e92ef490
MD5 93244a25346f5e0a32bf6ceaec378206
BLAKE2b-256 1c6b4596d762c1615b96b5157a0d695a27c463ff8526b73b4c2ebe9a3c2cd0f8

See more details on using hashes here.

File details

Details for the file variant-0.0.71-py3-none-any.whl.

File metadata

  • Download URL: variant-0.0.71-py3-none-any.whl
  • Upload date:
  • Size: 11.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.0 CPython/3.11.5 Linux/5.15.133.1-microsoft-standard-WSL2

File hashes

Hashes for variant-0.0.71-py3-none-any.whl
Algorithm Hash digest
SHA256 73407bb3e66924693ecc16a4549f4f5de54555fad5b9b406a417331ce7658c10
MD5 370bcf3b54fc45a8211355c7eda53284
BLAKE2b-256 bc804d6f3970beff8d5e610a6cfb83d6505d1243f7c9a828a5d82c317414cb7b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page