Skip to main content

like seqTailor, give a VCF file, it return genomic sequence.

Project description

vcf2seq

Aim

like seqTailor, give a VCF file, it return genomic sequence.

Installation

pip install vcf2ses

usage

usage: vcf2seq.py [-h] -g genome [-s SIZE] [-t {alt,ref,both}] [-b BLANK] [-a ADD_COLUMNS [ADD_COLUMNS ...]] [-o OUTPUT] [-v] vcf

================================================================================
Like seqtailor, give a VCF file, it return genomic sequence (default length: 31)

Nota:
- When a insertion is larger than '--size' option, only first '--size' nucleotides are outputed.
- header ID are formated like "<chr>_<position>_<ref>_<alt>".

VCF format specifications: https://github.com/samtools/hts-specs/blob/master/VCFv4.4.pdf
================================================================================

positional arguments:
  vcf                   vcf file (mandatory)

options:
  -h, --help            show this help message and exit
  -g genome, --genome genome
                        genome as fasta file (mandatory)
  -s SIZE, --size SIZE  size of the output sequence (defalt: 31)
  -t {alt,ref,both}, --type {alt,ref,both}
                        alt, ref, or both output? (default: alt)
  -b BLANK, --blank BLANK
                        Missing nucleotide character, default is dot (.)
  -a ADD_COLUMNS [ADD_COLUMNS ...], --add-columns ADD_COLUMNS [ADD_COLUMNS ...]
                        Add one or more columns to header (ex: '-a 3 AA' will add columns 3 and 27). The first column is '1' (or 'A')
  -o OUTPUT, --output OUTPUT
                        Output file (default: <input_file>-vcf2seq.fa)
  -v, --version         show program's version number and exit

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vcf2seq-0.4.0a0.tar.gz (18.8 kB view details)

Uploaded Source

Built Distribution

vcf2seq-0.4.0a0-py3-none-any.whl (19.7 kB view details)

Uploaded Python 3

File details

Details for the file vcf2seq-0.4.0a0.tar.gz.

File metadata

  • Download URL: vcf2seq-0.4.0a0.tar.gz
  • Upload date:
  • Size: 18.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.5

File hashes

Hashes for vcf2seq-0.4.0a0.tar.gz
Algorithm Hash digest
SHA256 85ab9da68724f9227c23538fa928d802bfa774aa2d0c935a4fe4e573df069deb
MD5 4f20b2d1a9c3ce8afe5b7f097924ca17
BLAKE2b-256 b0f11e42eb98feac8cbf4b9ee08778111dd9df0401cf4b4fe47a1885288f65a7

See more details on using hashes here.

File details

Details for the file vcf2seq-0.4.0a0-py3-none-any.whl.

File metadata

  • Download URL: vcf2seq-0.4.0a0-py3-none-any.whl
  • Upload date:
  • Size: 19.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.5

File hashes

Hashes for vcf2seq-0.4.0a0-py3-none-any.whl
Algorithm Hash digest
SHA256 8c598cc0584dceae724f62754a580b3960773c4539a49a77a7f69dbf38ba6ab5
MD5 df42444ffe009267e7965d14331f4601
BLAKE2b-256 9cf8cb9d8e83f10e000911dac05abf922eecce0f00f3e6474541558c3f43d063

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page