Skip to main content

Extract genome ferature sequence for biologists

Project description

Overview

The featurExtract is python package for bioinformatics.
The packages contains two executable command programs.
The first executable program is featurExtract including
ten subroutines termed create, gene, promoter, UTR, uORF,
CDS, dORF, exon, intron, intergenic. The create subroutine is
used for creating database. The promoter subroutine is used
for extracting promoter sequence. uORF subroutine is used
for extracting upstream open reading frames sequence. UTR
subroutine is used for extracting untranslated region sequence.
CDS subroutine is used for extracting coding sequence.intergenic
subroutine is used for extracting intergenic sequence between two
genes. The second executable program is genBankExtract including
four subroutines termed gene, CDS, rRNA, tRNA.

Brief introduction of featurExtract package

Install

Two way offer to install featurExtract module.

install command line

pip install featurExtract
# other
git clone https://github.com/SitaoZ/featurExtract.git
cd featurExtract
python setup.py install

Requirements

python >= 3.7.6 python
pandas >= 1.2.4 pandas
gffutils >= 0.10.1 gffutils
setuptools >= 49.2.0 setuptools
biopython >= 1.78 biopython

Usage

featurExtract is designed for GFF and GTF file
and GenBankExtract is suited for GenBank file.

featurExtract

# gff or gtf database 
which featurExtract
featurExtract -h 
featurExtract create -h 
featurExtract promoter -h 
featurExtract UTR -h 
featurExtract uORF -h 
featurExtract CDS -h 
featurExtract dORF -h
featurExtract exon -h
featurExtract intron -h
featurExtract intergenic -h

genBankExtract

# GenBank database
which genBankExtract
genBankExtract -h
genBankExtract gene -h
genBankExtract CDS  -h
genBankExtract rRNA -h
genBankExtract tRNA -h

Examples

featurExtract

# step 1 
featurExtract create -g ath.gff3 
# step 2 command
featurExtract promoter -l 200 -u 100 -f ath.fa -o promoter.csv
featurExtract UTR  -o UTR.csv
featurExtract uORF -o uORF.csv
featurExtract CDS  -o CDS.csv
featurExtract exon -f ath.fa -t AT1G01010.1 -p 
featurExtract intron -f ath.fa -t AT1G01010.1 -p  

genBankExtract

# GenBank step 3
genBankExtract gene -g NC_000932.gb -f dna -p  
genBankExtract CDS  -g NC_000932.gb -f dna -p 
genBankExtract rRNA -g NC_000932.gb -f dna -p
genBankExtract tRNA -g NC_000932.gb -f dna -p

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featurExtract-0.2.2.tar.gz (13.4 kB view details)

Uploaded Source

File details

Details for the file featurExtract-0.2.2.tar.gz.

File metadata

  • Download URL: featurExtract-0.2.2.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for featurExtract-0.2.2.tar.gz
Algorithm Hash digest
SHA256 b4ee36af3c7ff2748614f78a36f65c85edb0aa9aa673049d689002868196d4d8
MD5 9a19b23ef105aa3ea02887838424dd30
BLAKE2b-256 d9e759fae2793b739d453795627d1533627630506319dbbbe3a6de6d86f557f4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page