Skip to main content

Extract genome ferature sequence for biologists

Project description

Overview

The featurExtract is python package for bioinformatics.
The packages contains two executable command programs.
The first executable program is featurExtract including
ten subroutines termed create, gene, promoter, UTR, uORF,
CDS, dORF, exon, intron, intergenic. The create subroutine is
used for creating database. The promoter subroutine is used
for extracting promoter sequence. uORF subroutine is used
for extracting upstream open reading frames sequence. UTR
subroutine is used for extracting untranslated region sequence.
CDS subroutine is used for extracting coding sequence.intergenic
subroutine is used for extracting intergenic sequence between two
genes. The second executable program is genBankExtract including
four subroutines termed gene, CDS, rRNA, tRNA.

Brief introduction of featurExtract package

Install

Two way offer to install featurExtract module.

install command line

pip install featurExtract
# other
git clone https://github.com/SitaoZ/featurExtract.git
cd featurExtract
python setup.py install

Requirements

python >= 3.7.6 python
pandas >= 1.2.4 pandas
gffutils >= 0.10.1 gffutils
setuptools >= 49.2.0 setuptools
biopython >= 1.78 biopython

Usage

featurExtract is designed for GFF and GTF file
and GenBankExtract is suited for GenBank file.

featurExtract

# gff or gtf database 
which featurExtract
featurExtract -h 
featurExtract create -h 
featurExtract promoter -h 
featurExtract UTR -h 
featurExtract uORF -h 
featurExtract CDS -h 
featurExtract dORF -h
featurExtract exon -h
featurExtract intron -h
featurExtract intergenic -h

genBankExtract

# GenBank database
which genBankExtract
genBankExtract -h
genBankExtract gene -h
genBankExtract CDS  -h
genBankExtract rRNA -h
genBankExtract tRNA -h

Examples

featurExtract

# step 1 
featurExtract create -g ath.gff3 
# step 2 command
featurExtract promoter -l 200 -u 100 -f ath.fa -o promoter.csv
featurExtract UTR  -o UTR.csv
featurExtract uORF -o uORF.csv
featurExtract CDS  -o CDS.csv
featurExtract exon -f ath.fa -t AT1G01010.1 -p 
featurExtract intron -f ath.fa -t AT1G01010.1 -p  

genBankExtract

# GenBank step 3
genBankExtract gene -g NC_000932.gb -f dna -p  
genBankExtract CDS  -g NC_000932.gb -f dna -p 
genBankExtract rRNA -g NC_000932.gb -f dna -p
genBankExtract tRNA -g NC_000932.gb -f dna -p

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featurExtract-0.2.4.5.tar.gz (17.7 kB view details)

Uploaded Source

File details

Details for the file featurExtract-0.2.4.5.tar.gz.

File metadata

  • Download URL: featurExtract-0.2.4.5.tar.gz
  • Upload date:
  • Size: 17.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for featurExtract-0.2.4.5.tar.gz
Algorithm Hash digest
SHA256 7d1a7b8eb3d38a2447572d98361cf0edb1f53ef2af1957fecce6dd07303b8e8d
MD5 825d4d59eb3cd71f5ea065ad3f2b28be
BLAKE2b-256 3d4b41fd3b1548b24cee8e5ea77b49284430f42003276625bb426a8a611b5618

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page