Skip to main content

Extract genome ferature sequence for biologists

Project description

# Overview

The featurExtract is python package for bioinformatics. The packages contains two executable command programs. The first executable program is featurExtract including ten subroutines termed create, gene, promoter, UTR, uORF, CDS, dORF, exon, intron, intergenic. The create subroutine is used for creating database. The promoter subroutine is used for extracting promoter sequence. uORF subroutine is used for extracting upstream open reading frames sequence. UTR subroutine is used for extracting untranslated region sequence. CDS subroutine is used for extracting coding sequence.intergenic subroutine is used for extracting intergenic sequence between two genes. The second executable program is genBankExtract including four subroutines termed gene, CDS, rRNA, tRNA.

## Brief introduction of featurExtract package

### Install Two way offer to install featurExtract module.

#### install command line

`bash pip install featurExtract # other git clone https://github.com/SitaoZ/featurExtract.git cd featurExtract python setup.py install `

#### Requirements

python >= 3.7.6 [python](https://www.python.org/) pandas >= 1.2.4 [pandas](https://pandas.pydata.org/docs/) gffutils >= 0.10.1 [gffutils](https://pythonhosted.org/gffutils/) setuptools >= 49.2.0 [setuptools](https://pypi.org/project/setuptools/) biopython >= 1.78 [biopython](https://biopython.org/wiki/Documentation/)

### Usage featurExtract is designed for GFF and GTF file and GenBankExtract is suited for GenBank file.

#### featurExtract

`bash # gff or gtf database which featurExtract featurExtract -h featurExtract create -h featurExtract promoter -h featurExtract UTR -h featurExtract uORF -h featurExtract CDS -h featurExtract dORF -h featurExtract exon -h featurExtract intron -h featurExtract intergenic -h `

#### genBankExtract

`bash # GenBank database which genBankExtract genBankExtract -h genBankExtract gene -h genBankExtract CDS -h genBankExtract rRNA -h genBankExtract tRNA -h ` ### Examples

#### featurExtract

`bash # step 1 featurExtract create -g ath.gff3 # step 2 command featurExtract promoter -l 200 -u 100 -f ath.fa -o promoter.csv featurExtract UTR -o UTR.csv featurExtract uORF -o uORF.csv featurExtract CDS -o CDS.csv featurExtract exon -f ath.fa -t AT1G01010.1 -p featurExtract intron -f ath.fa -t AT1G01010.1 -p `

#### genBankExtract

`bash # GenBank step 3 genBankExtract gene -g NC_000932.gb -f dna -p genBankExtract CDS -g NC_000932.gb -f dna -p genBankExtract rRNA -g NC_000932.gb -f dna -p genBankExtract tRNA -g NC_000932.gb -f dna -p `

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featurExtract-0.2.4.2.tar.gz (16.6 kB view details)

Uploaded Source

File details

Details for the file featurExtract-0.2.4.2.tar.gz.

File metadata

  • Download URL: featurExtract-0.2.4.2.tar.gz
  • Upload date:
  • Size: 16.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.7

File hashes

Hashes for featurExtract-0.2.4.2.tar.gz
Algorithm Hash digest
SHA256 a8e998b7ecbd830edfd1f0971de61a1dbd8386d42b60d84647e5e9a44afdbf92
MD5 f3623ece9c138bdb5f28fc16835274fc
BLAKE2b-256 90baeeadffb803fa21540ab481b8cac001943962bd7dc51c5dd46fe30791a748

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page