Skip to main content

Open Reading Frame finder natively coded in Python.

Project description


banner


Downloads

ORFFinder

ORFFinder in Python. Inspired by NCBI's version: https://www.ncbi.nlm.nih.gov/orffinder/

Finds the open reading frame (6-frame scan) on a given 5' to 3' nucleotide.

Installation:

pip3 install orffinder

Terminal Usage

Two command-line executable commands are available: orffinder-to-gtf orffinder-to-sequence.

Documentation for these commands can be retrieved by specifying <command> -h.

API Usage

Import the package

IMPORTANT: Your DNA/RNA strand should always be from the 5' to 3' direction when input! Transcription direction (Image credit: Khan Academy)

from Bio import SeqIO
from orffinder import orffinder

sequence = SeqIO.read("gene.fasta", "fasta")
orffinder.getORFs(sequence, minimum_length=75, remove_nested=True)

Documentation

getORFs()

Returns the loci of discovered ORFs in a dictionary format.

sequence: sequence in Biopython Seq or String format.
minimum_length: minimum size of ORF in nucleotides. Default: 75
start_codons: recognised 3-base-pair codons for initialisation. Default: ["ATG"]
stop_codons: recognised 3-base pair condons for termination. Default: ["TAA", "TAG", "TGA"]
remove_nested: remove all ORFs completely encased in another. Default: False
trim_trailing: remove ORFs are the edge of the sequence that do not have a defined stop codon. Default: False

getORFNucleotides()

Returns a list of Biopython Seq objects or loci of discovered ORFs with Biopython Seq objects in a dictionary format.

sequence: sequence in Biopython Seq or String format.
return_loci: return the loci together with the nucleotide sequences. Default: False
minimum_length: minimum size of ORF in nucleotides. Default: 75
start_codons: recognised 3-base-pair codons for initialisation. Default: ["ATG"]
stop_codons: recognised 3-base pair condons for termination. Default: ["TAA", "TAG", "TGA"]
remove_nested: remove all ORFs completely encased in another. Default: False
trim_trailing: remove ORFs are the edge of the sequence that do not have a defined stop codon. Default: False

getORFProteins()

Returns a list of Biopython Seq objects or loci of discovered ORFs with Biopython Seq objects in a dictionary format.

sequence: sequence in Biopython Seq or String format.
translation_table: translation table as per BioPython. Default: 1
return_loci: return the loci together with the protein sequences. Default: False
minimum_length: minimum size of ORF in nucleotides. Default: 75
start_codons: recognised 3-base-pair codons for initialisation. Default: ["ATG"]
stop_codons: recognised 3-base pair condons for termination. Default: ["TAA", "TAG", "TGA"]
remove_nested: remove all ORFs completely encased in another. Default: False
trim_trailing: remove ORFs are the edge of the sequence that do not have a defined stop codon. Default: False

Dependencies

Biopython (https://biopython.org/)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orffinder-1.5.tar.gz (5.6 kB view details)

Uploaded Source

Built Distribution

orffinder-1.5-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file orffinder-1.5.tar.gz.

File metadata

  • Download URL: orffinder-1.5.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.23.4 CPython/3.6.4

File hashes

Hashes for orffinder-1.5.tar.gz
Algorithm Hash digest
SHA256 ec8c8e625ba83f9d5f21732fdd40bc37a462cf254449f85eae1b78a01a53a9d5
MD5 3ddb94c6dfbabba7f05a3edbe822b3af
BLAKE2b-256 32aeb4816c2b9d558a783f11ef45f61ffaad9d95a6024df9dcd014d255a04355

See more details on using hashes here.

File details

Details for the file orffinder-1.5-py3-none-any.whl.

File metadata

  • Download URL: orffinder-1.5-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.23.4 CPython/3.6.4

File hashes

Hashes for orffinder-1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9948de94b5d4b87058053e7c909aebfe73e274c486f1ffaa5cef0132df343bf3
MD5 f63e4bb26004d5c542d5f7a30f21842a
BLAKE2b-256 bb5a85294acdd15243236d8631d663037eacff7b9e4ee1d796efa9a86f35c259

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page