A a tool to find coding orfs
Project description
Introduction
GOODORFS is a tool to classify open reading frames into coding and noncoding
It takes as input a fasta file representing the entire genome. It then finds all potential open reading frames, and then for each, calculates the Energy Density Profile from the amino acid frequency.
To install GOODORFS
,
git clone https://github.com/deprekate/goodorfs.git
cd goodorfs
python3 setup.py install
To run GOODORFS
simply provide the path to a fasta file. The default output is the same format as Glimmers LONGORFS program, in order to serve as a drop in replacement.
The columns are: orf_id, start_location, stop_location, frame, a bunch of zeros as filler
$ goodorfs.py tests/NC_001416.fna
00001 191 736 +2 0.000
00002 711 2636 +3 0.000
00003 2633 2839 +2 0.000
00004 3270 2830 -1 0.000
00005 2836 4437 +1 0.000
00006 5095 4604 -2 0.000
00007 4283 5737 +2 0.000
...
Additionally GOODORFS
can also output the nucleotide sequences in fasta format for use in other applications:
$ good-orfs.py -Y fna tests/NC_001416.fna | head
>NC_001416_orf1 [START=191] [STOP=736]
ATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCA...
>NC_001416_orf2 [START=711] [STOP=2636]
GTGAATATATCGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCG...
>NC_001416_orf3 [START=2633] [STOP=2839]
ATGACGCGACAGGAAGAACTTGCCGCTGCCCGTGCGGCACTGCATGACCTGATGACAGG...
>NC_001416_orf4 [START=3270] [STOP=2830]
GTGCATGGCCACACCTTCCCGAATCATCATGGTAAACGTGCGTTTTCGCTCAACGTCAA...
...
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file goodorfs-1.0.tar.gz
.
File metadata
- Download URL: goodorfs-1.0.tar.gz
- Upload date:
- Size: 24.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 01fad0af0b8756f364b78ae8a3efb951552e16ae69bea9cd64a307af3d015224 |
|
MD5 | 4be61f93544b2341a3eaa95bf8e73040 |
|
BLAKE2b-256 | 427708048885e3b09787609d12476f6d11ae0891370ad25a1f218eed121fdf00 |
File details
Details for the file goodorfs-1.0-py3.6.egg
.
File metadata
- Download URL: goodorfs-1.0-py3.6.egg
- Upload date:
- Size: 29.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7994ddd90cdd51c8b9ee3a72570851778405b0d0ed79dccda1a444fc0d544562 |
|
MD5 | 731aa73283bc61a0b5d1d18173c7b8ab |
|
BLAKE2b-256 | 71092cc7c1dc8915525eb786dbe4d629f3031c8c01ef807ce9e58e26d1f16256 |
File details
Details for the file goodorfs-1.0-py2.py3-none-any.whl
.
File metadata
- Download URL: goodorfs-1.0-py2.py3-none-any.whl
- Upload date:
- Size: 25.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0714f1ca0eb6055f9b7fa59c543310cc12fa9ba7a47450c49e75ecb7db49b4ff |
|
MD5 | 57312f058b8744e217242de33a6fd7e3 |
|
BLAKE2b-256 | 4b94de4e160da4ab29d6cf8453f478bba20e04b17c176f86bc7bd92ce80ae26e |