software tool designed to accurately identify the genetic translation table (GTT) used in prokaryotic organisms.
Project description
gTranslate
gTranslate is a machine learning-based command-line tool for predicting the translation table (TT) used by prokaryotic genomes. By analyzing specific sequence features — such as coding density differences, Trp ratios, and Gly ratios — gTranslate can accurately distinguish between the genetic codes associated with reassignment of the UGA stop codon, i.e. translation tables 11 (standard prokaryotic code), 4 (UGA=Trp), and 25 (UGA=Gly).
Features
- Automated Table Detection: Rapidly predict the correct translation table for a single genome or large batches of genomes.
- Interactive Visualizations: Generate dynamic HTML dashboards to explore the feature space used by the classifiers.
Installation
gTranslate requires Python >= 3.12 and Prodigal >= 2.6.2 on your system path.
Option 1: Bioconda (Recommended)
We recommend using Mamba for a faster setup:
conda create -n gtranslate -c conda-forge -c bioconda gtdb-gtranslate
conda activate gtranslate
Option 2: pip
It is recommended to use a virtual environment:
# Install
python -m pip install -i https://test.pypi.org/simple/ gtdb-gtranslate
# Upgrade
python -m pip install -i https://test.pypi.org/simple/ --upgrade gtdb-gtranslate
Usage
gTranslate is operated via two subcommands: detect_table and generate_plot.
You can view the general help menu at any time:
gtranslate -h
Detect table
The core pipeline for detecting the translation table used by prokaryotic organisms. You must provide input genomes either via a directory or a batch file.
Basic Usage:
# Process a directory of genomic FASTA files
gtranslate detect_table --genome_dir /path/to/genomes --out_dir /path/to/output
# Process genomic FASTA files defined in a batch file
gtranslate detect_table --batchfile genomes.tsv --out_dir /path/to/output
The file provided to --batchfile is a two-column, tab-separated values (TSV) file that indicates the path to the genomic FASTA file and the desired genome identifier, e.g.:
/example/path/GCF_001729785.1_ASM172978v1/GCF_001729785.1.fna.gz G001729785
/a/different/path/GCF_043834535.1.fna.gz G043834535
...
Generate plot (optional)
Generates an interactive HTML dashboard to visually explore the features (e.g., coding density difference, amino acid ratios) used by gTranslate to predict the translation table.
Basic Usage:
gtranslate generate_plot --feature_file features.tsv --output_file dashboard.html
The file features.tsv provided to --feature_file is generated by the detect_table command.
Testing
Validation that gTranslate has been installed correctly and is operating as expected can be done using the check_install and test subcommands:
Check installation
Verifies your installation and ensures that all required dependencies and reference data files are correctly configured and present.
Basic Usage:
gtranslate check_install
Test operation of gTranslate
Runs the built-in test suite on bundled genomic data to validate that gTranslate is functioning correctly on your system.
Basic Usage:
gtranslate test --cpus 4
Note: If --out_dir is not specified, the test will securely execute in a temporary directory.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gtdb_gtranslate-0.0.2.tar.gz.
File metadata
- Download URL: gtdb_gtranslate-0.0.2.tar.gz
- Upload date:
- Size: 75.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d72bafed7bfc8bed16e46c236bfbdc3cbd229cabe31d863006ded7b692f38426
|
|
| MD5 |
8282a032c90801012d9583ec012c923d
|
|
| BLAKE2b-256 |
db2406c2862a94de4f4c524716bb7c4c98714bec6a0fd42929dc8c2e9f0a249f
|
Provenance
The following attestation bundles were made for gtdb_gtranslate-0.0.2.tar.gz:
Publisher:
release-publish.yml on cmc-aau/gTranslate
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gtdb_gtranslate-0.0.2.tar.gz -
Subject digest:
d72bafed7bfc8bed16e46c236bfbdc3cbd229cabe31d863006ded7b692f38426 - Sigstore transparency entry: 1437146899
- Sigstore integration time:
-
Permalink:
cmc-aau/gTranslate@60cb644a940b31c5d6da45f6b4733af2d51df723 -
Branch / Tag:
refs/tags/0.0.2 - Owner: https://github.com/cmc-aau
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-publish.yml@60cb644a940b31c5d6da45f6b4733af2d51df723 -
Trigger Event:
release
-
Statement type:
File details
Details for the file gtdb_gtranslate-0.0.2-py3-none-any.whl.
File metadata
- Download URL: gtdb_gtranslate-0.0.2-py3-none-any.whl
- Upload date:
- Size: 102.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d57cbbf04cc169b9f3c793255e8e993d20ce1e9b3f21c773cae1afafc31a922e
|
|
| MD5 |
2b35042febc536015166c25698d78a6e
|
|
| BLAKE2b-256 |
2ae930c5a9d96877896fb1d58c1dc947dc12c0d270e8f6115f8ce331892e592e
|
Provenance
The following attestation bundles were made for gtdb_gtranslate-0.0.2-py3-none-any.whl:
Publisher:
release-publish.yml on cmc-aau/gTranslate
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gtdb_gtranslate-0.0.2-py3-none-any.whl -
Subject digest:
d57cbbf04cc169b9f3c793255e8e993d20ce1e9b3f21c773cae1afafc31a922e - Sigstore transparency entry: 1437146902
- Sigstore integration time:
-
Permalink:
cmc-aau/gTranslate@60cb644a940b31c5d6da45f6b4733af2d51df723 -
Branch / Tag:
refs/tags/0.0.2 - Owner: https://github.com/cmc-aau
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-publish.yml@60cb644a940b31c5d6da45f6b4733af2d51df723 -
Trigger Event:
release
-
Statement type: