Skip to main content

Extract amino acid sequences from a fasta file based on a GFF

Project description

CodeFactor GitHub

AminoExtract

AminoExtract is a CLI tool and library to work with GFF files, and extract amino acid sequences from nucleotide sequences based on a GFF input file.

AminoExtract is able to filter the genomic features in the input files to make sure your output makes sense and to write the resulting amino acid sequences to either a single file or to individual files for every feature, depending on your inputs as a user.

Additionally, AminoExtract allows you to use GFF data in python by making it accessible in a pandas dataframe structure.

Why this tool?

Because sometimes you just want a dedicated tool to do a mundane task. And sometimes it's just simply necessary to, for example, ensure reproducibility, portability or to facilitate long-term maintainability of larger projects.

Instead of copying this one script used for translating and writing amino acids across all projects, we can now point to AminoExtract for just that.

Installation requirements

AminoExtract requires python 3.10 or later to work.

Dependencies such as Pandas, Biopython and python-magic are installed during the installation procedure.

Installation instructions

AminoExtract can be instsalled easily with conda or pip.
Installation through conda is recommended.

Installation with conda

conda install -c bioconda -c conda-forge aminoextract

Installation with pip

pip install AminoExtract

Installation from source

  1. Download the latest version of AminoExtract by cloning this repository and navigate to the newly created directory.
    Copy and paste the code-snippet below in order to do so.
git clone https://github.com/RIVM-bioinformatics/AminoExtract.git && cd AminoExtract/
  1. If necessary, create a conda-environment and install the necessary dependencies.
    Copy and paste the code-snippet below in order to do so.
mamba env create -f env.yml && conda activate AminoExtract
  1. Now install AminoExtract into the conda environment with the following:
pip install .

AminoExtract is now installed!
You can use AminoExtract from anywhere on your system as long as the conda-environment in which it is installed is active.
You can test if installation was succesful by typing AminoExtract -v which should display the installed version.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aminoextract-0.4.1.tar.gz (113.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aminoextract-0.4.1-py3-none-any.whl (21.7 kB view details)

Uploaded Python 3

File details

Details for the file aminoextract-0.4.1.tar.gz.

File metadata

  • Download URL: aminoextract-0.4.1.tar.gz
  • Upload date:
  • Size: 113.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aminoextract-0.4.1.tar.gz
Algorithm Hash digest
SHA256 ead4496a819e86519f0a6b7a3119539c81b0bec289652692ed13617fcbb570f0
MD5 48e8544a6940a114cf13287c972ee1ef
BLAKE2b-256 365d79b45e01998b6f5989d99821534f9c655d42d628bb285608e2cfdb384a18

See more details on using hashes here.

File details

Details for the file aminoextract-0.4.1-py3-none-any.whl.

File metadata

  • Download URL: aminoextract-0.4.1-py3-none-any.whl
  • Upload date:
  • Size: 21.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for aminoextract-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d17067299be3fa31cb01fef6b6ac1519ca8977bc37fc34c405eeaced857b75cc
MD5 cba5377626c294f960d4a04af867f831
BLAKE2b-256 d414ab535a0c914f903968f29dc22a5e30cec2b85990fd5ea7a1735ad725a77e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page