Skip to main content

Extract, Retrieve and Predict kcat values for a metabolic model to run enzyme constrained metabolic pipelines.

Project description

WILDkCAT

WILDkCAT is a set of scripts designed to extract, retrieve, and predict enzyme turnover numbers (kcat) for genome-scale metabolic models.


Installation

  1. Clone the repository:
git clone https://github.com/h-escoffier/WILDkCAT.git
cd WILDkCAT
  1. Install Miniconda or Anaconda.

  2. Create and activate the environment:

conda env create -f environment.yml
conda activate wildkcat-env
  1. Provide your BRENDA login credentials and Entrez API email adress to query the BRENDA enzyme database and NCBI database.

Create a file named .env in the root of your project with the following content:

ENTREZ_EMAIL=your_registered_email@example.com
BRENDA_EMAIL=your_registered_email@example.com
BRENDA_PASSWORD=your_password
  • Replace the placeholders with the credentials from the account you created on the BRENDA website.
  • Make sure this file is not shared publicly (e.g., add .env to your .gitignore) since it contains sensitive information.
  • The scripts will automatically read these environment variables to authenticate and retrieve kcat values.

Scripts Overview

extract_kcat.py

  • Verifies whether the reaction EC number exists.
  • Retains inputs where reaction-associated genes/enzymes are not supported by KEGG.
  • Retains inputs where no enzymes are provided by the model.

retrieve_kcat.py

  • If multiple enzymes are provided, searches UniProt for catalytic activity.
  • If multiple catalytic enzymes are identified, store all.
  • When multiple enzymes are found, computes identity percentages relative to the identified catalytic enzyme.
  • Applies Arrhenius correction to values within the appropriate pH range.
  • For rows with multiple scores, selects:
    • The best score
    • The highest identity percentage
    • The lowest kcat value

predict_kcat.py

  • If multiple enzymes are provided, searches UniProt for catalytic activity.
  • Skips entries missing KEGG compound IDs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wildkcat-0.0.2-py3-none-any.whl (40.5 kB view details)

Uploaded Python 3

File details

Details for the file wildkcat-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: wildkcat-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 40.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.1

File hashes

Hashes for wildkcat-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e14520ae0215e0cafb8ce7198001b6eef0bc8d5951f344b7712c18796f03e413
MD5 6119f64f0332f1ff733e3a1b712f3aca
BLAKE2b-256 53a0739d846161b17fed9c8742ec196d816711b7c218946ac7e7f9cd448073ad

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page