f2xba modelling framework: from FBA to extended genome-scale modelling
Project description
Welcome to f2xba's documentation
f2xba modelling framework: from FBA to extended genome-scale modelling
In the domain of systems biology, the f2xba modeling framework has been developed for the purpose of generating a variety of extended genome-scale metabolic model types using simple and consistent workflows. This modeling framework was developed at the research group for Computational Cell Biology (CCB) at Heinrich-Heine-University Düsseldorf, Germany.
The CCB research group has developed a suite of software tools to facilitate genome-scale metabolic modeling. Sybil is an R package that utilizes genome-scale metabolic network optimization through the use of flux balance analysis (FBA)-based methods. SybilccFBA is an extension designed to enhance the optimization of enzyme constraint models. TurNuP is a machine learning model that predicts turnover numbers, which are required to parametrize extended genome-scale models. smblxdf is a Python package that converts between SBML coded genome-scale metabolic models and tabular formats. It is used to create and modify SBML coded models, as well as to access model information.
Extended model types
f2xba support generation of enzyme constraint models, such as GECKO (Sánchez et al., 2017), ccFBA[^1], MOMENT (Adadi et al., 2012) and MOMENTmr[^2], resource balance constraint RBA models (Bulović et al., 2019; Goelzer et al., 2011), and thermodynamics constraint models, such as TFA (Henry et al., 2007; Salvy et al., 2019) and TGECKO (thermodynamic GECKO) and TRBA (thermodynamic RBA). These advanced model types, which have been developed in recent years, are based on existing genome-scale metabolic models used for FBA (flux balance analysis), a methodology that has been utilized for decades (Watson, 1986). Genome-scale metabolic models can be obtained from databases such as the BiGG models database (King, Lu, et al., 2015), or retrieved from publications.
Relevance of extended modelling
The advent of high-throughput data has led to a growing importance of these extended models. Fundamentally, FBA can be regarded as a predictor of the macroscopic behavior of metabolic networks, while extended models offer insights into the intricate functioning of these networks. Extended models contain considerably more parameters. While some of these additional parameters require definition, the majority are automatically retrieved from online databases and tools, including NCBI, UniProt, BioCyc, and TurNuP (Kroll et al., 2023). The development of these extended models and the enhancement of their parameters can be facilitated through simple and consistent workflows. Furthermore, the sharing of configuration data among different model types is encouraged. All extended models are exported in stand-alone SBML (Systems Biology Markup Language) coded files (Hucka et al., 2019) to facilitate model sharing and processing by downstream tools, such as cobrapy (Ebrahim et al., 2013). Additionally, the f2xba modeling framework provides optimization support via cobrapy or gurobipy interfaces. Optimization results are structured and enriched with additional data. This includes tables for each variable type, correlation plots, and exports to Esher (King, Dräger, et al., 2015). This facilitates interpretation of model predictions and supports workflows for model parameter adjustments.
Integrated solution
Research groups have already developed tools to support extended genome-scale modeling. These tools have been implemented in various programming environments, each exhibiting a distinct approach to model parametrization, generation, and optimization. However, none of these tools generate stand-alone models coded in SBML. ccFBA and MOMENT modeling is supported by the R package sybilccFBA, GECKO modeling by the MATLAB package geckomat, RBA modeling by the Python package RBApy, and thermodynamics modeling by the Python package pyTFA. f2xba is the first integrated tool to support model generation of various extended model types within a single programming environment, compatible model parametrizations, shareable configuration files, and consistent workflows for both model generation and optimization. The resulting models are exported to files and are fully compliant with the SBML standard. Furthermore, all annotation data from the original genome-scale (FBA) model is carried over. Depending on the availability of organism-specific data and actual requirements, different extended model types and differently parametrized versions of a target organism can be generated with relative ease. It is our hope that the f2xba modeling framework will support the community in actively using these extended model types, which have been published in the previous few years.
Tutorials
The documentation includes a set of tutorials with detailed descriptions, where different types of extended models are created based on the most recent genome-scale metabolic network reconstruction of Escherichia coli, iML1515 (Monk et al., 2017). Similar jupyter notebooks are available upon request for the generation of extended models based on yeast9 (Zhang et al., 2024), Saccharomyces cerevisiae, iJN678 (Nogales et al., 2012), Synechocystis sp. PCC 6803, and MMSYN (Breuer et al., 2019), the synthetic cell JCVI-Syn3A based on Mycoplasma mycoides capri.
Outlook
Growth balance analysis (GBA) (Dourado & Lercher, 2020) modeling is an active research project in CCB. In GBA models, reaction fluxes are coupled with protein requirements using non-linear kinetic functions, where enzyme saturation depends on variable metabolite concentrations. We have previously demonstrated the generation of small, schematic GBA models in SBML, the loading of these models from SBML, and the optimization of them using non-linear solvers. However, the optimization of genome-scale GBA models remains challenging. Once this optimization problem is resolved, f2xba could be extended to support GBA model generation, e.g., by extending GECKO or RBA configuration data, and GBA model optimization, either using nonlinear optimization features available in gurobi 12 or using a dedicated nonlinear solver like IPOP.
References:
[^1]: Desouki, A. A. (2015). sybilccFBA: Cost Constrained FLux Balance Analysis: MetabOlic Modeling with ENzyme kineTics (MOMENT). In CRAN. https://cran.r-project.org/web/packages/sybilccFBA/index.html
[^2]: Desouki, A. A. (2015). sybilccFBA: Cost Constrained FLux Balance Analysis: MetabOlic Modeling with ENzyme kineTics (MOMENT). In CRAN. https://cran.r-project.org/web/packages/sybilccFBA/index.html
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file f2xba-1.9.10.tar.gz.
File metadata
- Download URL: f2xba-1.9.10.tar.gz
- Upload date:
- Size: 197.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
81f275ef1599ea03b9c6c2cecd8633907075324334d9526c1057255b4caf3ee7
|
|
| MD5 |
2388c85715417ecd5e1dab3c72e11606
|
|
| BLAKE2b-256 |
977eb67e8ee3689119764a7a73b1b6f880a332b270cfcdf2c0c34614a02afd51
|
File details
Details for the file f2xba-1.9.10-py3-none-any.whl.
File metadata
- Download URL: f2xba-1.9.10-py3-none-any.whl
- Upload date:
- Size: 229.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
de5ea0143739b8ff30796fde698745a66f36a3ffbca23d4ce7257ca8c84043eb
|
|
| MD5 |
3e06fa5c94453385da2985d8576168b3
|
|
| BLAKE2b-256 |
4ad84d5eb622a802146176203dab16b2c5011a92b55202f4801b1651c4d8bef6
|