CobraBabel: COBRA Model Translator
Metabolic models are available from many sources and different sources use different ID namespaces. That makes combining and comparing models from different sources difficult. CobraBabel provides functions to translate between ID namespaces and convert metabolic models to a common ID namespace. CobraBabel supports the following namespaces:
- Virtual Metabolic Human (VMH)
- Biochemical, Genetic and Genomic knowledge base (BiGG)
- Kyoto Encyclopedia of Genes and Genomes (KEGG)
- Take a list of models, open all of them, create a universal model with all unique reactions and metabolites.
- Download specific files, can get the whole BiGG database, VMH mapping file, MetaNetX translation file.
- Function to convert compartment suffixes.
- Import from a Excel file or csv file.
- ID match without special characters
VMH provides a SEED2VMH_translation.csv file which is not current with the translation table in Supplementary Table 15 in the AGORA paper. Supplementary Table 15 is provided as an Excel workbook which was parsed into two files: (1) vmh_metabolite_xref.tsv and (2) vmh_reaction_xref.tsv which are provided in the “data” folder.
Universal metabolites can have more than one formula if different models use a different formula for a metabolite with the same ID (same situation can occur with charge). Currently, CobraBabel blindly picks the first formula in the list of formulae for a universal metabolite.
Universal metabolites have a list of all the compartments where the metabolite is used. Currently, CobraBabel assumes that all compartment IDs are defined the same across all models. For example, compartment ID “c” means “cytosol” in all BiGG models. The first instance of the compartment in a metabolite is the one that sets the name.
Some reactions are defined with unspecified stoichiometry coefficients, for example: (2n) MNXM1471 + 1 MNXM3341 = (2n) MNXM1 + 1 MNXM4074 + (2n) MNXM537. Currently, CobraBabel does not include these reactions when creating an universal model.
There are some reactions where there is a BIOMASS metabolite in the reaction definition. But the BIOMASS metabolite is not defined so any reactions with the BIOMASS metabolite are not included in the universal model.
If a reaction has a value in the Source field, there is only one source in the format: source:id. Sources include Rhea (rhea), KEGG (kegg), MetaCyc (metacyc), UniPathway (upa), The Seed (seed), BiGG (bigg), BioPath (biopath), and Reactome (reactome). Set the verbose parameter when calling create_metanetx_universal_model() to show a warning for reactions with an invalid format in the Source field.
There are no compartments so metabolites are defined without a compartment. Still not sure how exchange and transport reactions are defined.
All reactions are defined as bi-directional so the lower and upper bounds are set to the default values (-1000.0, 1000.0) for all reactions.
There are some limitations on the KEGG API, most importantly:
Restrictions: KEGG API is provided for academic use by academic users belonging to academic institutions. This service should not be used for bulk data downloads. Please obtain KEGG FTP academic subscription for downloading KEGG data.
When setting OTU representatives in the organism database, there can be variability in which organism is selected when there are multiple good matches. For example, in the KEGG database there are two Mycobacterium tuberculosis H37Rv organisms, one with ID “T00015” and one with ID “T02178”. It is random as to which one gets picked as the OTU representative.
MetaNetX.org: a website and repository for accessing, analyzing and manipulating metabolic networks. Bioinformatics (2013)
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size cobrababel-0.1.2.tar.gz (29.2 kB)||File type Source||Python version None||Upload date||Hashes View|