Skip to main content

A comprehensive toolkit for processing and analyzing metabolomics data.

Project description

Rodin: Metabolomics Data Analysis Toolkit

DOI PyPI Monthly installs All-time installs

Rodin is a Python library specifically designed for the comprehensive processing and analysis of metabolomics data and other omics data. It is a class-methods based toolkit, facilitating a range of tasks from basic data manipulation to advanced statistical evaluations, visualization, and metabolic pathway analysis.

📘 Documentation: https://minasenko.com/rodin

Now, most of its functionality is available in the Web App at https://rodin-meta.com.

Features

  • Efficient Data Handling: Streamlined manipulation and transformation of metabolomics data and other omics.
  • Robust Statistical Analysis: Includes ANOVA, t-tests, and more.
  • Machine Learning Methods: Random Forest, Logistic and Linear regressions.
  • Advanced Dimensionality Reduction: Techniques like PCA, t-SNE, UMAP.
  • Interactive Data Visualization: Tools for effective data visualization.
  • Pathway Analysis: Features for metabolic pathway analysis.

Installation

We recommend installing Rodin in a separate environment for effective dependency management.

Prerequisites

  • Python (3.10 or higher)

Install Rodin

pip install rodin

or install Rodin directly from GitHub:

pip install git+https://github.com/BM-Boris/rodin.git

Basic Example

Here's a basic example demonstrating the usage of Rodin for data analysis. Comprehensive Jupyter notebook guides can be found in the 'guides' folder

import rodin

# Assume 'features.csv' and 'class_labels.csv' are your datasets
features_path = 'path/to/features.csv'
classes_path = 'path/to/class_labels.csv'

# Creating an instance of Rodin_Class
rodin_instance = rodin.create(features_path, classes_path)

# Transform the data (imputation, normalization, and log-transformation steps)
rodin_instance.transform()

# Run t-test comparing two groups based on 'age'
rodin_instance.ttest('age')

# Run two-way anova test comparing groups based on 'age' and 'region'
rodin_instance.twoway_anova(['age','region'])

# Run multiple logistic regressions and linear regressions to get pvalues for each feature
rodin_instance.sf_lg('sex')
rodin_instance.sf_lr('age')

#Run a random forest classifier and regressor to obtain the metrics of the trained model using k-fold validation, with assigned feature importance scores to each variable
rodin_instance.rf_class('region')
rodin_instance.rf_regress('age')

#Slice the whole object using the pattern from pandas
rodin_instance = rodin_instance[rodin_instance.features[rodin_instance.features['imp(rf) age']>0]]

# Perform PCA with 2 principal components (UMAP and t-SNE are available as well)
rodin_instance.run_pca(n_components=2)

# Plotting the PCA results
# 'region' column in the 'samples' DataFrame is used for coloring the points
rodin_instance.plot(dr_name='pca', hue='region', title='PCA Plot')
# Volcano Plot
rodin_instance.volcano(p='p_adj(owa) region', effect_size='lfc (New York vs Georgia)', sign_line=0.01)
# Box Plot
rodin_instance.boxplot(rows=[9999,4561], hue='region')
# Clustergram 
rodin_instance.clustergram(hue='sex',standardize='row')

# Pathway analysis 
rodin_instance.analyze_pathways(pvals='p_value', stats='statistic',mode='positive')
# Replace 'p_value' and 'statistic' with the actual column names in your 'features' DataFrame(rodin_instance.features) and provide Mass spectrometry analysis mode.

The updated guide can be accessed here: https://bm-boris.github.io/rodin_guide/basics.html. Test data from the guide can be found at https://github.com/BM-Boris/rodin_guide/tree/main/data.

Contact

For questions, suggestions, or feedback, please contact boris.minasenko@emory.edu

Citation

If you use Rodin in your research, please cite the following paper:

Minasenko B, Wang D, Cirillo P, Krigbaum N, Cohn B, Jones DP, Collins JM, Hu X.
Rodin: a streamlined metabolomics data analysis and visualization tool. Bioinformatics Advances. 2025; 5(1): vbaf088.
https://doi.org/10.1093/bioadv/vbaf088

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rodin-1.9.25.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rodin-1.9.25-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file rodin-1.9.25.tar.gz.

File metadata

  • Download URL: rodin-1.9.25.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.6

File hashes

Hashes for rodin-1.9.25.tar.gz
Algorithm Hash digest
SHA256 28107d5297e9db76dc6773cc46fffb11aa3becde3e674fe256e5890d0fe21bbf
MD5 da5e40af205bced4b0a67c8b599eca04
BLAKE2b-256 c8279e699576f98e82a316840a6f2835eb9ab14f86093e6aeccef625f87f302d

See more details on using hashes here.

File details

Details for the file rodin-1.9.25-py3-none-any.whl.

File metadata

  • Download URL: rodin-1.9.25-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.6

File hashes

Hashes for rodin-1.9.25-py3-none-any.whl
Algorithm Hash digest
SHA256 322eb20a3117d2a51945b23f818b5030379edaad72eff75d442180db8afe12ff
MD5 5057069c9f4e0bb24bd99f1ff46dce41
BLAKE2b-256 34e2e031acfc808ddcaac1773a4637f088583b1c5fa52d5da604947dd63f0161

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page