GPU-accelerated pipeline to compute a Mitochondrial Health Index (MHI)
Project description
MitoOmics-GPU [Work in Progress]
GPU-accelerated multi-omics pipeline to quantify and visualize the Mitochondrial Health Index (MHI) by integrating extracellular vesicle/mitochondrial-derived vesicle (EV/MDV) proteomics with single-cell RNA-seq.
Hackathon project by Team Go Getters at the NVIDIA Accelerate Omics Hackathon (8-25 Sept 2025).
👥 Team Go Getters
- Sayane Shome, PhD (AI in Healthcare, Stanford)[Team Lead]
- Seema Parte, PhD (Ophthalmology, Stanford)
- Hirenkumar Patel, PhD (Ophthalmology, Stanford)
- Ankit Maisuriya (PhD candidate, Quantum Photonics, Northeastern)
- Medha Bhattacharya (CS undergrad, UC Irvine)
🚀 Project Objective
-
Develop a GPU-accelerated pipeline for mitochondrial health analysis.
-
Link blood-derived EV/MDV proteomics with mitochondrial DNA copy-number proxies from scRNA-seq.
-
Provide interpretable measures:
- Biogenesis (capacity to grow new mitochondria)
- Fusion/Fission (structural remodeling)
- Mitophagy (repair/recycling)
- Heterogeneity (variation across cells).
-
Output: a unified Mitochondrial Health Index (MHI) summarizing mitochondrial resilience, fitness, and disease risk.
⚡ Installation
pip install mitoomics-gpu
🖥️ GPU Acceleration
- Optimized with RAPIDS + GPU backends.
- Clear CPU vs GPU speedups for large datasets.
- Open-source, designed for integration with scverse/rapids-singlecell.
📊 Key Insights
- Unified mitochondrial health scoring (MHI).
- Patient-level and cell-type–level insights.
- Supports biomarker discovery, disease progression prediction, and drug response stratification.
📂 Real Data (Bundled)
All datasets are pre-bundled under mitoomics_gpu/data/:
| File | Description |
|---|---|
data/scrna.h5ad |
Full scRNA-seq dataset (AnnData, with subject_id, cell_type, batch) |
data/scrna.mito.h5ad |
Mito-filtered scRNA-seq (pre-subsetted to mitochondrial genes) |
data/ev_human.csv |
EV/MDV proteomics from PRIDE (PXD018301) — columns: subject_id, protein, abundance |
data/mitocarta3_table.csv |
MitoCarta 3.0 pathway table (parsed from Human.MitoCarta3.0.xls) |
data/genesets_curated.csv |
Curated gene sets (fusion, fission, mitophagy, biogenesis) |
data/ev_whitelist.csv |
EV-specific protein whitelist for filtering |
🧬 Usage with Real Data
Standard CPU pipeline
python -m mitoomics_gpu \
--scrna mitoomics_gpu/data/scrna.h5ad \
--proteomics mitoomics_gpu/data/ev_human.csv \
--mitocarta-table mitoomics_gpu/data/mitocarta3_table.csv \
--ev-whitelist mitoomics_gpu/data/ev_whitelist.csv \
--outdir results/
GPU-accelerated pipeline (recommended for large datasets)
python -m mitoomics_gpu.gpu_cli \
--scrna mitoomics_gpu/data/scrna.h5ad \
--proteomics mitoomics_gpu/data/ev_human.csv \
--genesets_csv mitoomics_gpu/data/genesets_curated.csv \
--ev_whitelist mitoomics_gpu/data/ev_whitelist.csv \
--outdir results/ \
--do_umap
Using the mito-filtered scRNA (faster, recommended)
Replace scrna.h5ad with scrna.mito.h5ad in either command above to use the
pre-filtered mitochondrial gene subset, which significantly reduces memory and
compute time:
python -m mitoomics_gpu.gpu_cli \
--scrna mitoomics_gpu/data/scrna.mito.h5ad \
--proteomics mitoomics_gpu/data/ev_human.csv \
--genesets_csv mitoomics_gpu/data/genesets_curated.csv \
--ev_whitelist mitoomics_gpu/data/ev_whitelist.csv \
--outdir results/
Outputs written to results/:
results_summary.csv/results_summary_GPU.csv— subject-level MHI scoresembedding_pca_GPU.csv— PCA embedding (GPU run)embedding_umap_GPU.csv— UMAP embedding (if--do_umappassed)report.md+ figures — visual summary
🔮 Future Directions
- Add modalities: scATAC, metabolomics, spatial transcriptomics.
- Deploy web-server / pip package for biologist-friendly use.
- Clinical validation with partners & cohorts.
- ML upgrades for pattern discovery & prediction on MHI.
📬 Contact
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mitoomics_gpu-0.2.0.tar.gz.
File metadata
- Download URL: mitoomics_gpu-0.2.0.tar.gz
- Upload date:
- Size: 55.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
158181bf255ca0a46ea9f833d2d39a4405a2fec57cf1b134776bc6be2b11fe5e
|
|
| MD5 |
01a1c0f9205ed104725080f6075297aa
|
|
| BLAKE2b-256 |
049ca480f8259c125acf94f0198d5355fc8dd8c0c3d5d274907f3fb6a44973a5
|
File details
Details for the file mitoomics_gpu-0.2.0-py3-none-any.whl.
File metadata
- Download URL: mitoomics_gpu-0.2.0-py3-none-any.whl
- Upload date:
- Size: 74.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
80776b7847088e982d486b87aa41d7dfc908c103dc7bcbaa2ea3ab4c54d339e4
|
|
| MD5 |
20dba5c857bcad6303ef31279e54105a
|
|
| BLAKE2b-256 |
cf79838d2780d6a5fd512f55fcdf1d361e53701a17681cac294e133766237aa6
|