BERTIS LC-MS/MS analysis library through MuData
Project description
Python toolkit for modular and traceable LC-MS/MS proteomics analysis based on MuData
Overview
msmu is an open-source Python package for modular and traceable post-DB search preprocessing and statistical analysis of bottom-up proteomics data.
It supports modules for every step of end-to-end processing—from search output parsing through hierarchical summarization, normalization, batch correction, statistical analysis, and visualization—implemented with commonly used analytical and statistical methods.
Central to msmu is the highly versatile and standardized MuData (and AnnData) as a unifying, provenance-aware data container for organizing and storing annotations and representations of multi-dimensional MS data and processing history.
This unique marriage between flexible processing pipeline and MuData empowers FAIR principle-aligned downstream analysis for biomarker discovery and systems biology.
Key Features
- Flexible data ingestion from Sage, DIA-NN, and other popular DB search tools
- MuData/AnnData-compatible object structure for organizing multi-level MS data
- Protein inference: infer protein groups from peptide evidence using parsimony rule
- Normalization: median centering, quantile normalization, etc.
- Batch correction for discrete and continuous variations
- Built-in QC: identification count, peptide length, charge, missed cleavage, intensity distribution, etc.
- Statistical analysis: differential expression analysis, dimensionality reduction
- PTM data support and stoichiometry adjustment with matched global dataset (if available)
- Visualization: PCA, UMAP, volcano plots, heatmaps, QC metrics
Supporting DB Search Tools
- Sage: https://sage-docs.vercel.app
- DIA-NN: https://github.com/vdemichev/DIA-NN
- MaxQuant: https://www.maxquant.org/
- FragPipe: https://fragpipe.nesvilab.org/
- and more upcoming.
Documentation
Comprehensive documentation, including installation instructions, tutorials, and API references, is available at: https://bertis-informatics.github.io/msmu/
Citation
If you use msmu in your research, please cite the following publication (preprint):
msmu: a Python toolkit for modular and traceable LC-MS proteomics data analysis based on MuData
Hyung-Wook Choi, Byeongchan Lee, Un-Beom Kang, Sunghyun Huh
bioRxiv 2026.01.07.698308; doi: 10.64898/2026.01.07.698308
License
BSD 3-Clause License. See LICENSE for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file msmu-0.2.8.tar.gz.
File metadata
- Download URL: msmu-0.2.8.tar.gz
- Upload date:
- Size: 17.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
818f582daa1c41c74365a8ad344de7b1168b02c7f45c3fff21d3f7d0a5f51d25
|
|
| MD5 |
15083a2fdf7a6f04b23693a36142e90e
|
|
| BLAKE2b-256 |
2fd7d532f13022cefd5cd7b94fbefa35c493ebccdf461d8a020f77418be248bb
|
Provenance
The following attestation bundles were made for msmu-0.2.8.tar.gz:
Publisher:
publish.yml on bertis-informatics/msmu
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
msmu-0.2.8.tar.gz -
Subject digest:
818f582daa1c41c74365a8ad344de7b1168b02c7f45c3fff21d3f7d0a5f51d25 - Sigstore transparency entry: 942804108
- Sigstore integration time:
-
Permalink:
bertis-informatics/msmu@3181b414a31a11a0528e23bda8b23c35ab75f3be -
Branch / Tag:
refs/tags/0.2.8 - Owner: https://github.com/bertis-informatics
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@3181b414a31a11a0528e23bda8b23c35ab75f3be -
Trigger Event:
push
-
Statement type:
File details
Details for the file msmu-0.2.8-py3-none-any.whl.
File metadata
- Download URL: msmu-0.2.8-py3-none-any.whl
- Upload date:
- Size: 17.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4b5f794ec818f2ea83ae3a1beabadf90d476abb86ad0b230681d9732a1df8647
|
|
| MD5 |
4af6c6718f088f8dc327b2bd83308aa9
|
|
| BLAKE2b-256 |
a8874d088d0347e846788132195d554e48326e04bba6e32830600212050aeb1c
|
Provenance
The following attestation bundles were made for msmu-0.2.8-py3-none-any.whl:
Publisher:
publish.yml on bertis-informatics/msmu
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
msmu-0.2.8-py3-none-any.whl -
Subject digest:
4b5f794ec818f2ea83ae3a1beabadf90d476abb86ad0b230681d9732a1df8647 - Sigstore transparency entry: 942804128
- Sigstore integration time:
-
Permalink:
bertis-informatics/msmu@3181b414a31a11a0528e23bda8b23c35ab75f3be -
Branch / Tag:
refs/tags/0.2.8 - Owner: https://github.com/bertis-informatics
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@3181b414a31a11a0528e23bda8b23c35ab75f3be -
Trigger Event:
push
-
Statement type: