Skip to main content

Post processing and analysis of quantitative proteomics data

Project description

MsReport

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. Python Version from PEP 621 TOML Run tests

MsReport is a Python library for post-processing quantitative proteomics data from bottom-up mass spectrometry experiments.

Table of Contents

What is MsReport?

MsReport is a Python library designed to simplify the post-processing and analysis of quantitative proteomics data from bottom-up mass spectrometry experiments. It provides a high-level, abstraction-focused API for efficient and standardized workflows. The modular design of the library provides the flexibility to meet project specific data processing needs and customize workflows as required.

The library supports importing protein and peptide-level quantification results from MaxQuant, FragPipe, and Spectronaut, as well as post-translational modification (PTM) data from MaxQuant and FragPipe. MsReport provides tools for data annotation, normalization and transformation, statistical testing, and data visualization.

Key features of MsReport

Data Import and Standardization

The reader module provides software-specific reader classes for importing data from MaxQuant, FragPipe, and Spectronaut that enable the import of protein, peptide and ion tables. During the import process, these classes transform tables column names and table values into a standardized format to ensure that the rest of the library can operate in a tool-agnostic manner.

Data management

The qtable module provides a structured approach to managing quantitative data through its central Qtable class. This class combines quantitative data with an experimental design table that defines the relationship between samples and experimental conditions. The quantitative data is stored in a wide format, where each sample's measurements are stored in separate columns. The Qtable class serves as the foundation for data analysis workflows in MsReport, providing the standardized data structure used by the analyze, plot, and export modules.

Data processing and analysis

The analyze module provides tools for post-processing of mass spectrometry data generated by software such as MaxQuant, FragPipe, or Spectronaut. It includes functions for filtering, normalization, imputation of missing values, and statistical testing. The library integrates with the R package LIMMA to enable differential expression analysis.

[!NOTE]
In order to use the R integration you need to install msreport with optional dependencies, see Optional Dependencies for more information.

Data visualization

The plot module supports the generation of visualizations for quality control and data analysis. It includes functions for creating various plots, such as intensity and ratio distributions, heatmaps, volcano plots, and PCA plots.

Data export

Finally, the export module enables the conversion and export into formats compatible with external tools. This includes generating input files for Amica and exporting tables for easier integration with Perseus.

Installation

If you do not already have a Python installation, we recommend installing the Anaconda distribution or Miniconda distribution from Continuum Analytics, which already contains a large number of popular Python packages for Data Science. Alternatively, you can also get Python from the Python homepage. Note that MsReport requires Python version 3.10 or higher.

The following command will install MsReport and its dependencies by using a wheel file.

pip install msreport

To uninstall the MsReport library use:

pip uninstall msreport

Installation when using Anaconda

To install the MsReport library using Anaconda, you need to either activate a custom conda environment or install it into the default base environment. Open the Anaconda Navigator, activate the desired conda environment or use the base environment, and then open a command line by running the "CMD.exe" application. Finally, use the pip install command as before.

Optional Dependencies

R Integration

MsReport provides an interface to the R package LIMMA for differential expression analysis. To use this functionality, you need:

  • A local installation of R (version 4.0 or higher).
  • The system environment variable R_HOME set to the R home directory.
  • To install msreport with the optional dependencies for R integration.
pip install msreport[R]

Setting the R_HOME environment variable

On Windows, you may need to restart your computer after modifying the system environment variables for the changes to take effect. To find the R home directory, you can run the following command in R:

normalizePath(R.home("home"))

For example, the R home directory might look like this on Windows: C:\Program Files\R\R-4.2.1

Development status

MsReport is a stable and reliable library that has been used on a daily basis for over two years in the Mass Spectrometry Facility at the Max Perutz Labs and the Mass Spectrometry Facility of IMP/IMBA/GMI. While the current interface of MsReport is stable, the library is still under active development, with new features being added regularly. Please note that a major rewrite is planned, which may introduce changes to the API in the future.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

msreport-0.0.29.tar.gz (103.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

msreport-0.0.29-py3-none-any.whl (100.8 kB view details)

Uploaded Python 3

File details

Details for the file msreport-0.0.29.tar.gz.

File metadata

  • Download URL: msreport-0.0.29.tar.gz
  • Upload date:
  • Size: 103.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.17

File hashes

Hashes for msreport-0.0.29.tar.gz
Algorithm Hash digest
SHA256 7d548d7918c40d7b838da85c553eb92678c9ec0d940a34b94008e9f3daffa963
MD5 6230e9e2e883f974bf99cf16aadb2405
BLAKE2b-256 8f2ff808cd3290ec5450dd0e95f76574ef5d7562e57058039e91072f1f2793ea

See more details on using hashes here.

File details

Details for the file msreport-0.0.29-py3-none-any.whl.

File metadata

  • Download URL: msreport-0.0.29-py3-none-any.whl
  • Upload date:
  • Size: 100.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.17

File hashes

Hashes for msreport-0.0.29-py3-none-any.whl
Algorithm Hash digest
SHA256 e7c1c30a8ef56fc3955f3e02dd0a1a9fda3fdddb85dd7a158dd71730e1ad3e49
MD5 0dc952c1d17cf7ab9fdf59fe9ee9869d
BLAKE2b-256 ffb706b867c6f70827a363a4ad7dacc3d6c6f6bdb970402b15139e9c3d18fa52

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page