Skip to main content

A Python library that presents a standardized dataset-based algorithm designed to reduce variation in large-scale data-independent acquisition (DIA) mass spectrometry data.

Project description



STAVER: A Standardized Dataset-Based Algorithm for Efficient Variation Reduction

Table of Contents

Introduction

STAVER is Python library that presents a standardized dataset-based algorithm designed to reduce variation in large-scale data-independent acquisition (DIA) mass spectrometry data. By employing a reference dataset to standardize mass spectrometry signals, STAVER effectively reduces noise and enhances protein quantification accuracy, especially in the context of multi-library search. The effectiveness of STAVER is demonstrated in several large-scale DIA datasets, showing improved identification and quantification of thousands of proteins. STAVER, featuring a modular design, provides flexible compatibility with existing DIA MS data analysis pipelines. The project aims to promote the adoption of multi-library search and improve the quality of DIA proteomics data through the open-source STAVER software package. A comprehensive overview of the research workflow and STAVER algorithm architecture are summarized in the following figure: alt text

Installation

You can install staver package from PyPI by calling the following command:

pip install staver

You may install from source by cloning the STAVER repo, navigating to the root directory and using one of the following commands pip install ., or pip install -e . to install in editable mode:

# clone the source repo
git clone https://github.com/Ran485/STAVER.git

# install the package in editable mode
pip install .

# or using the following command
pip install -e .

You may install additional environmental dependencies:

pip install -r requirements_dev.txt
pip install -r requirements.txt

Getting Started

For example code and an introduction to the library, see the Jupyter notebooks in tutorials, and the guided walkthrough here. A straightforward command-line demonstration for a quick start can be discovered in the following block.

python  ./staver_pipeline.py \
        --thread_numbers < The CPU worker numbers, Default to [nmax-2] > \
        --input < The DIA data input directory > \
        --output_peptide < The processed DIA peptide data output directory > \
        --output_protein < The processed DIA protein data output directory > \
        --count_cutoff_same_libs < Default to 1 > \
        --count_cutoff_diff_libs < Default to 2 > \
        --proteins_cv_thresh < Default to 0.3 > \
        --na_threshold < Default to 0.3 > \
        --top_precursor_ions < Default to 3 > \
        --file_suffix < Default to "_F1_R1" >  \

Run the test-data in the following block

python  ./staver/staver_pipeline.py \
        --thread_numbers 16 \
        --input ./staver/data/likai-diann-raw-20/ \
        --reference_dataset_path ./data/likai-diann-raw \
        --output_peptide ./staver/results/peptides/ \
        --output_protein ./staver/results/proteins/ \
        --count_cutoff_same_libs 1 \
        --count_cutoff_diff_libs 2 \
        --peptides_cv_thresh 0.3 \
        --proteins_cv_thresh 0.3 \
        --na_threshold 0.3 \
        --top_precursor_ions 5 \
        --file_suffix _F1_R1 \

Documentation

To gain a comprehensive understanding of STAVER's application and to thoroughly appreciate the function and purpose of each parameter, we highly recommend perusing the all-encompassing STAVER documentation. This resource provides detailed, step-by-step instructions, accompanied by illustrative examples and clear explanations, equipping users with the knowledge to skillfully navigate and exploit the software's complete potential.

How to Contribute

We welcome the contribution from the open-source community to improve the library!

To add a new explanation method/feature into the library, please follow the template and steps demonstrated in this documentation.

Contact Us

If you have any questions, comments or suggestions, please do not hesitate to contact us at 21112030023@m.fudan.edu.cn

License

The STAVER project licensed under the MIT License, granting users open access and the freedom to employ, adapt, and share the software as needed, while preserving the original copyright and license acknowledgements.

======= History

0.1.0 (2023-03-25)

  • First release on PyPI.

Credits

This package was created with Cookiecutter_ and the audreyr/cookiecutter-pypackage_ project template.

.. _Cookiecutter: https://github.com/audreyr/cookiecutter .. _audreyr/cookiecutter-pypackage: https://github.com/audreyr/cookiecutter-pypackage

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dia-staver-0.1.0.tar.gz (974.5 kB view details)

Uploaded Source

File details

Details for the file dia-staver-0.1.0.tar.gz.

File metadata

  • Download URL: dia-staver-0.1.0.tar.gz
  • Upload date:
  • Size: 974.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for dia-staver-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bed8810d913a6e2562f605567b42cee8f37c2da980f3350b23c8ea5fc815be3e
MD5 ba863c23a96786d3883d4c91043a385d
BLAKE2b-256 729bca8a3a90b3cdef3074385e6c37bed5b36dca7abd1f675949323857123676

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page