Skip to main content

A cool project that does something awesome on diaPASEF data.

Project description

Beta-DIA

Beta-DIA is a partially open-source, free-to-use Python software that provides comprehensive peptide/protein identification and accurate quantification results for single-shot diaPASEF data.


Contents

Installation
Usage
Output


Installation

We recommend using Conda to create a Python environment for using Beta-DIA, whether on Windows or Linux.

  1. Create a Python environment with version 3.9.18.

    conda create -n beta_env python=3.9.18 "numpy<2.0.0"
    conda activate beta_env
    
  2. Install the corresponding PyTorch and CuPy packages based on your CUDA version (which can be checked using the nvidia-smi command). Beta-DIA will fail on computers without a GPU.

  • CUDA-12
    pip install torch==2.3.1 --index-url https://download.pytorch.org/whl/cu121
    pip install cupy-cuda12x
    conda install cudatoolkit
    
  • CUDA-11
    pip install torch==2.3.1 --index-url https://download.pytorch.org/whl/cu118
    pip install cupy-cuda11x
    conda install cudatoolkit
    
  1. Install Beta-DIA
    pip install beta_dia
    

Usage

beta_dia -lib "Absolute path of the spectral library" -ws "Absolute path of the .d folder or a folder containing multiple .d folders"

(Please note that the path needs to be enclosed in quotes.)

  • -lib
    This parameter is used to specify the absolute path of the spectral library. Beta-DIA currently supports the spectral library with the suffix .speclib predicted by DIA-NN (>=v1.9). It supports oxygen modifications on methionine (M) but does not include modifications such as phosphorylation or acetylation. Refer to this for instructions on how to generate prediction spectral libraries using DIA-NN. Beta-DIA will develop its own predictor capable of forecasting the peptide retention time, ion mobility, and fragmentation pattern. It may also be compatible with other formats of spectral libraries based on requests.

  • -ws
    This parameter is used to specify the .d folder or the folder containing multiple .d folders that need to be analyzed.

Other optional params are list below by entering beta_dia -h:

       ******************
       * Beta-DIA x.y.z *
       ******************
Usage: beta_dia [-h] -ws WS -lib LIB [-out_name OUT_NAME] [-gpu_id GPU_ID]

optional arguments:
  -h, --help          show this help message and exit
  -ws WS              specify the folder that is .d or contains .d files.
  -lib LIB            specify the absolute path of a .speclib spectra library.
  -out_name OUT_NAME  specify the folder name of outputs. Default: beta_dia.
  -gpu_id GPU_ID      specify the GPU-ID (e.g. 0, 1, 2) which will be used. Default: 0.

Output

Beta-DIA will generate beta_dia/report_beta.log.txt and beta_dia/report_beta.tsv in each .d folder. The report_beta.tsv contains precursor and protein IDs, as well as plenty of associated information. Most column names are consistent with DIA-NN and are self-explanatory.

  • Protein.Group - inferred proteins. Beta-DIA uses IDPicker algorithm to infer proteins.
  • Protein.Ids - all proteins matched to the precursor in the library.
  • Protein.Names names (UniProt names) of the proteins in the Protein.Group.
  • PG.Quantity quantity of the Protein.Group.
  • Precursor.Id peptide seq + precursor charge.
  • Precursor.Charge the charge of precursor.
  • Q.Value run-specific precursor q-value.
  • Protein.Q.Value run-specific q-value for the unique protein, that is protein identified with proteotypic (=specific to it) peptides.
  • PG.Q.Value run-specific q-value for the protein group.
  • Proteotypic indicates the peptide is specific to a protein.
  • Precursor.Quantity quantity of the precursor.
  • RT the retention time of the precursor.
  • IM the ion mobility of the precursor.
  • CScore the final precursor score calculated by Beta-DIA after merging all sub-scores.
  • CScore.PG the final protein group score calculated by Beta-DIA after merging multiple peptide scores.

Troubleshooting


Citing Beta-DIA

Check out: Beta-DIA: Beta-DIA: Integrating learning-based and function-based feature scores to optimize the proteome profiling of diaPASEF mass spectrometry data


Changelog

0.1.0

  • FEAT: first commit on GitHub.

0.1.1

  • FIX: loading .d even with missing or occupied frame.

0.1.2

  • FEAT: support the selection of a GPU to be used.

0.2.0

  • FEAT: polishing the prs with share fgs to optimize FDR control.

0.2.1

  • FIX: polishing should be followed by qvalue calculation again.

0.3.0

  • FEAT: refactor code to speed up

0.4.0

  • FEAT: use model_ys; select locus with x == 1 or deep == 1

0.5.0

  • FEAT: speed up load_ms by numba index; add deep_big cut before fast fdr

0.5.1

  • FIX: train mlp by data with good_big_cut of 1.5sigma; rank by deep_big

0.5.2

  • FEAT: add param start_id

0.5.3

  • FIX: calib m/z using np.float64; remove autocast; model_ys_fast is ok

0.5.4

  • FEAT: two methods (ascending or descending) to polish prs; remove left > center

0.6.0

  • FEAT: 5% method to polish dubilous prs, may less the ids

0.7.0

  • FIX: polish dubious prs considering two more confident prs
  • FEAT: support .parquet lib

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

beta_dia-0.7.0.tar.gz (9.0 MB view details)

Uploaded Source

Built Distribution

beta_dia-0.7.0-py3-none-any.whl (9.1 MB view details)

Uploaded Python 3

File details

Details for the file beta_dia-0.7.0.tar.gz.

File metadata

  • Download URL: beta_dia-0.7.0.tar.gz
  • Upload date:
  • Size: 9.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.18

File hashes

Hashes for beta_dia-0.7.0.tar.gz
Algorithm Hash digest
SHA256 4c56b0544946508f9d51f0914bd202742986a50ffaa5da605fccf92ca25cb088
MD5 a66f930acfc7d9b369f447c7dd2c9406
BLAKE2b-256 b271921e4d9c21cfa032150090135ed0bcb3c62c7b7372f741072bcf348134b6

See more details on using hashes here.

File details

Details for the file beta_dia-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: beta_dia-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 9.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.18

File hashes

Hashes for beta_dia-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a3baf16baf053dca8791de03d32eb4085b451a0e14c8e223f9b9a25da47d6fb6
MD5 b5ae256e05f88b4355b0b3a5c49b279a
BLAKE2b-256 0cbd2a804c43cd041e87d7e8ecd83ac68267f6b935d4ab69a634ceaf4d2469b7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page