Skip to main content

A proteomics search engine for LC-MS1 spectra.

Project description

ms1searchpy - a DirectMS1 proteomics search engine for LC-MS1 spectra

The .tsv (or mzML) and .fasta files are required for basic operation of the script. tsv file is tab-separated text file with peptide features generated by Dinosaur software (J.Teleman et al., “Dinosaur: A Refined Open-Source Peptide MS Feature Detector”, JPR 2016) or Biosaur (https://github.com/abdrakhimov1/Biosaur) from mzML file. This file can be generated by any other software for peak-picking and must contain columns ‘massCalib’, ‘rtApex’, ‘charge’ and ‘nIsotopes’ columns. For a сonvenient usage, mzML files can be used directly and the script will run an attached version of Dinosaur (installed Java is required). For an efficient usage of retention time, user can install and use ELUDE prediction algorithm (-elude path_to_elude_binary should be used in parameters). For the most efficient usage of retention time, user can install and use DeepLC prediction algorithm (-deeplc path_to_deeplc_binary should be used in parameters).

Algorithm can be run with following command:

ms1searchpy path_to_MZML -d path_to_fasta

OR

ms1searchpy path_to_peptideFeatures -d path_to_fasta

The script output contains files: all identified proteins (filename_proteins_full.csv), filtered proteins (filename_proteins.csv), all matched peptide match fingerprints (filename_PFMs.csv), all matched peptide match fingerprints with features prepared for Machnine Learning (filename_PFMs_ML.csv) and log file with estimated mass and RT accuracies (filename_log.txt).

Citing ms1searchpy

Ivanov et al. DirectMS1: MS/MS-free identification of 1000 proteins of cellular proteomes in 5 minutes. https://doi.org/10.1021/acs.analchem.9b05095

Installation

Using the pip:

pip install ms1searchpy

Dependencies

  • pyteomics

  • numpy

  • scipy

  • sklearn

  • lightgbm

  • pandas

  • biosaur

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ms1searchpy-2.0.5.tar.gz (13.6 MB view details)

Uploaded Source

Built Distribution

ms1searchpy-2.0.5-py3-none-any.whl (13.7 MB view details)

Uploaded Python 3

File details

Details for the file ms1searchpy-2.0.5.tar.gz.

File metadata

  • Download URL: ms1searchpy-2.0.5.tar.gz
  • Upload date:
  • Size: 13.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5

File hashes

Hashes for ms1searchpy-2.0.5.tar.gz
Algorithm Hash digest
SHA256 6054e76d93bcdcf09f22321a5cbb3a727376fbae09604a64480357d0d8aea57f
MD5 5fe940922c14856b64d8ff84060e48be
BLAKE2b-256 2f3853d69ab3ce01699e3433b9434e35108fe495cc358913226c9e5979a80e25

See more details on using hashes here.

File details

Details for the file ms1searchpy-2.0.5-py3-none-any.whl.

File metadata

  • Download URL: ms1searchpy-2.0.5-py3-none-any.whl
  • Upload date:
  • Size: 13.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5

File hashes

Hashes for ms1searchpy-2.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e163bbfdb823f7ac1fa2aabc8f65ecdd0d31e271733bd6aec450b9f7d36e9701
MD5 bad6ee0d3963a26a66abbfc2a217196a
BLAKE2b-256 f869a82c39d08d80dd4a90831e09524c41184711722204bfefb52e5f1ed4f1ed

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page