A proteomics search engine for LC-MS1 spectra.
Project description
DirectMS1 - a proteomics search engine for LC-MS1 spectra
The .tsv (or mzML) and .fasta files are required for basic operation of the script. tsv file is tab-separated text file with peptide features generated by Dinosaur software (J.Teleman et al., “Dinosaur: A Refined Open-Source Peptide MS Feature Detector”, JPR 2016) or Biosaur (https://github.com/abdrakhimov1/Biosaur) from mzML file. This file can be generated by any other software for peak-picking and must contain columns ‘massCalib’, ‘rtApex’, ‘charge’ and ‘nIsotopes’ columns. For a сonvenient usage, mzML files can be used directly and the script will run an attached version of Dinosaur (installed Java is required). For an efficient usage of retention time, user can install and use ELUDE prediction algorithm (-elude path_to_elude_binary should be used in parameters). For the most efficient usage of retention time, user can install and use DeepLC prediction algorithm (-deeplc path_to_deeplc_binary should be used in parameters).
Algorithm can be run with following command:
ms1searchpy path_to_MZML -d path_to_fasta
OR
ms1searchpy path_to_peptideFeatures -d path_to_fasta
The script output contains files: all identified proteins (filename_proteins_full.csv), filtered proteins (filename_proteins.csv), all matched peptide match fingerprints (filename_PFMs.csv), all matched peptide match fingerprints with features prepared for Machnine Learning (filename_PFMs_ML.csv) and log file with estimated mass and RT accuracies (filename_log.txt).
Citing ms1searchpy
Ivanov et al. DirectMS1: MS/MS-free identification of 1000 proteins of cellular proteomes in 5 minutes. https://doi.org/10.1021/acs.analchem.9b05095
Installation
Using the pip:
pip install ms1searchpy
Dependencies
pyteomics
numpy
scipy
sklearn
lightgbm
pandas
Links
GitHub repo & issue tracker: https://github.com/markmipt/ms1searchpy
Mailing list: markmipt@gmail.com
Dinosaur repo: https://github.com/fickludd/dinosaur
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ms1searchpy-2.0.3.tar.gz
.
File metadata
- Download URL: ms1searchpy-2.0.3.tar.gz
- Upload date:
- Size: 13.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5fb82f89cbf0995aea448330394e2c4efe7009ad208f74e36368f418ee578b07 |
|
MD5 | 0aa2c0880e6ea03336876c972d4ea0ca |
|
BLAKE2b-256 | ef16aba9a367bf8045d6ca4af363fc7f83cd05309c2c5352d689026d72d933ae |
File details
Details for the file ms1searchpy-2.0.3-py3-none-any.whl
.
File metadata
- Download URL: ms1searchpy-2.0.3-py3-none-any.whl
- Upload date:
- Size: 13.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71869550b3445e3a37f495dd2f35208ccaf78244e1061b20171fcf34042e8fde |
|
MD5 | 56ec3eea3bcce0457eb35475689a4e1a |
|
BLAKE2b-256 | af9fdaae079be6f0f5a7aaf007b670b75afb072dbed41eee16a99ad84588377b |