Skip to main content

A feature detection LC-MS1 spectra.

Project description

biosaur2 - A feature detection LC-MS1 spectra. This project is a rewriten version of Biosaur software (https://github.com/abdrakhimov1/Biosaur).

The centroided mzML file is required for of the script.

Algorithm can be run with following command:

biosaur2 path_to_MZML

The script output contains tsv table with peptide features.

All available arguments can be shown with command "biosaur2 -h".

The default parameter minlh (the minimal number of consecutive scans for peptide feature) is 1 and this value is optimimal for ultra-short LC gradients (a few minutes). For the longer LC gradients, this value can be increased for reducing of feature detection time and removing noise isotopic clusters.

For TOF data please add "-tof" argument.

For PASEF data please convert mzML file using msconvert and '--combineIonMobilitySpectra --filter "msLevel 1" ' options. Do not use option --filter "scanSumming"! The latter is often required for MS/MS data analysis but breaks MS1 feature detection.

For negative mode data please add "-nm" argument.

Citing biosaur2

Abdrakhimov, et al. Biosaur: An open-source Python software for liquid chromatography-mass spectrometry peptide feature detection with ion mobility support. https://doi.org/10.1002/rcm.9045

Installation

Using the pip:

pip install biosaur2

Available parameters

-minlh: Minimum number of MS1 scans for peaks extracted from the mzML file. Optimal usually is in 1-3 range for 5-15 min LC gradients and 5-10 for 60-180 min gradients. Default = 2

-mini : Minimal intensity threshold for peaks extracted from the mzML file. Default = 1

-minmz : Minimal m/z value for peaks extracted from the mzML file. Default = 350

-maxmz : Maximal m/z value for peaks extracted from the mzML file. Default = 1500

-htol : Mass accuracy in ppm to combine peaks into hills between scans. Default = 8 ppm

-itol : Mass accuracy in ppm for isotopic hills. Default = 8 ppm

-ignore_iso_calib : Turn off accurate isotope error estimation if added as the parameter. Input "itol" value will be used instead of gaussian fitting of mass errors and systematic shifts for every isotope number.

-o : Path to output feature files. Default is the name of the input mzML file with added “.features.tsv” mask stored in the folder of the original mzML file

-hvf: Threshold to split hills into multiple if local minimum intensity multiplied by hvf is less than both surrounding local maximums. All peaks after splitting must have at least max(2, minlh) MS1 scans. Default = 1.3

-ivf: Threshold to split isotope pattern into multiple features if local minimum intensity multiplied by ivf is less right local maximum. Local minimum position should be higher than max(4rd isotope, isotope position with maximum intensity according to averagine model). Default = 5.0

-nm : Negative mode. 1-true, 0-false. Affect only neutral mass column calculated in the output features table. Default = 0

-cmin: Minimum allowed charge for isotopic clusters. Default = 1

-cmax: Maximal allowed charge for isotopic clusters. Default = 6

-nprocs: Number of processes used by biosau2. Automatically set to 1 for Windows system due to multiprocessing issues. Default = 4

-write_hills: Add hills output if added as the parameter

-paseminlh: For TIMS-TOF data. Minimum number of ion mobility values for m/z peaks to be kept in the analysis. Default = 1

-paseftol: For TIMS-TOF data. Ion mobility tolerance used to combine close peaks into a single one. Default = 0.05

-pasefmini: For TIMS-TOF data. Minimal intensity threshold for peaks after combining peaks with close m/z (itol option) and ion mobility (paseftol option) values. Default = 100

-tof: Experimental. If added as the parameter, biosaur2 estimates noise intensity distribution across m/z range and automatically calculates intensity cutoffs for different m/z value ranges. This is an alternative way to reduce noise to the "-mini" option which is a fixed intensity threshold for all m/z values. Can be usefull for TOF data

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biosaur2-0.2.23.tar.gz (195.1 kB view details)

Uploaded Source

Built Distributions

biosaur2-0.2.23-cp311-cp311-win_amd64.whl (134.7 kB view details)

Uploaded CPython 3.11 Windows x86-64

biosaur2-0.2.23-cp311-cp311-musllinux_1_2_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

biosaur2-0.2.23-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (979.7 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.23-cp310-cp310-win_amd64.whl (134.6 kB view details)

Uploaded CPython 3.10 Windows x86-64

biosaur2-0.2.23-cp310-cp310-musllinux_1_2_x86_64.whl (998.7 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

biosaur2-0.2.23-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (880.7 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.23-cp39-cp39-win_amd64.whl (134.6 kB view details)

Uploaded CPython 3.9 Windows x86-64

biosaur2-0.2.23-cp39-cp39-musllinux_1_2_x86_64.whl (996.1 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

biosaur2-0.2.23-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (881.4 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

File details

Details for the file biosaur2-0.2.23.tar.gz.

File metadata

  • Download URL: biosaur2-0.2.23.tar.gz
  • Upload date:
  • Size: 195.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for biosaur2-0.2.23.tar.gz
Algorithm Hash digest
SHA256 fbd8f9d40e669f998f56c7fb7d43b2c496faa308fc048fa87d182268f3f3c512
MD5 c80ed50d569f8ec5ebe20349c97fa0ee
BLAKE2b-256 a8b7a4a1a4203911ad92c851796d99c8dc2e5fb57821fa6baa5e6932e16e01e3

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 dc67423124751a4856e27235cacd8a4a45f5999344ba7ee2db0bb043984e828a
MD5 bfae08c7c8e32fb0fe95255b7ee7cd24
BLAKE2b-256 bcdddaf6f14c97fb656479fa7aadd69a495418e3cdd48cffb391338fbbd0ba78

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 dc9ba40d774b90f963ffad681a0d4b26ac93d568cd5a80bfd20aa08eb879cf55
MD5 62e6b6d1dd749165b2dcd3849843aeb1
BLAKE2b-256 9c2f9fdf2a324a911ad86a240b5e06afbe9093fa71281510a72e9882677ebdae

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 728b3b75cee6fa0f002a3309cf7a75f7bd1c3e44454ee983227e049e455210cb
MD5 d81eb2422de663a4cffcef3e1c90e269
BLAKE2b-256 3eefc0f009da1205457abb9880f5994044977b1524dfabcb6acc491ad9f40d60

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 6becd68c9df1922184482175bf23ad6559e01e2a0a6a2687358296d3b9504b49
MD5 a34e23abfcbda451d9004bd5a932f3c4
BLAKE2b-256 72d1a8f4ab8eac58a80eca09a2dc242d8102cd229b2985fd16793d40447fcbcf

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 07e2e03d0f99ac1916b758ede21fc2c9255a9eadd94e551c59842f0547407bdd
MD5 45c421679c96667c9ff0ee1c4a8e2ea3
BLAKE2b-256 47d662f0d2a2f3b9f49550314d538f0c12d96c0351e4403886ff071fba7ce77c

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2d083b1bccb0feababad3a5cb256e26a118615a3de8a276bab265d843802c2b5
MD5 383f429c0ab357ff8b5f1dae5b64224c
BLAKE2b-256 34b43eacdbd8b7648b4bef8ce8f38c0afc7241a1b5ba6a75a9f3d55437f8854b

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: biosaur2-0.2.23-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 134.6 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for biosaur2-0.2.23-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 5d3644c8eabdee4c986e263868f324b7b3bca63f9e50add49ebeda7d9ad1ce7b
MD5 9f96a7a2ed54bd4ff4f694416c54d8e0
BLAKE2b-256 00b6b480cc26fa564ff4b8c9d166261cc06167746220c257ffdbcbbc216ee0b4

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 c7b8226f63c0285176e22847162cc99ae99b806e3650d3e833f73acaef5a1d14
MD5 f6c1548a06dd384ac26599f2d9628bdf
BLAKE2b-256 135c9d4d975fbc95b494c0b9c0fba245453ef102d9abc62b911501edca47cbf2

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.23-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.23-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 1e06d689a9e24e4ea62840c88dbd99b19e5ef54bbd0d906a705b1103e6b067fa
MD5 c4a6190f0663c8b2cbf0bd6d760bb29b
BLAKE2b-256 1e4f0509cac644e335b1566ba7619e8491949487bd2636dd80e53f3d147dc2e9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page