Skip to main content

A feature detection LC-MS1 spectra.

Project description

biosaur2 - A feature detection LC-MS1 spectra. This project is a rewriten version of Biosaur software (https://github.com/abdrakhimov1/Biosaur).

The centroided mzML file is required for of the script.

Algorithm can be run with following command:

biosaur2 path_to_MZML

The script output contains tsv table with peptide features.

All available arguments can be shown with command "biosaur2 -h".

The default parameter minlh (the minimal number of consecutive scans for peptide feature) is 1 and this value is optimimal for ultra-short LC gradients (a few minutes). For the longer LC gradients, this value can be increased for reducing of feature detection time and removing noise isotopic clusters.

For TOF data please add "-tof" argument.

For PASEF data please convert mzML file using msconvert and '--combineIonMobilitySpectra --filter "msLevel 1" ' options. Do not use option --filter "scanSumming"! The latter is often required for MS/MS data analysis but breaks MS1 feature detection.

For negative mode data please add "-nm" argument.

Citing biosaur2

Abdrakhimov, et al. Biosaur: An open-source Python software for liquid chromatography-mass spectrometry peptide feature detection with ion mobility support. https://doi.org/10.1002/rcm.9045

Installation

Using the pip:

pip install biosaur2

Available parameters

-minlh: Minimum number of MS1 scans for peaks extracted from the mzML file. Optimal usually is in 1-3 range for 5-15 min LC gradients and 5-10 for 60-180 min gradients. Default = 2

-mini : Minimal intensity threshold for peaks extracted from the mzML file. Default = 1

-minmz : Minimal m/z value for peaks extracted from the mzML file. Default = 350

-maxmz : Maximal m/z value for peaks extracted from the mzML file. Default = 1500

-htol : Mass accuracy in ppm to combine peaks into hills between scans. Default = 8 ppm

-itol : Mass accuracy in ppm for isotopic hills. Default = 8 ppm

-ignore_iso_calib : Turn off accurate isotope error estimation if added as the parameter. Input "itol" value will be used instead of gaussian fitting of mass errors and systematic shifts for every isotope number.

-o : Path to output feature files. Default is the name of the input mzML file with added “.features.tsv” mask stored in the folder of the original mzML file

-hvf: Threshold to split hills into multiple if local minimum intensity multiplied by hvf is less than both surrounding local maximums. All peaks after splitting must have at least max(2, minlh) MS1 scans. Default = 1.3

-ivf: Threshold to split isotope pattern into multiple features if local minimum intensity multiplied by ivf is less right local maximum. Local minimum position should be higher than max(4rd isotope, isotope position with maximum intensity according to averagine model). Default = 5.0

-nm : Negative mode. 1-true, 0-false. Affect only neutral mass column calculated in the output features table. Default = 0

-cmin: Minimum allowed charge for isotopic clusters. Default = 1

-cmax: Maximal allowed charge for isotopic clusters. Default = 6

-nprocs: Number of processes used by biosau2. Automatically set to 1 for Windows system due to multiprocessing issues. Default = 4

-write_hills: Add hills output if added as the parameter

-paseminlh: For TIMS-TOF data. Minimum number of ion mobility values for m/z peaks to be kept in the analysis. Default = 1

-paseftol: For TIMS-TOF data. Ion mobility tolerance used to combine close peaks into a single one. Default = 0.05

-pasefmini: For TIMS-TOF data. Minimal intensity threshold for peaks after combining peaks with close m/z (itol option) and ion mobility (paseftol option) values. Default = 100

-tof: Experimental. If added as the parameter, biosaur2 estimates noise intensity distribution across m/z range and automatically calculates intensity cutoffs for different m/z value ranges. This is an alternative way to reduce noise to the "-mini" option which is a fixed intensity threshold for all m/z values. Can be usefull for TOF data

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biosaur2-0.2.22.tar.gz (195.1 kB view details)

Uploaded Source

Built Distributions

biosaur2-0.2.22-cp311-cp311-win_amd64.whl (134.7 kB view details)

Uploaded CPython 3.11 Windows x86-64

biosaur2-0.2.22-cp311-cp311-musllinux_1_2_x86_64.whl (1.1 MB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

biosaur2-0.2.22-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (979.7 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.22-cp310-cp310-win_amd64.whl (134.6 kB view details)

Uploaded CPython 3.10 Windows x86-64

biosaur2-0.2.22-cp310-cp310-musllinux_1_2_x86_64.whl (998.7 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

biosaur2-0.2.22-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (880.7 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.22-cp39-cp39-win_amd64.whl (134.6 kB view details)

Uploaded CPython 3.9 Windows x86-64

biosaur2-0.2.22-cp39-cp39-musllinux_1_2_x86_64.whl (996.1 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

biosaur2-0.2.22-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (881.4 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

File details

Details for the file biosaur2-0.2.22.tar.gz.

File metadata

  • Download URL: biosaur2-0.2.22.tar.gz
  • Upload date:
  • Size: 195.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for biosaur2-0.2.22.tar.gz
Algorithm Hash digest
SHA256 72e6b77befd405e35a75d6b8773f85c5d08f714d3509943a26ac64ea837fdcdb
MD5 35e629f567e0012f593880296d278093
BLAKE2b-256 7d82dde5591128e9784025fc1b602d3f2d9e5a399a006d3a621555008474f3cd

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 857e3da7b3655e4556aacf5081e8a4ea6110f3956cccc6fd5c15b24e44112ea5
MD5 7544ccd57d1a1ff1c4c783bb94412b83
BLAKE2b-256 68a391b296fded1acc2b1f96415b5d4c70fc15e7044e58050ed041765df97ac4

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 14ea4ea930690d580dd056a8af5704080ca4acbf2bffbd14c54d5d9bceb57fb9
MD5 9a3d63b4329b006d56af3a4ffe93944b
BLAKE2b-256 1371a9e6ba7d5be4153bbd857a46c7554c26e5e5b6967b89dc158c6b270230c5

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 08ec9f6be5d50ef1279819cb3e9d75ddb6c207cf0eff944020c8394d07d33723
MD5 6adf7efadc99ff661e64a178e9dfaccc
BLAKE2b-256 9c12bc17a8f0a355edc066ea1dac7b8496b282d46c6d52958caca45a75b1e2bb

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 c8e1d97826b3e47633fc1fc5d081f846420a35ad618b72a51ac3c8fe3aa1b058
MD5 f4c58a9923ae51f8a8b459b434512f6e
BLAKE2b-256 70d27334c29d61fdf98ea5b3d9babfdca06768a03976ea0ae32c1c02339dc603

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 ebd8272d5900754713261371eff3a23873e8da80c6079074bb61d3203d71a02e
MD5 bf1f8f6d475d36344847ca313a9e11e8
BLAKE2b-256 04ac1e2011dccef063bcda032a75981694b8812d44e1b5be38625d014300de38

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 41f4ff0bbf67365d343f6ffeb168f5f438dcd5912b45f67a48a2a668de8ea81f
MD5 8034e5553c2300c0440f52b10b3d2716
BLAKE2b-256 6a618e43f9501c235c502b3be3882406dfa5f746f7d285475780458bb195c1d0

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: biosaur2-0.2.22-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 134.6 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for biosaur2-0.2.22-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 b363fd6fb27ef53f9a4d11ac7989722fbf28812d616e9953e18436d284f88f5f
MD5 7eee2147fd5ffe79552ad39e707b21f9
BLAKE2b-256 c5c5b59798d02bcf42de73afe48efe55741bc0b3211aee289fb3a571cabe2939

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 e8e08a1c070be4fc20736cc44a5c2a17b431bef5ff80f765493018b277c23c9e
MD5 b95a48fbaed61a31d5c34f348fbad3b5
BLAKE2b-256 6489a93d3e26770ba182479b9013d4fa43bed4a05438f01487571574324ce4e3

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.22-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.22-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 bc768c827e501b484ec61afb4cc4ef6c8b420e97e9a8086650c3f72cbf4eff79
MD5 58c54b36d8c5b723b8b20ac4f409302d
BLAKE2b-256 a785deae1c15f606471eacbca267185b2f6a07a120b2fdf130f00c51ee2b94b1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page