Skip to main content

A feature detection LC-MS1 spectra.

Project description

biosaur2 - A feature detection LC-MS1 spectra. This project is a rewriten version of Biosaur software (https://github.com/abdrakhimov1/Biosaur).

The centroided mzML file is required for of the script.

Algorithm can be run with following command:

biosaur2 path_to_MZML

The script output contains tsv table with peptide features.

All available arguments can be shown with command "biosaur2 -h".

The default parameter minlh (the minimal number of consecutive scans for peptide feature) is 1 and this value is optimimal for ultra-short LC gradients (a few minutes). For the longer LC gradients, this value can be increased for reducing of feature detection time and removing noise isotopic clusters.

For TOF data please add "-tof" argument.

For PASEF data please convert mzML file using msconvert and '--combineIonMobilitySpectra --filter "msLevel 1" ' options. Do not use option --filter "scanSumming"! The latter is often required for MS/MS data analysis but breaks MS1 feature detection.

For negative mode data please add "-nm" argument.

Citing biosaur2

Abdrakhimov, et al. Biosaur: An open-source Python software for liquid chromatography-mass spectrometry peptide feature detection with ion mobility support. https://doi.org/10.1002/rcm.9045

Installation

Using the pip:

pip install biosaur2

Available parameters

-minlh: Minimum number of MS1 scans for peaks extracted from the mzML file. Optimal usually is in 1-3 range for 5-15 min LC gradients and 5-10 for 60-180 min gradients. Default = 2

-mini : Minimal intensity threshold for peaks extracted from the mzML file. Default = 1

-minmz : Minimal m/z value for peaks extracted from the mzML file. Default = 350

-maxmz : Maximal m/z value for peaks extracted from the mzML file. Default = 1500

-htol : Mass accuracy in ppm to combine peaks into hills between scans. Default = 8 ppm

-itol : Mass accuracy in ppm for isotopic hills. Default = 8 ppm

-ignore_iso_calib : Turn off accurate isotope error estimation if added as the parameter. Input "itol" value will be used instead of gaussian fitting of mass errors and systematic shifts for every isotope number.

-o : Path to output feature files. Default is the name of the input mzML file with added “.features.tsv” mask stored in the folder of the original mzML file

-hvf: Threshold to split hills into multiple if local minimum intensity multiplied by hvf is less than both surrounding local maximums. All peaks after splitting must have at least max(2, minlh) MS1 scans. Default = 1.3

-ivf: Threshold to split isotope pattern into multiple features if local minimum intensity multiplied by ivf is less right local maximum. Local minimum position should be higher than max(4rd isotope, isotope position with maximum intensity according to averagine model). Default = 5.0

-nm : Negative mode. 1-true, 0-false. Affect only neutral mass column calculated in the output features table. Default = 0

-cmin: Minimum allowed charge for isotopic clusters. Default = 1

-cmax: Maximal allowed charge for isotopic clusters. Default = 6

-nprocs: Number of processes used by biosau2. Automatically set to 1 for Windows system due to multiprocessing issues. Default = 4

-write_hills: Add hills output if added as the parameter

-paseminlh: For TIMS-TOF data. Minimum number of ion mobility values for m/z peaks to be kept in the analysis. Default = 1

-paseftol: For TIMS-TOF data. Ion mobility tolerance used to combine close peaks into a single one. Default = 0.05

-pasefmini: For TIMS-TOF data. Minimal intensity threshold for peaks after combining peaks with close m/z (itol option) and ion mobility (paseftol option) values. Default = 100

-tof: Experimental. If added as the parameter, biosaur2 estimates noise intensity distribution across m/z range and automatically calculates intensity cutoffs for different m/z value ranges. This is an alternative way to reduce noise to the "-mini" option which is a fixed intensity threshold for all m/z values. Can be usefull for TOF data

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biosaur2-0.2.24.tar.gz (195.2 kB view details)

Uploaded Source

Built Distributions

biosaur2-0.2.24-cp311-cp311-win_amd64.whl (124.4 kB view details)

Uploaded CPython 3.11 Windows x86-64

biosaur2-0.2.24-cp311-cp311-musllinux_1_2_x86_64.whl (1.0 MB view details)

Uploaded CPython 3.11 musllinux: musl 1.2+ x86-64

biosaur2-0.2.24-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (955.0 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.24-cp310-cp310-win_amd64.whl (124.3 kB view details)

Uploaded CPython 3.10 Windows x86-64

biosaur2-0.2.24-cp310-cp310-musllinux_1_2_x86_64.whl (958.2 kB view details)

Uploaded CPython 3.10 musllinux: musl 1.2+ x86-64

biosaur2-0.2.24-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (864.1 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

biosaur2-0.2.24-cp39-cp39-win_amd64.whl (124.7 kB view details)

Uploaded CPython 3.9 Windows x86-64

biosaur2-0.2.24-cp39-cp39-musllinux_1_2_x86_64.whl (957.8 kB view details)

Uploaded CPython 3.9 musllinux: musl 1.2+ x86-64

biosaur2-0.2.24-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (858.7 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

File details

Details for the file biosaur2-0.2.24.tar.gz.

File metadata

  • Download URL: biosaur2-0.2.24.tar.gz
  • Upload date:
  • Size: 195.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.7

File hashes

Hashes for biosaur2-0.2.24.tar.gz
Algorithm Hash digest
SHA256 6b7d4d491405e1e630b45e827cf2ff9377590cd7031d51d5008584339faea36b
MD5 8664f63843ba34a72bf0ddbbce237234
BLAKE2b-256 5b9abcf65e7fdddc035052b1a55305a1e07d5dd14dafbc6b892e5529fa751e5f

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 583dfec27bfc32d239efe9042a561d0fcf92f98b529dbb5737e7f8b1ea54d918
MD5 fb6c219fbef55691358d2fe8792e4f94
BLAKE2b-256 7a934c60b303a84373ec8a69b0e7c2ac7e1ae618e9606f45c182f6e068f13e80

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp311-cp311-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp311-cp311-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 0403411ada38fe8e1d23f7eae73e6b2d5a9bc71eefb351e127fde2e8c41742cd
MD5 d390039609a604dc20ba215bfec36716
BLAKE2b-256 1aaac24626bf6567d226a4d104418aaacd069c44250db5f8004b5335f94eb9db

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7006f0e668d9b6dc68b2b211fe8d58f346c4e4fcee6b12c9e193407181eb411b
MD5 61a655fb2d7d291f630647bb2f94c6fa
BLAKE2b-256 586fd1395793a0cb3a4559c95d1e21843ca0e9c6200e74429aa616d8c11c5000

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 13b4dba5e9b046c4b7dbc9000e9ef81ef48d73c74967be3a7c3e5cd65d566f8a
MD5 2101a639657b1693969691e8b71143d7
BLAKE2b-256 d59166a61eaf42264d25aa72b388a25de5c71e45b83741867ee9c9c049b66b80

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp310-cp310-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp310-cp310-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 e994ea3f3622d801a71cc8c3625df8a0a046c4ed5fa5cc07926ad489f40134fe
MD5 9293ca3a9575a92ed6103f2961d1e8c3
BLAKE2b-256 181c0c554cd7897c74837d6e83a5e5d82060bf858935fa2fd32f01c5d3d3bbfd

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 e9c8ddea14209965ec66c593fc0ae715237aa2e6b51b89fc96b2e8f3689a1547
MD5 7aed9ae37660008ae46b3ba71a7d3367
BLAKE2b-256 dac3d66f515cce1dcb34b545563e269b0b7bc7b3b5a49709fc63fd79f6e37465

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp39-cp39-win_amd64.whl.

File metadata

  • Download URL: biosaur2-0.2.24-cp39-cp39-win_amd64.whl
  • Upload date:
  • Size: 124.7 kB
  • Tags: CPython 3.9, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.7

File hashes

Hashes for biosaur2-0.2.24-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 1ad98aaa59ef2d3ca80582c6d3f9f346a44aab8785d429cb198f416eeebdc9cd
MD5 4b47daec255798ea8ba984e05c73f320
BLAKE2b-256 b7ec8ba6cb06a5f4d83c52f34a37931e26b063f14c75a3de514d3fa47852d5a9

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp39-cp39-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp39-cp39-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 e1e5d3e65bbe626bd2a86cc700792058782086f02a0bb19429731eb998380462
MD5 8f36b49f6523b113d5527abe1f74616f
BLAKE2b-256 c168cc9434a920be6d33f249e85593e8dbdae4e86754c10a5f749d77eebfb7e3

See more details on using hashes here.

File details

Details for the file biosaur2-0.2.24-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for biosaur2-0.2.24-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9bc401852acbc95c467df2839a8ec72c3030b8b206ba195ea347b125f82decec
MD5 6048214a00f503ad79cbe99a917fd31b
BLAKE2b-256 3ae93d6f1bb5c442bc5a668590cf5bfd5bf575a91a685a8f1274c23486770630

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page