Auto MS mass checker
Project description
MScheck for locating target compound masses in mass spectra
MScheck is a python package that hunts for a target compound mass + given ion mass (eg. H+, Na+). MScheck was created to assist with the automated mass spectrum analysis of target compounds synthesised using a high throughput approach.
MSCheck uses the excellent Python library pyOpenMS for reading and handling .mzML mass spectra files. The spectra are stored and handled as a MassSpectrum class object - see spectrum.py.
The AnalyseSpectum class - see analyse.py - use Scipy's signal peak finding algorithms (find_peaks and peak_widths) to find peaks and calculate the full width at half maximum height (FWHM) of the peaks found. Mass spectrum data points are analysed in the area of the peak above the FWHM height by searching for the sum of the parent mass of the target molecule and ion. Different ions can be included in the search - see the example below.
MScheck installation
MScheck relies on rdkit for generating molecule SVG images and for calculating molecular weights
Installing rdkit using conda works best followed by a pip install of MScheck
-
Create a conda environment
conda create -c conda-forge -n MScheck python=3.11
-
Activate the MScheck conda environment created
conda activate MScheck
-
Pip install pyOpenMS
pip install --index-url https://pypi.cs.uni-tuebingen.de/simple/ pyopenms
-
Pip install MScheck
pip install mscheck
Preparing vendor files
MScheck has been tested on using Agilent LCMS files (.D) as the starting file format. One challenge is to convert vendor file formats into a format with the binary decoded.
- Convert .D Agilent folder to a .d MassHunter format using Agilent's ChemStation to MassHunter Translator (B.04.00)
- Convert .d format into .mzML format using ProteoWizard's MSConvert tool
- Finally - we have an file format that we can use!
- In your favourite IDE or Jupyter notebook - a basic example of using MSCheck is provided below:
from mscheck.analyse import AnalyseSpectrum
# Create MS scptrum object and find peaks
test = AnalyseSpectrum("<path to .mzML file>", mode="Positive")
# Set SMILES of target to search for
target_SMILES = "CCOC(=O)N1CCN(C(=O)N2CCN(C(=O)c3ccco3)CC2)CC1"
# Analyse test spectrum searching for target SMILES
test.analyse(compoundsmiles=target_SMILES,
ionstoadd=["[H]", "[Na]", "[K]", "[NH4+]"],
tolerance=1)
# Create a .svg report - if you do not give a compound_name
# the ending leaf of the file name will be used
test.create_report(compound_name="Test")
The .svg report will be in a folder called Reports
Example of report output:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file mscheck-0.2.9b0.tar.gz
.
File metadata
- Download URL: mscheck-0.2.9b0.tar.gz
- Upload date:
- Size: 10.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ed8f8a7426bce0afb1b05fd2aa46af453c098b43f15c28d657c851087cdbfb1e |
|
MD5 | 590b56001bef47c6ea0668c39697a125 |
|
BLAKE2b-256 | 6b02b08c8b11dc780808430f4124160093b2f8ef277a8c514f3e505786d1f958 |