Skip to main content

Auto MS mass checker

Project description

MScheck for locating target compound masses in mass spectra

PyPI version

build test

MScheck is a python package that hunts for a target compound mass + given ion mass (eg. H+, Na+). MScheck was created to assist with the automated mass spectrum analysis of target compounds synthesised using a high throughput approach.

MSCheck uses the excellent Python library pyOpenMS for reading and handling .mzML mass spectra files. The spectra are stored and handled as a MassSpectrum class object - see spectrum.py.

The AnalyseSpectum class - see analyse.py - use Scipy's signal peak finding algorithms (find_peaks and peak_widths) to find peaks and calculate the full width at half maximum height (FWHM) of the peaks found. Mass spectrum data points are analysed in the area of the peak above the FWHM height by searching for the sum of the parent mass of the target molecule and ion. Different ions can be included in the search - see the example below.

MScheck installation

MScheck relies on rdkit for generating molecule SVG images and for calculating molecular weights
Installing rdkit using conda works best followed by a pip install of MScheck

  1. Create a conda environment with rdkit

    conda create -c conda-forge -n MScheck rdkit

  2. Activate the MScheck conda environment created

    conda activate MScheck

  3. Pip install MScheck

    pip install mscheck

Preparing vendor files

MScheck has been tested on using Agilent LCMS files (.D) as the starting file format. One challenge is to convert vendor file formats into a format with the binary decoded.

  1. Convert .D Agilent folder to a .d MassHunter format using Agilent's ChemStation to MassHunter Translator (B.04.00)
  2. Convert .d format into .mzML format using ProteoWizard's MSConvert tool
  3. Finally - we have an file format that we can use!
  4. In your favourite IDE or Jupyter notebook - a basic example of using MSCheck is provided below:
from mscheck.analyse import AnalyseSpectrum

# Create MS scptrum object and find peaks
test = AnalyseSpectrum("<path to .mzML file>", mode="Positive")

# Set SMILES of target to search for
target_SMILES = "CCOC(=O)N1CCN(C(=O)N2CCN(C(=O)c3ccco3)CC2)CC1"

# Analyse test spectrum searching for target SMILES
test.analyse(compoundsmiles=target_SMILES,
             ionstoadd=["[H]", "[Na]", "[K]", "[NH4+]"],
             tolerance=1)

# Create a .svg report - if you do not give a compound_name
# the ending leaf of the file name will be used
test.create_report(compound_name="Test")

The .svg report will be in a folder called Reports

Example of report output:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mscheck-0.2.4b0.tar.gz (8.5 kB view details)

Uploaded Source

File details

Details for the file mscheck-0.2.4b0.tar.gz.

File metadata

  • Download URL: mscheck-0.2.4b0.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.15

File hashes

Hashes for mscheck-0.2.4b0.tar.gz
Algorithm Hash digest
SHA256 292414803601209d4115e5d390614c7f149b8528b6f883a1136ec15cf3076e52
MD5 fe998f34cede9b6a391d203738e1f27e
BLAKE2b-256 2cad82822b41cd38d3c2b607c7be39f7427408a9367dec3670a8bfce2875250f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page