Skip to main content

ModiFinder package

Project description

Welcome to ModiFinder's documentation!

ModiFinder is a tool for site localization of structural modifications using MS/MS data.

This project is under active development.

The documentation is available at: https://wang-bioinformatics-lab.github.io/ModiFinder_base/

Citing

ModiFinder: Tandem Mass Spectral Alignment Enables Structural Modification Site Localization

Mohammad Reza Zare Shahneh, Michael Strobel, Giovanni Andrea Vitale, Christian Geibel, Yasin El Abiead, Neha Garg, Berenike Wagner, Karl Forchhammer, Allegra Aron, Vanessa V Phelan, Daniel Petras, and Mingxun Wang

Journal of the American Society for Mass Spectrometry 2024 35 (11), 2564-2578

DOI: 10.1021/jasms.4c00061

License

Academic Software License: © 2024 UCR (“Institution”). Academic or nonprofit researchers are permitted to use this Software (as defined below) subject to Paragraphs 1-4:

1. Institution hereby grants to you free of charge, so long as you are an academic or nonprofit researcher, a nonexclusive license under Institution’s copyright ownership interest in this software and any derivative works made by you thereof (collectively, the “Software”) to use, copy, and make derivative works of the Software solely for educational or academic research purposes, and to distribute such Software free of charge to other academic or nonprofit researchers for their educational or academic research purposes, in all cases subject to the terms of this Academic Software License. Except as granted herein, all rights are reserved by Institution, including the right to pursue patent protection of the Software.
2. Any distribution of copies of this Software -- including any derivative works made by you thereof -- must include a copy (including the copyright notice above), and be made subject to the terms, of this Academic Software License; failure by you to adhere to the requirements in Paragraphs 1 and 2 will result in immediate termination of the license granted to you pursuant to this Academic Software License effective as of the date you first used the Software.
3. IN NO EVENT WILL INSTITUTION BE LIABLE TO ANY ENTITY OR PERSON FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE, EVEN IF INSTITUTION HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. INSTITUTION SPECIFICALLY DISCLAIMS ANY AND ALL WARRANTIES, EXPRESS AND IMPLIED, INCLUDING, BUT NOT LIMITED TO, ANY IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE IS PROVIDED “AS IS.” INSTITUTION HAS NO OBLIGATION TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS OF THIS SOFTWARE.
4. Any academic or scholarly publication arising from the use of this Software or any derivative works thereof will include the following acknowledgment:  The Software used in this research was created by [INSERT AUTHOR NAMES] of UC Riverside. © 2024 UCR.

Commercial entities: please contact mingxun.wang@cs.ucr.edu or tp@ucr.edu for licensing opportunities.

Core API (Working)

ModiFinder requires two spectrum objects:

main_compound = Compound(
    spectrum=s1_peaks,                       # Formatted as [[mz, int], ...]
    precursor_mz=s1_prec_mz,                 # Float
    precursor_charge=s1_charge,              # Int
    adduct=s1_adduct,                        # Str
    smiles=s1_smiles                         # Str
)
mod_compound = Compound(
    spectrum=s2_peaks,
    precursor_mz=s2_prec_mz,
    precursor_charge=s2_charge,
    adduct=s2_adduct,
    smiles=s2_smiles                         # Optional
)

Pass these to a ModiFinder Object

helper_compounds = None
siteLocator = ModiFinder(
    main_compound,
    mod_compound,
    helpers=helper_compounds,
    **args)

Collect dictionary results:

peaksObj, fragmentsObj = siteLocator.get_result()

Dictionary format:

peaksObj = {
            "main_compound_peaks": main_compound_peaks,
            "mod_compound_peaks": mod_compound_peaks,
            "matched_peaks": matched_peaks,
            "main_precursor_mz": knownCompound.spectrum.precursor_mz,
            "mod_precursor_mz": unknownCompound.spectrum.precursor_mz,
        }
fragmentsObj = {
    "frags_map": knownCompound.spectrum.peak_fragment_dict,
    "structure": knownCompound.structure,
    "peaks": main_compound_peaks,
    "Precursor_MZ": knownCompound.spectrum.precursor_mz,
}

Useful Utility Functions

ModiFinder includes several useful utility functions for mass spectrometry data analysis and visualization, exposed under modifinder.utilities.

Reading MGF Files You can easily read MGF files into a pandas DataFrame using read_mgf.

from modifinder.utilities import read_mgf

# Read MGF file
df = read_mgf("path/to/your/spectrum.mgf")
print(df.head())

Visualization The vis module provides powerful visualization tools.

from modifinder.utilities import vis
import matplotlib.pyplot as plt

# Draw a molecule
img = vis.draw_molecule("C1=CC=C(C=C1)O", output_type="png")
plt.imshow(img)
plt.show()

# Draw a spectrum
# spectrum_data is a list of (mz, intensity) tuples
spectrum_data = df.iloc[0]['spectrum'].T.tolist()
vis.draw_spectrum(spectrum_data)
plt.show()

Developer Guide

Running Tests

To run the automated tests, ensure you have pytest installed. Then run:

pytest

or specifically for utilities:

pytest modifinder/utilities/tests/

Release Process

ModiFinder uses GitHub Actions for automated releases and documentation deployment.

Creating a New Release (PyPI)

  1. Create and push a new tag starting with v (e.g., v1.2.3).
    git tag v1.2.3
    git push origin v1.2.3
    
    The version is automatically derived from the tag using setuptools_scm.
  2. This triggers the pypi_publish.yml workflow, which:
    • Builds the package.
    • Publishes it to PyPI (and TestPyPI).
    • Creates a GitHub Release.

Updating Documentation Documentation is automatically rebuilt and deployed to GitHub Pages whenever changes are pushed to the main branch (via documentation.yml).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modifinder-2.0.2b0.tar.gz (13.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

modifinder-2.0.2b0-py3-none-any.whl (10.4 MB view details)

Uploaded Python 3

File details

Details for the file modifinder-2.0.2b0.tar.gz.

File metadata

  • Download URL: modifinder-2.0.2b0.tar.gz
  • Upload date:
  • Size: 13.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for modifinder-2.0.2b0.tar.gz
Algorithm Hash digest
SHA256 c6a1339b0d022698180a2004bfa4ae81817616ea61d6f6332abe5c815ad042d7
MD5 b158d89ef6af1045ef1a9beb9e1ff69a
BLAKE2b-256 00c062b5d26fca3741277921a2569224f0cde43a3e4d251b2a405b1fa2c94631

See more details on using hashes here.

Provenance

The following attestation bundles were made for modifinder-2.0.2b0.tar.gz:

Publisher: pypi_publish.yml on Wang-Bioinformatics-Lab/ModiFinder_base

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file modifinder-2.0.2b0-py3-none-any.whl.

File metadata

  • Download URL: modifinder-2.0.2b0-py3-none-any.whl
  • Upload date:
  • Size: 10.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for modifinder-2.0.2b0-py3-none-any.whl
Algorithm Hash digest
SHA256 7fee3217f98b87adb1833fb3dc1a054730b46236f9d8bb6a6699106e63d5f6cc
MD5 f41bb1f8ae8fb0f519bcab99bcbf75c0
BLAKE2b-256 ac6e5252a3e6dad2969b44017bd3a52e331beaa23a656a004691a471428e09b2

See more details on using hashes here.

Provenance

The following attestation bundles were made for modifinder-2.0.2b0-py3-none-any.whl:

Publisher: pypi_publish.yml on Wang-Bioinformatics-Lab/ModiFinder_base

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page