LCMS Processing tools used by the Metabolomics Platform at the Broad Institute.

These details have not been verified by PyPI

Project links

Repository

Project description

BMXP - The Metabolomics Platform at the Broad Institute

pip install bmxp

Please cite: https://www.biorxiv.org/content/10.1101/2023.06.09.544417v1.full

This is a collection of tools for processing our data, which powers our cloud processing workflow. Each tool is meant to be a standalone module that performs a step in our processing pipeline. They are written in Python and C, and designed to be perfomant and cloud-compatible.

Eclipse - Align two or more same-method nontargeted LCMS datasets
Gravity - Cluster redundant LCMS features based on RT and Correlation (And someday, XIC shape)
Blueshift - Drift Correction via pooled technical replicates and internal standards
Formation - Formatting and Final QC
Chroma - Read .raw and .mzml files

We expect users to be familiar with Python and already have an understanding of LCMS Metabolomics data processing and the specific steps they wish to accomplish.

While the tools are and always will be standalone, we are working on linking them closer together with a shared schema, and eventually may have a pipeline ability to run all steps, given a set of parameters.

We are open to feedback and suggestions, with a focus on performance and application in pipelines.

Shared Schema

All BMXP modules use a shared schema and file formats with our prefered columns headers. These files are (along with their labels):

Feature Metadata bmxp.FMDATA - Describes the feature. Index default is Compound_ID
Injection Metadata bmxp.IMDATA - Describes the Injection. Index default is injection_id
Sample Metadata bmxp.SMDATA - Describes the biospecimen from which the Injection is derived. Index default is broad_id
Feature Abundances - Pivot table of Feature x Injection (Compound_ID x injection_id) containing the abundances.

Some modules (Blueshift, Eclipse) require merging Feature Metadata + Feature Abundances.

These can be changed globally so that all packages will use the same terminology. To update the schema, modify the dictionary objects in the module directly prior to running code. For example:

import bmxp
from bxmp.eclipse import MSAligner
from bxmp.blueshift import DriftCorrection
from bmxp.gravity import cluster
bmxp.FMDATA['Compound_ID'] = 'Feature_ID'
bmxp.IMDATA['injection_id'] = 'Filename'

# continue with work...

With those changes above, Eclipse, Blushift and Gravity will use "Feature_ID" and "Filename" as column headers instead of "Compound_ID" and "injection_id".

Feature Metadata - bmxp.FMDATA

Feature Metadata describes the LCMS feature. This is a mixture of fundamental nontargeted feature information, annotation info, and anything else.

Feature Specific

Compound_ID - Index, Project-unique feature ID (a bit of a misnomer)
RT - Unitless retention time, may or may not be scaled
MZ - Unsigned mass-to-charge ratio
Intensity - Average feature intensity
Method - Human Readable name of LCMS method used
__extraction_method - Name of extraction method/software used. Used to denote mixed Targeted/Nontargeted

Annotation

Annotation_ID - Method-unique annotation label
Adduct - Adduct form of the annotation
__annotation_id - Globally unique annotation identifier
Metabolite - Preferred display/reporting name of metabolite
Non_Quant - Boolean denoting that a feature is not quanitifiable

Generated by Gravity

Cluster_Num - Cluster number assigned during Gravity clustering
Cluster_Size - Number of members in the cluster

Generated by Blueshift

Batches Skipped - Batches that were skipped due to lack of PREFs

Injection Metadata - bmxp.IMDATA

injection_id - Index, Injection name, usually filename without the extension
broad_id - Assigned biospeciemn label
program_id - Biospecimen label as received (inherited from Sample Metadata)
injection_type - Type of injection ("sample", "prefa", "prefb", "blank", "other-", "not_used-")
comments - Comments about the injection
column_number - Column number, in multi-column studies
injection_order - Injection number, not skipping blanks or non-samples
batches - Denotes batches ('batch_start' or 'batch_end')

Generated by Blueshift

QCRole - Role in drift correction ("QC-drift_correction", "QC-pooled_ref", "QC-not_used", "sample")

Sample Metadata - bmxp.SMDATA

broad_id - Assigned biospecimen label
Arbitrary Metadata Columns - Any column label except labels in Injection Metadata

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.5.1

Apr 16, 2026

This version

0.5.0

Feb 26, 2026

0.4.8

Dec 24, 2025

0.4.7

Dec 24, 2025

0.4.6

Nov 26, 2025

0.4.5

Nov 21, 2025

0.4.4

Nov 19, 2025

0.4.3

Nov 7, 2025

0.4.2

Oct 31, 2025

0.4.1

Oct 21, 2025

0.4.0

Oct 15, 2025

0.3.18

Sep 30, 2025

0.3.17

Sep 22, 2025

0.3.16

Aug 12, 2025

0.3.15

Jul 25, 2025

0.3.13

May 23, 2025

0.3.12

Apr 30, 2025

0.3.11

Apr 25, 2025

0.3.10

Mar 25, 2025

0.3.9

Mar 4, 2025

0.3.8

Mar 3, 2025

0.3.7

Feb 25, 2025

0.3.6

Feb 21, 2025

0.3.5

Feb 13, 2025

0.3.4

Feb 7, 2025

0.3.3

Feb 6, 2025

0.3.2

Feb 6, 2025

0.3.1

Jan 31, 2025

0.3.0

Jan 24, 2025

0.2.5

Jan 22, 2025

0.2.4

Jan 8, 2025

0.2.3

Dec 9, 2024

0.2.2

Nov 5, 2024

0.2.0

Oct 1, 2024

0.1.14

Sep 30, 2024

0.1.13

Sep 19, 2024

0.1.12

Sep 5, 2024

0.1.11

Aug 30, 2024

0.1.10

Aug 13, 2024

0.1.9

Aug 7, 2024

0.1.8

Jul 26, 2024

0.1.7

Jul 18, 2024

0.1.6

Jul 9, 2024

0.1.5

Jul 9, 2024

0.1.4

Jul 9, 2024

0.1.3

Jul 8, 2024

0.1.2

May 3, 2024

0.1.1 yanked

May 3, 2024

Reason this release was yanked:

Broken distribution

0.1.0 yanked

May 3, 2024

Reason this release was yanked:

Broken distribution

0.0.18

Apr 29, 2024

0.0.17

Feb 9, 2024

0.0.15

Aug 14, 2023

0.0.14

Apr 14, 2023

0.0.13

Mar 7, 2023

0.0.11

Jan 31, 2023

0.0.10

Jan 18, 2023

0.0.9

Jan 3, 2023

0.0.8

Jan 3, 2023

0.0.7

Dec 21, 2022

0.0.6

Dec 21, 2022

0.0.5

Dec 21, 2022

0.0.4

Nov 18, 2022

0.0.3

Nov 8, 2022

0.0.2

Sep 26, 2022

0.0.1

Sep 26, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bmxp-0.5.0.tar.gz (2.6 MB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bmxp-0.5.0-py3-none-any.whl (1.3 MB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file bmxp-0.5.0.tar.gz.

File metadata

Download URL: bmxp-0.5.0.tar.gz
Upload date: Feb 26, 2026
Size: 2.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bmxp-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`3529196e689580fc2771ec28a2e561f6ba84758de83d3dd60c96288b0601dda7`
MD5	`18fb68c8963dfe52e094c112fde42167`
BLAKE2b-256	`15efa10121420e2aac8afc91cd35e97a65ecd3f11bd9b1a01db406c9a6bb2ec4`

See more details on using hashes here.

File details

Details for the file bmxp-0.5.0-py3-none-any.whl.

File metadata

Download URL: bmxp-0.5.0-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 1.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for bmxp-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9ac115430adb1465557db808cc1cb7e35d59735acf0d319e9b35341e79da9394`
MD5	`39bc740621dd51c389ac05183f2760d9`
BLAKE2b-256	`305ac354856927cb1440ca2a14239fc3fff5c099b9dfd4bf7d413cd37713e49b`

See more details on using hashes here.

bmxp 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BMXP - The Metabolomics Platform at the Broad Institute

Shared Schema

Feature Metadata - bmxp.FMDATA

Feature Specific

Annotation

Generated by Gravity

Generated by Blueshift

Injection Metadata - bmxp.IMDATA

Generated by Blueshift

Sample Metadata - bmxp.SMDATA

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes