Skip to main content

MatchMiner-AI package

Project description

matchminer-ai

Overview

matchminer-ai is a Python package for running the clinical trial matching inference workflow described in Altreuter et al., MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching. The package provides modular functions for the core MatchMiner-AI workflow: summarizing trials and patient histories, generating embeddings of each, retrieving candidate matches, scoring match quality, and assessing exclusion criteria.

This package is currently pre-v1 and under active development. APIs, configuration options, and outputs may change.

Compute requirements

The most compute-intensive step is summarizing patient notes with the default Gemma 4 language model. Full pipeline runs can use either a local high-memory GPU environment, such as an NVIDIA H100 80GB, or a compatible remote vLLM inference server configured with the Gemma 4 reasoning parser. See the example notebook for details on these two options.

Other entry points, such as running from precomputed summaries, may require less compute.

Installation

This package requires Python 3.13+.

pip install matchminer-ai

Quickstart

See the example notebook for a full walkthrough using sample input data: example notebook

Citation

If you use matchminer-ai, please cite:

Altreuter J, Trukhanov P, Paul MA, Hassett MJ, Riaz IB, Afzal MU, Mohammed AA, Sammons S, Lindsay J, Mallaber E, Klein HR, Gungor G, Galvin M, Deletto M, Van Nostrand SC, Provencher J, Yu J, Tahir N, Wischhusen J, Kozyreva O, Ortiz T, Tuncer H, Masri JE, Malcolm A, Mazor T, Cerami E, Kehl KL. MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching. arXiv. 2026. doi: 10.48550/arXiv.2412.17228

Contributing

Clone the repository and install the package in editable mode. We recommend working in a virtual or conda environment.

git clone https://github.com/dfci/matchminer-ai-inference.git
cd matchminer-ai-inference
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"

This repository uses pre-commit for local code quality checks. To enable the hooks:

pre-commit install

Run the test suite with:

# lightweight tests
pytest
# tests requiring GPU
pytest -m resource_heavy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matchminer_ai-0.2.1.tar.gz (68.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

matchminer_ai-0.2.1-py3-none-any.whl (64.6 kB view details)

Uploaded Python 3

File details

Details for the file matchminer_ai-0.2.1.tar.gz.

File metadata

  • Download URL: matchminer_ai-0.2.1.tar.gz
  • Upload date:
  • Size: 68.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for matchminer_ai-0.2.1.tar.gz
Algorithm Hash digest
SHA256 92dba5628f68674ebd6617f811f211afd8d633c7f3259c429bcb86cf1dbd0ed8
MD5 9241e2def3d0d4e15bb5aafbaee9f6a9
BLAKE2b-256 aa464deec1296895181d6577a29caf185b1b327c0ad13ff907c242ffda596329

See more details on using hashes here.

File details

Details for the file matchminer_ai-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: matchminer_ai-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 64.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for matchminer_ai-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a7f4fb36ab68986e2e492609bcf016119ceef1036e70ba73599dd7341f485ca4
MD5 f290a2282675e88e0a093607c774e04d
BLAKE2b-256 cc25861024e167779a04d30b924bc4d751949923a863c657ec7ad63222cd88b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page