MatchMiner-AI package
Project description
matchminer-ai
Overview
matchminer-ai is a Python package for running the clinical trial matching inference workflow described in Altreuter et al., MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching. The package provides modular functions for the core MatchMiner-AI workflow: summarizing trials and patient histories, generating embeddings of each, retrieving candidate matches, scoring match quality, and assessing exclusion criteria.
This package is currently pre-v1 and under active development. APIs, configuration options, and outputs may change.
Compute requirements
The most compute-intensive step is summarizing patient notes with the default Gemma 4 language model. Full pipeline runs can use either a local high-memory GPU environment, such as an NVIDIA H100 80GB, or a compatible remote vLLM inference server configured with the Gemma 4 reasoning parser. See the example notebook for details on these two options.
Other entry points, such as running from precomputed summaries, may require less compute.
Installation
This package requires Python 3.13+.
pip install matchminer-ai
Quickstart
See the example notebook for a full walkthrough using sample input data: example notebook
Citation
If you use matchminer-ai, please cite:
Altreuter J, Trukhanov P, Paul MA, Hassett MJ, Riaz IB, Afzal MU, Mohammed AA, Sammons S, Lindsay J, Mallaber E, Klein HR, Gungor G, Galvin M, Deletto M, Van Nostrand SC, Provencher J, Yu J, Tahir N, Wischhusen J, Kozyreva O, Ortiz T, Tuncer H, Masri JE, Malcolm A, Mazor T, Cerami E, Kehl KL. MatchMiner-AI: An Open-Source Solution for Cancer Clinical Trial Matching. arXiv. 2026. doi: 10.48550/arXiv.2412.17228
Contributing
Clone the repository and install the package in editable mode. We recommend working in a virtual or conda environment.
git clone https://github.com/dfci/matchminer-ai-inference.git
cd matchminer-ai-inference
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"
This repository uses pre-commit for local code quality checks. To enable the hooks:
pre-commit install
Run the test suite with:
# lightweight tests
pytest
# tests requiring GPU
pytest -m resource_heavy
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file matchminer_ai-0.2.1.tar.gz.
File metadata
- Download URL: matchminer_ai-0.2.1.tar.gz
- Upload date:
- Size: 68.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
92dba5628f68674ebd6617f811f211afd8d633c7f3259c429bcb86cf1dbd0ed8
|
|
| MD5 |
9241e2def3d0d4e15bb5aafbaee9f6a9
|
|
| BLAKE2b-256 |
aa464deec1296895181d6577a29caf185b1b327c0ad13ff907c242ffda596329
|
File details
Details for the file matchminer_ai-0.2.1-py3-none-any.whl.
File metadata
- Download URL: matchminer_ai-0.2.1-py3-none-any.whl
- Upload date:
- Size: 64.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a7f4fb36ab68986e2e492609bcf016119ceef1036e70ba73599dd7341f485ca4
|
|
| MD5 |
f290a2282675e88e0a093607c774e04d
|
|
| BLAKE2b-256 |
cc25861024e167779a04d30b924bc4d751949923a863c657ec7ad63222cd88b2
|