Skip to main content

Merlin: A Vision Language Foundation Model for 3D Computed Tomography

Project description

Merlin: Vision Language Foundation Model for 3D Computed Tomography

arXiv    Hugging Face    pypi    Watch the Talk on YouTube    License

Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.

⚡️ Installation

To install Merlin, you can simply run:

pip install merlin-vlm

For an editable installation, use the following commands to clone and install this repository.

conda create --name merlin python==3.10
conda activate merlin

git clone https://github.com/StanfordMIMI/Merlin.git
cd Merlin
pip install -e .

# Alternatively, to install exact package versions as tested:
# uv sync

🚀 Inference with Merlin

To create a Merlin model with both image and text embeddings enabled, use the following:

from merlin import Merlin

model = Merlin()

To initialize the model with only image embeddings active, use:

from merlin import Merlin

model = Merlin(ImageEmbedding=True)

To initialize the model for phenotype classification, use:

from merlin import Merlin

model = Merlin(PhenotypeCls=True)

To initialize the model for five-year disease prediction, use:

from merlin import Merlin

model = Merlin(FiveYearPred=True)

To initialize the model for radiology report generation, use:

from merlin import Merlin

model = Merlin(RadiologyReport=True)

For inference on a demo CT scan, please check out the general demo and report generation demo.

For additional information, please read the inference documentation and report generation documentation.

📂 Merlin Abdominal CT Dataset

We are excited to release the Merlin Abdominal CT Dataset to the community!

For details on accessing and using the dataset, please see the download documentation!

📎 Citation

If you find this repository useful for your work, please cite the cite the original paper:

@article{blankemeier2024merlin,
  title={Merlin: A vision language foundation model for 3d computed tomography},
  author={Blankemeier, Louis and Cohen, Joseph Paul and Kumar, Ashwin and Van Veen, Dave and Gardezi, Syed Jamal Safdar and Paschali, Magdalini and Chen, Zhihong and Delbrouck, Jean-Benoit and Reis, Eduardo and Truyts, Cesar and others},
  journal={Research Square},
  pages={rs--3},
  year={2024}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

merlin_vlm-0.0.6.tar.gz (14.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

merlin_vlm-0.0.6-py3-none-any.whl (15.6 kB view details)

Uploaded Python 3

File details

Details for the file merlin_vlm-0.0.6.tar.gz.

File metadata

  • Download URL: merlin_vlm-0.0.6.tar.gz
  • Upload date:
  • Size: 14.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.0 CPython/3.10.14

File hashes

Hashes for merlin_vlm-0.0.6.tar.gz
Algorithm Hash digest
SHA256 6c567ca56d0b0ab0f694f61b817dfe70022f5f36fbd356234ce0586e82e201e8
MD5 44f43cda5c3450cde39d231e063148f2
BLAKE2b-256 094c9ea4af50269150be953b0d50c0c7a8e2d3578e221aae6363c476de5f993c

See more details on using hashes here.

File details

Details for the file merlin_vlm-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: merlin_vlm-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 15.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.0 CPython/3.10.14

File hashes

Hashes for merlin_vlm-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 322dc7ac18d20bc65ec5ef2917b81207393e267da181993c9f25996b27500d6b
MD5 82450deb1a9fc9adccb3b4f573b629f7
BLAKE2b-256 75cd83cf61a41c0ad1aeabe44df3cf3ebe03b1011d6d5ca6fdca7589c8719351

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page