A Scalable Modular Framework for Multimodal AI in Oncology
Project description
HoneyBee
A Scalable Modular Framework for Multimodal AI in Oncology
Publication
HoneyBee has been officially published in Nature Digital Medicine!
Tripathi, A., Waqas, A., Schabath, M.B. et al. HONeYBEE: enabling scalable multimodal AI in oncology through foundation model-driven embeddings. npj Digit. Med. 8, 622 (2025). https://doi.org/10.1038/s41746-025-02003-4
Overview
HoneyBee is a comprehensive multimodal AI framework designed specifically for oncology research and clinical applications. It seamlessly integrates and processes diverse medical data types—clinical text, radiology images, pathology slides, and molecular data—through a unified, modular architecture. Built with scalability and extensibility in mind, HoneyBee empowers researchers to develop sophisticated AI models for cancer diagnosis, prognosis, and treatment planning.
[!WARNING] Alpha Release: This framework is currently in alpha. APIs may change, and some features are still under development.
Key Features
- Multimodal data support: clinical text, radiology (DICOM/NIFTI), pathology (WSI), and molecular data
- 3-layer modular architecture: clean separation between loaders, processors, and embedding models
- Clinical NLP pipeline: OCR, cancer entity extraction, temporal parsing, and medical ontology mapping
- Whole Slide Image processing: tissue detection, patch extraction, stain normalization, and quality filtering
- State-of-the-art embedding models: GatorTron, BioBERT, PubMedBERT, UNI, REMEDIS, RadImageNet, and more
- Cross-modal integration: unified patient-level representations from multiple data modalities
- Survival analysis: Cox PH, Random Survival Forest, and DeepSurv
- Similar patient retrieval: find patients with matching clinical profiles
- Interactive visualization: t-SNE dashboards for embedding exploration
- GPU-accelerated: CuCIM backend for WSI processing with OpenSlide fallback
Quick Start
System Dependencies
# Ubuntu/Debian
sudo apt-get install -y openslide-tools tesseract-ocr
# macOS
brew install openslide tesseract
Installation
pip install honeybee-ml
python -c "import nltk; nltk.download('punkt'); nltk.download('punkt_tab')"
Optional Extras
| Extra | Command | Includes |
|---|---|---|
| Clinical | pip install honeybee-ml[clinical] |
NLP, OCR, and text processing dependencies |
| Pathology | pip install honeybee-ml[pathology] |
WSI loading and image processing |
| Molecular | pip install honeybee-ml[molecular] |
Genomics and expression data support |
| All | pip install honeybee-ml[all] |
Everything above |
Research Applications
HoneyBee has been successfully applied to:
- Cancer Subtype Classification: Automated identification of cancer subtypes from multimodal data
- Survival Prediction: Risk stratification and outcome prediction for treatment planning
- Similar Patient Retrieval: Finding patients with similar clinical profiles for precision medicine
- Biomarker Discovery: Identifying multimodal patterns associated with treatment response
License
See the LICENSE file for details.
Citation
If you use HoneyBee in your research, please cite our paper:
Tripathi, A., Waqas, A., Schabath, M.B. et al. HONeYBEE: enabling scalable multimodal AI in
oncology through foundation model-driven embeddings. npj Digit. Med. 8, 622 (2025).
https://doi.org/10.1038/s41746-025-02003-4
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file honeybee_ml-0.2.0.tar.gz.
File metadata
- Download URL: honeybee_ml-0.2.0.tar.gz
- Upload date:
- Size: 145.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a916cd0c6c76b1aac4da63a23052e8d3eeb8d993d16cad026111c08ec2c6079
|
|
| MD5 |
fe1cffb890b2b8442a84833e49b51fa4
|
|
| BLAKE2b-256 |
624c67282e4b4866b0c0ab1786ddf6688187d63fb47031e1676bef5b2b0a3f27
|
File details
Details for the file honeybee_ml-0.2.0-py3-none-any.whl.
File metadata
- Download URL: honeybee_ml-0.2.0-py3-none-any.whl
- Upload date:
- Size: 166.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3dea90caeb64a545adfcb3f41898318636c3d03b5187a1922288caf56e3c5e96
|
|
| MD5 |
effc58b88c767c8b3e49afc8ab94e125
|
|
| BLAKE2b-256 |
7112548fc68f5e4e8f5aa2ab6acb181b0913fe5c1b98511465a646ec1fbe44a9
|