Skip to main content

Med-Discover is an AI-powered tool designed to assist biomedical researchers by leveraging Retrieval-Augmented Generation (RAG) with fine-tuned LLMs on PubMed literature. It enables efficient document retrieval, knowledge extraction, and interactive querying from biomedical research papers, helping researchers find relevant insights quickly. The package supports both GPU-based embeddings (MedCPT) and CPU-friendly alternatives (GPT-4 embeddings), making it accessible for a wide range of users.

Project description

MedDiscover

MedDiscover is an AI-powered tool designed to assist biomedical researchers using RAG-LLM models fine-tuned on PubMed literature.

CLI evaluation (headless)

Install the package (or use it in editable mode), set your OPENAI_API_KEY, and run the built-in evaluator:

pip install .
export OPENAI_API_KEY=...
# optional: ALLOW_MEDCPT_CPU=1 to force MedCPT on CPU

meddiscover-eval \
  --pdfs med_discover_ai/eval_samples/sample_pdfs/fmed-11-1345659.pdf med_discover_ai/eval_samples/sample_pdfs/reviewer_comments_aug12.pdf \
  --qa_csv med_discover_ai/eval_samples/sample_qa.csv \
  --embedding_model "MedCPT (GPU Recommended)" \
  --llm_models gpt-4.1-mini \
  --k 3 \
  --max_tokens 64 \
  --out_dir ./eval_outputs_demo
  • For Ada-based retrieval, switch --embedding_model to OpenAI Ada-002 (CPU/Cloud).
  • RAGAS metrics are optional; if dependencies are missing or the QA CSV lacks a reference column, they fall back to None.
  • Re-ranking stays disabled on CPU; enable --rerank only when a GPU and cross-encoder are available.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

med_discover_ai-1.0.7.tar.gz (188.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

med_discover_ai-1.0.7-py3-none-any.whl (192.7 kB view details)

Uploaded Python 3

File details

Details for the file med_discover_ai-1.0.7.tar.gz.

File metadata

  • Download URL: med_discover_ai-1.0.7.tar.gz
  • Upload date:
  • Size: 188.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for med_discover_ai-1.0.7.tar.gz
Algorithm Hash digest
SHA256 df5c488949e1cf60b1595ac845277c173edd8c9ad1c0c04244e3867b5c3cdb00
MD5 99928f7dbdc614e531e41f5a2ba41dc2
BLAKE2b-256 268d0c093e15e5c13b4854da24605d644933b75912f103347a752f2ce48a638a

See more details on using hashes here.

File details

Details for the file med_discover_ai-1.0.7-py3-none-any.whl.

File metadata

File hashes

Hashes for med_discover_ai-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 a6d1a3ca0755d93f7e76453ce94c1d9aa6dccf634bf8668ada42d2a298c7b4d5
MD5 9738341c09010fa6eda3495a36541bdb
BLAKE2b-256 2de032c570ed316973a471246b5c553d1a2bcc1e1ceb7d3e84e452cd89b9731d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page