Medical RAG with Asset-Aware MCP - Precise PDF asset retrieval (tables, figures, sections) for AI Agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

asset-aware-mcp

🏥 Medical RAG with Asset-Aware MCP - Precise PDF asset retrieval (tables, figures, sections) and Knowledge Graph for AI Agents.

🌐 繁體中文

🎯 Why Asset-Aware MCP?

AI cannot directly read image files on your computer. This is a common misconception.

Method	Can AI analyze image content?	Description
❌ Provide PNG path	No	AI cannot access the local file system
✅ Asset-Aware MCP	Yes	Retrieves Base64 via MCP, allowing AI vision to understand directly

Real-world Effect

# After retrieving the image via MCP, the AI can analyze it directly:

User: What is this figure about?

AI: This is the architecture diagram for Scaled Dot-Product Attention:
    1. Inputs: Q (Query), K (Key), V (Value)
    2. MatMul of Q and K
    3. Scale (1/√dₖ)
    4. Optional Mask (for decoder)
    5. SoftMax normalization
    6. Final MatMul with V to get the output

This is the value of Asset-Aware MCP - enabling AI Agents to truly "see" and understand charts and tables in your PDF literature.

✨ Features

📄 Asset-Aware ETL - PDF → Markdown with dual-engine PDF parsing:
- PyMuPDF (default) - Fast extraction (~50MB)
- Marker (optional, use_marker=True) - High-precision structured parsing with blocks.json (bbox/coordinates)
🧩 Unified Segmentation Export - Normalized segmentation.json merges manifest, blocks, reading order, and persisted markdown line spans for downstream tools and extensions.
🖼️ Layout Overlay Debugging - Render page overlays from original.pdf to inspect bbox, segment type, and reading order visually.
🔤 On-Demand OCR Preprocessing - Optional ocrmypdf preprocessing path for scanned PDFs before ETL.
🧭 Section Navigation - Dynamic hierarchy section tree with 5 tools: browse, search, detail, content reading, and block extraction for any depth of headings.
🔄 Async Job Pipeline - Supports asynchronous task processing and progress tracking for large documents.
🗺️ Document Manifest - Provides a structured "map" of the document for precise data access by Agents.
🧠 LightRAG Integration - Knowledge Graph + Vector Index, supporting cross-document comparison and reasoning.
🧾 Citation-Aware KG Output - consult_knowledge_graph now supports structured answer/reference payloads for downstream agent workflows.
📝 Docx Editing (DFM) - Edit .docx files in Markdown via Docx-Flavored Markdown format. Supports legacy .doc, .odt, and .ods ingest via LibreOffice auto-conversion. 14 tools: ingest, read, save, list, delete, export, strict round-trip validation, DOCX→PDF, DOCX→DOC, DOCX→ODT, and Docx ↔ A2T bridges.
🛡️ DFM Integrity Checker - Automatic validation and auto-repair at every pipeline stage (post-ingest, pre-save, post-save). Catches orphan markers, column mismatches, and format inconsistencies.
📊 A2T (Anything to Table) - 7 operation-based tools for building professional tables from any source (PDF assets, Knowledge Graph, URLs, user input). Features: Citations (AssetRef), Audit Trail, Schema Evolution, Templates, Drafting, and Token-efficient resumption.
🖥️ VS Code Management Extension - Graphical interface for monitoring server status, ingested documents, and A2T tables/drafts with one-click Excel export.
🔌 MCP Server - Exposes tools and resources to Copilot/Claude via FastMCP.
🏥 Medical Research Focus - Optimized for medical literature, supporting Base64 image transmission for Vision AI analysis.

🏗️ Architecture

Asset-Aware MCP Architecture

┌─────────────────────────────────────────────────────────┐
│                    AI Agent (Copilot)                   │
└─────────────────────┬───────────────────────────────────┘
                      │ MCP Protocol (Tools & Resources)
┌─────────────────────▼───────────────────────────────────┐
│            MCP Server (Modular Presentation)            │
│  ┌─────────────────────────────────────────────────┐   │
│  │ tools/: 48 tools in 7 modules                   │   │
│  │   document (11) │ docx (14) │ section (5)       │   │
│  │   job (3) │ knowledge (2) │ table (7) │ profile (5) │
│  └─────────────────────────────────────────────────┘   │
│  ┌─────────────────────────────────────────────────┐   │
│  │ resources/: 13 resources in 2 modules           │   │
│  └─────────────────────────────────────────────────┘   │
└─────────────────────┬───────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────┐
│                  ETL Pipeline (DDD)                     │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐              │
│  │ PyMuPDF  │  │  Asset   │  │ LightRAG │              │
│  │ Adapter  │→ │  Parser  │→ │  Index   │              │
│  └──────────┘  └──────────┘  └──────────┘              │
└─────────────────────┬───────────────────────────────────┘
                      │
┌─────────────────────▼───────────────────────────────────┐
│                   Local Storage                         │
│  ./data/                                                │
│  ├── doc_{id}/        # Document Assets                 │
│  ├── docx_{id}/       # Docx IR + DFM + Assets          │
│  ├── tables/          # A2T Tables (JSON/MD/XLSX)       │
│  │   └── drafts/      # Table Drafts (Persistence)      │
│  └── lightrag_db/     # Knowledge Graph                 │
└─────────────────────────────────────────────────────────┘

📁 Project Structure (DDD)

asset-aware-mcp/
├── src/
│   ├── domain/              # 🔵 Domain: Entities, Value Objects, Interfaces
│   ├── application/         # 🟢 Application: Doc Service, Table Service (A2T), Asset Service
│   ├── infrastructure/      # 🟠 Infrastructure: PyMuPDF, LightRAG, Excel Renderer
│   └── presentation/        # 🔴 Presentation: MCP Server (FastMCP)
├── data/                    # Document and Asset Storage
├── docs/
│   └── spec.md              # Technical Specification
├── tests/                   # Unit and Integration Tests
├── vscode-extension/        # VS Code Management Extension
└── pyproject.toml           # uv Project Config

📐 Architecture Diagrams

Visual overview for the project. All diagrams use consistent GitHub README style.

Diagram	Description
01 — System Architecture	Full stack: Telegram → Gateway → MCP Adapter → 3 MCP servers → Ollama
02 — Data Layout	48 tools organized in 7 categories with asset-aware data tree
03 — PDF Ingestion Pipeline	7-stage flow from PDF upload to knowledge graph
04 — DOCX Bidirectional Edit	DOCX ingest → TableContext edit → round-trip save workflow
05 — Knowledge Graph Search	Cross-document search with 3 parallel query paths
06 — Installation Steps	7-step installation from clone to verification
07 — PDF ETL Pipeline	Dual-engine parsing: PyMuPDF + Marker
08 — KG Architecture	lightrag-hku 3-layer KG architecture
08 — KG Architecture	lightrag-hku 3-layer KG architecture

💡 All generation prompts are saved in docs/diagrams/prompts/README.md for style consistency and regeneration.

🚀 Quick Start

# Install dependencies (using uv) — default install skips Marker/torch
uv sync

# Optional: install Marker backend only if you need structured parsing
uv sync --extra marker

# Run MCP Server
uv run python -m src.presentation.server

# Or use the VS Code extension for graphical management

Runtime note: The VS Code extension prefers a managed Python 3.11 runtime when launching the MCP server via uv or uvx. This avoids native package builds on end-user machines, especially macOS systems without Xcode Command Line Tools, while keeping the project itself compatible with newer Python versions.

Installation scope note:

The VS Code extension installs once per user (global). The MCP server launched through uvx asset-aware-mcp reuses the user uv cache rather than reinstalling per workspace.
Runtime data stays with your repo: .env and assetAwareMcp.dataDir default to ./data, so ingested assets remain scoped to the current workspace.

Marker note: marker-pdf is now an optional dependency because it may pull in torch, surya, and platform-specific ML wheels. Default installs use the PyMuPDF backend only. Enable Marker only when you need use_marker=True or parse_pdf_structure.

🔌 MCP Tools

Document & Asset Tools

Tool	Purpose
`ingest_documents`	Process PDF files with optional Marker backend (`use_marker=True` for blocks.json)
`list_documents`	List all ingested documents and their asset counts
`delete_document`	Delete an ingested PDF, its local artifacts, and LightRAG index entries when enabled
`convert_pdf_to_docx`	Reconstruct a readable DOCX from extracted PDF content
`convert_pdf_to_pptx`	Rebuild editable PPTX slides from extracted PDF markdown and figures
`inspect_document_manifest`	Inspect document structure before fetching specific assets
`fetch_document_asset`	Precisely retrieve tables (MD) / figures (B64) / sections
`parse_pdf_structure`	Run high-precision Marker parsing and emit structured blocks
`search_source_location`	Search exact source locations with page + bbox for verification
`export_document_segmentation`	Export normalized `segmentation.json` with reading order + line ranges
`visualize_document_layout`	Render page overlay images for bbox / type / reading-order inspection
`ocr_pdf_document`	Run OCR preprocessing and generate a cleaned PDF for later ETL

Job Management Tools

Tool	Purpose
`get_job_status`	Get async ingestion job progress and final result
`list_jobs`	List active or historical ETL jobs
`cancel_job`	Cancel a running ETL job

Knowledge Graph Tools

Tool	Purpose
`consult_knowledge_graph`	Citation-aware knowledge graph query with `structured`, `data`, and `text` response modes
`export_knowledge_graph`	Export graph summary / JSON / Mermaid for inspection

Knowledge graph note:

consult_knowledge_graph defaults to response_mode="structured" and can return answer, references, metadata, retrieval, and counts for agent-side citation workflows.
Use response_mode="data" when you want retrieval payloads without final answer synthesis, or response_mode="text" for legacy plain-text behavior.

Section Navigation Tools (Dynamic Hierarchy)

Tool	Purpose
`list_section_tree`	Display complete section hierarchy tree (supports any depth)
`get_section_detail`	Get detailed info for a specific section
`get_section_blocks`	Extract all blocks from a section with page + bbox
`search_sections`	Search section titles
`get_section_content`	Read section content via asset service

Docx Editing Tools (DFM — Docx-Flavored Markdown)

Edit .docx files as Markdown. Preserves formatting, tables, media on round-trip.

Tool	Purpose
`ingest_docx`	Import .docx and decompose into DFM blocks
`get_docx_content`	Read DFM content of specific blocks
`save_docx`	Write DFM edits back to .docx
`list_docx_blocks`	List document block structure
`list_docx_documents`	List all ingested DOCX/DFM documents
`delete_docx`	Delete an ingested DOCX/DFM document and its local artifacts
`convert_docx_to_pdf`	Export the current DOCX/DFM state to PDF in fidelity mode
`convert_docx_to_doc`	Export the current DOCX/DFM state to DOC in fidelity mode
`docx_validate_roundtrip`	6-dimension round-trip fidelity validation + file-level comparison (SHA-256, ZIP diff)
`docx_table_to_context`	Bridge: Docx table → A2T context
`docx_table_from_context`	Bridge: A2T table → Docx table
`docx_chart_data`	Extract chart data from Docx
`export_markdown`	Export Markdown to .docx/.pdf/.doc

A2T (Anything to Table) Tools — 7 Operation-Based Tools

Agent-friendly design: each tool handles multiple operations via operation parameter. Tables accept any source — PDF assets, KG entities, external URLs, or user input.

Tool	Operations	Purpose
`plan_table`	`schema` / `templates` / `from_template`	Schema planning, browse 4 built-in templates, create from template
`table_manage`	`create` / `delete` / `list` / `preview` / `resume` / `render` / `add_column` / `remove_column` / `rename_column`	Table lifecycle + Schema evolution
`table_data`	`add_rows` / `get_row` / `update_row` / `delete_row` / `get_cell` / `update_cell` / `clear_cell`	Row & cell CRUD
`table_cite`	`add` / `get` / `remove` / `cell_history`	Citation management with AssetRef (7 source types)
`table_history`	`changes` / `tokens`	Audit trail & token estimation
`table_draft`	`create` / `update` / `add_rows` / `resume` / `commit` / `list` / `delete`	Draft workflow with persistence
`discover_sources`	—	Cross-document source discovery (sections, tables, figures, KG)

ETL Profile Tools

Different journals/formats need different extraction settings. Use these tools to switch profiles.

Tool	Purpose
`list_etl_profiles`	List all available profiles (default, arxiv, nature, ieee, elsevier)
`get_etl_profile`	Get detailed configuration of a specific profile
`get_current_etl_profile`	Show currently active profile
`set_etl_profile`	Switch profile for subsequent document ingestion
`load_etl_profile_from_json`	Load custom profile from JSON file

🔧 Tech Stack

Category	Technology
Language	Python 3.10+
Package Manager	uv (all pip/setup-python removed)
ETL	PyMuPDF (fitz) + Marker (optional, high-precision)
RAG	LightRAG (lightrag-hku)
MCP	FastMCP
Storage	Local filesystem (JSON/Markdown/PNG)

📋 Documentation

Installation guidance:

Default install: uv sync
Install Marker backend only when needed: uv sync --extra marker
Safer extension Marker setup: enable Marker backend in settings and keep torchBackend=cpu unless you explicitly need GPU wheels
Technical Spec - Detailed technical specification
Architecture - System architecture
Constitution - Project principles
Competitive Analysis - MCP + DOCX ecosystem landscape

📄 License

Apache License 2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

u9401066

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.13

Apr 24, 2026

0.6.12

Apr 24, 2026

This version

0.6.11

Apr 23, 2026

0.6.10

Apr 23, 2026

0.6.8

Apr 19, 2026

0.6.4

Apr 13, 2026

0.6.3

Mar 23, 2026

0.6.2

Mar 19, 2026

0.6.0

Mar 18, 2026

0.5.2

Mar 18, 2026

0.5.1

Mar 14, 2026

0.5.0

Mar 14, 2026

0.4.2

Mar 9, 2026

0.2.10

Feb 9, 2026

0.2.5

Jan 5, 2026

0.2.4

Jan 5, 2026

0.2.3

Jan 5, 2026

0.2.2

Jan 5, 2026

0.2.1

Jan 5, 2026

0.2.0

Jan 5, 2026

0.1.1

Jan 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asset_aware_mcp-0.6.11.tar.gz (27.3 MB view details)

Uploaded Apr 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

asset_aware_mcp-0.6.11-py3-none-any.whl (254.5 kB view details)

Uploaded Apr 23, 2026 Python 3

File details

Details for the file asset_aware_mcp-0.6.11.tar.gz.

File metadata

Download URL: asset_aware_mcp-0.6.11.tar.gz
Upload date: Apr 23, 2026
Size: 27.3 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for asset_aware_mcp-0.6.11.tar.gz
Algorithm	Hash digest
SHA256	`2b8e5c643050d79646a1e0b1c062a1c6af1568192edcb7164193013070107c03`
MD5	`a8c225442371e3930b14f7f676fcb66e`
BLAKE2b-256	`648d9c279a956fc9fca58cb34316f835f7cfd0cdf6f87e95980f2d9d0259a52e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for asset_aware_mcp-0.6.11.tar.gz:

Publisher: release.yml on u9401066/asset-aware-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: asset_aware_mcp-0.6.11.tar.gz
- Subject digest: 2b8e5c643050d79646a1e0b1c062a1c6af1568192edcb7164193013070107c03
- Sigstore transparency entry: 1361570940
- Sigstore integration time: Apr 23, 2026
Source repository:
- Permalink: u9401066/asset-aware-mcp@fc835cf06f5a83a806af6ea2545c6e3bc9da69a5
- Branch / Tag: refs/tags/v0.6.11
- Owner: https://github.com/u9401066
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@fc835cf06f5a83a806af6ea2545c6e3bc9da69a5
- Trigger Event: push

File details

Details for the file asset_aware_mcp-0.6.11-py3-none-any.whl.

File metadata

Download URL: asset_aware_mcp-0.6.11-py3-none-any.whl
Upload date: Apr 23, 2026
Size: 254.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for asset_aware_mcp-0.6.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`81ccce614f84e8d4ec49933751f7131aaab09f5e7f2ff9ca5c3b7ae99c085a41`
MD5	`e35d79f06093bd3b8feb40396ab87cf8`
BLAKE2b-256	`59683fddd0c9bd8ebb5ae5b032e7bef36e1ddb2a243807a4cc907ed16fcaa3c6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for asset_aware_mcp-0.6.11-py3-none-any.whl:

Publisher: release.yml on u9401066/asset-aware-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: asset_aware_mcp-0.6.11-py3-none-any.whl
- Subject digest: 81ccce614f84e8d4ec49933751f7131aaab09f5e7f2ff9ca5c3b7ae99c085a41
- Sigstore transparency entry: 1361570958
- Sigstore integration time: Apr 23, 2026
Source repository:
- Permalink: u9401066/asset-aware-mcp@fc835cf06f5a83a806af6ea2545c6e3bc9da69a5
- Branch / Tag: refs/tags/v0.6.11
- Owner: https://github.com/u9401066
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@fc835cf06f5a83a806af6ea2545c6e3bc9da69a5
- Trigger Event: push

asset-aware-mcp 0.6.11

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

asset-aware-mcp

🎯 Why Asset-Aware MCP?

Real-world Effect

✨ Features

🏗️ Architecture

📁 Project Structure (DDD)

📐 Architecture Diagrams

🚀 Quick Start

🔌 MCP Tools

Document & Asset Tools

Job Management Tools

Knowledge Graph Tools

Section Navigation Tools (Dynamic Hierarchy)

Docx Editing Tools (DFM — Docx-Flavored Markdown)

A2T (Anything to Table) Tools — 7 Operation-Based Tools

ETL Profile Tools

🔧 Tech Stack

📋 Documentation

📄 License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance