Skip to main content

Develop MCP Servers for automate the work processes of the Clinical Data Manager.

Project description

Clinical Data Management MCP Server

A unified MCP (Model Context Protocol) server for automating Clinical Data Manager (CDM) workflows. Provides Excel comparison, DVS validation, PDF comparison, and more through any MCP-compatible AI agent.


Available Tools

Excel & DB Spec

  • read_excel_info: Reads an Excel file and returns its structure — sheet names, column names, row counts, and a preview of the first few rows. Use this first when sheet names or column names are unknown.
  • compare_common_excel: Compares specific sheets of two Excel files and generates a difference report. Requires specifying the sheet name, header row, and primary key column.
  • compare_db_spec: Compares full CDM DB Spec workbooks. Automatically detects header positions and updates the Revision History sheet.

DVS (Data Validation Specifications)

  • download_dvs_grammar: Copies the bundled DVS Specification Grammar reference file (DVS_SPEC_GRAMMAR.md) to a specified directory. Use this when a user needs the grammar definition for writing or reviewing DVS Specifications column entries.
  • validate_dvs_specifications: Reads a DVS Excel file, validates all Specification entries against the DVS grammar, and generates a detailed validation report. Reports syntax errors and auto-corrected suggestions where possible.
  • check_dvs_consistency (requires MCP Sampling): Checks whether the natural language Description is logically consistent with the formal Specification for each DVS entry. Results are written to a new Excel report.

Google Calendar Integration

  • download_timeline_template: Copies the bundled DM Timeline Excel template to a specified directory. Use this before filling in timeline data and calling make_google_calendar_csv.
  • make_google_calendar_csv: Extracts key schedules from a Timeline Excel file and converts them into a CSV formatted for Google Calendar import.

Blank eCRF PDF Comparison

  • compare_ecrf_pdf_single: Compares two Blank eCRF PDF files and produces a single side-by-side comparison PDF. Changed pages are shown with Before (left) and After (right) layouts. Annotation colors: red = deleted, green = added, yellow = modified.

DB Spec Validation

  • validate_db_spec: Validates a DB Specification Excel file against CDASH standards (CDASHIG v2.3 / SDTMIG v3.4). Runs three checks: domain validity, ITEM ID suffix vs. DB TYPE consistency, and CODE field duplicate code numbers. Generates an Excel validation report with per-check result sheets and a summary.
  • generate_cdash_variables: Generates or updates the CDASH variables JSON reference from a CDASH Model Excel file. Use this when a new version of the CDASH Model is released. Supports merge mode to add new entries without replacing existing data.

Installation

Windows users: This server must be run inside WSL (Windows Subsystem for Linux). All commands below should be run in a WSL terminal, not PowerShell or CMD. If WSL is not installed, follow Microsoft's WSL setup guide first.

After installing WSL, open a WSL terminal and execute below commands.

Step 1. Install uv

uv is a fast Python package manager. Installing it also gives you uvx, which runs packages without a separate Python installation.

Mac / Linux / WSL

curl -LsSf https://astral.sh/uv/install.sh | sh

Step 2. Set Up Your AI Agent

Follow the section for your AI agent. Each section covers registration, skill installation, and connection verification.


Claude Code

Register the MCP server

claude mcp add --scope user cdm-server uvx cdm-mcp-server@latest

Install skills (optional)

uvx --from cdm-mcp-server cdm-install-skills --tool claude

Skills are installed into .claude/skills/ in your project directory.

Verify the connection

claude mcp list

Gemini CLI

Prerequisite — Node.js

Gemini CLI requires Node.js. Install it via nvm if you don't have it.

Mac / Linux

curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.40.3/install.sh | bash
nvm install --lts

Windows — Download and install nvm-windows, then:

nvm install lts
nvm use lts

Install Gemini CLI and register the MCP server

npm install -g @google/gemini-cli
gemini mcp add cdm-server uvx --args cdm-mcp-server@latest --scope user

Install skills (optional)

uvx --from cdm-mcp-server cdm-install-skills --tool gemini

Skills are installed into .gemini/skills/ in your project directory.

Verify the connection

gemini mcp list

Codex CLI

Register the MCP server

codex mcp add cdm-server -- uvx cdm-mcp-server@latest

Install skills (optional)

uvx --from cdm-mcp-server cdm-install-skills --tool codex

Skills are installed into .agents/skills/ in your project directory.

Verify the connection

codex mcp list

You should see cdm-server listed as connected.

Install skills for all agents at once

If you use multiple AI tools in the same project directory, run:

uvx --from cdm-mcp-server cdm-install-skills --all

Usage

Once connected, simply describe what you need in natural language:

"Compare the previous and current DB Spec files and generate a difference report."

"Validate the DVS file and report any syntax errors."

"Download the Timeline template, then convert the completed file into a Google Calendar CSV."

The AI agent will automatically select and call the appropriate tool.


Troubleshooting

MCP server shows as disconnected

The most common cause is that uvx is not in the PATH when the AI tool launches the MCP server process.

1. Find the uvx location

which uvx

2. Add its directory to PATH

Replace ~/.local/bin with the directory from the output above:

echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.bashrc && source ~/.bashrc

For zsh:

echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.zshrc && source ~/.zshrc

Then restart the AI tool and verify the connection again.


Project Structure

src/
└── cdm_mcp_server/
    ├── __init__.py
    ├── server.py                   # MCP server entry point
    ├── templates/
    │   └── DM Timeline 20251120.xlsx  # Bundled Timeline template
    └── modules/
        ├── excel_theme.py          # Shared Excel formatting standards
        ├── excel_compare.py        # Excel & DB Spec comparison engine
        ├── excel_info.py           # Excel structure inspector
        ├── calendar_maker.py       # Timeline parsing & Calendar CSV generator
        ├── dvs_parser.py           # DVS specification grammar parser
        ├── dvs_excel_writer.py     # DVS validation report writer
        ├── dvs_consistency_checker.py  # LLM-powered DVS consistency checker
        └── pdf_ecrf_compare.py     # Blank eCRF PDF comparison engine

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cdm_mcp_server-0.3.0.tar.gz (100.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cdm_mcp_server-0.3.0-py3-none-any.whl (110.3 kB view details)

Uploaded Python 3

File details

Details for the file cdm_mcp_server-0.3.0.tar.gz.

File metadata

  • Download URL: cdm_mcp_server-0.3.0.tar.gz
  • Upload date:
  • Size: 100.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cdm_mcp_server-0.3.0.tar.gz
Algorithm Hash digest
SHA256 56e52d99d9b3f9b6fff3ef62deed981b0f6450c3fd53149cdecab2ee069aa690
MD5 22e75d8a05b0bdbbd0dc82976f88a7cc
BLAKE2b-256 78537821aed8c49982401108f28507348f8bb244dabcf3f59e1b2e1b2b256606

See more details on using hashes here.

File details

Details for the file cdm_mcp_server-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: cdm_mcp_server-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 110.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cdm_mcp_server-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 faf09841048c69d0fdeddd39cb63b5a41377429079d980a144cc88fad8b8d564
MD5 336587634a4bb5d9e191871621b49d68
BLAKE2b-256 dd3cf3c207cdef5d07f3ac93f339e40b02ff8e21542aa3e7b2e47fd88c51b538

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page