An IBM watsonx.data MCP server that seamlessly connects AI agents with document libraries

These details have not been verified by PyPI

Project links

Project description

Watsonx.data Document Library Retrieval MCP Server

The Watsonx.data Document Library Retrieval MCP Server is a Model Context Protocol (MCP)-compliant service that seamlessly connects AI agents with document libraries in watsonx.data, enabling intelligent data retrieval and interaction.

Key Features

Dynamic Discovery & Registration
Automatically detects and registers document libraries as MCP tools.
Natural Language Interface
Query document libraries using conversational language and receive human-readable responses.
Minimal Configuration
Deploy with simple setup requirements and zero complex configurations.
Framework-Agnostic Integration
Plug directly into the preferred agentic frameworks with native MCP compatibility.

Overview

Protocol: Model Context Protocol (MCP)
Purpose: Acts as a bridge between agentic AI frameworks and watsonx.data document libraries
Supported Environments: IBM Cloud Pak for Data (CPD), Watsonx SaaS
Agent Compatibility: The agentic framework must support the MCP standard (via SSE or Stdio).
Note: This server will not function with agents that do not support MCP.

Prerequisites

Python version 3.11 or later
Access to your CPD or SaaS environment
Access credentials and a CA certificate bundle for CPD
Ensure your agent framework supports MCP protocol

Getting CA Bundle for CPD

oc login -u kubeadmin -p '<your_openshift_password>' https://<your_openshift_cpd_url>:6443

Extract the root CA bundle:

oc get configmap kube-root-ca.crt -o jsonpath='{.data.ca\.crt}' > cabundle.crt

NOTE: Please use open shift login command. The user and password will be open shift portal login username and password

Setup

Step 1: Install Python

Official Installer: https://www.python.org/downloads/

Step 2: Create a virtual environment

python -m venv .venv

Step 3: Activate the virtual environment

source .venv/bin/activate  # macOS/Linux
.venv\Scripts\activate     # Windows

Step 4: Install the `uv` package manager

pip install uv

uv package: https://pypi.org/project/uv/

Step 5: Install the MCP server package

pip install ibm-watsonxdata-dl-retrieval-mcp-server

Configuration

For Cloud Pak for Data (CPD):

export CPD_ENDPOINT="<cpd-endpoint>"
export CPD_USERNAME="<cpd-username>"
export CPD_PASSWORD="<cpd-password>"
export CA_BUNDLE_PATH="<absolute_path_to_cabundle.crt>"
export LH_CONTEXT="CPD"

NOTE:

For CPD_ENDPOINT use endpoint url for installed CPD. Example: "https://cpd-cpd-instance.apps.perf10.5y2z.openshiftapps.com"
For CPD_USERNAME and CPD_PASSWORD use the username and password used to login to CPD.

For Watsonx SaaS:

export WATSONX_DATA_API_KEY="<api-key>"
export WATSONX_DATA_RETRIEVAL_ENDPOINT="<retrieval-service-endpoint>"
export DOCUMENT_LIBRARY_API_ENDPOINT="<document-library-endpoint>"
export WATSONX_DATA_TOKEN_GENERATION_ENDPOINT="<token-generation-endpoint>"
export LH_CONTEXT="SAAS"

NOTE:

For DOCUMENT_LIBRARY_API_ENDPOINT please use the endpoint url corresponding to region from here: https://cloud.ibm.com/apidocs/data-ai-common-core#endpoint-url. Example: https://api.ca-tor.dai.cloud.ibm.com
For WATSONX_DATA_RETRIEVAL_ENDPOINT please use the watsonx.data endpoint. Example : https://console-ibm-cator.lakehouse.saas.ibm.com
For WATSONX_DATA_TOKEN_GENERATION_ENDPOINT please use the endpoint url from here: https://cloud.ibm.com/apidocs/iam-identity-token-api#endpoints . Example: https://iam.cloud.ibm.com

Running the Server

uv run ibm-watsonxdata-dl-retrieval-mcp-server

By default, the server runs in sse transport mode on port 8000.

Transport: SSE

uv run ibm-watsonxdata-dl-retrieval-mcp-server --port <desired_port> --transport sse

Transport: stdio

uv run ibm-watsonxdata-dl-retrieval-mcp-server --port <desired_port> --transport stdio

Integrating with WXO

Prerequisite:

Install WXO ADK and complete the initial setup. Refer documentation for more details: https://developer.watson-orchestrate.ibm.com

Transport: STDIO

To add the MCP server in stdio transport with WXO refer the example below.

create connection

orchestrate connections add -a <app id>

Configure connection

orchestrate connections configure --app-id <app id> --environment draft -t team -k key_value

Setting credentials

orchestrate connections set-credentials --app-id=<app id> --env draft -e WATSONX_DATA_API_KEY="<api_key>" -e WATSONX_DATA_RETRIEVAL_ENDPOINT="<wxd retrieval endpoint>" -e DOCUMENT_LIBRARY_API_ENDPOINT="<DL endpoint>" -e WATSONX_DATA_TOKEN_GENERATION_ENDPOINT="<token generation endpoint>" -e LH_CONTEXT="SAAS"

Example for Saas:

orchestrate toolkits import \
    --kind mcp \
    --name "mcp-toolkit" \
    --description "mcp server for watsonx retrival service" \
    --package "ibm-watsonxdata-dl-retrieval-mcp-server" \
    --command "uv run ibm-watsonxdata-dl-retrieval-mcp-server --port <port> --transport stdio" \
    --language python \
    --tools "*" \
    --app-id <app id>

Transport: SSE

Install mcp-proxy

pip install mcp-proxy

Run ibm-watsonxdata-dl-retrieval-mcp-server in sse transport.

Once prerequisites are met, the tools can be added as toolkit in WXO.

Example :

orchestrate toolkits import \ 
  --kind mcp \ 
  --name mcp_toolkit \ 
  --description "MCP server (hosted, SSE)" \ 
  --package "mcp-proxy" \ 
  --language python \ 
  --command "uvx mcp-proxy https://<mcp server endpoint>/sse" \ 
  --tools "*"

NOTE:
When running wxo in SAAS and MCP server locally, expose the mcp server endpoint if required.

Refer wxo documentation for more details: https://www.ibm.com/docs/en/watsonx/watson-orchestrate/base?topic=tools-importing-from-mcp-server

Integrating with other Agentic Frameworks

For more examples on using Watsonx.data Document Library Retrieval MCP Server with agentic framework refer examples

Limitations

Environment credentials cannot be changed during runtime.
- To change credentials, either:
  - Start a new server with new env variables, OR
  - Source new environment variables and restart the server.

Tool Naming

Each document library is registered with a unique tool name:

tool_name = <library_name><library_id>

Example:

invoice_document_library77e4b4dd_479e_4406_acc4_ce154c96266c

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.9

Nov 6, 2025

0.2.7

Nov 4, 2025

0.2.6

Nov 4, 2025

0.2.5

Nov 4, 2025

0.2.4

Nov 4, 2025

0.2.3

Nov 4, 2025

0.2.2

Nov 4, 2025

This version

0.2.1

Nov 4, 2025

0.2.0

Nov 3, 2025

0.1.1

Aug 11, 2025

0.1.0

Jun 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1.tar.gz (92.1 kB view details)

Uploaded Nov 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1-py3-none-any.whl (16.9 kB view details)

Uploaded Nov 4, 2025 Python 3

File details

Details for the file ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1.tar.gz.

File metadata

Download URL: ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1.tar.gz
Upload date: Nov 4, 2025
Size: 92.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`c38080948629f61cc3a3a7859665a180fa6827a02557fada9057ac0e8bde56dc`
MD5	`f03f8188dc2eb2f324d8ef2c2d50ce8d`
BLAKE2b-256	`e597b06b2caeefe285046d090c917e37427cb386732f561ee79fc2aa4924f704`

See more details on using hashes here.

File details

Details for the file ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1-py3-none-any.whl.

File metadata

Download URL: ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1-py3-none-any.whl
Upload date: Nov 4, 2025
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for ibm_watsonxdata_dl_retrieval_mcp_server-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4208c0934e8eaf120bf6ef3d48a5c89a0daaac9dd4c6bba1391deaecb94187a4`
MD5	`d42f5f1172283eb8991607d6320f66bc`
BLAKE2b-256	`b28562842b51c9fc749cb4feb39d9665d82de958df992b43bb84da8a946b6bb5`

See more details on using hashes here.

ibm-watsonxdata-dl-retrieval-mcp-server 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Watsonx.data Document Library Retrieval MCP Server

Key Features

Overview

Prerequisites

Getting CA Bundle for CPD

Setup

Step 1: Install Python

Step 2: Create a virtual environment

Step 3: Activate the virtual environment

Step 4: Install the uv package manager

Step 5: Install the MCP server package

Configuration

For Cloud Pak for Data (CPD):

For Watsonx SaaS:

Running the Server

Transport: SSE

Transport: stdio

Integrating with WXO

Transport: STDIO

Transport: SSE

Integrating with other Agentic Frameworks

Limitations

Tool Naming

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Step 4: Install the `uv` package manager