Skip to main content

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant

Project description

DataHub Agent Context

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant.

What you can do

  • Search datasets, dashboards, pipelines, and other data assets by name or description
  • Retrieve entity details — schema, lineage, ownership, tags, glossary terms, and more
  • Trace lineage upstream and downstream across your data assets
  • Mutate metadata — update descriptions, tags, owners, domains, and glossary terms
  • Build LangChain or Google ADK agents with pre-built tool bindings
  • Set up Snowflake AI agents with one CLI command

Installation

pip install datahub-agent-context

# With LangChain support
pip install "datahub-agent-context[langchain]"

Quickstart

LangChain agent

from datahub.sdk.main_client import DataHubClient
from datahub_agent_context.langchain_tools import build_langchain_tools

client = DataHubClient.from_env()

# Read-only tools (search, lineage, entity details)
tools = build_langchain_tools(client, include_mutations=False)

# Include write tools (tags, descriptions, owners, etc.)
tools = build_langchain_tools(client, include_mutations=True)

# DataHub Cloud: add Ask DataHub AI assistant
from datahub_agent_context.langchain_tools import build_langchain_cloud_tools
tools += build_langchain_cloud_tools(client, ask_datahub=True)

Snowflake AI agent setup

datahub agent create snowflake \
  --datahub-url https://your-datahub-instance \
  --datahub-token your-token

Available tools

Searchsearch(), search_documents(), grep_documents()

Entitiesget_entities(), list_schema_fields()

Lineageget_lineage(), get_lineage_paths_between()

Queriesget_dataset_queries()

Mutationsadd_tags(), remove_tags(), update_description(), set_domains(), add_owners(), add_glossary_terms(), add_structured_properties(), save_document()

Cloud-onlyask_datahub_chat() (DataHub Cloud AI assistant)

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datahub_agent_context-1.6.0.10rc4.tar.gz (98.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datahub_agent_context-1.6.0.10rc4-py3-none-any.whl (135.0 kB view details)

Uploaded Python 3

File details

Details for the file datahub_agent_context-1.6.0.10rc4.tar.gz.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc4.tar.gz
Algorithm Hash digest
SHA256 f24b8978ee058352d6ee8fc9592f5dee27b568fe5d1f6e07bd20f9d8476ccd48
MD5 5029798449281045ec7da9813bc1247c
BLAKE2b-256 1f0fce764df2cd7fc2bb3b1ecb8bedae7c539aafb9a1bdb269f9ce99619b81be

See more details on using hashes here.

File details

Details for the file datahub_agent_context-1.6.0.10rc4-py3-none-any.whl.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc4-py3-none-any.whl
Algorithm Hash digest
SHA256 818e7f6bb192e41291d839b93efa43c9aa487590aa98d25f2c79b48bb79c78a6
MD5 d969eb4a7b4b5d5215cdba6813c006ec
BLAKE2b-256 473e97191d70145749d3d97f8e0a3aef38923e8c214438fda94aef43a653c28c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page