Skip to main content

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant

Project description

DataHub Agent Context

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant.

What you can do

  • Search datasets, dashboards, pipelines, and other data assets by name or description
  • Retrieve entity details — schema, lineage, ownership, tags, glossary terms, and more
  • Trace lineage upstream and downstream across your data assets
  • Mutate metadata — update descriptions, tags, owners, domains, and glossary terms
  • Build LangChain or Google ADK agents with pre-built tool bindings
  • Set up Snowflake AI agents with one CLI command

Installation

pip install datahub-agent-context

# With LangChain support
pip install "datahub-agent-context[langchain]"

Quickstart

LangChain agent

from datahub.sdk.main_client import DataHubClient
from datahub_agent_context.langchain_tools import build_langchain_tools

client = DataHubClient.from_env()

# Read-only tools (search, lineage, entity details)
tools = build_langchain_tools(client, include_mutations=False)

# Include write tools (tags, descriptions, owners, etc.)
tools = build_langchain_tools(client, include_mutations=True)

# DataHub Cloud: add Ask DataHub AI assistant
from datahub_agent_context.langchain_tools import build_langchain_cloud_tools
tools += build_langchain_cloud_tools(client, ask_datahub=True)

Snowflake AI agent setup

datahub agent create snowflake \
  --datahub-url https://your-datahub-instance \
  --datahub-token your-token

Available tools

Searchsearch(), search_documents(), grep_documents()

Entitiesget_entities(), list_schema_fields()

Lineageget_lineage(), get_lineage_paths_between()

Queriesget_dataset_queries()

Mutationsadd_tags(), remove_tags(), update_description(), set_domains(), add_owners(), add_glossary_terms(), add_structured_properties(), save_document()

Cloud-onlyask_datahub_chat() (DataHub Cloud AI assistant)

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datahub_agent_context-1.6.0.10rc3.tar.gz (98.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datahub_agent_context-1.6.0.10rc3-py3-none-any.whl (135.0 kB view details)

Uploaded Python 3

File details

Details for the file datahub_agent_context-1.6.0.10rc3.tar.gz.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc3.tar.gz
Algorithm Hash digest
SHA256 3f3dc57745a91ede49b1c396ca81fe093e019d0a3865f7f23031f1e040d68e89
MD5 fe4fba9278a24eeca3cbf2d204c794a5
BLAKE2b-256 0f38527f227799313b16bb761d09538041e3cbbe71acec3424c50cbd897c6e74

See more details on using hashes here.

File details

Details for the file datahub_agent_context-1.6.0.10rc3-py3-none-any.whl.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc3-py3-none-any.whl
Algorithm Hash digest
SHA256 ce75582dde3e3c865242cf31ed494428ddb02e4b5c99755813900d3143ad2aaa
MD5 46b1b3fd4ee8be9e5e07a1b638881da4
BLAKE2b-256 59259fab1f19934c7421037c7de842d81b224ecb47b7be8a449b279c831f7c31

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page