Skip to main content

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant

Project description

DataHub Agent Context

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant.

What you can do

  • Search datasets, dashboards, pipelines, and other data assets by name or description
  • Retrieve entity details — schema, lineage, ownership, tags, glossary terms, and more
  • Trace lineage upstream and downstream across your data assets
  • Mutate metadata — update descriptions, tags, owners, domains, and glossary terms
  • Build LangChain or Google ADK agents with pre-built tool bindings
  • Set up Snowflake AI agents with one CLI command

Installation

pip install datahub-agent-context

# With LangChain support
pip install "datahub-agent-context[langchain]"

Quickstart

LangChain agent

from datahub.sdk.main_client import DataHubClient
from datahub_agent_context.langchain_tools import build_langchain_tools

client = DataHubClient.from_env()

# Read-only tools (search, lineage, entity details)
tools = build_langchain_tools(client, include_mutations=False)

# Include write tools (tags, descriptions, owners, etc.)
tools = build_langchain_tools(client, include_mutations=True)

# DataHub Cloud: add Ask DataHub AI assistant
from datahub_agent_context.langchain_tools import build_langchain_cloud_tools
tools += build_langchain_cloud_tools(client, ask_datahub=True)

Snowflake AI agent setup

datahub agent create snowflake \
  --datahub-url https://your-datahub-instance \
  --datahub-token your-token

Available tools

Searchsearch(), search_documents(), grep_documents()

Entitiesget_entities(), list_schema_fields()

Lineageget_lineage(), get_lineage_paths_between()

Queriesget_dataset_queries()

Mutationsadd_tags(), remove_tags(), update_description(), set_domains(), add_owners(), add_glossary_terms(), add_structured_properties(), save_document()

Cloud-onlyask_datahub_chat() (DataHub Cloud AI assistant)

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datahub_agent_context-1.6.0.10rc2.tar.gz (98.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datahub_agent_context-1.6.0.10rc2-py3-none-any.whl (135.0 kB view details)

Uploaded Python 3

File details

Details for the file datahub_agent_context-1.6.0.10rc2.tar.gz.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc2.tar.gz
Algorithm Hash digest
SHA256 5f6392aa2703acaa6f7e20a886c71d7dd7c3dad627d7043e34f4b12cc3f8bca9
MD5 413b14f0e8a735c80c8e12ad304ff38b
BLAKE2b-256 138f3f4c9b51d561a13ad06e771b91760b80e1d484692bbed462054eccec3bb4

See more details on using hashes here.

File details

Details for the file datahub_agent_context-1.6.0.10rc2-py3-none-any.whl.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc2-py3-none-any.whl
Algorithm Hash digest
SHA256 01b639ed912f9649c2286e894e92a74871131364bc1ddb61da016003d4e42f2b
MD5 a80028f191252d73c367b46b27e2756f
BLAKE2b-256 f63026ba5306ac5841d6b8635d2185ad4ba7e0dc387033ca6468f0dd7c383bf0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page