Skip to main content

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant

Project description

DataHub Agent Context

MCP tools for AI agents to search and query your DataHub metadata catalog — works with Claude, Cursor, Copilot, and any MCP-compatible AI assistant.

What you can do

  • Search datasets, dashboards, pipelines, and other data assets by name or description
  • Retrieve entity details — schema, lineage, ownership, tags, glossary terms, and more
  • Trace lineage upstream and downstream across your data assets
  • Mutate metadata — update descriptions, tags, owners, domains, and glossary terms
  • Build LangChain or Google ADK agents with pre-built tool bindings
  • Set up Snowflake AI agents with one CLI command

Installation

pip install datahub-agent-context

# With LangChain support
pip install "datahub-agent-context[langchain]"

Quickstart

LangChain agent

from datahub.sdk.main_client import DataHubClient
from datahub_agent_context.langchain_tools import build_langchain_tools

client = DataHubClient.from_env()

# Read-only tools (search, lineage, entity details)
tools = build_langchain_tools(client, include_mutations=False)

# Include write tools (tags, descriptions, owners, etc.)
tools = build_langchain_tools(client, include_mutations=True)

# DataHub Cloud: add Ask DataHub AI assistant
from datahub_agent_context.langchain_tools import build_langchain_cloud_tools
tools += build_langchain_cloud_tools(client, ask_datahub=True)

Snowflake AI agent setup

datahub agent create snowflake \
  --datahub-url https://your-datahub-instance \
  --datahub-token your-token

Available tools

Searchsearch(), search_documents(), grep_documents()

Entitiesget_entities(), list_schema_fields()

Lineageget_lineage(), get_lineage_paths_between()

Queriesget_dataset_queries()

Mutationsadd_tags(), remove_tags(), update_description(), set_domains(), add_owners(), add_glossary_terms(), add_structured_properties(), save_document()

Cloud-onlyask_datahub_chat() (DataHub Cloud AI assistant)

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datahub_agent_context-1.6.0.10rc1.tar.gz (98.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datahub_agent_context-1.6.0.10rc1-py3-none-any.whl (135.0 kB view details)

Uploaded Python 3

File details

Details for the file datahub_agent_context-1.6.0.10rc1.tar.gz.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc1.tar.gz
Algorithm Hash digest
SHA256 ac005d832d883c60224f02cb304874c2212b3be6ddace0102199222c1dc8afcd
MD5 41fc9eb975015b7738de422752d3078d
BLAKE2b-256 43f3a2fb77bab551f299bd8978e157e336dfec1eacaf61e5db3a5e15afc7b45a

See more details on using hashes here.

File details

Details for the file datahub_agent_context-1.6.0.10rc1-py3-none-any.whl.

File metadata

File hashes

Hashes for datahub_agent_context-1.6.0.10rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 eb61110d5aceed8e59bfe982c1a0ac3fcf70a831d7f0e7234e0f3a6cf38f849c
MD5 1920b3bf2c9b3c432b20ea5685bb5844
BLAKE2b-256 e4919fbb94f210b3c07460accec834b05858bb2b2521d4b272891282bef51033

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page