Semantic context for your LLMs — generated automatically
Project description
Databao Context Engine
Semantic context for your LLMs — generated automatically.
No more copying schemas. No manual documentation. Just accurate answers.
Website • Quickstart • Data Sources • Contributing
What is Databao Context Engine?
Databao Context Engine is a Python library that automatically generates governed semantic context from your databases, BI tools, documents, and spreadsheets.
Use it with any LLM to deliver accurate, context-aware answers — without copying schemas or writing documentation by hand.
You can add Databao Context Engine as a standard Python dependency in your code or via Databao CLI (coming soon).
Your data sources → Context Engine → Unified semantic graph → Any LLM
Why choose Databao Context Engine?
| Feature | What it means for you |
|---|---|
| Auto-generated context | Extracts schemas, relationships, and semantics automatically |
| Runs locally | Your data never leaves your environment |
| MCP integration | Works with Claude Desktop, Cursor, and any MCP-compatible tool |
| Multiple sources | Databases, dbt projects, spreadsheets, documents |
| Built-in benchmarks | Measure and improve context quality over time |
| LLM agnostic | OpenAI, Anthropic, Ollama, Gemini — use any model |
| Governed & versioned | Track, version, and share context across your team |
| Dynamic or static | Serve context via MCP server or export as artifact |
Installation
Databao Context Engine is available on PyPI and can be installed with uv, pip, or another package manager.
Using uv
uv add databao-context-engine
Using pip
pip install databao-context-engine
Supported data sources
- Athena
- BigQuery
- ClickHouse
- DuckDB
- MSSQL
- MySQL
- PostgreSQL
- Snowflake
- SQLite
- dbt projects
- PDF files
- Markdown and text files
Supported LLMs
| Provider | Configuration |
|---|---|
| Ollama | languageModel: OLLAMA: runs locally, free |
Quickstart
1. Create a domain
# Initialize the domain in an existing directory
from databao_context_engine import init_dce_domain
domain_manager = init_dce_domain(Path(tempfile.mkdtemp()))
# Or use an existing project
from databao_context_engine import DatabaoContextDomainManager
domain_manager = DatabaoContextDomainManager(domain_dir=Path("path/to/project"))
2. Configure data sources
from databao_context_engine import (
DatasourceConnectionStatus,
DatasourceType,
)
# Create a new datasource
postgres_datasource_id = domain_manager.create_datasource_config(
DatasourceType(full_type="postgres"),
datasource_name="my_postgres_datasource",
config_content={
"connection": {"host": "localhost", "user": "dev", "password": "pass"}
},
).datasource.id
# Check the connection to the datasource is valid
check_result = domain_manager.check_datasource_connection()
assert len(check_result) == 1
assert check_result[0].datasource_id == postgres_datasource_id
assert check_result[0].connection_status == DatasourceConnectionStatus.VALID
3. Build context
build_result = domain_manager.build_context()
assert len(build_result) == 1
assert build_result[0].datasource_id == postgres_datasource_id
assert build_result[0].datasource_type == DatasourceType(full_type="postgres")
assert build_result[0].context_file_path.is_file()
4. Use the built contexts
Create a context engine
# Switch to the engine if you're already using a domain_manager
context_engine = domain_manager.get_engine_for_domain()
# Or directly create a context engine from the path to your DCE domaint
from databao_context_engine import DatabaoContextEngine
context_engine = DatabaoContextEngine(domain_dir=Path("path/to/project"))
Get all built contexts
# Switch to the engine to use the context built
all_built_contexts = context_engine.get_all_contexts()
assert len(all_built_contexts) == 1
assert all_built_contexts[0].datasource_id == postgres_datasource_id
print(all_built_contexts[0].context)
Search in built contexts
# Run a vector similarity search
results = context_engine.search_context("my search query")
print(f"Found {len(results)} results for query")
print(
"\n\n".join(
[f"{str(result.datasource_id)}\n{result.context_result}" for result in results]
)
)
Contributing
We’d love your help! Here’s how to get involved:
- ⭐ Star this repo — it helps others find us!
- 🐛 Found a bug? Open an issue
- 💡 Have an idea? We’re all ears — create a feature request
- 👍 Upvote issues you care about — helps us prioritize
- 🔧 Submit a PR
- 📝 Improve docs — typos, examples, tutorials — everything helps!
New to open source? No worries! We're friendly and happy to help you get started. 🌱
For more details, see CONTRIBUTING.
📄 License
Apache 2.0 — use it however you want. See the LICENSE file for details.
Like Databao Context Engine? Give us a ⭐ — it means a lot!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file databao_context_engine-0.5.0.tar.gz.
File metadata
- Download URL: databao_context_engine-0.5.0.tar.gz
- Upload date:
- Size: 101.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cda05481ee65c011557745b4395ee6d314ca33c9c3a981c0b7db0c1e9b88e2f8
|
|
| MD5 |
0fccbfe3d5eef859810b1376fe53330f
|
|
| BLAKE2b-256 |
508e31dd85cb0e01feedcd7e34e751c12a6fe2f30c168c715457a6204be1fa53
|
File details
Details for the file databao_context_engine-0.5.0-py3-none-any.whl.
File metadata
- Download URL: databao_context_engine-0.5.0-py3-none-any.whl
- Upload date:
- Size: 169.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d104ccc7e4ec0a5f19f203f196da017bd742bb2b6e5321a75f005377ac69a71c
|
|
| MD5 |
b46eb778d0a4e39d7c28bd0be637a980
|
|
| BLAKE2b-256 |
9a8a166a6cf0e650faf57c93a6022a0862cd490acfc50a9a70e1d0e6686b95e7
|