Read-only database MCP proxy for AI - safe SELECT access with 5-layer defense
Project description
dbread
Read-only database MCP proxy for AI — safe SELECT access with 5-layer defense
Why · Quickstart · Tools · Security Model · Docs
🤔 Why
Handing a raw database connection string to an AI is like handing a stranger your car keys. They probably won't crash it, but you wouldn't bet the car on it.
dbread sits between your AI and your DBs and enforces read-only access through five independent layers — if one layer has a bug, the next one still blocks you.
⚡ Quickstart (2 minutes, no clone needed)
1. Install as a tool
# From PyPI (recommended):
uv tool install "dbread[postgres]" # extras: postgres, mysql, mssql, oracle, duckdb, clickhouse
# OR straight from GitHub (no PyPI needed):
uv tool install "git+https://github.com/tvtdev94/dbread[postgres]"
2. Scaffold config (one command)
dbread init
Creates ~/.dbread/config.yaml, ~/.dbread/.env, and ~/.dbread/sample.db (a tiny read-only SQLite demo so everything works immediately). Prints the exact claude mcp add line to paste in step 4. Skip to step 4 if you only want the demo; otherwise edit config.yaml / .env first (step 3).
2b. Create a read-only DB user (when pointing at a real DB)
See docs/setup-db-readonly.md — copy-paste SQL snippets for PostgreSQL / MySQL / MSSQL / Oracle / SQLite / DuckDB / ClickHouse, plus compat notes for CockroachDB · Timescale · Aurora · SingleStore · PlanetScale · Yugabyte.
3. Create config.yaml + .env
# ~/.dbread/config.yaml
connections:
mydb:
url_env: MYDB_URL
dialect: postgres
rate_limit_per_min: 60
statement_timeout_s: 30
max_rows: 1000
audit:
path: ~/.dbread/audit.jsonl # ~ expansion supported
rotate_mb: 50 # rotates current → .1 → .2 → .3 (oldest dropped)
timezone: UTC # IANA name; default UTC
redact_literals: false # true → SQL literals become "?" in log (PII hardening)
# ~/.dbread/.env
MYDB_URL=postgresql+psycopg2://ai_readonly:password@host:5432/mydb
4. Register with Claude Code
claude mcp add --scope user dbread \
--env DBREAD_CONFIG=/path/to/config.yaml \
-- dbread
Or without install (one-shot via uvx):
claude mcp add --scope user dbread \
--env DBREAD_CONFIG=/path/to/config.yaml \
-- uvx --from "dbread[postgres]" dbread
5. Use it
Restart Claude Code → /mcp → dbread appears. Ask Claude: "list connections in dbread, then count rows per status in the orders table."
Alternative: clone the repo (for development)
git clone https://github.com/tvtdev94/dbread && cd dbread
uv sync --extra postgres --extra dev
cp config.example.yaml config.yaml && cp .env.example .env
claude mcp add --scope user dbread -- uv --directory $(pwd) run dbread
Ask Claude: "List connections in dbread, then count rows per status in the orders table."
🏗️ Architecture
Data flow for a query call:
sequenceDiagram
participant AI as Claude
participant T as tools.query
participant G as SqlGuard
participant R as RateLimiter
participant D as Database
participant A as Audit
AI->>T: query(sql, connection)
T->>G: validate(sql, dialect)
alt SQL is DML/DDL
G-->>T: rejected
T->>A: log(rejected, reason)
T-->>AI: ❌ sql_guard error
else SQL is SELECT
G->>T: ✓ plus inject LIMIT N
T->>R: acquire(connection)
alt Rate limit hit
R-->>T: denied
T->>A: log(rejected, rate_limit)
T-->>AI: ❌ rate_limit_exceeded
else Rate limit OK
R->>T: ✓
T->>D: execute(sql)
D-->>T: rows
T->>A: log(ok, rows, ms)
T-->>AI: ✅ rows JSON
end
end
🧰 Tools
| Tool | Purpose | Input |
|---|---|---|
list_connections |
Configured connections + dialects | — |
list_tables |
Tables in a connection | connection, schema? |
describe_table |
Columns, types, nullability, PKs, indexes | connection, table, schema? |
query |
Run SELECT/WITH/EXPLAIN/SHOW. Auto-limited. Rate-limited. Audited. |
connection, sql, max_rows? |
explain |
Query execution plan | connection, sql |
🛡️ Security Model
| Layer | Mechanism | What it rejects |
|---|---|---|
| 0 | DB user with GRANT SELECT only |
All writes — mandatory, non-bypassable |
| 1 | sqlglot AST validation |
INSERT · UPDATE · DELETE · MERGE · CREATE · ALTER · DROP · TRUNCATE · GRANT · REVOKE · multi-statement (SELECT 1; DROP...) · PG CTE-DML trick (WITH d AS (DELETE...) SELECT...) · time-based DoS (pg_sleep*, sleep, benchmark, MSSQL WAITFOR DELAY/TIME) · function blacklist (pg_read_file, xp_cmdshell, load_file, dblink_exec, ClickHouse url/s3/remote, DuckDB read_csv/read_parquet, …) |
| 2 | Rate limit + statement_timeout |
Runaway loops · long-running queries |
| 3 | Auto-inject LIMIT N |
Oversized result sets |
| 4 | JSONL audit log (fsync each write, 3-backup rotate, opt-in PII redact) |
(detection, not prevention — grep-friendly forensics) |
💡 Principle: Never rely on a single layer. Layer 0 is the guarantee; Layers 1–4 make attacks loud and rare.
Full threat model: docs/security-threat-model.md (STRIDE analysis).
📋 Example Prompts
💬 "List connections in dbread."
💬 "Describe the schema of the orders table in analytics_prod."
💬 "Top 10 customers by lifetime value — use dbread."
💬 "Run EXPLAIN on: SELECT ... ORDER BY created_at"
💬 "Update user 1 to 'hacked'."
→ ❌ sql_guard: node_rejected: Update
💬 "WITH d AS (DELETE FROM users RETURNING *) SELECT * FROM d"
→ ❌ sql_guard: node_rejected: Delete (PG CTE-DML blocked)
💬 "SELECT 1; DROP TABLE users;"
→ ❌ sql_guard: multi_statement_not_allowed
📜 Audit Log
Every call lands in audit.jsonl — one JSON per line, append-only, fsync'd on each write (survives kill -9), auto-rotated at 50 MB through a 3-backup chain (.1 → .2 → .3).
{"ts":"2026-04-22T12:30:12+00:00","conn":"analytics","sql":"SELECT * FROM users LIMIT 100","rows":100,"ms":42,"status":"ok"}
{"ts":"2026-04-22T12:30:15+00:00","conn":"analytics","sql":"DELETE FROM users","rows":0,"ms":0,"status":"rejected","reason":"node_rejected: Delete"}
Default timezone is UTC; override with audit.timezone: Asia/Bangkok (IANA). Enable audit.redact_literals: true to rewrite SQL literals to ? before logging — handy when prompts may contain PII.
jq 'select(.status=="rejected")' audit.jsonl # just rejections
jq 'select(.ms > 1000)' audit.jsonl # slow queries
jq -s 'group_by(.status)|map({s:.[0].status,n:length})' audit.jsonl # counts
🗂️ Config
config.yaml (gitignored — safe to edit with real values):
connections:
analytics_prod:
url_env: ANALYTICS_PROD_URL # credentials from .env
dialect: postgres
rate_limit_per_min: 60
statement_timeout_s: 30
max_rows: 1000
local_mysql:
url: mysql+pymysql://readonly:pw@localhost/shop
dialect: mysql
rate_limit_per_min: 120
statement_timeout_s: 15
max_rows: 500
local_duckdb:
url: duckdb:///./analytics.duckdb?access_mode=read_only
dialect: duckdb
rate_limit_per_min: 200
statement_timeout_s: 30
max_rows: 5000
clickhouse_prod:
url_env: CLICKHOUSE_URL # clickhouse+http://readonly:pw@host:8123/db
dialect: clickhouse
rate_limit_per_min: 60
statement_timeout_s: 30
max_rows: 1000
audit:
path: ~/.dbread/audit.jsonl # ~ expansion supported
rotate_mb: 50 # rotate chain: current → .1 → .2 → .3
timezone: UTC # IANA; default UTC
redact_literals: false # true → SQL literals → "?"
Supported dialects: postgres · mysql · mssql · sqlite · oracle · duckdb · clickhouse.
Compat (no new dialect): CockroachDB, TimescaleDB, Aurora PG (use postgres) · Aurora MySQL, SingleStore, PlanetScale (use mysql). See docs/setup-db-readonly.md.
🧪 Testing
uv sync --extra dev
uv run pytest # 122 passing
uv run pytest --cov=dbread # coverage report (92% overall, 85% server.py)
uv run ruff check src/ # lint
# Integration tests with real PG + MySQL + ClickHouse (needs Docker):
cd tests/integration && docker compose up -d
uv run pytest tests/integration/ -v
- ~110 unit tests cover config, connections, audit (fsync/tz/redact/rotate), SQL guard (57 evasion cases incl. WAITFOR & sleep variants), rate limiter, tools.
- 4 subprocess smoke tests drive
server.pyvia real stdio JSON-RPC. - 4 SQLite + 4 DuckDB E2E tests always run (no Docker).
- 4 PG + 4 MySQL + ClickHouse E2E tests skip gracefully without Docker.
- CI runs on GitHub Actions matrix: Python 3.11/3.12 × Ubuntu/Windows.
📚 Docs
| Document | What's in it |
|---|---|
docs/setup-db-readonly.md |
Copy-paste SQL for Layer 0 DB user on PG / MySQL / MSSQL / Oracle / SQLite |
docs/architecture.md |
Component diagram · 5-layer details · data flow · design decisions |
docs/security-threat-model.md |
Full STRIDE analysis · residual risks · response plan |
docs/manual-smoke-test.md |
Step-by-step checklist for verifying integration with Claude Code |
🧱 Project Layout
src/dbread/
├── server.py # MCP stdio entry — registers 5 tools
├── tools.py # tool handlers (guard → limit → rate → exec → audit)
├── sql_guard.py # sqlglot AST validator + LIMIT injection
├── rate_limiter.py # thread-safe token bucket per connection
├── connections.py # SQLAlchemy engine manager (lazy, per-dialect)
├── config.py # pydantic Settings (YAML + env)
└── audit.py # append-only JSONL with size rotation
Every source file is under 200 LOC — designed to be readable end-to-end.
🙏 Credits
Built with mcp · sqlglot · SQLAlchemy 2.x · pydantic · uv.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dbread-0.2.2.tar.gz.
File metadata
- Download URL: dbread-0.2.2.tar.gz
- Upload date:
- Size: 159.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
005ddedace96e69e8880d1b280c08f3e4d7bd048eb635ac5f2d5a45b08d04fce
|
|
| MD5 |
4130d0c19e4bc85744a620357f8c5e9a
|
|
| BLAKE2b-256 |
33e920341388a8f02e29b9bee2d97479a50740984ba19a4d8e1611c0c15e5283
|
File details
Details for the file dbread-0.2.2-py3-none-any.whl.
File metadata
- Download URL: dbread-0.2.2-py3-none-any.whl
- Upload date:
- Size: 20.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2cb1921995a355c5da83fcc382ff76042de16cce93cf7fa17817c1a744e3c6a2
|
|
| MD5 |
ec9ca439496a017c9e0f687ab4fff26b
|
|
| BLAKE2b-256 |
bf25d4f29ccc78f94d92c6a9778b75fb1d0f52d198d6d0d91b55a2e81d261a58
|