SQL analytics for AI systems — query LLM responses, agent traces, and RAG pipelines like a database

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

sadayamuthu

These details have not been verified by PyPI

Project links

Project description

anySQL

SQL Analytics for AI Systems

From vibes to queries.

Quick Start · How It Works · 5 Use Cases · Installation · CLI Usage · Examples

What is anySQL?

anySQL is an open-source SQL analytics engine for AI systems. It lets engineers query LLM responses, agent traces, and RAG pipelines with standard SQL — powered by DuckDB in-memory, persisted to SQLite, with zero configuration.

AI engineers debug with print() statements, JSON log files, and pre-built dashboards that show what the tool designer thought you'd want to see. What's missing is raw SQL over normalized AI telemetry data — specifically the cross-layer JOIN that lets you ask whether your RAG pipeline is failing at retrieval or generation.

Quick Start

# Install
pip install anysql

# Install with provider support
pip install "anysql[openai]"
pip install "anysql[anthropic]"
pip install "anysql[all]"        # OpenAI + Anthropic + LangChain

import anysql

# Initialize (in-memory by default, or pass a file path for persistence)
db = anysql.init()

# Wrap your OpenAI client — all calls are auto-logged
client = anysql.openai(openai_client)

# Wrap your Anthropic client
client = anysql.claude(anthropic_client)

# Tag pipeline runs for cost attribution
@anysql.context(feature="search", version="v2")
def run_search(query):
    ...

# Query anything with standard SQL
df = db.query("SELECT model, AVG(cost_usd) FROM llm_responses GROUP BY model")

# Or use built-in analytics methods
df = db.model_comparison()       # UC1: multi-model comparison
df = db.prompt_regressions()     # UC2: regression detection
df = db.cost_by_feature()        # UC3: cost attribution
df = db.tool_failure_rates()     # UC4: agent debugging
df = db.rag_failure_modes()      # UC5: RAG forensics

How It Works

User Code
    │
    ├── @anysql.context(feature="x")     ← Python contextvars, sync+async safe
    ├── OpenAI/Claude wrapped client      ← transparent proxy, one-line swap
    ├── AgentTracer (LangChain callback)  ← manual or callback-based
    └── RAGTracer.after_retrieval()       ← auto-detects LangChain/LlamaIndex/dict
              │
              ▼ insert()
    AnySQL engine
    ├── in-memory buffer (dict lists per table)
    ├── SQLite persistence (JSON blobs, cross-session)
    └── DuckDB (Arrow views, SQL at query time)
              │
              ▼ query()
    6 PyArrow tables as DuckDB views:
    llm_responses, eval_results, pipeline_runs,
    agent_tool_calls, agent_trace, rag_chunks

Key design decisions:

Schema enforcement at Arrow layer — SQLite stores raw JSON, validation happens at query time
Dot-namespace tables (llm.responses) map to flat SQL view names (llm_responses)
Contextvars for thread-safe and async-safe tagging — no manual pass-through required

The 6 Canonical Tables

Table	Use Cases	Join Keys
`llm_responses`	UC1, UC2	`response_id`
`eval_results`	UC1, UC2, UC5	`response_id`, `run_id`, `query_id`
`pipeline_runs`	UC3	`run_id`, `session_id`
`agent_tool_calls`	UC4	`session_id`
`agent_trace`	UC4	`session_id`
`rag_chunks`	UC5	`query_id` ← cross-layer join key

The 5 Use Cases

UC	Name	Key Methods	What It Answers
UC1	Multi-Model Comparison	`model_comparison()`, `model_by_task()`	Which model performs best on my task?
UC2	Prompt Regression Detection	`prompt_regressions()`, `eval_debt()`, `silent_degradation()`	Did my last prompt change break something?
UC3	Cost Attribution	`cost_by_feature()`, `cost_anomalies()`	Which feature is burning my LLM budget?
UC4	Agent Debugging	`tool_failure_rates()`, `loop_detector()`, `session_diff()`, `human_intervention_points()`	Where is my agent getting stuck?
UC5	RAG Forensics	`rag_failure_modes()`, `chunk_quality_ranking()`, `similarity_calibration()`	Is my RAG failing at retrieval or generation?

The cross-layer join (UC5) is the killer feature — query_id threads RAG retrieval to eval results, enabling retrieval vs. generation failure classification.

Installation

From PyPI

pip install anysql

Provider extras

pip install "anysql[openai]"      # + openai>=1.0.0
pip install "anysql[anthropic]"   # + anthropic>=0.25.0
pip install "anysql[langchain]"   # + langchain>=0.2.0
pip install "anysql[all]"         # everything

CLI Usage

# Run a SQL query against a persisted database
anysql query "SELECT model, COUNT(*) FROM llm_responses GROUP BY model"

# Show table row counts and basic stats
anysql stats

# Query a specific database file
anysql query "SELECT * FROM eval_results LIMIT 10" --db ./myproject.db

Examples

Three runnable demos are included in examples/. All auto-detect missing API keys and fall back to mock mode — no downloads required.

Demo	Dataset	Models
`realtime_openai_demo.py`	BBC News (2004–05), 12 articles	`gpt-4o`, `gpt-4o-mini`
`realtime_claude_demo.py`	AG News, 15 articles	`claude-sonnet-4-6`, `claude-haiku-4-5`
`realtime_combined_demo.py`	Reuters R8, 20 articles	All 4 models head-to-head

# Run combined demo (works without API keys)
python examples/realtime_combined_demo.py

Adapter Usage

OpenAI

import openai
import anysql

db = anysql.init()
client = anysql.openai(openai.OpenAI())

# All calls now logged automatically
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Summarize this article..."}]
)

Anthropic

import anthropic
import anysql

db = anysql.init()
client = anysql.claude(anthropic.Anthropic())

response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Classify this text..."}]
)

Agent Tracing

tracer = anysql.agent_tracer()

# Manual tracing
tracer.trace_tool_call(
    session_id="sess-001",
    tool_name="web_search",
    input_data={"query": "latest news"},
    output_data={"results": [...]},
    success=True,
    latency_ms=320,
)

# LangChain callback (automatic)
from langchain_openai import ChatOpenAI
llm = ChatOpenAI(callbacks=[tracer])

RAG Tracing

rag = anysql.rag_tracer()

query_id = rag.before_retrieval(query="What is anySQL?")
chunks = retriever.get_relevant_documents(query)
rag.after_retrieval(query_id=query_id, chunks=chunks)

# Record eval result with cross-layer join key
rag.record_eval(
    query_id=query_id,
    score=0.92,
    passed=True,
    eval_type="faithfulness",
)

Development

pip install -e ".[dev]"

pytest tests/ -v           # Run tests
pytest tests/ --tb=short   # Short failure output
ruff check anysql/         # Lint
ruff format anysql/        # Format

Repository Structure

anysql/
├── anysql/
│   ├── __init__.py        # Public API surface
│   ├── engine.py          # DuckDB engine + UC analytics methods
│   ├── schema.py          # 6 PyArrow schemas
│   ├── storage.py         # SQLite persistence
│   ├── context.py         # @context decorator + context_scope()
│   ├── cli.py             # CLI entry point
│   ├── adapters/
│   │   ├── openai.py      # OpenAI transparent proxy
│   │   ├── claude.py      # Anthropic transparent proxy
│   │   └── generic.py     # Generic JSON/dict adapter
│   └── tracers/
│       ├── agent.py       # AgentTracer (manual + LangChain)
│       └── rag.py         # RAGTracer (LangChain/LlamaIndex/dict)
├── tests/                 # 94 tests, all passing
├── examples/              # 3 runnable demos
└── docs/
    └── QUERIES.md         # Canonical SQL query library

License

Apache 2.0

anySQL is an open-source SQL analytics engine for AI systems

anySQL is managed by OpenAstra · anysql.org

PyPI · GitHub · Docs

Project details

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

sadayamuthu

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Mar 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anysql_sdk-0.1.0.tar.gz (72.7 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

anysql_sdk-0.1.0-py3-none-any.whl (22.5 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file anysql_sdk-0.1.0.tar.gz.

File metadata

Download URL: anysql_sdk-0.1.0.tar.gz
Upload date: Mar 10, 2026
Size: 72.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for anysql_sdk-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e217b40569d352d86922b2bcf8b41159ff7579a9f8b3e545fd3feb0d0ef652d6`
MD5	`ba1f3c1f1a22f8d774d5ea03b32b501b`
BLAKE2b-256	`7b57dd46798f8aaabd10312b1913935c7737aeced91b4a82be21bbc3a4f7e665`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anysql_sdk-0.1.0.tar.gz:

Publisher: release.yml on sadayamuthu/anySQL

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anysql_sdk-0.1.0.tar.gz
- Subject digest: e217b40569d352d86922b2bcf8b41159ff7579a9f8b3e545fd3feb0d0ef652d6
- Sigstore transparency entry: 1070783366
- Sigstore integration time: Mar 10, 2026
Source repository:
- Permalink: sadayamuthu/anySQL@59431f038d9591983c6e584bc6af4d397a389686
- Branch / Tag: refs/heads/main
- Owner: https://github.com/sadayamuthu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@59431f038d9591983c6e584bc6af4d397a389686
- Trigger Event: push

File details

Details for the file anysql_sdk-0.1.0-py3-none-any.whl.

File metadata

Download URL: anysql_sdk-0.1.0-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 22.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for anysql_sdk-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f382bd71e7d0c9a6b4dd06dd4047b9dd571284a628b162678488dbfcbf2e91b9`
MD5	`6221169cd065b59551e49d8daedd705f`
BLAKE2b-256	`006644151c6998641698d1feb5a6ab3ae195f3962edfdce04b4db1839fafab55`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anysql_sdk-0.1.0-py3-none-any.whl:

Publisher: release.yml on sadayamuthu/anySQL

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anysql_sdk-0.1.0-py3-none-any.whl
- Subject digest: f382bd71e7d0c9a6b4dd06dd4047b9dd571284a628b162678488dbfcbf2e91b9
- Sigstore transparency entry: 1070783430
- Sigstore integration time: Mar 10, 2026
Source repository:
- Permalink: sadayamuthu/anySQL@59431f038d9591983c6e584bc6af4d397a389686
- Branch / Tag: refs/heads/main
- Owner: https://github.com/sadayamuthu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@59431f038d9591983c6e584bc6af4d397a389686
- Trigger Event: push

anysql-sdk 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

anySQL

SQL Analytics for AI Systems

What is anySQL?

Quick Start

How It Works

The 6 Canonical Tables

The 5 Use Cases

Installation

From PyPI

Provider extras

CLI Usage

Examples

Adapter Usage

OpenAI

Anthropic

Agent Tracing

RAG Tracing

Development

Repository Structure

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance