Lightweight, LLM-agnostic RAG pipeline with pluggable corpora. Works with Claude, Gemini, or any LLM.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

patrick.roebuck2

These details have not been verified by PyPI

Project description

attune-rag

Lightweight, LLM-agnostic RAG pipeline with pluggable corpora. Works with Claude, Gemini, or any LLM.

No LLM SDK at install time. All provider deps are optional extras.
Pluggable corpus. Use attune-help (the default), any markdown directory, or your own CorpusProtocol.
Returns a prompt string by default — send it to whatever LLM you like. Optional provider adapters ship convenience wrappers.
Optional hybrid retrieval. QueryExpander and LLMReranker layer Claude Haiku on top of keyword retrieval to improve recall and precision — both opt-in, both fail-safe.

Install

pip install attune-rag                     # core only
pip install 'attune-rag[attune-help]'      # + bundled help corpus
pip install 'attune-rag[claude]'           # + Claude adapter
pip install 'attune-rag[gemini]'           # + Gemini adapter
pip install 'attune-rag[all]'              # everything

Quick start — Claude

pip install 'attune-rag[attune-help,claude]'

import asyncio
from attune_rag import RagPipeline

async def main():
    pipeline = RagPipeline()  # defaults to AttuneHelpCorpus
    response, result = await pipeline.run_and_generate(
        "How do I run a security audit with attune?",
        provider="claude",
    )
    print(response)
    print("\nSources:", [h.entry.path for h in result.citation.hits])

asyncio.run(main())

Quick start — Gemini

pip install 'attune-rag[attune-help,gemini]'

response, result = await pipeline.run_and_generate(
    "...", provider="gemini", model="gemini-1.5-pro",
)

Quick start — custom corpus, any LLM

from pathlib import Path
from attune_rag import RagPipeline, DirectoryCorpus

pipeline = RagPipeline(corpus=DirectoryCorpus(Path("./my-docs")))
result = pipeline.run("How do I...?")

# Send result.augmented_prompt to whatever LLM you use.
# The pipeline itself does NOT call an LLM unless you use
# run_and_generate or call a provider adapter yourself.

Hybrid retrieval (optional)

QueryExpander and LLMReranker require the [claude] extra and an ANTHROPIC_API_KEY. Both are opt-in and fail-safe — any API error falls back to keyword-only order automatically.

from attune_rag import RagPipeline, LLMReranker, QueryExpander

# Reranker only (recommended for precision):
pipeline = RagPipeline(reranker=LLMReranker())

# Expander + reranker (max coverage):
pipeline = RagPipeline(
    expander=QueryExpander(),
    reranker=LLMReranker(),
)

Template editor primitives (`attune_rag.editor`)

Headless toolkit for tools that need to validate, lint, and refactor a template corpus — used by the attune-gui template editor and the attune-author edit CLI, but works standalone with any CorpusProtocol.

API	What it does
`load_schema()`	Loads `template_schema.json` (the v1 frontmatter contract: required `type` enum + `name`; optional `tags`, `aliases`, `summary`, `source`, `hash`; `additionalProperties: true`).
`parse_frontmatter(text)` / `validate_frontmatter(data)`	Split a template into frontmatter + body and report typed `FrontmatterIssue`s — used by linters and editors.
`lint_template(text, rel_path, corpus)`	Returns `Diagnostic[]` for schema violations, broken `[[alias]]` references, and depth-marker sequence errors. 1-indexed line/col ranges.
`autocomplete_tags(corpus, prefix, limit)` / `autocomplete_aliases(corpus, prefix, limit)`	Prefix-match completions ranked by frequency (tags) or lexical proximity (aliases). Sub-ms on 1k templates.
`find_references(corpus, name, kind)`	Locate every alias/tag/path occurrence across body, frontmatter, and `cross_links.json`.
`plan_rename(corpus, old, new, kind)`	Build a `RenamePlan` (one `FileEdit` per affected file with unified-diff hunks) for `kind="alias"` or `"tag"`. Raises `RenameCollisionError` on existing alias targets.
`apply_rename(corpus, plan)`	Atomically apply the plan (tempfile-per-file + sequential rename + drift-detection rollback). Returns the list of affected paths.

Schema, lint, and rename are pure functions over CorpusProtocol — no I/O, no global state. All three pieces are tested as a unit and used live by the attune-gui editor's /api/corpus/<id>/lint, /autocomplete, and /refactor/rename/{preview,apply} routes.

from attune_rag import DirectoryCorpus
from attune_rag.editor import lint_template, plan_rename, apply_rename

corpus = DirectoryCorpus(Path("./templates")).load()

# Validate a template before saving
diagnostics = lint_template(
    text=Path("./templates/concepts/foo.md").read_text(),
    rel_path="concepts/foo.md",
    corpus=corpus,
)

# Rename an alias across the whole corpus
plan = plan_rename(corpus, old="oldname", new="newname", kind="alias")
print(f"Affects {len(plan.edits)} files")
affected = apply_rename(corpus, plan)

Dashboard

attune-rag dashboard show    # live terminal dashboard
attune-rag dashboard render --out report.html  # HTML snapshot

Quality baseline

Every PR is gated against a locked retrieval + faithfulness baseline. Thresholds are set at mean − 2σ per metric, measured from 20 back-to-back benchmark runs on an unchanged HEAD — empirically grounded rather than guessed.

Metric	Threshold (current)	Source
`precision_at_1`	0.95	retrieval, deterministic
`recall_at_3`	1.00	retrieval, deterministic
`mean_faithfulness`	0.9686	Claude judge, σ ≈ 0.005

The CI workflow at .github/workflows/benchmark.yml runs the benchmark on every PR. Faithfulness gating engages when the PR touches retrieval, reranker, expander, pipeline, prompts, or eval paths, or when the PR title contains [full-bench]. Methodology, raw numbers, and the re-measurement procedure live under docs/specs/release-quality-baseline/.

Roadmap — embeddings (next minor release)

Keyword retrieval + optional Claude reranker currently carry attune-rag past 87% P@1 on the attune-help golden set. The remaining misses are queries with zero token overlap against their target doc (e.g. "vulnerability scan" → tool-security-audit.md). Closing that gap needs vector search.

Next minor release will ship attune-rag[embeddings] using fastembed for local, CPU-only embeddings — no new network dependency, no API key required at retrieval time. Keyword retrieval stays the default; embeddings layer in opt-in, same shape as QueryExpander and LLMReranker.

See CHANGELOG.md for the decision record and remaining-gap analysis.

Prompt caching (Claude only)

When using the Claude provider, run_and_generate automatically enables Anthropic prompt caching on the stable RAG context prefix (≥ 1 024 chars). This eliminates repeated token costs on the corpus portion of the prompt when the same context block is reused across calls.

No configuration needed — the provider handles the cache_control header automatically.

Public API

attune-rag's public surface is documented below and snapshot-tested in tests/unit/test_api_surface.py. Formal SemVer commitments begin with the 0.2.0 release — see docs/POLICY.md for the deprecation policy. Until then the surface is honor-system: the lock test catches drift, but treat 0.1.x as still-evolving.

Top-level (from attune_rag import ...):

Pipeline — RagPipeline, RagResult
Corpus — CorpusProtocol, RetrievalEntry, DirectoryCorpus, AttuneHelpCorpus
Retrieval — KeywordRetriever, RetrievalHit, RetrieverProtocol
Provenance — CitationRecord, CitedSource, ClaimCitation, format_citations_markdown, format_claim_citations_markdown
Prompting — build_augmented_prompt, PROMPT_VARIANTS
Hybrid retrieval — QueryExpander, LLMReranker

PUBLIC submodules (importable by qualified path):

attune_rag.corpus — exposes AliasInfo, DuplicateAliasError in addition to the top-level corpus names
attune_rag.corpus.attune_help — AttuneHelpCorpus
attune_rag.corpus.help_adapter — HelpCorpusAdapter Protocol
attune_rag.providers — LLMProvider, get_provider, list_available
attune_rag.editor — template-editor primitives (lint, schema, rename, autocomplete, references); see "Template editor primitives" above for the symbol list
attune_rag.editor.{rename,schema,lint,autocomplete,references} — the individual editor submodules

Anything not listed above is INTERNAL and may change in any release. The underscore-prefixed editor modules (attune_rag.editor._rename etc.) shipped in 0.1.x are deprecation shims as of 0.2.0; they re-export the new non-underscore names and emit DeprecationWarning. They are removed in 0.3.0.

Status

v0.1.10. Part of the attune ecosystem (attune-ai, attune-help, attune-author).

License

Apache 2.0. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

patrick.roebuck2

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.0

May 25, 2026

0.1.23

May 21, 2026

0.1.22

May 20, 2026

0.1.21

May 20, 2026

This version

0.1.19

May 16, 2026

0.1.18

May 16, 2026

0.1.17

May 16, 2026

0.1.16

May 15, 2026

0.1.15

May 15, 2026

0.1.14

May 8, 2026

0.1.13

May 8, 2026

0.1.12

May 5, 2026

0.1.11

May 1, 2026

0.1.10

May 1, 2026

0.1.9

Apr 27, 2026

0.1.8

Apr 24, 2026

0.1.6

Apr 23, 2026

0.1.5

Apr 19, 2026

0.1.4

Apr 19, 2026

0.1.3

Apr 19, 2026

0.1.2

Apr 19, 2026

0.1.1

Apr 18, 2026

0.1.0

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

attune_rag-0.1.19.tar.gz (83.1 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

attune_rag-0.1.19-py3-none-any.whl (96.0 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file attune_rag-0.1.19.tar.gz.

File metadata

Download URL: attune_rag-0.1.19.tar.gz
Upload date: May 16, 2026
Size: 83.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for attune_rag-0.1.19.tar.gz
Algorithm	Hash digest
SHA256	`4bccccb783c046ecf39ce78683dd13fc62151377489714947cb92fb5c5946a1f`
MD5	`3cebb1951f38f822514d6d02ea27c284`
BLAKE2b-256	`3590ce2704c58591df5bcdc80a9161350a40dab600fb647838d4fb8bbe33e56b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for attune_rag-0.1.19.tar.gz:

Publisher: publish.yml on Smart-AI-Memory/attune-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: attune_rag-0.1.19.tar.gz
- Subject digest: 4bccccb783c046ecf39ce78683dd13fc62151377489714947cb92fb5c5946a1f
- Sigstore transparency entry: 1553241373
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: Smart-AI-Memory/attune-rag@8edd93e7f81062917d8ab59ede77c3dabe2e4186
- Branch / Tag: refs/tags/v0.1.19
- Owner: https://github.com/Smart-AI-Memory
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8edd93e7f81062917d8ab59ede77c3dabe2e4186
- Trigger Event: release

File details

Details for the file attune_rag-0.1.19-py3-none-any.whl.

File metadata

Download URL: attune_rag-0.1.19-py3-none-any.whl
Upload date: May 16, 2026
Size: 96.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for attune_rag-0.1.19-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ef41e6056eb38c48d6397c1319326a92914228a3596578e0578dc51c1bb0c6da`
MD5	`803eec540eb17377272576f3ae83e239`
BLAKE2b-256	`48ff85a7e855aa912f655597da247819bd3ccc3f9586e5b6dee6017bf9c7d040`

See more details on using hashes here.

Provenance

The following attestation bundles were made for attune_rag-0.1.19-py3-none-any.whl:

Publisher: publish.yml on Smart-AI-Memory/attune-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: attune_rag-0.1.19-py3-none-any.whl
- Subject digest: ef41e6056eb38c48d6397c1319326a92914228a3596578e0578dc51c1bb0c6da
- Sigstore transparency entry: 1553241385
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: Smart-AI-Memory/attune-rag@8edd93e7f81062917d8ab59ede77c3dabe2e4186
- Branch / Tag: refs/tags/v0.1.19
- Owner: https://github.com/Smart-AI-Memory
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8edd93e7f81062917d8ab59ede77c3dabe2e4186
- Trigger Event: release

attune-rag 0.1.19

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

attune-rag

Install

Quick start — Claude

Quick start — Gemini

Quick start — custom corpus, any LLM

Hybrid retrieval (optional)

Template editor primitives (attune_rag.editor)

Dashboard

Quality baseline

Roadmap — embeddings (next minor release)

Prompt caching (Claude only)

Public API

Status

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Template editor primitives (`attune_rag.editor`)