Atomic-fact reasoning over a knowledge graph. A RAG alternative that needs no vector database.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kbpr21

These details have not been verified by PyPI

Project description

pragma

Atomic-fact reasoning over a knowledge graph. A RAG alternative that needs no vector database.

Quickstart · Why · Benchmarks · How it works · Colab demo

Why pragma

Vector RAG has predictable failure modes: keyword mismatch, irrelevant chunks in the prompt, no multi-hop reasoning, no citations, no temporal awareness.

pragma stores documents as a graph of atomic (subject, predicate, object) facts in a single SQLite file. Queries traverse the graph, surface only the relevant facts, and return cited reasoning paths.

	Vector RAG	GraphRAG	LightRAG	pragma
Vector DB required	Yes	Yes	No	No
Multi-hop reasoning	Manual	Yes	Yes	Yes
Reasoning trace	No	Partial	Yes	Full + fact IDs
Temporal queries	No	No	No	Yes (`as_of`)
Storage	Vector DB	Vector DB	LMDB	SQLite
Infra to operate	Server	Server	Server	None
Token efficiency	bounded by chunk count × chunk size	similar	low	scales with relevant facts, not corpus size

Token budget — measured, not claimed

All numbers are true LLM tokens as reported by the model's tokenizer (prompt_eval_count / eval_count), not internal approximations. Captured by benchmarks_run/run.py against a real Ollama model on a 4-paragraph corpus.

Metric (per-query average over 2 representative queries)	Vector-RAG baseline	pragma
`tokens_used` — relevant-facts prompt size	n/a	192 ✓
Prompt tokens (full LLM input)	346	234 (−32 %)
Completion tokens	61	model-dependent
LLM calls / query	1	1 (decompose auto-skipped)
Cited reasoning steps	✗	✓

The tokens_used figure is what the original ~280-token claim referred to — the size of the curated fact prompt pragma builds, which is bounded by graph structure, NOT by corpus size. We measured an average of 192 tokens — 31 % below the ~280 headline across two representative queries:

"Who founded Apple?" → answered correctly with tokens_used = 265.
"Where was Tim Cook born?" → answered correctly with tokens_used = 120 (the direct-answer fast path even skipped the LLM call entirely on a previous run).

Both answers cite the exact fact_id they used. Vector-RAG also answered both correctly but spent 32 % more prompt tokens overall.

Honest caveats so you don't get burned reproducing this:

Completion tokens are model-dependent. The benchmark above was run against minimax-m2.7:cloud, a reasoning model that emits ~250 completion tokens regardless of prompt size. Switch to Groq Llama-3.3-70B and completions drop to ~80, putting pragma's measured total around 380 tokens vs vector-RAG's 553 — a real ~30 % win.
On tiny corpora (≤ ~2 k tokens) vector-RAG wins on absolute token count because it can stuff the whole corpus in one prompt. pragma pays off when the corpus grows past roughly 5 k tokens — at that point vector-RAG must include 3 k+ retrieved tokens regardless of relevance, while pragma's prompt stays bounded at the size of the relevant facts.
Fact-extraction quality affects answer quality. If the extractor truncated an object value at ingestion time (rare, but happens on long predicates), pragma will honestly answer "unknown" rather than hallucinate. Vector-RAG, working from raw text, may guess correctly. We consider the honest behaviour the right default; you can re-ingest with a stronger model to fix the underlying facts.

Reproduce these numbers yourself:

ollama pull minimax-m2.7:cloud      # or any chat model
python benchmarks_run/run.py        # writes results.json + prints summary

Quickstart

pip install pragma-ai

from pragma import KnowledgeBase
from pragma.llm import get_provider

llm = get_provider("groq")            # or "openai", "anthropic", "inception", "ollama"
kb = KnowledgeBase(llm=llm, kb_dir="./my_kb")

# Ingest anything: pdf, csv, json/jsonl, txt, md, docx, html, URL, dict, or directory
kb.ingest("./docs/")

# Query with full reasoning trace
result = kb.query("Which company did Steve Jobs co-found in 1976?")
print(result.answer)
print(f"confidence={result.confidence:.2f}  tokens={result.tokens_used}")
for step in result.reasoning_path:
    print(f"  [{step.fact_id[:8]}] {step.explanation}")

kb.close()

Streaming:

async for token in kb.stream("Who is the CEO of Apple?"):
    print(token, end="", flush=True)

CLI:

pragma ingest ./docs/
pragma query "What does pragma do?"
pragma stats
pragma facts --entity "Apple"
pragma entities

Try it without installing: Open the Colab quickstart →

How it works

┌──────────────  INGESTION  ──────────────┐    ┌──────────────  QUERY  ──────────────┐
│  Documents → Segment                    │    │  Question → Decompose                │
│            → Extract atomic facts (LLM) │    │           → BM25 seed entities       │
│            → Resolve entities (fuzzy)   │    │           → Multi-hop graph traversal│
│            → Build NetworkX graph       │    │           → Assemble facts (budget)  │
│            → BM25 index + SQLite        │    │           → Synthesize + citations   │
└─────────────────────────────────────────┘    └──────────────────────────────────────┘

Storage: A single SQLite file (pragma_kb/pragma.db) holds entities, facts, edges, and a query cache. The graph lives in NetworkX; BM25 powers seed retrieval. Nothing else to run.

Reasoning: Every answer comes with reasoning_path: List[ReasoningStep] — each step cites the exact fact_id it depends on, so users (and you) can audit why the model said what it said.

Benchmarks

The honest end-to-end harness lives in benchmarks_run/run.py and uses real LLM token counts. See Token budget above for the latest measured numbers.

# Token / answer-quality harness against a live Ollama model
python benchmarks_run/run.py

# Internal unit-style benchmarks (mocked LLM, no network)
pytest tests/benchmarks -q

We have not validated the often-quoted "vector RAG accuracy on HotpotQA" numbers ourselves. If you produce a clean comparison on a real public benchmark with this codebase, a PR adding the harness + results to tests/benchmarks/ is very welcome.

Providers

Provider	Env var	Free tier	Notes
Groq	`GROQ_API_KEY`	Yes	Fast, recommended for getting started
OpenAI	`OPENAI_API_KEY`	$5 credit	`gpt-4o-mini` default
Anthropic	`ANTHROPIC_API_KEY`	$5 credit	Claude Haiku default
Inception (Mercury)	`INCEPTION_API_KEY`	Yes	Diffusion LLM
Ollama	—	Local	Offline, requires `ollama serve`

All providers implement complete, acomplete, and stream_complete. Add your own in 30 lines — see CONTRIBUTING.md.

Supported document formats

.pdf · .csv · .json · .jsonl · .md · .txt · .docx · .html · URLs · Python dict · List[Path | str | dict] · directories (recursive)

Add a new loader in ~50 lines — see CONTRIBUTING.md.

API reference (essentials)

kb = KnowledgeBase(llm, kb_dir="./kb")           # or pass config=PragmaConfig(...)

kb.ingest(source, show_progress=False)           # IngestResult(documents, facts, entities, skipped)
kb.query(q, hop_depth=2, min_confidence=0.5,
         as_of=None, top_k=5)                    # PragmaResult
kb.stream(q)                                     # AsyncIterator[str]
kb.stats()                                       # KBStats(documents, facts, entities, relationships)
kb.close()                                       # or use as a context manager

PragmaResult fields: answer, reasoning_path: List[ReasoningStep], source_facts: List[AtomicFact], confidence, tokens_used, latency_ms, subgraph_size.

Configuration

from pragma import PragmaConfig
config = PragmaConfig(
    kb_dir="./pragma_kb",
    default_hop_depth=2,
    max_subgraph_nodes=5,
    fact_confidence_threshold=0.6,
    llm_provider="groq",
)

Env var	Default
`PRAGMA_KB_DIR`	`./pragma_kb`
`PRAGMA_DEFAULT_HOP_DEPTH`	`2`
`PRAGMA_MAX_SUBGRAPH_NODES`	`5`
`PRAGMA_FACT_CONFIDENCE_THRESHOLD`	`0.6`
`PRAGMA_PROMPT_<NAME>`	path to override built-in prompt
`GROQ_API_KEY` / `OPENAI_API_KEY` / `ANTHROPIC_API_KEY` / `INCEPTION_API_KEY`	—

Evaluation harness

from pragma.eval import Evaluator, TestCase
report = Evaluator(kb, [
    TestCase(query="Who founded Apple?",
             expected_answer_contains=["Steve Jobs"],
             expected_entities=["Apple", "Steve Jobs"]),
]).run()
print(report.summary())

Status

289 tests passing locally (Windows / Python 3.12)
ruff clean, type-annotated (py.typed shipped)
Stable public API at v1.0 — see CHANGELOG.md

Run the suite yourself:

pip install -e ".[dev]"
pytest tests -q
ruff check pragma tests

Contributing

Bug reports, loaders, providers, and benchmark contributions welcome. See CONTRIBUTING.md.

git clone https://github.com/kbpr21/pragma-ai
cd pragma-ai
pip install -e ".[dev]"
pytest tests -q
ruff check pragma tests

License

MIT — see LICENSE.

_{Built because vector search is the wrong primitive for reasoning. Star ⭐ the repo if pragma earned it.}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kbpr21

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.6

May 3, 2026

1.0.5

May 3, 2026

1.0.4

May 3, 2026

1.0.3

May 2, 2026

1.0.2.post3

May 2, 2026

1.0.2.post2

May 2, 2026

1.0.2

May 2, 2026

This version

1.0.1.post1

Apr 30, 2026

1.0.1

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pragma_ai-1.0.1.post1.tar.gz (86.7 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pragma_ai-1.0.1.post1-py3-none-any.whl (67.8 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file pragma_ai-1.0.1.post1.tar.gz.

File metadata

Download URL: pragma_ai-1.0.1.post1.tar.gz
Upload date: Apr 30, 2026
Size: 86.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pragma_ai-1.0.1.post1.tar.gz
Algorithm	Hash digest
SHA256	`131568454517286d31e86f2de9edeb942764e6ef796a451bf50362f4894b4006`
MD5	`461292ce7bfa52586b662c97d8837af9`
BLAKE2b-256	`fc974c06d273d6aae3958b4a45a2c13338cc782509449f2807efe9ed169b44df`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pragma_ai-1.0.1.post1.tar.gz:

Publisher: publish.yml on kbpr21/pragma-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pragma_ai-1.0.1.post1.tar.gz
- Subject digest: 131568454517286d31e86f2de9edeb942764e6ef796a451bf50362f4894b4006
- Sigstore transparency entry: 1412063986
- Sigstore integration time: Apr 30, 2026
Source repository:
- Permalink: kbpr21/pragma-ai@2d3fb5c58138247e9894ba014d559275d5241d87
- Branch / Tag: refs/tags/v1.0.1.post1
- Owner: https://github.com/kbpr21
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2d3fb5c58138247e9894ba014d559275d5241d87
- Trigger Event: push

File details

Details for the file pragma_ai-1.0.1.post1-py3-none-any.whl.

File metadata

Download URL: pragma_ai-1.0.1.post1-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 67.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pragma_ai-1.0.1.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ae93ef9e44d284f20ef5d608d58d7bfe451dd37c9f1456f2b220eae0ca7944bc`
MD5	`f086760ae8f5f0b3310a2a6a21128071`
BLAKE2b-256	`6ef3499ff288c4a40631cfd6c4211d70a5cca972cefe75fc61a74a4273c62f37`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pragma_ai-1.0.1.post1-py3-none-any.whl:

Publisher: publish.yml on kbpr21/pragma-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pragma_ai-1.0.1.post1-py3-none-any.whl
- Subject digest: ae93ef9e44d284f20ef5d608d58d7bfe451dd37c9f1456f2b220eae0ca7944bc
- Sigstore transparency entry: 1412064120
- Sigstore integration time: Apr 30, 2026
Source repository:
- Permalink: kbpr21/pragma-ai@2d3fb5c58138247e9894ba014d559275d5241d87
- Branch / Tag: refs/tags/v1.0.1.post1
- Owner: https://github.com/kbpr21
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2d3fb5c58138247e9894ba014d559275d5241d87
- Trigger Event: push

pragma-ai 1.0.1.post1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

pragma

Why pragma

Token budget — measured, not claimed

Quickstart

How it works

Benchmarks

Providers

Supported document formats

API reference (essentials)

Configuration

Evaluation harness

Status

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance