Atomic-fact reasoning over a knowledge graph. A RAG alternative that needs no vector database.

These details have not been verified by PyPI

Project links

Project description

pragma

Atomic-fact reasoning over a knowledge graph. A RAG alternative that needs no vector database.

Quickstart · Why · Benchmarks · How it works · Colab demo

Why pragma

Vector RAG fails silently in predictable ways: keyword mismatch, irrelevant context, no multi-hop reasoning, no citations, no temporal awareness, ~3,000 tokens per query.

pragma stores documents as a graph of atomic (subject, predicate, object) facts. Queries traverse the graph, return cited reasoning paths, and use ~6× fewer tokens than vector RAG.

	Vector RAG	GraphRAG	LightRAG	pragma
Vector DB required	Yes	Yes	No	No
Multi-hop reasoning	Manual	Yes	Yes	Yes
Tokens per query	~3,000	~5,000	~400	~280
Reasoning trace	No	Partial	Yes	Full + fact IDs
Temporal queries	No	No	No	Yes (`as_of`)
Storage	Vector DB	Vector DB	LMDB	SQLite
Infra to operate	Server	Server	Server	None

Quickstart

pip install pragma-ai

from pragma import KnowledgeBase
from pragma.llm import get_provider

llm = get_provider("groq")            # or "openai", "anthropic", "inception", "ollama"
kb = KnowledgeBase(llm=llm, kb_dir="./my_kb")

# Ingest anything: pdf, csv, json/jsonl, txt, md, docx, html, URL, dict, or directory
kb.ingest("./docs/")

# Query with full reasoning trace
result = kb.query("Which company did Steve Jobs co-found in 1976?")
print(result.answer)
print(f"confidence={result.confidence:.2f}  tokens={result.tokens_used}")
for step in result.reasoning_path:
    print(f"  [{step.fact_id[:8]}] {step.explanation}")

kb.close()

Streaming:

async for token in kb.stream("Who is the CEO of Apple?"):
    print(token, end="", flush=True)

CLI:

pragma ingest ./docs/
pragma query "What does pragma do?"
pragma stats
pragma facts --entity "Apple"
pragma entities

Try it without installing: Open the Colab quickstart →

How it works

┌──────────────  INGESTION  ──────────────┐    ┌──────────────  QUERY  ──────────────┐
│  Documents → Segment                    │    │  Question → Decompose                │
│            → Extract atomic facts (LLM) │    │           → BM25 seed entities       │
│            → Resolve entities (fuzzy)   │    │           → Multi-hop graph traversal│
│            → Build NetworkX graph       │    │           → Assemble facts (budget)  │
│            → BM25 index + SQLite        │    │           → Synthesize + citations   │
└─────────────────────────────────────────┘    └──────────────────────────────────────┘

Storage: A single SQLite file (pragma_kb/pragma.db) holds entities, facts, edges, and a query cache. The graph lives in NetworkX; BM25 powers seed retrieval. Nothing else to run.

Reasoning: Every answer comes with reasoning_path: List[ReasoningStep] — each step cites the exact fact_id it depends on, so users (and you) can audit why the model said what it said.

Benchmarks

pytest tests/benchmarks -q

Representative runs (Groq Llama-3.3-70B, 100 multi-hop questions over 50-document corpus):

Metric	Vector RAG	GraphRAG	pragma
Tokens / query (avg)	3,142	4,890	278
2-hop accuracy	41 %	76 %	82 %
Cite-able reasoning	✗	partial	✓
Cold-start infra	Pinecone/Qdrant	Neo4j+Pinecone	0 services

Numbers vary with corpus and model; see tests/benchmarks/ for reproducible harnesses.

Providers

Provider	Env var	Free tier	Notes
Groq	`GROQ_API_KEY`	Yes	Fast, recommended for getting started
OpenAI	`OPENAI_API_KEY`	$5 credit	`gpt-4o-mini` default
Anthropic	`ANTHROPIC_API_KEY`	$5 credit	Claude Haiku default
Inception (Mercury)	`INCEPTION_API_KEY`	Yes	Diffusion LLM
Ollama	—	Local	Offline, requires `ollama serve`

All providers implement complete, acomplete, and stream_complete. Add your own in 30 lines — see CONTRIBUTING.md.

Supported document formats

.pdf · .csv · .json · .jsonl · .md · .txt · .docx · .html · URLs · Python dict · List[Path | str | dict] · directories (recursive)

Add a new loader in ~50 lines — see CONTRIBUTING.md.

API reference (essentials)

kb = KnowledgeBase(llm, kb_dir="./kb")           # or pass config=PragmaConfig(...)

kb.ingest(source, show_progress=False)           # IngestResult(documents, facts, entities, skipped)
kb.query(q, hop_depth=2, min_confidence=0.5,
         as_of=None, top_k=5)                    # PragmaResult
kb.stream(q)                                     # AsyncIterator[str]
kb.stats()                                       # KBStats(documents, facts, entities, relationships)
kb.close()                                       # or use as a context manager

PragmaResult fields: answer, reasoning_path: List[ReasoningStep], source_facts: List[AtomicFact], confidence, tokens_used, latency_ms, subgraph_size.

Configuration

from pragma import PragmaConfig
config = PragmaConfig(
    kb_dir="./pragma_kb",
    default_hop_depth=2,
    max_subgraph_nodes=5,
    fact_confidence_threshold=0.6,
    llm_provider="groq",
)

Env var	Default
`PRAGMA_KB_DIR`	`./pragma_kb`
`PRAGMA_DEFAULT_HOP_DEPTH`	`2`
`PRAGMA_MAX_SUBGRAPH_NODES`	`5`
`PRAGMA_FACT_CONFIDENCE_THRESHOLD`	`0.6`
`PRAGMA_PROMPT_<NAME>`	path to override built-in prompt
`GROQ_API_KEY` / `OPENAI_API_KEY` / `ANTHROPIC_API_KEY` / `INCEPTION_API_KEY`	—

Evaluation harness

from pragma.eval import Evaluator, TestCase
report = Evaluator(kb, [
    TestCase(query="Who founded Apple?",
             expected_answer_contains=["Steve Jobs"],
             expected_entities=["Apple", "Steve Jobs"]),
]).run()
print(report.summary())

Status

289 tests passing across 3 OS × 4 Python versions
ruff clean, type-annotated (py.typed)
Stable public API at v1.0 — see CHANGELOG.md

Contributing

Bug reports, loaders, providers, and benchmark contributions welcome. See CONTRIBUTING.md.

git clone https://github.com/kbpr21/pragma
cd pragma
pip install -e ".[dev]"
pytest tests -q
ruff check pragma tests

License

MIT — see LICENSE.

_{Built because vector search is the wrong primitive for reasoning. Star ⭐ the repo if pragma earned it.}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.6

May 3, 2026

1.0.5

May 3, 2026

1.0.4

May 3, 2026

1.0.3

May 2, 2026

1.0.2.post3

May 2, 2026

1.0.2.post2

May 2, 2026

1.0.2

May 2, 2026

1.0.1.post1

Apr 30, 2026

This version

1.0.1

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pragma_ai-1.0.1.tar.gz (73.5 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pragma_ai-1.0.1-py3-none-any.whl (61.9 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file pragma_ai-1.0.1.tar.gz.

File metadata

Download URL: pragma_ai-1.0.1.tar.gz
Upload date: Apr 30, 2026
Size: 73.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for pragma_ai-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`fae77535b9e4ddba53d890d9496040d561012c13e82c5455fd538d9e0724fad2`
MD5	`0c511dd3d6f43de6056a20a7b934dab3`
BLAKE2b-256	`f2c0ffeb2adee6579290d466fba1b9827a256d4d04b6d9851d62616d4416ca2c`

See more details on using hashes here.

File details

Details for the file pragma_ai-1.0.1-py3-none-any.whl.

File metadata

Download URL: pragma_ai-1.0.1-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 61.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for pragma_ai-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`35bc60989014dfb6a7da5c9ea501d0261bcfc396c53153b44112805203910092`
MD5	`9aa3bd35d381fd2da2e61f597c7530c4`
BLAKE2b-256	`88f096c4aedeee10a0a136fb16cc4f2433a62391885c16e797efebf44ee355c5`

See more details on using hashes here.

pragma-ai 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pragma

Why pragma

Quickstart

How it works

Benchmarks

Providers

Supported document formats

API reference (essentials)

Configuration

Evaluation harness

Status

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes