Framework-agnostic RAG pipeline SDK. Plug in any component, swap any stage, configure everything in YAML

These details have not been verified by PyPI

Project links

Project description

NexRAG

███╗   ██╗███████╗██╗  ██╗██████╗  █████╗  ██████╗
████╗  ██║██╔════╝╚██╗██╔╝██╔══██╗██╔══██╗██╔════╝
██╔██╗ ██║█████╗   ╚███╔╝ ██████╔╝███████║██║  ███╗
██║╚██╗██║██╔══╝   ██╔██╗ ██╔══██╗██╔══██║██║   ██║
██║ ╚████║███████╗██╔╝ ██╗██║  ██║██║  ██║╚██████╔╝
╚═╝  ╚═══╝╚══════╝╚═╝  ╚═╝╚═╝  ╚═╝╚═╝  ╚═╝ ╚═════╝

●plug ⇄swap ▶scale

Framework-agnostic RAG pipeline SDK. Plug in any component, swap any stage, configure everything in YAML.

What is NexRAG?

NexRAG is a production-grade RAG (Retrieval-Augmented Generation) pipeline SDK for Python.

NexRAG owns the pipeline shape. You own the components.

Every stage — loading, chunking, embedding, retrieval, generation — is a clean interface. NexRAG ships default implementations for each. You can swap any of them by implementing the interface and declaring it in YAML. No framework lock-in. No magic. No hidden behavior.

Quickstart

pip install "nexrag[openai,chromadb,pdf]"
export OPENAI_API_KEY=sk-...
cp nexrag.example.yaml nexrag.yaml   # edit to taste

from nexrag import NexRAG, RunMetrics

pipeline = NexRAG.from_config("nexrag.yaml")

# Ingest a PDF
result = pipeline.ingest("contracts/agreement.pdf")
print(f"Ingested {result.documents_loaded} doc, {result.chunks_produced} chunks")

# Query (blocking)
result = pipeline.query("What are the termination clauses?")
print(result.answer)
for source in result.sources:
    print(f"  [{source.rank}] score={source.score:.3f}  {source.chunk.metadata.get('source')}")

# Streaming — tokens arrive live; RunMetrics is the final item
metrics = None
for item in pipeline.stream_query("Summarise the key obligations."):
    if isinstance(item, RunMetrics):
        metrics = item
    else:
        print(item, end="", flush=True)
print(f"\n\n{metrics.total_latency_ms:.0f}ms — {metrics.chunks_retrieved} chunks retrieved")

# nexrag.yaml (minimal)
ingestion:
  loader:
    type: pdf
  embedder:
    provider: openai
    model: text-embedding-3-small
    api_key: ${OPENAI_API_KEY}
  vector_db:
    provider: chroma
    default_collection: documents
    collections:
      documents:
        mode: persistent
        path: ./.nexrag/chroma

query:
  embedder: inherit
  llm:
    provider: openai
    model: gpt-4o
    api_key: ${OPENAI_API_KEY}

See docs/ for the full documentation site.

Installation

# Core only — pydantic + pyyaml. No provider SDKs; add the extras you use below.
pip install nexrag

# Default getting-started stack (OpenAI embedder/LLM + ChromaDB vector store)
pip install "nexrag[openai,chromadb]"

# Provider extras — install only what you use
pip install "nexrag[openai]"         # OpenAI embedder + LLM
pip install "nexrag[chromadb]"       # ChromaDB vector store
pip install "nexrag[anthropic]"      # Anthropic (Claude) LLM
pip install "nexrag[ollama]"         # Ollama local LLM + embedder
pip install "nexrag[huggingface]"    # HuggingFace embedder

# Document loaders
pip install "nexrag[pdf]"            # PDFLoader (pypdf)
pip install "nexrag[word]"           # Word documents (python-docx)
pip install "nexrag[html]"           # HTML pages (beautifulsoup4)

# Retrieval extras
pip install "nexrag[bm25]"           # BM25Retriever keyword search (rank-bm25)
pip install "nexrag[cohere]"         # CohereReranker
pip install "nexrag[cross-encoder]"  # CrossEncoderReranker (sentence-transformers)

# Observability
pip install "nexrag[observability]"  # OpenTelemetry metrics + traces + logs (Prometheus / OTLP)

# Convenience bundles
pip install "nexrag[all-sparse]"     # all sparse retrievers (bm25)
pip install "nexrag[all-rerankers]"  # all rerankers (cohere + cross-encoder)
pip install "nexrag[all-retrieval]"  # all-sparse + all-rerankers
pip install "nexrag[all-loaders]"    # all document loaders
pip install "nexrag[all-providers]"  # all LLM + embedder providers

# Full bundle — dev/CI; pulls PyTorch via sentence-transformers
pip install "nexrag[all]"

Design Principles

Principle	What it means
Interface-first	Every stage is a contract. Implementation is secondary.
Config-driven	YAML configures the pipeline. Code defines the logic.
Zero lock-in	Core has no dependency on LangChain, LlamaIndex, or any AI SDK.
Explicit over implicit	No hidden defaults. Every behavior is declared or documented.
Extensible by design	New components plug in without touching core.

Architecture

NexRAG has two independent pipelines:

INGESTION  →  Loader → Sanitizer → Chunker → Embedder → VectorDB
QUERY      →  Embedder → Retriever → PromptBuilder → LLM → PipelineResult

See Architecture Documentation for full pipeline diagrams.

Supported Providers

Category	Providers
Embedders	OpenAI, Gemini, Ollama, HuggingFace
Vector DBs	ChromaDB (in-memory, persistent, server), Pinecone
LLMs	OpenAI, Anthropic, Gemini, Ollama
Loaders	PDF, plain text, Word, HTML, Excel
Chunkers	Recursive, token, sentence, sentence-window, markdown, code, semantic, proposition
Retrievers	Dense (cosine similarity), BM25 (keyword), Hybrid (dense + BM25)
Rerankers	Cohere, CrossEncoder (sentence-transformers)
Observability	OpenTelemetry — Prometheus pull, OTLP push (Grafana Alloy, Jaeger, etc.)

Observability

Connect Prometheus + Grafana with two YAML lines:

pip install "nexrag[observability]"

# nexrag.yaml
observability:
  enabled: true
  service_name: my-rag-app
  exporters:
    prometheus:
      enabled: true    # exposes GET http://host:9464/metrics
      port: 9464
    otlp:
      enabled: false   # push to OTel Collector / Grafana Alloy
      endpoint: ${OTEL_EXPORTER_OTLP_ENDPOINT:-http://localhost:4317}
      protocol: grpc
  signals:
    metrics: true
    traces: true
    logs: true

Always-on metrics cover every pipeline stage — latency, token counts, retrieval scores, ingestion stats, and cost. Optional LLM-as-judge evaluators (faithfulness, answer relevance, completeness, coherence, context diversity) run off the response path at a configurable sample rate, each with its own model and embedder.

Contributing

NexRAG is in early development. Contribution guidelines will be published with v1.0.

Changelog

See CHANGELOG.md.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.0

Jun 24, 2026

0.4.0

Jun 16, 2026

0.3.3

Jun 13, 2026

0.3.2

Jun 12, 2026

0.3.1

Jun 10, 2026

0.3.0

May 31, 2026

0.2.0

May 28, 2026

0.1.0

May 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nexrag-0.5.0.tar.gz (127.9 kB view details)

Uploaded Jun 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nexrag-0.5.0-py3-none-any.whl (185.3 kB view details)

Uploaded Jun 24, 2026 Python 3

File details

Details for the file nexrag-0.5.0.tar.gz.

File metadata

Download URL: nexrag-0.5.0.tar.gz
Upload date: Jun 24, 2026
Size: 127.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for nexrag-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`0ede79c2d7fbf2b104d13afbfc7b5bb4f7ac664953f68c2416563faf2ba5ce83`
MD5	`b5596bfbf05b1fdd5a2beeabbb85b44b`
BLAKE2b-256	`8f3a7cb11970725e01b3d0a18a45b8cdf6a790f3a07c146219d7ec558b30732d`

See more details on using hashes here.

File details

Details for the file nexrag-0.5.0-py3-none-any.whl.

File metadata

Download URL: nexrag-0.5.0-py3-none-any.whl
Upload date: Jun 24, 2026
Size: 185.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for nexrag-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5f3f9a9f1963723c0fca42c4e5fe57d969e2751ce44b8a3fb4d6d890a7b87247`
MD5	`8e2f8010aef50c6cd4f7e704857baa6b`
BLAKE2b-256	`9cd45c69b6b797ea241d383b9f916cd72713bea656efbd828cc3c9c143ff1216`

See more details on using hashes here.

nexrag 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NexRAG

What is NexRAG?

Quickstart

Installation

Design Principles

Architecture

Supported Providers

Observability

Contributing

Changelog

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes