Portable executable RAG artifacts for Python

These details have not been verified by PyPI

Project description

RagBucket

Portable Executable RAG Artifacts for Python

Build Retrieval-Augmented Generation systems as reusable, shareable, and executable .rag artifacts.

PyPI • Portable RAG Runtime • Semantic Retrieval • Groq-Powered Generation

Overview

Traditional machine learning models are portable.

model.pt
model.onnx
model.gguf

They can be:

saved
shared
deployed
reused anywhere

But modern Retrieval-Augmented Generation (RAG) systems are different.

A typical RAG pipeline depends on:

vector databases
embedding models
chunking logic
retrievers
metadata
external storage systems

This makes RAG systems:

fragmented
difficult to distribute
hard to reproduce
tightly coupled to infrastructure

Introducing `.rag`

RagBucket introduces:

`.rag`

A portable executable artifact format for Retrieval-Augmented Generation systems.

The .rag artifact packages:

semantic vector memory
chunked knowledge
retrieval metadata
indexing structures
runtime configuration

into a single reusable file.

Core Philosophy

RagBucket treats RAG systems as:

portable intelligence artifacts

instead of:

fragmented retrieval pipelines

This allows developers to:

package knowledge systems
distribute semantic memory
separate retrieval from generation
reuse RAG artifacts across projects
execute portable knowledge bases anywhere

How It Works

Documents
    ↓
RagBuilder
    ↓
model.rag
    ↓
RagRuntime
    ↓
Question Answering

The builder serializes the retrieval pipeline into a portable artifact.

The runtime loads the artifact and executes semantic retrieval using an external language model.

What a `.rag` Artifact Contains

A .rag file stores everything required for retrieval:

vector embeddings
FAISS semantic index
chunked document memory
retrieval metadata
artifact manifest
runtime configuration

The only external dependency required during inference is:

an LLM provider API key (Groq)

Features

Portable Artifact System

Serialize RAG knowledge systems into reusable .rag files.

Semantic Retrieval Engine

Built-in FAISS vector indexing for fast semantic search.

Lightweight Runtime

Load and execute .rag artifacts anywhere using Python.

Groq Integration

Generate responses using Groq-hosted LLMs.

Self-Contained Knowledge

Store retrieval memory locally inside the artifact.

Simple Developer API

Minimal abstraction for building and querying artifacts.

Extensible Architecture

Designed for future support of:

hybrid retrieval
reranking
metadata filtering
distributed vector stores

Installation

Using uv

uv add ragbucket

Using pip

pip install ragbucket

Creating a `.rag` Artifact

from ragbucket import RagBuilder

builder = RagBuilder()

builder.build(
    documents_path="docs/",
    output_path="finance_assistant.rag"
)

This generates:

finance_assistant.rag

containing:

vector memory
document chunks
retrieval metadata
semantic index

Using a `.rag` Artifact

from ragbucket import RagRuntime

rag = RagRuntime(
    rag_path="finance_assistant.rag",
    api_key="your_groq_api_key"
)

response = rag.ask(
    "What is EBITDA?"
)

print(response)

Runtime Pipeline

User Query
    ↓
Query Embedding
    ↓
FAISS Semantic Search
    ↓
Relevant Context Retrieval
    ↓
Groq LLM Generation
    ↓
Final Response

Technology Stack

Component	Technology
Embeddings	Sentence Transformers
Vector Search	FAISS
Chunking	LangChain
Runtime	Python
Generation	Groq
Packaging	zipfile
Artifact Format	`.rag`

Why RagBucket Matters

Current RAG systems are deeply tied to infrastructure.

RagBucket introduces a cleaner abstraction:

Knowledge
≠
Generation

The .rag artifact becomes:

portable retrieval memory

while the external LLM becomes:

the reasoning engine

This separation enables:

reusable RAG systems
portable semantic memory
simplified deployment
reproducible retrieval pipelines

Current Scope

RagBucket currently supports:

local artifact packaging
FAISS vector indexing
semantic retrieval
Groq-powered generation

The project is intentionally lightweight and focused on:

portable RAG execution

Future Roadmap

Planned Features

Hybrid retrieval
Metadata-aware search
Artifact versioning
Reranking support
Multi-provider LLM runtime
Milvus integration
Distributed retrieval
.rag registries and marketplaces

Vision

RagBucket aims to become:

“The portable runtime layer for Retrieval-Augmented Generation systems.”

A future where RAG systems can be:

built once
shared anywhere
executed everywhere

through standardized portable intelligence artifacts.

License

MIT License

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.6

May 20, 2026

0.2.5

May 20, 2026

0.2.4

May 20, 2026

0.2.3

May 20, 2026

0.2.2

May 20, 2026

0.2.1

May 20, 2026

0.2.0

May 20, 2026

0.1.3

May 19, 2026

0.1.2

May 19, 2026

This version

0.1.1

May 19, 2026

0.1.0

May 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ragbucket-0.1.1.tar.gz (9.5 kB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ragbucket-0.1.1-py3-none-any.whl (11.1 kB view details)

Uploaded May 19, 2026 Python 3

File details

Details for the file ragbucket-0.1.1.tar.gz.

File metadata

Download URL: ragbucket-0.1.1.tar.gz
Upload date: May 19, 2026
Size: 9.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for ragbucket-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`80dcb285e844c0d19d0d55f0c2dfef458565b5133b89ca25614c0e6255f56dcb`
MD5	`039ba1b33b616955dd49c41d94d98827`
BLAKE2b-256	`d40d52bb09ed4e8d9d80e55b75fe921c3861fab2e8df51bab4b12785ac999df0`

See more details on using hashes here.

File details

Details for the file ragbucket-0.1.1-py3-none-any.whl.

File metadata

Download URL: ragbucket-0.1.1-py3-none-any.whl
Upload date: May 19, 2026
Size: 11.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for ragbucket-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`286b1651aa7e0b2887e957e195d39f44c9265b9fb1488a47374cb908a9d50189`
MD5	`9099757fe35171cbb933a0fbada4386e`
BLAKE2b-256	`7dbaaa9bfb587a3573fb4ae7d73878dd259d44c631ccb5f718a0824748422343`

See more details on using hashes here.

ragbucket 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

RagBucket

Portable Executable RAG Artifacts for Python

Overview

Introducing .rag

.rag

Core Philosophy

portable intelligence artifacts

fragmented retrieval pipelines

How It Works

What a .rag Artifact Contains

Features

Portable Artifact System

Semantic Retrieval Engine

Lightweight Runtime

Groq Integration

Self-Contained Knowledge

Simple Developer API

Extensible Architecture

Installation

Using uv

Using pip

Creating a .rag Artifact

Using a .rag Artifact

Runtime Pipeline

Technology Stack

Why RagBucket Matters

Current Scope

portable RAG execution

Future Roadmap

Planned Features

Vision

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Introducing `.rag`

`.rag`

What a `.rag` Artifact Contains

Creating a `.rag` Artifact

Using a `.rag` Artifact