Portable executable RAG artifacts for Python

These details have not been verified by PyPI

Project description

RagBucket

Portable Executable RAG Artifacts for Python

Build once. Load anywhere.

Installation • Quickstart • What is RagBucket? • Providers • Roadmap

The Problem

Traditional ML models are portable by default:

model.pt   model.onnx   model.gguf   model.h5

They can be saved, shared, and deployed anywhere. RAG systems can't.

A typical RAG pipeline is a fragile web of:

vector databases tied to infrastructure
embedding pipelines that must be re-run
chunking configs scattered across codebases
provider-specific integrations with no portability
metadata that lives nowhere and everywhere

RagBucket solves this. It packages your entire RAG pipeline — vectors, chunks, config, and runtime metadata — into a single portable .rag artifact.

Introducing `.rag`

A .rag artifact is a self-contained, executable unit of retrieval intelligence. It packages:

What	How
Semantic embeddings	via Sentence Transformers
Vector index	via FAISS
Chunked knowledge	via LangChain splitters
Retrieval configuration	embedded in manifest
Runtime metadata	versioned artifact manifest

Build it once. Drop it anywhere. Query it with one line of code.

Full Architecture

✦ Installation

# Using uv (recommended)
uv add ragbucket

# Using pip
pip install ragbucket

## ⚡ Quickstart

### Step 1 — Build a Portable `.rag` Artifact

```python
from ragbucket import RagBuilder
from ragbucket import RagConfig

import os

from dotenv import load_dotenv


load_dotenv()


config = RagConfig(

    # ------------------------------------
    # EMBEDDING PROVIDER
    # ------------------------------------
    embedding_provider="cohere",

    embedding_model="embed-english-v3.0",

    embedding_api_key=os.getenv("COHERE_API_KEY"),

    # ------------------------------------
    # CHUNKING
    # ------------------------------------
    chunk_size=512,

    chunk_overlap=50,

    # ------------------------------------
    # RETRIEVAL
    # ------------------------------------
    top_k=3
)


builder = RagBuilder(
    config=config
)


builder.build(
    doc_path="docs/",
    op_path="artifacts/demo.rag"
)
```

This generates:

```text
artifacts/demo.rag
```

The `.rag` artifact contains:

- vector embeddings
- FAISS index
- document chunks
- retrieval configuration
- artifact metadata

Build once. Query anywhere.

---

### Step 2 — Load and Query the Artifact

```python
from ragbucket import RagRuntime

import os

from dotenv import load_dotenv


load_dotenv()


system_prompt = """
You are a helpful assistant.

Keep answers short and crisp.
"""


rag = RagRuntime(

    # ------------------------------------
    # RAG ARTIFACT
    # ------------------------------------
    rag_path="artifacts/demo.rag",

    # ------------------------------------
    # GENERATION PROVIDER
    # ------------------------------------
    provider="groq",

    api_key=os.getenv("GROQ_API_KEY"),

    model="llama-3.1-8b-instant",

    # ------------------------------------
    # EMBEDDING PROVIDER KEY
    # ------------------------------------
    embedding_api_key=os.getenv("COHERE_API_KEY"),

    # ------------------------------------
    # SYSTEM PROMPT
    # ------------------------------------
    system_prompt=system_prompt
)


response = rag.ask(
    "What are Anik's AI/ML skills?"
)

print(response)
```

---

## 🔌 Multi-Provider Runtime

RagBucket separates:

- retrieval embeddings
- response generation

This allows fully modular AI pipelines.

---

### Supported Generation Providers

| Provider    | Example Model             |
| ----------- | ------------------------- |
| `groq`      | `llama-3.1-8b-instant`    |
| `openai`    | `gpt-4o-mini`             |
| `gemini`    | `gemini-1.5-flash`        |
| `anthropic` | `claude-3-haiku-20240307` |

Example:

```python
rag = RagRuntime(

    rag_path="demo.rag",

    provider="openai",

    api_key=os.getenv("OPENAI_API_KEY"),

    model="gpt-4o-mini",

    embedding_api_key=os.getenv("COHERE_API_KEY")
)
```

---

## 🧠 Modular Embedding Providers

RagBucket supports multiple embedding systems.

| Provider | Example Model            |
| -------- | ------------------------ |
| `local`  | `BAAI/bge-small-en-v1.5` |
| `cohere` | `embed-english-v3.0`     |
| `openai` | `text-embedding-3-small` |
| `gemini` | `models/embedding-001`   |
| `voyage` | `voyage-large-2`         |

Example:

```python
config = RagConfig(

    embedding_provider="openai",

    embedding_model="text-embedding-3-small",

    embedding_api_key=os.getenv("OPENAI_API_KEY")
)
```

---

## ⚙️ Dynamic Retrieval Configuration

Customize every stage of the retrieval pipeline:

```python
from ragbucket import RagConfig


config = RagConfig(

    # Embedding system
    embedding_provider="local",

    embedding_model="sentence-transformers/all-MiniLM-L6-v2",

    # Chunking
    chunk_size=1024,

    chunk_overlap=100,

    # Retrieval
    top_k=5
)
```

All missing values are automatically filled using framework defaults.

---

## 🪶 Lightweight by Default

RagBucket no longer forces heavyweight local AI dependencies.

The core package remains lightweight.

Local embedding dependencies are only required when using:

```python
embedding_provider="local"
```

This enables:

- lightweight installations
- faster deployments
- cloud/serverless compatibility
- provider-based embedding pipelines

---

## 📦 What a `.rag` Artifact Contains

```text
demo.rag
│
├── vectors.faiss
├── chunks.json
├── manifest.json
```

The artifact stores:

- semantic vectors
- retrieval memory
- embedding configuration
- retrieval settings
- runtime metadata

making the retrieval system:

- portable
- reusable
- executable
- shareable

🏗️ Project Structure

ragbucket/
├── builder/
│   ├── builder.py      ← RagBuilder orchestrator
│   ├── chunker.py      ← LangChain recursive splitter
│   ├── embedder.py     ← Sentence Transformers encoder
│   ├── indexer.py      ← FAISS index builder
│   └── packager.py     ← .rag artifact packaging
├── runtime/
│   ├── runtime.py      ← RagRuntime orchestrator
│   ├── loader.py       ← .rag artifact loader
│   ├── retriever.py    ← Semantic vector retrieval
│   ├── models.py       ← Cached embedding model singleton
│   └── providers/      ← Groq / OpenAI / Gemini / Anthropic
├── schemas/
│   ├── config.py       ← RagConfig Pydantic model
│   └── manifest.py     ← Artifact manifest schema
└── utils/
    ├── file_utils.py   ← Document loading helpers
    └── hashing.py      ← Artifact integrity utilities

🧰 Technology Stack

Component	Technology
Embeddings	Sentence Transformers
Vector Search	FAISS
Chunking	LangChain Text Splitters
Artifact Packaging	Python `zipfile`
Config Validation	Pydantic
Runtime	Pure Python

✦ Philosophy

RagBucket treats RAG systems as portable intelligence artifacts — not fragile infrastructure pipelines.

This cleanly separates:

Retrieval memory (what you built) → lives in the .rag file
Language generation (how you query it) → any provider, any environment

The result: reusable semantic memory that travels with your code.

📄 License

MIT License — see LICENSE for details.

RagBucket · Built by Anik Chand

The portable runtime layer for Retrieval-Augmented Generation systems.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.6

May 20, 2026

This version

0.2.5

May 20, 2026

0.2.4

May 20, 2026

0.2.3

May 20, 2026

0.2.2

May 20, 2026

0.2.1

May 20, 2026

0.2.0

May 20, 2026

0.1.3

May 19, 2026

0.1.2

May 19, 2026

0.1.1

May 19, 2026

0.1.0

May 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ragbucket-0.2.5.tar.gz (3.2 MB view details)

Uploaded May 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ragbucket-0.2.5-py3-none-any.whl (17.8 kB view details)

Uploaded May 20, 2026 Python 3

File details

Details for the file ragbucket-0.2.5.tar.gz.

File metadata

Download URL: ragbucket-0.2.5.tar.gz
Upload date: May 20, 2026
Size: 3.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for ragbucket-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`bc22e321b90ac081dc5a6eaab9325d1c222b1cfc4d8184fd9de986525759b1aa`
MD5	`e9c7fe2f8d1b47d4e7a666c32d096152`
BLAKE2b-256	`bfab0c453f4c4bf42336ebaa6ba4b59f1a3b60f8e988e6a20e6e2754ced2c68a`

See more details on using hashes here.

File details

Details for the file ragbucket-0.2.5-py3-none-any.whl.

File metadata

Download URL: ragbucket-0.2.5-py3-none-any.whl
Upload date: May 20, 2026
Size: 17.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for ragbucket-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0f0ad9c74df9c7488b06c0684056e8773801e4a39d0b139a394dde8f336596bb`
MD5	`cc89079537fbdfa82871b45a1a3cf769`
BLAKE2b-256	`8dbc805ecc5271a492b0ee1680965d97a4929dc82951691df19b416564da56a5`

See more details on using hashes here.

ragbucket 0.2.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

RagBucket

Portable Executable RAG Artifacts for Python

The Problem

Introducing `.rag`

Full Architecture

✦ Installation

🏗️ Project Structure

🧰 Technology Stack

✦ Philosophy

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

ragbucket 0.2.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

RagBucket

Portable Executable RAG Artifacts for Python

The Problem

Introducing .rag

Full Architecture

✦ Installation

🏗️ Project Structure

🧰 Technology Stack

✦ Philosophy

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Introducing `.rag`