Building blocks for rapid development of GenAI applications

These details have not been verified by PyPI

Project links

Project description

🐰 Ragbits

Building blocks for rapid development of GenAI applications

Homepage | Documentation | Contact

Features

🔨 Build Reliable & Scalable GenAI Apps

Swap LLMs anytime – Switch between 100+ LLMs via LiteLLM or run local models.
Type-safe LLM calls – Use Python generics to enforce strict type safety in model interactions.
Bring your own vector store – Connect to Qdrant, PgVector, and more with built-in support.
Developer tools included – Manage vector stores, query pipelines, and test prompts from your terminal.
Modular installation – Install only what you need, reducing dependencies and improving performance.

📚 Fast & Flexible RAG Processing

Ingest 20+ formats – Process PDFs, HTML, spreadsheets, presentations, and more. Process data using Docling, Unstructured or create a custom parser.
Handle complex data – Extract tables, images, and structured content with built-in VLMs support.
Connect to any data source – Use prebuilt connectors for S3, GCS, Azure, or implement your own.
Scale ingestion – Process large datasets quickly with Ray-based parallel processing.

🤖 Build Multi-Agent Workflows with Ease

Multi-agent coordination – Create teams of specialized agents with role-based collaboration using A2A protocol for interoperability.
Real-time data integration – Leverage Model Context Protocol (MCP) for live web access, database queries, and API integrations.
Conversation state management – Maintain context across interactions with automatic history tracking.

🚀 Deploy & Monitor with Confidence

Real-time observability – Track performance with OpenTelemetry and CLI insights.
Built-in testing – Validate prompts with promptfoo before deployment.
Auto-optimization – Continuously evaluate and refine model performance.
Chat UI – Deploy chatbot interface with API, persistance and user feedback.

Installation

Stable Release

To get started quickly, you can install the latest stable release with:

pip install ragbits

Nightly Builds

For the latest development features, you can install nightly builds that are automatically published from the develop branch:

pip install ragbits --pre

Note: Nightly builds include the latest features and bug fixes but may be less stable than official releases. They follow the version format X.Y.Z.devYYYYMMDDHHMM.

Package Contents

This is a starter bundle of packages, containing:

ragbits-core - fundamental tools for working with prompts, LLMs and vector databases.
ragbits-agents - abstractions for building agentic systems.
ragbits-document-search - retrieval and ingestion piplines for knowledge bases.
ragbits-evaluate - unified evaluation framework for Ragbits components.
ragbits-guardrails - utilities for ensuring the safety and relevance of responses.
ragbits-chat - full-stack infrastructure for building conversational AI applications.
ragbits-cli - ragbits shell command for interacting with Ragbits components.

Alternatively, you can use individual components of the stack by installing their respective packages.

Quickstart

Basics

To define a prompt and run LLM:

import asyncio
from pydantic import BaseModel
from ragbits.core.llms import LiteLLM
from ragbits.core.prompt import Prompt

class QuestionAnswerPromptInput(BaseModel):
    question: str

class QuestionAnswerPrompt(Prompt[QuestionAnswerPromptInput, str]):
    system_prompt = """
    You are a question answering agent. Answer the question to the best of your ability.
    """
    user_prompt = """
    Question: {{ question }}
    """

llm = LiteLLM(model_name="gpt-4.1-nano")

async def main() -> None:
    prompt = QuestionAnswerPrompt(QuestionAnswerPromptInput(question="What are high memory and low memory on linux?"))
    response = await llm.generate(prompt)
    print(response)

if __name__ == "__main__":
    asyncio.run(main())

Document Search

To build and query a simple vector store index:

import asyncio
from ragbits.core.embeddings import LiteLLMEmbedder
from ragbits.core.vector_stores import InMemoryVectorStore
from ragbits.document_search import DocumentSearch

embedder = LiteLLMEmbedder(model_name="text-embedding-3-small")
vector_store = InMemoryVectorStore(embedder=embedder)
document_search = DocumentSearch(vector_store=vector_store)

async def run() -> None:
    await document_search.ingest("web://https://arxiv.org/pdf/1706.03762")
    result = await document_search.search("What are the key findings presented in this paper?")
    print(result)

if __name__ == "__main__":
    asyncio.run(run())

Retrieval-Augmented Generation

To build a simple RAG pipeline:

import asyncio
from collections.abc import Iterable
from pydantic import BaseModel
from ragbits.core.embeddings import LiteLLMEmbedder
from ragbits.core.llms import LiteLLM
from ragbits.core.prompt import Prompt
from ragbits.core.vector_stores import InMemoryVectorStore
from ragbits.document_search import DocumentSearch
from ragbits.document_search.documents.element import Element

class QuestionAnswerPromptInput(BaseModel):
    question: str
    context: Iterable[Element]

class QuestionAnswerPrompt(Prompt[QuestionAnswerPromptInput, str]):
    system_prompt = """
    You are a question answering agent. Answer the question that will be provided using context.
    If in the given context there is not enough information refuse to answer.
    """
    user_prompt = """
    Question: {{ question }}
    Context: {% for chunk in context %}{{ chunk.text_representation }}{%- endfor %}
    """

llm = LiteLLM(model_name="gpt-4.1-nano")
embedder = LiteLLMEmbedder(model_name="text-embedding-3-small")
vector_store = InMemoryVectorStore(embedder=embedder)
document_search = DocumentSearch(vector_store=vector_store)

async def run() -> None:
    question = "What are the key findings presented in this paper?"

    await document_search.ingest("web://https://arxiv.org/pdf/1706.03762")
    chunks = await document_search.search(question)

    prompt = QuestionAnswerPrompt(QuestionAnswerPromptInput(question=question, context=chunks))
    response = await llm.generate(prompt)
    print(response)

if __name__ == "__main__":
    asyncio.run(run())

Agentic RAG

To build an agentic RAG pipeline:

import asyncio
from ragbits.agents import Agent
from ragbits.core.embeddings import LiteLLMEmbedder
from ragbits.core.llms import LiteLLM
from ragbits.core.vector_stores import InMemoryVectorStore
from ragbits.document_search import DocumentSearch

embedder = LiteLLMEmbedder(model_name="text-embedding-3-small")
vector_store = InMemoryVectorStore(embedder=embedder)
document_search = DocumentSearch(vector_store=vector_store)

llm = LiteLLM(model_name="gpt-4.1-nano")
agent = Agent(llm=llm, tools=[document_search.search])

async def main() -> None:
    await document_search.ingest("web://https://arxiv.org/pdf/1706.03762")
    response = await agent.run("What are the key findings presented in this paper?")
    print(response.content)

if __name__ == "__main__":
    asyncio.run(main())

Chat UI

To expose your GenAI application through Ragbits API:

from collections.abc import AsyncGenerator
from ragbits.agents import Agent, ToolCallResult
from ragbits.chat.api import RagbitsAPI
from ragbits.chat.interface import ChatInterface
from ragbits.chat.interface.types import ChatContext, ChatResponse, LiveUpdateType
from ragbits.core.embeddings import LiteLLMEmbedder
from ragbits.core.llms import LiteLLM, ToolCall
from ragbits.core.prompt import ChatFormat
from ragbits.core.vector_stores import InMemoryVectorStore
from ragbits.document_search import DocumentSearch

embedder = LiteLLMEmbedder(model_name="text-embedding-3-small")
vector_store = InMemoryVectorStore(embedder=embedder)
document_search = DocumentSearch(vector_store=vector_store)

llm = LiteLLM(model_name="gpt-4.1-nano")
agent = Agent(llm=llm, tools=[document_search.search])

class MyChat(ChatInterface):
    async def setup(self) -> None:
        await document_search.ingest("web://https://arxiv.org/pdf/1706.03762")

    async def chat(
        self,
        message: str,
        history: ChatFormat,
        context: ChatContext,
    ) -> AsyncGenerator[ChatResponse]:
        async for result in agent.run_streaming(message):
            match result:
                case str():
                    yield self.create_live_update(
                        update_id="1",
                        type=LiveUpdateType.START,
                        label="Answering...",
                    )
                    yield self.create_text_response(result)
                case ToolCall():
                    yield self.create_live_update(
                        update_id="2",
                        type=LiveUpdateType.START,
                        label="Searching...",
                    )
                case ToolCallResult():
                    yield self.create_live_update(
                        update_id="2",
                        type=LiveUpdateType.FINISH,
                        label="Search",
                        description=f"Found {len(result.result)} relevant chunks.",
                    )

        yield self.create_live_update(
            update_id="1",
            type=LiveUpdateType.FINISH,
            label="Answer",
        )

if __name__ == "__main__":
    api = RagbitsAPI(MyChat)
    api.run()

Rapid development

Create Ragbits projects from templates:

uvx create-ragbits-app

Explore create-ragbits-app repo here. If you have a new idea for a template, feel free to contribute!

Documentation

Tutorials - Get started with Ragbits in a few minutes
How-to - Learn how to use Ragbits in your projects
CLI - Learn how to run Ragbits in your terminal
API reference - Explore the underlying Ragbits API

Contributing

We welcome contributions! Please read CONTRIBUTING.md for more information.

License

Ragbits is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.7.0.dev202605130309 pre-release

May 13, 2026

1.7.0.dev202604280307 pre-release

Apr 28, 2026

1.7.0.dev202604240307 pre-release

Apr 24, 2026

1.7.0.dev202604150306 pre-release

Apr 15, 2026

1.7.0.dev202604020305 pre-release

Apr 2, 2026

1.6.2

Mar 31, 2026

1.6.1

Mar 24, 2026

1.6.0

Mar 18, 2026

1.5.0

Feb 25, 2026

1.4.2

Feb 18, 2026

1.4.1

Feb 10, 2026

1.4.0

Feb 5, 2026

1.4.0.dev202603100257 pre-release

Mar 10, 2026

This version

1.4.0.dev202603070252 pre-release

Mar 7, 2026

1.4.0.dev202602261352 pre-release

Feb 26, 2026

1.4.0.dev202602190302 pre-release

Feb 19, 2026

1.4.0.dev202602170301 pre-release

Feb 17, 2026

1.4.0.dev202602130304 pre-release

Feb 13, 2026

1.4.0.dev202602120304 pre-release

Feb 12, 2026

1.4.0.dev202602100304 pre-release

Feb 10, 2026

1.4.0.dev202602070256 pre-release

Feb 7, 2026

1.4.0.dev202602030301 pre-release

Feb 3, 2026

1.4.0.dev202601310254 pre-release

Jan 31, 2026

1.4.0.dev202601300258 pre-release

Jan 30, 2026

1.4.0.dev202601261217 pre-release

Jan 26, 2026

1.4.0.dev202601170236 pre-release

Jan 17, 2026

1.4.0.dev202601130240 pre-release

Jan 13, 2026

1.4.0.dev202601010248 pre-release

Jan 1, 2026

1.4.0.dev202512160238 pre-release

Dec 16, 2025

1.4.0.dev202512151244 pre-release

Dec 15, 2025

1.4.0.dev202512110238 pre-release

Dec 11, 2025

1.4.0.dev202512100237 pre-release

Dec 10, 2025

1.4.0.dev202512090236 pre-release

Dec 9, 2025

1.4.0.dev202512050236 pre-release

Dec 5, 2025

1.4.0.dev202512030235 pre-release

Dec 3, 2025

1.4.0.dev202512021005 pre-release

Dec 2, 2025

1.4.0.dev202511290233 pre-release

Nov 29, 2025

1.4.0.dev202511160236 pre-release

Nov 16, 2025

1.4.0.dev202509220622 pre-release

Sep 22, 2025

1.4.0.dev202509220615 pre-release

Sep 22, 2025

1.3.0

Sep 11, 2025

1.3.0.dev202509120609 pre-release

Sep 12, 2025

1.2.2

Aug 9, 2025

1.2.1

Aug 5, 2025

1.2.0

Aug 2, 2025

1.1.0

Jul 9, 2025

1.0.0

Jun 4, 2025

0.20.1

Jun 4, 2025

0.20.0

Jun 3, 2025

0.19.1

May 27, 2025

0.19.0

May 27, 2025

0.18.0

May 22, 2025

0.17.1

May 12, 2025

0.17.0

May 6, 2025

0.16.0

Apr 29, 2025

0.15.0

Apr 28, 2025

0.14.0

Apr 22, 2025

0.13.0

Apr 2, 2025

0.12.0

Mar 25, 2025

0.11.0

Mar 25, 2025

0.10.2

Mar 21, 2025

0.10.1

Mar 19, 2025

0.10.0

Mar 17, 2025

0.9.0

Feb 25, 2025

0.8.0

Jan 29, 2025

0.7.0

Jan 21, 2025

0.6.0

Dec 27, 2024

0.5.1

Dec 9, 2024

0.5.0

Dec 5, 2024

0.4.0

Nov 27, 2024

0.3.0

Nov 6, 2024

0.2.0

Oct 23, 2024

0.1.0

Oct 8, 2024

0.0.30rc1 pre-release

Dec 19, 2025

0.0.30.dev29302392 pre-release

Dec 9, 2025

0.0.8.dev23005 pre-release

Dec 9, 2025

0.0.1

Sep 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ragbits-1.4.0.dev202603070252.tar.gz (13.7 kB view details)

Uploaded Mar 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ragbits-1.4.0.dev202603070252-py3-none-any.whl (5.5 kB view details)

Uploaded Mar 7, 2026 Python 3

File details

Details for the file ragbits-1.4.0.dev202603070252.tar.gz.

File metadata

Download URL: ragbits-1.4.0.dev202603070252.tar.gz
Upload date: Mar 7, 2026
Size: 13.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ragbits-1.4.0.dev202603070252.tar.gz
Algorithm	Hash digest
SHA256	`b20107992ba584ecebe9645b79928d02af78867e9e1b704873d61912afcfaf8a`
MD5	`6d3648ec00d6fbad8c45e8249d6f3e8e`
BLAKE2b-256	`b7d9a08f6031348104836d466e597b70da823c1fb979ec5554284028e4d94d19`

See more details on using hashes here.

File details

Details for the file ragbits-1.4.0.dev202603070252-py3-none-any.whl.

File metadata

Download URL: ragbits-1.4.0.dev202603070252-py3-none-any.whl
Upload date: Mar 7, 2026
Size: 5.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ragbits-1.4.0.dev202603070252-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6d3b47e40ce9364b26de756e34c056031902883c30c5f77b8287023b697bcbd8`
MD5	`36bb8c59547b71f4b910b260bd67c7e2`
BLAKE2b-256	`b3768206fcffe540a39574f14a76c7d1988a319de1b945646f7e59f4994e73ed`

See more details on using hashes here.

ragbits 1.4.0.dev202603070252

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🐰 Ragbits

Features

🔨 Build Reliable & Scalable GenAI Apps

📚 Fast & Flexible RAG Processing

🤖 Build Multi-Agent Workflows with Ease

🚀 Deploy & Monitor with Confidence

Installation

Stable Release

Nightly Builds

Package Contents

Quickstart

Basics

Document Search

Retrieval-Augmented Generation

Agentic RAG

Chat UI

Rapid development

Documentation

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes