RAG framework

These details have not been verified by PyPI

Project links

Project description

Rago

License

Rago is a lightweight framework for RAG.

Software License: BSD 3 Clause
Documentation: https://osl-incubator.github.io/rago

Features

Vector Database support
- FAISS
Retrieval features
- Support PDF extraction via Langchain
Augmentation (Embedding + Vector Database Search)
- Support for Sentence Transformer (Hugging Face)
- Support for Open AI
- Support for SpaCy
Generation (LLM)
- Support for Hugging Face
- Support for Llama (Hugging Face)
- Support for OpenAI
- Support for Gemini

Roadmap

1. Add new Backends

As noted in several GitHub issues, our initial goal is to support as many backends as possible. This approach will provide valuable insights into user needs and inform the structure for the next phase.

2. Declarative API for Rago

Objective

To simplify and streamline the user experience in configuring RAG by introducing a declarative, composable API—similar to how Plotnine or Altair allows users to build visualizations.

Overview

The current procedural approach in Rago requires users to instantiate and connect individual components (retrieval, augmentation, generation, etc.) manually. This can become cumbersome as support for multiple backends grows. We propose a new declarative interface that lets users define their entire RAG steps in a single, fluent expression using operator overloading.

Proposed Syntax Example

from pathlib import Path

from rago import Rago, Retrieval, Augmented, Generation, DB, Cache

datasource = ...

rag = (
    Rago()
    | DB(backend="faiss")
    | Cache(backend="file", target_dir=Path(".rago-cache"))
    | Retrieval(backend="string")
    | Augmented(
        backend="openai",
        model_name="text-embedding-3-small",
        top_k=5,
    )
    | Generation(
        backend="openai",
        model_name="gpt-4o-mini",
        prompt_template="Question: {query}\nContext: {context}\nAnswer:"
    )
)

result = rag.run(query="What is the capital of France?", source=datasource)
print(result.result)

Key Benefits

Intuitive Composition: Users can build complex pipelines by simply adding layers together.
Modularity: Each component is encapsulated, making it easy to swap or extend backends without altering the overall architecture.
Reduced Boilerplate: The declarative syntax minimizes the need for repetitive setup code, focusing on the "what" rather than the "how."
Enhanced Readability: The pipeline’s structure becomes immediately clear, promoting easier maintenance and collaboration.

Implementation Plan

Define Base Classes: Develop abstract base classes for each component (DB, Cache, Retrieval, Augmented, Generation) to standardize interfaces and facilitate future extensions.
Operator Overloading: Implement the __or__ method in the main Rago class to allow chaining of components, effectively building the pipeline through a fluent interface.
Configuration and Defaults: Integrate sensible defaults and validation (using tools like Pydantic) so that users can override only when necessary.
Documentation and Examples: Provide comprehensive documentation and examples to illustrate the new declarative syntax and usage scenarios.

Installation

If you want to install it for cpu only, you can run:

$ pip install rago[cpu]

But, if you want to install it for gpu (cuda), you can run:

$ pip install rago[gpu]

Setup

Llama 3

In order to use a Llama model, visit its page on Hugging Face and request access via its form, for example: https://huggingface.co/meta-llama/Llama-3.2-1B.

After you are granted access to the desired model, you will be able to use it with Rago.

You will also need to provide a Hugging Face token in order to download the models locally, for example:

from rago import Augmented, Generation, Rago, Retrieval

# For Gated LLMs
HF_TOKEN = 'YOUR_HUGGING_FACE_TOKEN'

animals_data = [
    "The Blue Whale is the largest animal ever known to have existed, even "
    "bigger than the largest dinosaurs.",
    "The Peregrine Falcon is renowned as the fastest animal on the planet, "
    "capable of reaching speeds over 240 miles per hour.",
    "The Giant Panda is a bear species endemic to China, easily recognized by "
    "its distinctive black-and-white coat.",
    "The Cheetah is the world's fastest land animal, capable of sprinting at "
    "speeds up to 70 miles per hour in short bursts covering distances up to "
    "500 meters.",
    "The Komodo Dragon is the largest living species of lizard, found on "
    "several Indonesian islands, including its namesake, Komodo.",
]

rag = (
    Rago()
    | Retrieval(backend='string')
    | Augmented(
        backend='sentence_transformers',
        model_name='paraphrase-MiniLM-L12-v2',
        top_k=2,
    )
    | Generation(
        backend='llama',
        model_name='meta-llama/Llama-3.2-1B',
        api_key=HF_TOKEN,
    )
)

rag.prompt('What is the fastest animal on Earth?', source=animals_data)

Ollama

For testing the generation with Ollama, run first the following commands:

$ ollama pull llama3.2:1b
$ ollama serve

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.14.5

Mar 14, 2026

0.14.4

Sep 5, 2025

0.14.3

Aug 18, 2025

0.14.2

Aug 17, 2025

0.14.0

Apr 18, 2025

0.13.0

Mar 13, 2025

0.12.0

Feb 11, 2025

0.11.3

Feb 7, 2025

0.11.2

Jan 22, 2025

0.11.1

Jan 22, 2025

0.11.0

Jan 21, 2025

0.10.1

Jan 19, 2025

0.10.0

Jan 16, 2025

0.9.0

Nov 21, 2024

0.8.1

Nov 19, 2024

0.8.0

Nov 19, 2024

0.7.1

Nov 15, 2024

0.7.0

Nov 14, 2024

0.6.0

Nov 7, 2024

0.5.1

Nov 4, 2024

0.5.0

Nov 1, 2024

0.4.0

Oct 31, 2024

0.3.0

Oct 28, 2024

0.2.0

Oct 23, 2024

0.1.0

Oct 23, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rago-0.14.5.tar.gz (38.7 kB view details)

Uploaded Mar 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rago-0.14.5-py3-none-any.whl (48.3 kB view details)

Uploaded Mar 14, 2026 Python 3

File details

Details for the file rago-0.14.5.tar.gz.

File metadata

Download URL: rago-0.14.5.tar.gz
Upload date: Mar 14, 2026
Size: 38.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for rago-0.14.5.tar.gz
Algorithm	Hash digest
SHA256	`44563353e7d7769bedbb6624083067185d8e1b815feb78a267647fcab7dbcea8`
MD5	`66c9520fd3dae05e01da24d758b6462f`
BLAKE2b-256	`6ab1daca2b1499f236de289550e4443281d8374fd956a40c4e53e91ec9aabec5`

See more details on using hashes here.

File details

Details for the file rago-0.14.5-py3-none-any.whl.

File metadata

Download URL: rago-0.14.5-py3-none-any.whl
Upload date: Mar 14, 2026
Size: 48.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for rago-0.14.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d3cb9e55ee7c79f72f03fa8fcb7506070107b661ce15a5af5ea7506dae5b6b30`
MD5	`1a977511b3c3f4eb3782b6a5af7aa696`
BLAKE2b-256	`4dc337b73fe409e51b7e42d7d5f7ca36cbc31a96b0dfb4ebe4409224b3718fac`

See more details on using hashes here.

rago 0.14.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Rago

Features

Roadmap

1. Add new Backends

2. Declarative API for Rago

Objective

Overview

Proposed Syntax Example

Key Benefits

Implementation Plan

Installation

Setup

Llama 3

Ollama

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes