SciPhi R2R

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

R2R: Production-ready RAG systems.

A semi-opinionated RAG framework.

R2R was conceived to bridge the gap between experimental RAG models and robust, production-ready systems. Our semi-opinionated framework cuts through the complexity, offering a straightforward path to deploy, adapt, and maintain RAG pipelines in production. We prioritize simplicity and practicality, aiming to set a new industry benchmark for ease of use and effectiveness.

Demo(s)

Launching the server locally, running the client, and pipeline observabiilty application:

Launching the basic chat client and running a test query:

https://github.com/SciPhi-AI/R2R/assets/68796651/dc44309f-a52c-4c2a-b0ab-80140e6a2fff

Quick Install:

Install R2R directly using pip:

# use the `'r2r[all]'` to download all required deps
pip install 'r2r[parsing,eval]'

# setup env 
export OPENAI_API_KEY=sk-...
export LOCAL_DB_PATH=local.sqlite

# OR do `vim .env.example && cp .env.example .env`
# INCLUDE secrets and modify config.json
# if using cloud providers (e.g. pgvector, supabase, ...)

Run the server with Docker:

docker pull emrgntcmplxty/r2r:latest

# Place your secrets in `.env` before deploying
docker run -d --name r2r_container -p 8000:8000 --env-file .env r2r

Links

Join the Discord server

Read our Docs

Basic Examples

The project includes several basic examples that demonstrate application deployment and interaction:

basic app: This example runs the backend server, which includes the ingestion, embedding, and RAG pipelines served via FastAPI.
```
# If using a venv, replace `uvicorn` with `venv_path/bin/uvicorn`
uvicorn r2r.examples.basic.app:app
```
basic client: This example should be run after starting the server. It demonstrates uploading text entries as well as a PDF to the local server with the python client. Further, it shows document and user-level vector management with built-in features.
```
python -m r2r.examples.basic.run_client
```

pdf chat: An example demonstrating upload and chat with a more realistic pdf.

# Ingest pdf
python -m r2r.examples.pdf_chat.run_client ingest

# Return search results
python -m r2r.examples.pdf_chat.run_client search "What are the key themes of Meditations?"

# Stream a rag response
poetry run python -m r2r.examples.pdf_chat.run_client rag_completion_streaming "According to Meditaitons, what are some principles to live by?"

academy: A more sophisticated demo demonstrating how to build a more novel pipeline which involves synthetic queries

# Launch the `academy` example application
# If using a venv, replace `uvicorn` with `venv_path/bin/uvicorn`
uvicorn r2r.examples.academy.app:app

# Ask a question
python -m r2r.examples.academy.run_client search "What are the key themes of Meditations?"

intelligence: A web application which communicates with the backend server to provide visual intelligence.
```
cd $workdir/web && pnpm install

# Serve the web app
pnpm dev
```
chat: A chat application which communicates with the basic pipeline to stream chat responses and sources in real time.
```
cd $workdir/chat && pnpm install

# Serve the web app
pnpm dev
```

Full Install:

Follow these steps to ensure a smooth setup:

Install Poetry:
- Before installing the project, make sure you have Poetry on your system. If not, visit the official Poetry website for installation instructions.
Clone and Install Dependencies:

Clone the project repository and navigate to the project directory:
```
git clone git@github.com:SciPhi-AI/r2r.git
cd r2r
```

Copy the .env.example file to .env. This file is in the main project folder:

cp .env.example .env

# Add secrets, `OPENAI_API_KEY` at a minimum
vim .env

Install the project dependencies with Poetry:

# See pyproject.toml for available extras
# use "all" to include every optional dependency
poetry install -E parsing -E eval

Execute with poetry run:

python -m r2r.examples.pdf_chat.run_client ingest

Configure Environment Variables:
- You need to set up cloud provider secrets in your .env. At a minimum, you will need an OpenAI key.
- The framework currently supports PostgreSQL (locally), pgvector and Qdrant with plans to extend coverage.

Key Features

🚀 Rapid Deployment: Facilitates a smooth setup and development of production-ready RAG systems.
⚖️ Flexible Standardization: Ingestion, Embedding, and RAG with proper Observability.
🧩 Easy to modify: Provides a structure that can be extended to deploy your own custom pipelines.
📦 Versioning: Ensures your work remains reproducible and traceable through version control.
🔌 Extensibility: Enables a quick and robust integration with various VectorDBs, LLMs and Embeddings Models.
🤖 OSS Driven: Built for and by the OSS community, to help startups and enterprises to quickly build with RAG.
📝 Deployment Support: Available to help you build and deploy your RAG systems end-to-end.

Core Abstractions

The framework primarily revolves around three core abstractions:

The Ingestion Pipeline: Facilitates the preparation of embeddable 'Documents' from various data formats (json, txt, pdf, html, etc.). The abstraction can be found in ingestion.py.
The Embedding Pipeline: Manages the transformation of text into stored vector embeddings, interacting with embedding and vector database providers through a series of steps (e.g., extract_text, transform_text, chunk_text, embed_chunks, etc.). The abstraction can be found in embedding.py.
The RAG Pipeline: Works similarly to the embedding pipeline but incorporates an LLM provider to produce text completions. The abstraction can be found in rag.py.
The Eval Pipeline: Samples some subset of rag_completion calls for evaluation. Currently DeepEval is supported. The abstraction can be found in eval.py.

Each pipeline incorporates a logging database for operation tracking and observability.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.2.22

Jun 16, 2024

0.2.21

Jun 15, 2024

0.2.20

Jun 15, 2024

0.2.19

Jun 15, 2024

0.2.18

Jun 13, 2024

0.2.17

Jun 13, 2024

0.2.16

Jun 13, 2024

0.2.15

Jun 13, 2024

0.2.12

Jun 7, 2024

0.2.11

Jun 6, 2024

0.2.4

Jun 2, 2024

0.2.3

May 30, 2024

0.2.2

May 30, 2024

0.2.1

May 30, 2024

0.2.0

May 23, 2024

0.1.35

Apr 17, 2024

0.1.34

Apr 10, 2024

0.1.33

Apr 10, 2024

0.1.32

Apr 10, 2024

0.1.31

Apr 5, 2024

0.1.29

Apr 4, 2024

0.1.28

Mar 29, 2024

0.1.27

Mar 25, 2024

0.1.26

Mar 12, 2024

0.1.25

Mar 11, 2024

This version

0.1.24

Mar 9, 2024

0.1.23

Mar 3, 2024

0.1.22

Feb 29, 2024

0.1.21

Feb 27, 2024

0.1.2

Feb 27, 2024

0.1.1

Feb 26, 2024

0.1.0

Feb 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

r2r-0.1.24.tar.gz (3.8 MB view hashes)

Uploaded Mar 9, 2024 Source

Built Distribution

r2r-0.1.24-py3-none-any.whl (3.8 MB view hashes)

Uploaded Mar 9, 2024 Python 3

Hashes for r2r-0.1.24.tar.gz

Hashes for r2r-0.1.24.tar.gz
Algorithm	Hash digest
SHA256	`80148b85bed13a3d37cc53295080fa7b48bc164dc9d7596e73ed315230d2fe41`
MD5	`79ad357a483fe24c12ab6bf2b8eccfed`
BLAKE2b-256	`eb0be76badb65e8142dfacf981848bd27d8bb2a835bf6e9ab33aa137b2eae089`

Hashes for r2r-0.1.24-py3-none-any.whl

Hashes for r2r-0.1.24-py3-none-any.whl
Algorithm	Hash digest
SHA256	`06a2468a8623ae3f861cefbe7e4603dd1428ed411f3bfdfafb2fa1bab7f33593`
MD5	`2381599ae480505848acef08d1ee93a8`
BLAKE2b-256	`bdb4053fca52cc4a34aa18a7013b5a8d906031f9c73c68b0113830850624b0f2`