Skip to main content

Aquiles-RAG is a high-performance Retrieval-Augmented Generation (RAG) solution built on Redis. It offers a high-level interface through FastAPI REST APIs

Project description

Aquiles-RAG

Llada Logo

Description

Aquiles-RAG is a high-performance Retrieval-Augmented Generation (RAG) solution built on Redis. It offers a high-level interface through FastAPI REST APIs to:

  • Create RAG indexes in Redis.
  • Send raw text alongside its embeddings (the client must chunk the text and compute embeddings before submission).
  • Query the index to retrieve the most relevant chunks.

Features

  • Optimized Performance: Uses Redis as a vector search engine.
  • Simple API: Endpoints for index creation, insertion, and querying.
  • Extensible: Basic implementation ready for enhancements and integration into ML pipelines.

High-Level Architecture

Here's a diagram illustrating how Aquiles-RAG connects clients to Redis using an asynchronous FastAPI server:

diagram

Usage

Create Index

curl -X POST http://localhost:5500/create/index \
     -H 'Content-Type: application/json' \
     -d '{"indexname": "my_index"}'

Send RAG

curl -X POST http://localhost:5500/rag/create \
     -H 'Content-Type: application/json' \
     -d '{
           "index": "my_index",
           "raw_text": "Full text goes here...",
           "embeddings": [0.12, 0.34, ...]
         }'

Query RAG

curl -X POST http://localhost:5500/rag/query-rag \
     -H 'Content-Type: application/json' \
     -d '{
           "index": "my_index",
           "embeddings": [0.56, 0.78, ...],
           "top_k": 5
         }'

Command-Line Interface (CLI)

Usage Examples

# Save configuration
aquiles-rag configs --local True --host redis.local --port 6380

# Start server
aquiles-rag serve

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aquiles_rag-0.2.0.tar.gz (639.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aquiles_rag-0.2.0-py3-none-any.whl (624.1 kB view details)

Uploaded Python 3

File details

Details for the file aquiles_rag-0.2.0.tar.gz.

File metadata

  • Download URL: aquiles_rag-0.2.0.tar.gz
  • Upload date:
  • Size: 639.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.2.0.tar.gz
Algorithm Hash digest
SHA256 c02b76651ce83cb2020d694b1f69a5c7846c62f6e21f1abc87871b285a99452f
MD5 f6b3e20420b247b6d3106dbaa46b56cf
BLAKE2b-256 c7249121f4d49af0e1883d26a54bce942f1dc9010118237f6294fe7c0bc6aaa7

See more details on using hashes here.

File details

Details for the file aquiles_rag-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: aquiles_rag-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 624.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1f076f368c52d5d8437bc0e149d9c916b515ae401d7bb25c70c0a0bc9c0396b6
MD5 636a2f0880477b8713c3054683b902be
BLAKE2b-256 4296c9613215b973f3be57ad2f8b604ac19e5577b25c8cee57edc7a20f189beb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page