Skip to main content

Aquiles-RAG is a high-performance Retrieval-Augmented Generation (RAG) solution built on Redis. It offers a high-level interface through FastAPI REST APIs

Project description

Aquiles-RAG

Llada Logo

Description

Aquiles-RAG is a high-performance Retrieval-Augmented Generation (RAG) solution built on Redis. It offers a high-level interface through FastAPI REST APIs to:

  • Create RAG indexes in Redis.
  • Send raw text alongside its embeddings (the client must chunk the text and compute embeddings before submission).
  • Query the index to retrieve the most relevant chunks.

Features

  • Optimized Performance: Uses Redis as a vector search engine.
  • Simple API: Endpoints for index creation, insertion, and querying.
  • Extensible: Basic implementation ready for enhancements and integration into ML pipelines.

High-Level Architecture

Here's a diagram illustrating how Aquiles-RAG connects clients to Redis using an asynchronous FastAPI server:

diagram

Usage

Create Index

curl -X POST http://localhost:5500/create/index \
     -H 'Content-Type: application/json' \
     -d '{"indexname": "my_index"}'

Send RAG

curl -X POST http://localhost:5500/rag/create \
     -H 'Content-Type: application/json' \
     -d '{
           "index": "my_index",
           "raw_text": "Full text goes here...",
           "embeddings": [0.12, 0.34, ...]
         }'

Query RAG

curl -X POST http://localhost:5500/rag/query-rag \
     -H 'Content-Type: application/json' \
     -d '{
           "index": "my_index",
           "embeddings": [0.56, 0.78, ...],
           "top_k": 5
         }'

Command-Line Interface (CLI)

Usage Examples

# Save configuration
aquiles-rag configs --local True --host redis.local --port 6380

# Start server
aquiles-rag serve

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aquiles_rag-0.1.8.tar.gz (630.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aquiles_rag-0.1.8-py3-none-any.whl (618.2 kB view details)

Uploaded Python 3

File details

Details for the file aquiles_rag-0.1.8.tar.gz.

File metadata

  • Download URL: aquiles_rag-0.1.8.tar.gz
  • Upload date:
  • Size: 630.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.1.8.tar.gz
Algorithm Hash digest
SHA256 ec34ddc4fe27d0ac7176c901bca5697a11c6a82fd1f60d589c600be9b7190abf
MD5 0f2e33aed919994151f39ba080412162
BLAKE2b-256 21a55cf753867ca3798009dbee53d662dc579329c2b4d45ccc19dc2c94ac0aa9

See more details on using hashes here.

File details

Details for the file aquiles_rag-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: aquiles_rag-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 618.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 761a19dbbf34e439cde708a3f576dcdeaf96b5537e2ec86ad5164b0616c0a9d2
MD5 49b8e52d005630ee4b72aa7993a98223
BLAKE2b-256 4141d5078c8a5488b42c80951929c81a7ec8ba288296ffe7b4ac219e4a30a59e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page