Aquiles-RAG is a high-performance Augmented Recovery-Generation (RAG) solution based on Redis, Qdrant or PostgreSQLRAG. It offers a high-level interface using FastAPI REST APIs.

These details have not been verified by PyPI

Project links

Project description

Aquiles-RAG

High-performance Retrieval-Augmented Generation (RAG) on Redis, Qdrant or PostgreSQL (pgvector)
🚀 FastAPI • Redis / Qdrant / PostgreSQL • Async • Embedding-agnostic

📖 Documentation

⭐ Features

📈 High Performance: Vector search powered by Redis HNSW, Qdrant, or PostgreSQL with pgvector.
🛠️ Simple API: Endpoints for index creation, insertion, querying, and optional re-ranking.
🔌 Embedding-agnostic: Works with any embedding model (OpenAI, Llama 3, HuggingFace, etc.).
💻 Interactive Setup Wizard: aquiles-rag configs walks you through full configuration for Redis, Qdrant, or PostgreSQL.
⚡ Sync & Async clients: AquilesRAG (requests) and AsyncAquilesRAG (httpx) with embedding_model and metadata support.
🧩 Extensible: Designed to integrate into ML pipelines, microservices, or serverless deployments; supports an optional re-ranker stage for improved result ordering.

🛠 Tech Stack

Python 3.9+
FastAPI
Redis, Qdrant or PostgreSQL + pgvector as vector store
NumPy
Pydantic
Jinja2
Click (CLI)
Requests (sync client)
HTTPX (async client)
Platformdirs (config management)

⚙️ Requirements

Redis (standalone or cluster) — or Qdrant (HTTP / gRPC) — or PostgreSQL with the pgvector extension.
Python 3.9+
pip

Optional: run Redis locally with Docker:

docker run -d --name redis-stack -p 6379:6379 redis/redis-stack-server:latest

🚀 Installation

Via PyPI (recommended)

pip install aquiles-rag

From Source (optional)

git clone https://github.com/Aquiles-ai/Aquiles-RAG.git
cd Aquiles-RAG

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

# optional development install
pip install -e .

🔧 Configuration & Connection Options

Configuration is persisted at:

~/.local/share/aquiles/aquiles_config.json

Setup Wizard (recommended)

The previous manual per-flag config flow was replaced by an interactive wizard. Run:

aquiles-rag configs

The wizard prompts for everything required for either Redis, Qdrant, or PostgreSQL (host, ports, TLS/gRPC options, API keys, admin user). At the end it writes aquiles_config.json to the standard location.

The wizard also includes optional re-ranker configuration (enable/disable, execution provider, model name, concurrency, preload) so you can activate a re-ranking stage that scores (query, doc) pairs after the vector store returns candidates.

Manual config (advanced / CI)

If you prefer automation, generate the same JSON schema the wizard writes and place it at ~/.local/share/aquiles/aquiles_config.json before starting the server (or use the deploy pattern described below).

Redis connection modes (examples)

Aquiles-RAG supports multiple Redis modes:

Local Cluster

RedisCluster(host=host, port=port, decode_responses=True)

Standalone Local

redis.Redis(host=host, port=port, decode_responses=True)

Remote with TLS/SSL

redis.Redis(host=host, port=port, username=username or None,
            password=password or None, ssl=True, decode_responses=True,
            ssl_certfile=ssl_certfile, ssl_keyfile=ssl_keyfile, ssl_ca_certs=ssl_ca_certs)

Remote without TLS/SSL

redis.Redis(host=host, port=port, username=username or None, password=password or None, decode_responses=True)

If you select PostgreSQL in the wizard, the wizard will prompt for connection and pool settings for your Postgres instance. Note: Aquiles-RAG does not run DB migrations automatically — if you use Postgres you must prepare the pgvector and pgcrypto extension, tables and indexes yourself.

📖 Usage

CLI

Interactive Setup Wizard (recommended):

aquiles-rag configs

Serve the API:

aquiles-rag serve --host "0.0.0.0" --port 5500

Deploy with bootstrap script (pattern: deploy_*.py with run() that calls gen_configs_file()):

# Redis example
aquiles-rag deploy --host "0.0.0.0" --port 5500 --workers 2 deploy_redis.py

# Qdrant example
aquiles-rag deploy --host "0.0.0.0" --port 5500 --workers 2 deploy_qdrant.py

# PostgreSQL example
aquiles-rag deploy --host "0.0.0.0" --port 5500 --workers 2 deploy_postgres.py

The deploy command imports the given Python file, executes its run() to generate the config (writes aquiles_config.json), then starts the FastAPI server.

REST API — common examples

Create Index

curl -X POST http://localhost:5500/create/index \
  -H "X-API-Key: YOUR_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "indexname": "documents",
    "embeddings_dim": 768,
    "dtype": "FLOAT32",
    "delete_the_index_if_it_exists": false
  }'

Insert Chunk (ingest)

curl -X POST http://localhost:5500/rag/create \
  -H "X-API-Key: YOUR_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "index": "documents",
    "name_chunk": "doc1_part1",
    "dtype": "FLOAT32",
    "chunk_size": 1024,
    "raw_text": "Text of the chunk...",
    "embeddings": [0.12, 0.34, 0.56, ...]
  }'

Query Top-K

curl -X POST http://localhost:5500/rag/query-rag \
  -H "X-API-Key: YOUR_API_KEY" \
  -H 'Content-Type: application/json' \
  -d '{
    "index": "documents",
    "embeddings": [0.78, 0.90, ...],
    "dtype": "FLOAT32",
    "top_k": 5,
    "cosine_distance_threshold": 0.6
  }'

The API supports an optional re-ranking stage (configurable in the server). When enabled, the typical flow is: vector search → candidate filtering/metadata match → optional re-ranker scores pairs to improve ordering. (See configuration wizard to enable/disable and set re-ranker options.)

Python Client

Sync client

from aquiles.client import AquilesRAG

client = AquilesRAG(host="http://127.0.0.1:5500", api_key="YOUR_API_KEY")

# Create an index (returns server text)
resp_text = client.create_index("documents", embeddings_dim=768, dtype="FLOAT32")

# Insert chunks using your embedding function
def get_embedding(text):
    return embedding_model.encode(text)

responses = client.send_rag(
    embedding_func=get_embedding,
    index="documents",
    name_chunk="doc1",
    raw_text=full_text,
    embedding_model="text-embedding-v1"  # optional metadata sent with each chunk
)

# Query the index (returns parsed JSON)
results = client.query("documents", query_embedding, top_k=5)
print(results)

Async client

import asyncio
from aquiles.client import AsyncAquilesRAG

client = AsyncAquilesRAG(host="http://127.0.0.1:5500", api_key="YOUR_API_KEY")

async def main():
    await client.create_index("documents_async")
    responses = await client.send_rag(
        embedding_func=async_embedding_func,   # supports sync or async callables
        index="documents_async",
        name_chunk="doc_async",
        raw_text=full_text
    )
    results = await client.query("documents_async", query_embedding)
    print(results)

asyncio.run(main())

Notes

Both clients accept an optional embedding_model parameter forwarded as metadata — helpful when storing/querying embeddings produced by different models.
send_rag chunks text using chunk_text_by_words() (default ≈600 words / ≈1024 tokens) and uploads each chunk (concurrently in the async client).
If the re-ranker is enabled on the server, the client can call the re-rank endpoint after receiving RAG results to re-score/re-order candidates.

UI Playground

Open the web UI (protected) at:

http://localhost:5500/ui

Use it to:

Run the Setup Wizard link (if available) or inspect live configs
Test /create/index, /rag/create, /rag/query-rag
Access protected Swagger UI & ReDoc after logging in

🏗 Architecture

Architecture

Clients (HTTP/HTTPS, Python SDK, or UI Playground) make asynchronous HTTP requests.
FastAPI Server — orchestration and business logic; validates requests and translates them to vector store operations.
Vector Store — Redis (HASH + HNSW/COSINE search), Qdrant (collections + vector search), or PostgreSQL with pgvector and pgcrypto (manual DB preparation required).
Optional Re-ranker — when enabled, a re-ranking component scores (query, doc) pairs to improve final ordering.

⚠️ Backend differences & notes

Metrics / /status/ram: Redis offers INFO memory and memory_stats() — for Qdrant the same Redis-specific metrics are not available (the endpoint will return a short message explaining this). For PostgreSQL, metrics exposed differ from Redis and Qdrant; check your Postgres monitoring tooling for memory and indexing statistics.
Dtype handling: Server validates dtype for Redis (converts embeddings to the requested NumPy dtype). Qdrant accepts float arrays directly — dtype is informational/compatibility metadata. For PostgreSQL+pgvector, ensure the stored vector dimension and any normalization required for cosine/inner product are handled by your ingestion pipeline.
gRPC: Qdrant can be used over HTTP or gRPC (prefer_grpc=true in the config). Ensure your environment allows gRPC outbound/inbound as needed.
PostgreSQL note: Aquiles-RAG does not run automatic migrations for Postgres — create the pgvector extension, tables and indexes manually (or via your own migration tool) before using Postgres as a vector store.

🔎 Test Suite

See the test/ directory for automated tests:

client tests for the Python SDK
API tests for endpoint behavior
test_deploy.py for deployment / bootstrap validation

If you add Postgres to CI, prepare the DB (create pgvector extension and required tables/indexes) in your test fixtures since there are no automatic migrations.

📄 License

Apache License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.5

Jan 6, 2026

0.5.0

Nov 29, 2025

0.4.54

Nov 22, 2025

This version

0.4.53

Nov 21, 2025

0.4.52

Nov 21, 2025

0.4.51

Nov 21, 2025

0.4.5

Nov 20, 2025

0.4.2

Nov 15, 2025

0.4.0

Sep 7, 2025

0.3.75

Sep 1, 2025

0.3.72

Aug 29, 2025

0.3.7

Aug 29, 2025

0.3.6

Aug 29, 2025

0.3.4

Aug 28, 2025

0.3.3

Aug 21, 2025

0.3.2

Aug 20, 2025

0.3.1

Aug 18, 2025

0.3.0

Aug 18, 2025

0.2.9

Aug 14, 2025

0.2.8.5

Aug 11, 2025

0.2.8

Aug 9, 2025

0.2.7.1

Aug 6, 2025

0.2.7

Aug 6, 2025

0.2.6.1

Aug 3, 2025

0.2.6

Jul 31, 2025

0.2.5.4

Jul 30, 2025

0.2.5.3

Jul 30, 2025

0.2.5.2

Jul 29, 2025

0.2.5.1

Jul 29, 2025

0.2.5

Jul 27, 2025

0.2.2

Jul 25, 2025

0.2.1

Jul 25, 2025

0.2.0

Jul 24, 2025

0.1.9.1

Jul 24, 2025

0.1.9

Jul 23, 2025

0.1.8

Jul 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aquiles_rag-0.4.53.tar.gz (1.3 MB view details)

Uploaded Nov 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aquiles_rag-0.4.53-py3-none-any.whl (1.3 MB view details)

Uploaded Nov 21, 2025 Python 3

File details

Details for the file aquiles_rag-0.4.53.tar.gz.

File metadata

Download URL: aquiles_rag-0.4.53.tar.gz
Upload date: Nov 21, 2025
Size: 1.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.4.53.tar.gz
Algorithm	Hash digest
SHA256	`e851da1aed116325b5025170a69babec69e6b99acccf36d46bf03cfb79156067`
MD5	`7033566dcad1ab7a369466c92cb227d8`
BLAKE2b-256	`acbbf0141e638b419329dc0713374bb98bb7859446f07cf9b07cbd5ef15e5c04`

See more details on using hashes here.

File details

Details for the file aquiles_rag-0.4.53-py3-none-any.whl.

File metadata

Download URL: aquiles_rag-0.4.53-py3-none-any.whl
Upload date: Nov 21, 2025
Size: 1.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for aquiles_rag-0.4.53-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c6e935f931c8ae126a29927ad45f8c95b673b82b32d07e614b526a537d7f82f6`
MD5	`5fdf4d9c87d47171a77e2d3874f09e8e`
BLAKE2b-256	`86fbc8a14972b7a2ccd09325e7af299703ba85462e87650a41eb2e0b312936ea`

See more details on using hashes here.

aquiles-rag 0.4.53

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Aquiles-RAG

📑 Table of Contents

⭐ Features

🛠 Tech Stack

⚙️ Requirements

🚀 Installation

Via PyPI (recommended)

From Source (optional)

🔧 Configuration & Connection Options

Setup Wizard (recommended)

Manual config (advanced / CI)

Redis connection modes (examples)

📖 Usage

CLI

REST API — common examples

Python Client

Sync client

Async client

UI Playground

🏗 Architecture

⚠️ Backend differences & notes

🔎 Test Suite

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes