Modular RAG framework with support for FAISS, Weaviate, and Hybrid Search.

These details have not been verified by PyPI

Project links

Homepage

Project description

Hayagriva – Modular Retrieval-Augmented Generation Framework

Hayagriva

सर्वविद्याप्रसूत्यर्थं हयग्रीवोऽवतारतः वेदान् संरक्ष्य जगतां हितकारी सदा भवेत्

"To restore and protect all knowledge, Hayagriva has manifested. He safeguards the Vedas for the good of the world."

Hayagriva is a lightweight, modular Retrieval-Augmented Generation (RAG) framework that connects LLMs with efficient document retrieval. It focuses on grounded answers, fast iteration, and easy integration for developers and research teams.

Key Features

Retrieval-Augmented Generation

Combine contextual retrieval with LLMs to produce grounded, source-aware answers.

Supported LLM backend today: Groq.

Retrieval Strategies

Hayagriva supports multiple retrieval strategies (depending on vector store):

Vector Search: Dense semantic retrieval.
BM25: Sparse keyword retrieval (Weaviate).
Hybrid Search: Vector + keyword with configurable weighting (Weaviate).

Chunking Strategies

Word (Default): Sliding window on word count.
Recursive: Split by separators to preserve structure.
Semantic: Embedding-aware topic shifts.
Hierarchical: Parent/child chunking for broad + precise context.

Modular Vector Stores

FAISS: Lightweight, in-memory vector store.
Weaviate: Production-grade vector DB with hybrid/BM25.

Flexible Document Handling

Programmatic document ingestion.
File and directory ingestion helpers.
- Directory ingestion reads .txt files by default.

Automatic chunking and metadata assignment are built in.

Installation

Default (Lightweight)

Core libraries only:

pip install hayagriva

CPU Support (Recommended for Local Testing)

Installs sentence-transformers and faiss-cpu:

pip install "hayagriva[cpu]"

GPU Support

Installs sentence-transformers and faiss-gpu:

pip install "hayagriva[cuda]"

Python Usage

Example setup scripts are available in setup/, and capability-focused examples are grouped under examples/.

Example Categories

The examples are split by what part of the RAG pipeline you want to try:

examples/ingestion/: load text or .txt files without embeddings or LLM calls.
examples/chunking/: run word, recursive, semantic, or hierarchical chunking.
examples/indexing/: embed chunks and write them to FAISS or Weaviate.
examples/retrieval/: retrieve chunks without calling Groq.
examples/end_to_end/: complete Groq-powered RAG flows.
examples/vectorstores/weaviate/: Weaviate vector, BM25, and hybrid examples.

For local FAISS and embedding examples:

bash setup/install_local_deps.sh
export GROQ_API_KEY="YOUR_GROQ_KEY"

For Weaviate examples:

bash setup/install_local_deps.sh
bash setup/start_weaviate.sh
export GROQ_API_KEY="YOUR_GROQ_KEY"

Basic Example (FAISS + Vector Search)

from hayagriva import Hayagriva, HayagrivaConfig

config = HayagrivaConfig(
    backend="groq",
    api_key="YOUR_GROQ_KEY",
    model="llama-3.1-8b-instant",
)

rag = Hayagriva(config)
rag.add_documents(["Hayagriva restores forgotten knowledge."])

response = "".join(rag.ask("Who retrieved the lost Vedas?"))
print(response)

Building Block Example (Chunking Only)

from hayagriva.config import ChunkingConfig
from hayagriva.core.chunker import WordChunker

chunker = WordChunker(ChunkingConfig(chunk_size=8, overlap=2))
chunks, metadata = chunker.chunk([
    "Hayagriva restores knowledge and RAG retrieves context before generation."
])

print(chunks)
print(metadata)

Structured Response (Answer + Metadata)

resp = rag.ask("Who retrieved the lost Vedas?", return_metadata=True)
print(resp["answer"])
print(resp["chunks"][0])
print(resp["retrieval"]["strategy"])

Returned metadata includes: retrieved chunk ranks/scores, chunking strategy, retrieval strategy, model backend, and vector store.

Customizing Embeddings

config = HayagrivaConfig(
    backend="groq",
    api_key="YOUR_KEY",
    embedding_model="intfloat/multilingual-e5-large",
)

Advanced Example (Weaviate + Hybrid Search)

from hayagriva import Hayagriva, HayagrivaConfig
from hayagriva.config import WeaviateConfig, RetrievalConfig, ChunkingConfig

weaviate_config = WeaviateConfig(
    url="http://localhost:8080",
    index_name="HayagrivaDocs",
)

config = HayagrivaConfig(
    backend="groq",
    api_key="YOUR_GROQ_KEY",
    vector_store="weaviate",
    weaviate=weaviate_config,
    retrieval=RetrievalConfig(strategy="hybrid", alpha=0.5, top_k=4),
    chunking=ChunkingConfig(strategy="recursive", chunk_size=500, overlap=50),
)

rag = Hayagriva(config)
rag.add_documents(["Hayagriva is an avatar of Vishnu."])

for token in rag.ask("Who is Hayagriva?"):
    print(token, end="", flush=True)

Requirements

Python 3.10+
API key for Groq

Optional (only if you use local embeddings or FAISS):

sentence-transformers
faiss-cpu or faiss-gpu

If using external vector databases:

weaviate-client

Roadmap

Expanded LLM Provider Support

Anthropic Claude
Google Gemini
DeepSeek
Local GGUF models and llama.cpp

Additional Vector Database Integration

ChromaDB
Additional pluggable backends

Memory-Augmented Chat

Multi-turn memory
Embedding-based long-term memory
Summarization-based memory compression

Use Cases

Retrieval-augmented assistants
Knowledge-base and enterprise search
Research and benchmarking of RAG pipelines
Lightweight production deployments
Internal document Q&A systems

Contributing

Contributions to model integrations, retrieval modules, and documentation are welcome. Submit issues or pull requests.

Support

For questions, bugs, or feature requests, open an issue on the project repository.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.0

Apr 25, 2026

0.2.5.1

Apr 25, 2026

This version

0.2.5

Apr 25, 2026

0.2.4

Jan 26, 2026

0.2.2

Dec 17, 2025

0.2.0

Dec 17, 2025

0.1.0

Dec 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hayagriva-0.2.5.tar.gz (18.0 kB view details)

Uploaded Apr 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hayagriva-0.2.5-py3-none-any.whl (22.6 kB view details)

Uploaded Apr 25, 2026 Python 3

File details

Details for the file hayagriva-0.2.5.tar.gz.

File metadata

Download URL: hayagriva-0.2.5.tar.gz
Upload date: Apr 25, 2026
Size: 18.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for hayagriva-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`ce466ed5ba94693831ffa431400ae2c95bb9222ecbbd1b1c745ec4cd590cd226`
MD5	`3dc66c6bcaf0b7581eed34d02d61ce9c`
BLAKE2b-256	`b1398c0284803c2496c362be3de6738cc37984bbf28a4f14a45ee1a59ef0b9c2`

See more details on using hashes here.

File details

Details for the file hayagriva-0.2.5-py3-none-any.whl.

File metadata

Download URL: hayagriva-0.2.5-py3-none-any.whl
Upload date: Apr 25, 2026
Size: 22.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for hayagriva-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`97bb35578c20592c3687e5944f806d154416cab2b13e798e1166d91c40a3b416`
MD5	`5ca94d9656ca07eeaa4506df530ce179`
BLAKE2b-256	`481edb527c7bce4ab69f0aafffc137495bf976a68839e5d6c7e18f0ee026b783`

See more details on using hashes here.

hayagriva 0.2.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Hayagriva – Modular Retrieval-Augmented Generation Framework

Key Features

Retrieval-Augmented Generation

Retrieval Strategies

Chunking Strategies

Modular Vector Stores

Flexible Document Handling

Installation

Default (Lightweight)

CPU Support (Recommended for Local Testing)

GPU Support

Python Usage

Example Categories

Basic Example (FAISS + Vector Search)

Building Block Example (Chunking Only)

Structured Response (Answer + Metadata)

Customizing Embeddings

Advanced Example (Weaviate + Hybrid Search)

Requirements

Roadmap

Expanded LLM Provider Support

Additional Vector Database Integration

Memory-Augmented Chat

Use Cases

Contributing

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes