A minimal Python library for Retrieval-Augmented Generation with multiple vector store backends

These details have not been verified by PyPI

Project links

Project description

TinyRag 🚀

A minimal, powerful Python library for Retrieval-Augmented Generation (RAG) with support for multiple document formats and vector storage backends.

🌟 Features

🔌 Multiple Vector Stores: Faiss, ChromaDB, In-Memory, Pickle-based
📄 Document Support: PDF, DOCX, TXT, and raw text
🧠 Default Embeddings: Uses all-MiniLM-L6-v2 by default (no API key needed)
🚀 Multithreading Support: Parallel document processing for faster indexing
🔍 Query Without LLM: Direct similarity search functionality
💬 Optional LLM Integration: Chat completion with retrieved context
⚡ Minimal Setup: Works out of the box without configuration
🎯 Easy to Use: Simple API with powerful features

🚀 Quick Start

Installation

# Basic installation
pip install tinyrag

# With all optional dependencies
pip install tinyrag[all]

# Specific vector stores
pip install tinyrag[faiss]    # High performance
pip install tinyrag[chroma]   # Persistent storage
pip install tinyrag[docs]     # Document processing

Usage Examples

Basic Usage (No API Key Required)

from tinyrag import TinyRag

# Initialize with default all-MiniLM-L6-v2 embeddings
rag = TinyRag()

# Add multiple documents with multithreading
rag.add_documents([
    "path/to/doc1.pdf",
    "path/to/doc2.txt", 
    "Raw text content here"
])

# Query without LLM
results = rag.query("What is this about?")
print("Similar chunks:", results)

With LLM for Chat

from tinyrag import Provider, TinyRag

provider = Provider(
    api_key="sk-xxxxxx",
    model="gpt-4",
    embedding_model="default",
    base_url="https://api.openai.com/v1"
)

rag = TinyRag(provider=provider, vector_store="faiss", max_workers=4)

rag.add_documents([
    "path/to/docs_or_raw_text",
    "Another document", 
    "More content"
])

response = rag.chat("Summarize the documents.")
print(response)

📖 Documentation

Core Components

Provider Class

Handles API interactions and embeddings:

from tinyrag import Provider

# Local embeddings only (no API key needed)
provider = Provider(embedding_model="default")

# With OpenAI API
provider = Provider(
    api_key="sk-your-key",
    model="gpt-4",
    embedding_model="text-embedding-ada-002",
    base_url="https://api.openai.com/v1"
)

TinyRag Class

Main interface for RAG operations:

from tinyrag import TinyRag

# Initialize with different vector stores
rag = TinyRag(provider, vector_store="memory")     # No dependencies
rag = TinyRag(provider, vector_store="faiss")      # High performance  
rag = TinyRag(provider, vector_store="chroma")     # Persistent
rag = TinyRag(provider, vector_store="pickle")     # Simple file-based

Vector Store Comparison

Store	Performance	Persistence	Memory	Dependencies	Best For
Memory	Good	Manual	High	None	Development, small datasets
Faiss	Excellent	Manual	Low	faiss-cpu	Large-scale, performance-critical
ChromaDB	Good	Automatic	Medium	chromadb	Production, automatic persistence
Pickle	Fair	Manual	Medium	scikit-learn	Simple file-based storage

API Reference

Core Methods

# Document Management
rag.add_documents(data)                    # Add documents/text
rag.get_chunk_count()                      # Get number of chunks
rag.get_all_chunks()                       # Get all text chunks
rag.clear_documents()                      # Clear all data

# Querying (No LLM)
rag.query(query, k=5, return_scores=True) # Basic similarity search
rag.search_documents(query, k=5, min_score=0.0) # With score filtering
rag.get_similar_chunks(text, k=5)         # Find similar to given text

# LLM Integration
rag.chat(query, k=3)                      # Generate response with context

# Persistence
rag.save_vector_store(filepath)           # Save to disk
rag.load_vector_store(filepath)           # Load from disk

🔧 Configuration Options

Vector Store Configuration

# Faiss with custom settings
rag = TinyRag(
    provider=provider,
    vector_store="faiss",
    chunk_size=1000,  # Larger chunks
    vector_store_config={}
)

# ChromaDB with persistence
rag = TinyRag(
    provider=provider,
    vector_store="chroma", 
    vector_store_config={
        "collection_name": "my_collection",
        "persist_directory": "./chroma_db"
    }
)

# Memory store (no config needed)
rag = TinyRag(provider=provider, vector_store="memory")

# Pickle store with scikit-learn
rag = TinyRag(provider=provider, vector_store="pickle")

Provider Configuration

# Local embeddings only
provider = Provider(embedding_model="default")

# OpenAI with custom settings
provider = Provider(
    api_key="sk-your-key",
    model="gpt-3.5-turbo",
    embedding_model="text-embedding-ada-002",
    base_url="https://api.openai.com/v1"
)

# Custom API endpoint
provider = Provider(
    api_key="your-key",
    model="custom-model",
    base_url="https://your-custom-api.com/v1"
)

📦 Installation Options

# Minimal installation
pip install tinyrag

# With specific vector stores
pip install tinyrag[faiss]      # For high-performance similarity search
pip install tinyrag[chroma]     # For persistent vector database
pip install tinyrag[pickle]     # For simple file-based storage

# With document processing
pip install tinyrag[docs]       # PDF and DOCX support

# Everything included
pip install tinyrag[all]        # All optional dependencies

🛠️ Development

Requirements

Python 3.7+
sentence-transformers (core)
requests (core)
numpy (core)

Optional Dependencies

faiss-cpu: High-performance vector search
chromadb: Persistent vector database
scikit-learn: Pickle vector store similarity
PyPDF2: PDF document processing
python-docx: Word document processing

Contributing

Fork the repository: https://github.com/Kenosis01/TinyRag.git
Create a feature branch: git checkout -b feature-name
Make your changes and add tests
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Support

GitHub Issues: Report bugs or request features
Documentation: Full documentation
Examples: Check the examples/ directory in the repository

🎯 Use Cases

Document Q&A: Query your documents without LLM costs
Knowledge Base: Build searchable knowledge repositories
Content Discovery: Find similar content in large document collections
RAG Applications: Full retrieval-augmented generation workflows
Research Tools: Semantic search through research papers
Customer Support: Query company documentation and policies

TinyRag - Making RAG simple, powerful, and accessible! 🚀

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.5

Aug 23, 2025

0.3.4

Aug 22, 2025

0.3.3

Aug 22, 2025

0.3.2

Aug 22, 2025

0.3.1

Aug 22, 2025

0.3.0

Aug 22, 2025

This version

0.2.0

Aug 22, 2025

0.1.0

Aug 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tinyrag-0.2.0.tar.gz (19.7 kB view details)

Uploaded Aug 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tinyrag-0.2.0-py3-none-any.whl (16.7 kB view details)

Uploaded Aug 22, 2025 Python 3

File details

Details for the file tinyrag-0.2.0.tar.gz.

File metadata

Download URL: tinyrag-0.2.0.tar.gz
Upload date: Aug 22, 2025
Size: 19.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.10

File hashes

Hashes for tinyrag-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`25b58e8e9a9cec97ee740d38b4a31f4ab4f7758f08f645d91e7d0eaec0b28daf`
MD5	`bd19f6a62553b6d9331742d00c136815`
BLAKE2b-256	`3bad5c4de2be6ab70d6a3a1f8c87b869e57739b14c135681722aa275246ec749`

See more details on using hashes here.

File details

Details for the file tinyrag-0.2.0-py3-none-any.whl.

File metadata

Download URL: tinyrag-0.2.0-py3-none-any.whl
Upload date: Aug 22, 2025
Size: 16.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.10

File hashes

Hashes for tinyrag-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`665fc425d1a1617259bc0753dc8ff9e65fd8dd0ba4c69fd6e36534a9fbc5e0ef`
MD5	`90b9d3f02fd0b9290b854645ab973563`
BLAKE2b-256	`ebe85bcf0ab3b37849bb27bbec7fcafbb2dbe42e2cbab91d62c62de99ecf7488`

See more details on using hashes here.

tinyrag 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TinyRag 🚀

🌟 Features

🚀 Quick Start

Installation

Usage Examples

Basic Usage (No API Key Required)

With LLM for Chat

📖 Documentation

Core Components

Provider Class

TinyRag Class

Vector Store Comparison

API Reference

Core Methods

🔧 Configuration Options

Vector Store Configuration

Provider Configuration

📦 Installation Options

🛠️ Development

Requirements

Optional Dependencies

Contributing

📄 License

🤝 Support

🎯 Use Cases

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes