A customizable RAG pipeline supporting multiple LLM providers and embedding backends.

These details have not been verified by PyPI

Project description

🧠 customrag

customrag is a customizable Retrieval-Augmented Generation (RAG) pipeline that supports multiple LLMs and embedding models via a simple YAML config. It’s built for developers who want a plug-and-play RAG setup that works across:

✅ OpenAI (ChatGPT, Embeddings)
✅ Gemini (Cloud Console SDK or Gemini Studio via LangChain)
✅ HuggingFace Hub
✅ xAI
✅ Local models via Sentence Transformers

🚀 Features

📆 Easy pip install (pip install customrag)
⚙️ YAML-based config — switch providers anytime
🔗 Supports multiple file formats (.txt, .pdf, .csv, .json, .docx, .md)
📁 Saves vectorstore using FAISS
🧐 Built-in SDK support for Gemini Cloud Console (not supported by LangChain)
🛠️ LangChain-native support for OpenAI, xAI, Gemini Studio, and HuggingFace

👅 Installation

pip install customrag

🛠️ One-Time Setup

Create a default config file in your project directory:

customrag-setup

This generates a config.yaml file with placeholders for your API keys and model settings.

📁 Example `config.yaml`

embedding:
  provider: gemini            # Options: gemini, openai, huggingface, sentence-transformers, xai, gemini_studio
  model: models/embedding-001 # Model for embeddings

llm:
  provider: gemini            # Options: gemini, gemini_studio, openai, huggingface, xai
  model: gemini-1.5-pro       # Chat model

api_keys:
  gemini: your_gemini_api_key_here
  gemini_studio: your_gemini_studio_api_key_here
  openai: your_openai_api_key_here
  huggingface: your_huggingface_token_here
  xai: your_xai_api_key_here

🔧 Usage

1⃣ Initialize the Pipeline

from customrag import RAGPipeline

pipeline = RAGPipeline(config_path="config.yaml")

2⃣ Build a Vectorstore from Documents

pipeline.build_vectorstore("resume.pdf")  # Accepts .pdf, .txt, .docx, .md, .json, .csv

This will:

Load and chunk your document
Embed it using the configured embedding model
Save the FAISS vectorstore locally

3⃣ Ask a Question

answer = pipeline.query("What are my key skills?")

Depending on your config, it will:

Retrieve top matching chunks using FAISS
Generate an answer using either LangChain or Gemini SDK

🤖 Supported Providers

Provider	Embeddings ✅	Chat (LLM) ✅	Chat SDK Support
OpenAI	✅ `text-embedding-ada-002`	✅ `gpt-3.5 / gpt-4`	❌
Gemini	✅ `models/embedding-001`	❌ (SDK only)	✅
Gemini Studio	✅	✅ `gemini-pro`	❌
HuggingFace	✅	✅ via `HuggingFaceHub`	❌
xAI	✅	✅ `Grok (xAI)`	❌
Local (sentence-transformers)	✅	❌	❌

📆 Example Project Structure

your-project/
├── config.yaml
├── resume.pdf
├── script.py
└── faiss_index/

👨‍💼 CLI Tool

Run this once in your project root:

customrag-setup

It will create a config.yaml you can edit with your API keys and model names.

💃 Supported Document Formats

You can ingest files of type:

📄 .txt, .pdf, .docx, .md
📈 .csv
🧾 .json (array of objects)

🧠 How It Works

graph TD
    A[User Input] -->|Query| B[RAGPipeline]
    B --> C[FAISS Vectorstore]
    C --> D[Top-K Context]
    D --> E[LLM or Gemini SDK]
    E --> F[Answer Returned]

👨‍💻 Author

Made by Anuj Goel

📬 Contribute

Issues and PRs are welcome. Add support for more LLMs or improve CLI! 🚀

📄 License

MIT License – free for personal and commercial use.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Jun 29, 2025

This version

0.1.1

Jun 29, 2025

0.1.0

Jun 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

customrag-0.1.1.tar.gz (5.4 kB view details)

Uploaded Jun 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

customrag-0.1.1-py3-none-any.whl (5.9 kB view details)

Uploaded Jun 29, 2025 Python 3

File details

Details for the file customrag-0.1.1.tar.gz.

File metadata

Download URL: customrag-0.1.1.tar.gz
Upload date: Jun 29, 2025
Size: 5.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.6

File hashes

Hashes for customrag-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`36ad462880de3df6854be02ed6dd280a52a4aa0b8022db4c0525208b2d85090d`
MD5	`384c55013ed7a2cd3842585734a3e063`
BLAKE2b-256	`d6b0b5317a863128c1d4b85454d52800c561ef80fd5160fe83adf0e1e7e4ca63`

See more details on using hashes here.

File details

Details for the file customrag-0.1.1-py3-none-any.whl.

File metadata

Download URL: customrag-0.1.1-py3-none-any.whl
Upload date: Jun 29, 2025
Size: 5.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.6

File hashes

Hashes for customrag-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d4c4c9bc00c223ff44c91e6eb7fe4f72d7b6cfff6ee9afde5cbc5585c056a187`
MD5	`885464c0dc619f0c73f83b4372182edb`
BLAKE2b-256	`f7144512682e887bf3a3ed3e4f02f6e8e1edb716a30310afb0d488f08b6176e3`

See more details on using hashes here.

customrag 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

🧠 customrag

🚀 Features

👅 Installation

🛠️ One-Time Setup

📁 Example config.yaml

🔧 Usage

1⃣ Initialize the Pipeline

2⃣ Build a Vectorstore from Documents

3⃣ Ask a Question

🤖 Supported Providers

📆 Example Project Structure

👨‍💼 CLI Tool

💃 Supported Document Formats

🧠 How It Works

👨‍💻 Author

📬 Contribute

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

📁 Example `config.yaml`