A modular framework for evaluating and optimizing RAG pipelines.

These details have not been verified by PyPI

Project links

Project description

Ragmint

Python License Tests Optuna Status

Ragmint (Retrieval-Augmented Generation Model Inspection & Tuning) is a modular, developer-friendly Python library for evaluating, optimizing, and tuning RAG (Retrieval-Augmented Generation) pipelines.

It provides a complete toolkit for retriever selection, embedding model tuning, and automated RAG evaluation with support for Optuna-based Bayesian optimization.

✨ Features

✅ Automated hyperparameter optimization (Grid, Random, Bayesian via Optuna)
🔍 Built-in RAG evaluation metrics — faithfulness, recall, BLEU, ROUGE, latency
⚙️ Retrievers — FAISS, Chroma, ElasticSearch
🧩 Embeddings — OpenAI, HuggingFace
🧠 Rerankers — MMR, CrossEncoder (extensible via plugin interface)
💾 Caching, experiment tracking, and reproducibility out of the box
🧰 Clean modular structure for easy integration in research and production setups

🚀 Quick Start

1️⃣ Installation

git clone https://github.com/andyolivers/ragmint.git
cd ragmint
pip install -e .

The -e flag installs Ragmint in editable (development) mode.
Requires Python ≥ 3.9.

2️⃣ Run a RAG Optimization Experiment

python ragmint/main.py --config configs/default.yaml --search bayesian

Example configs/default.yaml:

retriever: faiss
embedding_model: text-embedding-3-small
reranker:
  mode: mmr
  lambda_param: 0.5
optimization:
  search_method: bayesian
  n_trials: 20

3️⃣ Manual Pipeline Usage

from ragmint.core.pipeline import RAGPipeline

pipeline = RAGPipeline({
    "embedding_model": "text-embedding-3-small",
    "retriever": "faiss",
})

result = pipeline.run("What is retrieval-augmented generation?")
print(result)

🧪 Dataset Options

Ragmint can automatically load evaluation datasets for your RAG pipeline:

Mode	Example	Description
🧱 Default	`validation_set=None`	Uses built-in `experiments/validation_qa.json`
📁 Custom File	`validation_set="data/my_eval.json"`	Load your own QA dataset (JSON or CSV)
🌐 Hugging Face Dataset	`validation_set="squad"`	Automatically downloads benchmark datasets (requires `pip install datasets`)

Example

from ragmint.tuner import RAGMint

ragmint = RAGMint(
    docs_path="data/docs/",
    retrievers=["faiss", "chroma"],
    embeddings=["text-embedding-3-small"],
    rerankers=["mmr"],
)

# Use built-in default
ragmint.optimize(validation_set=None)

# Use Hugging Face benchmark
ragmint.optimize(validation_set="squad")

# Use your own dataset
ragmint.optimize(validation_set="data/custom_qa.json")

🧩 Folder Structure

ragmint/
├── core/
│   ├── pipeline.py         # RAGPipeline implementation
│   ├── retriever.py        # Retriever logic (FAISS, Chroma)
│   ├── reranker.py         # MMR + CrossEncoder rerankers
│   └── embedding.py        # Embedding backends
├── tuner.py                # Grid, Random, Bayesian optimization (Optuna)
├── utils/                  # Metrics, logging, caching helpers
├── configs/                # Default experiment configs
├── experiments/            # Saved experiment results
├── tests/                  # Unit tests for all components
├── main.py                 # CLI entrypoint for tuning
└── pyproject.toml          # Project dependencies & build metadata

🧪 Running Tests

To verify your setup:

pytest -v

Or to test a specific component (e.g., reranker):

pytest tests/test_reranker.py -v

All tests are designed for Pytest and run with lightweight mock data.

⚙️ Configuration via `pyproject.toml`

Your pyproject.toml automatically includes:

[project]
name = "ragmint"
version = "0.1.0"
dependencies = [
    "numpy",
    "optuna",
    "scikit-learn",
    "faiss-cpu",
    "chromadb",
    "pytest",
    "openai",
    "tqdm",
]

📊 Example Experiment Workflow

Define your retriever and reranker configuration in YAML
Launch an optimization search (Grid, Random, or Bayesian)
Ragmint evaluates combinations automatically and reports top results
Export best parameters for production pipelines

🧬 Architecture Overview

flowchart TD
    A[Query] --> B[Embedder]
    B --> C[Retriever]
    C --> D[Reranker]
    D --> E[Generator]
    E --> F[Evaluation]
    F --> G[Optuna Tuner]
    G -->|Best Params| B

📘 Example Output

[INFO] Starting Bayesian optimization with Optuna
[INFO] Trial 7 finished: recall=0.83, latency=0.42s
[INFO] Best parameters: {'lambda_param': 0.6, 'retriever': 'faiss'}

🧠 Why Ragmint?

Built for RAG researchers, AI engineers, and LLM ops
Works with LangChain, LlamaIndex, or standalone RAG setups
Designed for extensibility — plug in your own models, retrievers, or metrics

⚖️ License

Licensed under the Apache License 2.0 — free for personal, research, and commercial use.

👤 Author

André Oliveira
andyolivers.com
Data Scientist | AI Engineer

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.12

Nov 21, 2025

0.4.11

Nov 21, 2025

0.4.10

Nov 20, 2025

0.4.9

Nov 15, 2025

0.4.8

Nov 15, 2025

0.4.7

Nov 10, 2025

0.4.6

Nov 10, 2025

0.4.5

Nov 10, 2025

0.4.4

Nov 9, 2025

0.4.3

Nov 7, 2025

0.4.2

Nov 7, 2025

0.4.1

Nov 7, 2025

0.4.0

Nov 7, 2025

0.3.1

Nov 7, 2025

0.3.0

Nov 6, 2025

0.2.3

Nov 5, 2025

0.2.2

Nov 5, 2025

0.2.1

Nov 5, 2025

0.2.0

Nov 5, 2025

This version

0.1.1

Nov 4, 2025

0.1.0

Nov 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ragmint-0.1.1.tar.gz (15.1 kB view details)

Uploaded Nov 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ragmint-0.1.1-py3-none-any.whl (16.9 kB view details)

Uploaded Nov 4, 2025 Python 3

File details

Details for the file ragmint-0.1.1.tar.gz.

File metadata

Download URL: ragmint-0.1.1.tar.gz
Upload date: Nov 4, 2025
Size: 15.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for ragmint-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`b22154a74f0c4e83217f8fc80c46d6ad8baea55c090f0655d96e3c984a07929c`
MD5	`5b77d47714e7e120fd40fb3e8187130e`
BLAKE2b-256	`12e697fee6f0606887c0e03811fec169114e85d6dc54a0101d76424f82ac78d7`

See more details on using hashes here.

File details

Details for the file ragmint-0.1.1-py3-none-any.whl.

File metadata

Download URL: ragmint-0.1.1-py3-none-any.whl
Upload date: Nov 4, 2025
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for ragmint-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`69381dcf527cf6d10c84fa8681452c54c59f173e47e6dc244b834aced2540f0b`
MD5	`6ac7cecfd871ba93ecc00fe8998c687b`
BLAKE2b-256	`b8a00b3a7a44f3c184ac235d1ff817b41b4bf000cc856caddafd6d63b4ec1f59`

See more details on using hashes here.

ragmint 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Ragmint

✨ Features

🚀 Quick Start

1️⃣ Installation

2️⃣ Run a RAG Optimization Experiment

3️⃣ Manual Pipeline Usage

🧪 Dataset Options

Example

🧩 Folder Structure

🧪 Running Tests

⚙️ Configuration via pyproject.toml

📊 Example Experiment Workflow

🧬 Architecture Overview

📘 Example Output

🧠 Why Ragmint?

⚖️ License

👤 Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

⚙️ Configuration via `pyproject.toml`