Generate QA pairs from JSON documents to evaluate RAG pipelines

Project description

QA Pairs Generator for RAG Evaluation

Automatically generate question-answer pairs from a corpus of JSON documents to evaluate Retrieval-Augmented Generation (RAG) pipelines. The tool extracts named entities from your documents, matches each entity to its most relevant documents via hybrid search (keyword + embeddings), and then prompts an LLM to produce one QA pair per (entity, document) combination.

Installation
Input document format
Environment variables
Usage
Output format

Installation

Requires Python ≥ 3.10.

With pip

# runtime only
pip install -e .

# runtime + dev dependencies (pytest)
pip install -e ".[dev]"

With uv

uv sync          # runtime only
uv sync --extra dev   # include dev dependencies

Input document format

Each document must be a single .json file inside the input directory. A document can have any top-level fields; you tell the pipeline which ones to use via --search-fields.

Minimal example (docs/doc_001.json):

{
  "title": "Introduction to Transformers",
  "description": "Transformers are a type of neural network architecture introduced in the paper Attention Is All You Need.",
  "author": "Vaswani et al.",
  "year": 2017
}

If you run the pipeline with --search-fields title description, the tool will concatenate the title and description fields to build the corpus text used for entity extraction and search. Fields listed in --search-fields that are absent from a document are silently skipped.

Environment variables

OpenAI (default)

Variable	Required	Description
`OPENAI_API_KEY`	Yes	Your OpenAI secret key

Azure OpenAI (`--client azure`)

Variable	Required	Description
`AZURE_OPENAI_API_KEY`	Yes	Your Azure OpenAI key
`AZURE_OPENAI_ENDPOINT`	Yes	Your Azure endpoint URL (e.g. `https://<resource>.openai.azure.com/`)
`OPENAI_API_VERSION`	Yes	API version (e.g. `2024-02-01`)

You can export the variables in your shell or store them in a .env file and load it before running the pipeline.

# Linux / macOS
export OPENAI_API_KEY="sk-..."

# Windows PowerShell
$env:OPENAI_API_KEY = "sk-..."

Usage

python run_pipeline.py \
    --input-dir   <DIR>          \
    --search-fields <FIELD ...>  \
    --output      <FILE.json>    \
    [--client     openai|azure]  \
    [--model      <MODEL>]       \
    [--embedding-model <MODEL>]  \
    [--top-n      <N>]

Argument	Required	Default	Description
`--input-dir`	Yes	—	Directory containing `.json` input files
`--search-fields`	Yes	—	One or more document fields to use for entity extraction and corpus building
`--output`	Yes	—	Path to the output JSON file
`--client`	No	`openai`	LLM provider: `openai` or `azure`
`--model`	No	`gpt-4o-mini`	Chat model used for entity extraction and QA generation
`--embedding-model`	No	`text-embedding-3-small`	Embedding model used for semantic search
`--top-n`	No	`3`	Number of documents retrieved per entity via embedding search

Complete example

python run_pipeline.py \
    --input-dir   ./docs \
    --search-fields title description \
    --output      qa_output.json \
    --client      openai \
    --model       gpt-4o-mini \
    --embedding-model text-embedding-3-small \
    --top-n       3

Azure OpenAI example

python run_pipeline.py \
    --input-dir   ./docs \
    --search-fields title description \
    --output      qa_output.json \
    --client      azure \
    --model       my-gpt4o-deployment \
    --embedding-model my-embedding-deployment

Output format

The output is a JSON array. Each element is a QA pair with the following fields:

[
  {
    "entity": "Transformers",
    "question": "What problem do Transformers solve compared to RNNs?",
    "answer": "Transformers solve the sequential computation bottleneck of RNNs by relying entirely on self-attention mechanisms, enabling parallelisation during training.",
    "source_document": "Introduction to Transformers\nTransformers are a type of neural network architecture..."
  }
]

Field	Type	Description
`entity`	`string`	Named entity extracted from the documents
`question`	`string`	Generated question about the entity
`answer`	`string`	Generated answer grounded in the source document
`source_document`	`string`	Concatenated text of the document used to generate the pair

Project details

Release history Release notifications | RSS feed

0.4.0

Apr 13, 2026

This version

0.1.0

Apr 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rag_evaluation_dataset-0.1.0.tar.gz (51.3 kB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rag_evaluation_dataset-0.1.0-py3-none-any.whl (12.4 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file rag_evaluation_dataset-0.1.0.tar.gz.

File metadata

Download URL: rag_evaluation_dataset-0.1.0.tar.gz
Upload date: Apr 10, 2026
Size: 51.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rag_evaluation_dataset-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`387034f371ba570a66caf7c998a83f598fe0cf18401d197351dab5483289349f`
MD5	`1bb76d7f95595219f275e7dc15c2450e`
BLAKE2b-256	`b0380eb77f363ac822cf63d83501a7be0bfa490f4033912e3bf686a409f4d168`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rag_evaluation_dataset-0.1.0.tar.gz:

Publisher: cd.yml on ViniciusKos/rag_evaluation_dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rag_evaluation_dataset-0.1.0.tar.gz
- Subject digest: 387034f371ba570a66caf7c998a83f598fe0cf18401d197351dab5483289349f
- Sigstore transparency entry: 1272475814
- Sigstore integration time: Apr 10, 2026
Source repository:
- Permalink: ViniciusKos/rag_evaluation_dataset@096c896b0f1108f68ce1b3c1ac8e348ad6fede9d
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/ViniciusKos
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@096c896b0f1108f68ce1b3c1ac8e348ad6fede9d
- Trigger Event: push

File details

Details for the file rag_evaluation_dataset-0.1.0-py3-none-any.whl.

File metadata

Download URL: rag_evaluation_dataset-0.1.0-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 12.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rag_evaluation_dataset-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f5b179b121ddc89b325cf294db1f28f3e079a1c7f935e349724a20aab90b3dc7`
MD5	`d9e49223919018318ca717f99aa1c8b2`
BLAKE2b-256	`ea73c3fd2a84190b1a533f98d888b862e8d0294e406e100fc936d34dce2b65cd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rag_evaluation_dataset-0.1.0-py3-none-any.whl:

Publisher: cd.yml on ViniciusKos/rag_evaluation_dataset

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rag_evaluation_dataset-0.1.0-py3-none-any.whl
- Subject digest: f5b179b121ddc89b325cf294db1f28f3e079a1c7f935e349724a20aab90b3dc7
- Sigstore transparency entry: 1272475831
- Sigstore integration time: Apr 10, 2026
Source repository:
- Permalink: ViniciusKos/rag_evaluation_dataset@096c896b0f1108f68ce1b3c1ac8e348ad6fede9d
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/ViniciusKos
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@096c896b0f1108f68ce1b3c1ac8e348ad6fede9d
- Trigger Event: push

rag-evaluation-dataset 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

QA Pairs Generator for RAG Evaluation

Table of contents

Installation

With pip

With uv

Input document format

Environment variables

OpenAI (default)

Azure OpenAI (`--client azure`)

Usage

Complete example

Azure OpenAI example

Output format

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

rag-evaluation-dataset 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

QA Pairs Generator for RAG Evaluation

Table of contents

Installation

With pip

With uv

Input document format

Environment variables

OpenAI (default)

Azure OpenAI (--client azure)

Usage

Complete example

Azure OpenAI example

Output format

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Azure OpenAI (`--client azure`)