Generate ideal question-answer dataset for testing your LLM.
Project description
FiddleCube - Generate ideal question-answers for testing RAG
FiddleCube generates an ideal question-answer dataset for testing your LLM. Run tests on this dataset before pushing any prompt or RAG upgrades.
Quickstart
Install FiddleCube
pip3 install fiddlecube
API Key
Get the API key here.
Usage
from fiddlecube import FiddleCube
fc = FiddleCube(api_key="<api-key>")
dataset = fc.generate(
[
"The cat did not want to be petted.",
"The cat was not happy with the owner's behavior.",
],
10,
)
dataset
{
"results": [
{
"query": "Question: Why did the cat not want to be petted?",
"contexts": ["The cat did not want to be petted."],
"answer": "The cat did not want to be petted because it was not in the mood for physical affection at that moment.",
"score": 0.8,
"question_type": "SIMPLE"
},
{
"query": "Was the cat pleased with the owner's actions?",
"contexts": ["The cat was not happy with the owner's behavior."],
"answer": "No, the cat was not pleased with the owner's actions.",
"score": 0.8,
"question_type": "NEGATIVE"
}
],
"status": "COMPLETED",
"num_tokens_generated": 44,
"rate_limited": false
}
Ideal QnA datasets for testing, eval and training LLMs
Testing, evaluation or training LLMs requires an ideal QnA dataset aka the golden dataset.
This dataset needs to be diverse, covering a wide range of queries with accurate responses.
Creating such a dataset takes significant manual effort.
As the prompt or RAG contexts are updated, which is nearly all the time for early applications, the dataset needs to be updated to match.
FiddleCube generates ideal QnA from vector embeddings
- The questions cover the entire RAG knowledge corpus.
- Complex reasoning, safety alignment and 5 other question types are generated.
- Filtered for correctness, context relevance and style.
- Auto-updated with prompt and RAG updates.
Roadmap
- Question-answers, complex reasoning from RAG
- Multi-turn conversations
- Evaluation Setup - Integrate metrics
- CI setup - Run as part of CI/CD pipeline
- Diagnose failures - step-by-step analysis of failed queries
More Questions?
Book a demo
Contact us at founders@fiddlecube.ai for any feature requests, feedback or questions.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file fiddlecube-0.1.6.tar.gz
.
File metadata
- Download URL: fiddlecube-0.1.6.tar.gz
- Upload date:
- Size: 2.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.11.4 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6b48032a16c2c16e01e6305643759e977d9d51d3a3496de6e8e8d3fe2deefd46 |
|
MD5 | fd1cb341202432c703afa0d6d2631388 |
|
BLAKE2b-256 | 75c3ceedd47bc46b0b9159c80889b06fa469942bf6700382c52bd3d5086823da |
File details
Details for the file fiddlecube-0.1.6-py3-none-any.whl
.
File metadata
- Download URL: fiddlecube-0.1.6-py3-none-any.whl
- Upload date:
- Size: 2.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.11.4 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3f659d86f59cb3e33839ce3cf16023d248b773c0c7b5153f2f0506574456dc7 |
|
MD5 | dc3f791f96cc09369fb6fadfba440037 |
|
BLAKE2b-256 | ecdeca053ae0d409549e59f8f1f3f33b3e0f8b9caa7bd9cc4dc7b5e1ee651fc4 |