Visualize OpenAI evals with Zeno
Project description
Zeno 🤝 OpenAI Evals
Use Zeno to visualize the results of OpenAI Evals.
Usage
pip install zeno-evals
Run an evaluation following the evals instructions. This will produce a cache file in /tmp/evallogs/
.
Pass this file to the zeno-evals
command:
zeno-evals /tmp/evallogs/my_eval_cache.jsonl
Example
We include an example looking at the MedMCQA dataset:
zeno-evals example.jsonl
Todo
- Support model-graded evaluations
- Support custom evaluation templates (e.g. BLEU for translation)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
zeno_evals-0.1.1.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for zeno_evals-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f2115a68277b83594fb93d60ca6fcbbb3cb7a751df3267d133c8bf84e9df218 |
|
MD5 | ef478f67d0756dda40b6cd185bf83f55 |
|
BLAKE2b-256 | ebc618e648dadfb57a62ebecb87f709de73634529018a43538ec0e1ca28effb0 |