Project description

Zeno 🤝 OpenAI Evals

Use Zeno to visualize the results of OpenAI Evals.

pip install zeno-evals

Run an evaluation following the evals instructions. This will produce a cache file in /tmp/evallogs/.

Pass this file to the zeno-evals command:

zeno-evals /tmp/evallogs/my_eval_cache.jsonl

We include an example looking at the MedMCQA dataset (Thanks to @SinanAkkoyun):

zeno-evals ./example_medicine/example.jsonl --functions_file=./example_medicine/distill.py

These details have not been verified by PyPI

0.1.10

Apr 21, 2023

0.1.9

Apr 21, 2023

0.1.8

Apr 21, 2023

0.1.7

Apr 18, 2023

0.1.6

Apr 18, 2023

0.1.5

Apr 18, 2023

0.1.4

Apr 11, 2023

0.1.3

Apr 10, 2023

This version

0.1.2

Mar 16, 2023

0.1.1

Mar 15, 2023

0.1.0

Mar 15, 2023

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Uploaded Mar 16, 2023 Source

Uploaded Mar 16, 2023 Python 3

Hashes for zeno_evals-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`149a0c469f7c3495aaf443aac2d054e993caacf2a3dc02a938d5ce567ee70d2e`
MD5	`1dea1fb7d8ee797475216530c2b72c37`
BLAKE2b-256	`221c2ff8b2ba246ad7209ce6350eb7d1cb98e1a69f928cd0eaa7173fe36c1765`

Hashes for zeno_evals-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5c8ad1919051c7377e2b1b4202960ab81b2f2282a5feef1a8ccf17cc84500a2`
MD5	`ea5045f71d1040c3366feaa4f86e57f2`
BLAKE2b-256	`337c0fb28361f35cadfbc9eda5079ed5601b8f82d3fee27b725c22b7eb986d47`