Debug code generation models
Project description
CodeGaze [Beta]: A library for evaluating and debugging code generation models
Still in development.
Code gaze implements a set of evaluation metrics and visualization tools for debugging code generation models.
CodeGaze is build around a set of abstractions that allow for the evaluation of code generation models.
- Dataset: An example code generation dataset e.g. humaneval.
- Experiment: A set of parameters that define the evaluation of a code generation model. Each experiment specifies things like the dataset, model properties (temperature, n_completions), and some metric properties.
- Model: A code generation model that can be evaluated. This is either an OpenAI model or a HuggingFace model.
The basic starting point is to run an experiment on a dataset with a list of models.
Installation
pip install codegaze
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
codegaze-0.0.2a0.tar.gz
(11.3 kB
view details)
Built Distribution
File details
Details for the file codegaze-0.0.2a0.tar.gz
.
File metadata
- Download URL: codegaze-0.0.2a0.tar.gz
- Upload date:
- Size: 11.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a67bede2bb380a32da660c6f45dff8984115dfe7840eb6926028485441474585 |
|
MD5 | e507026f476f0f27ad550a5c2e61e58a |
|
BLAKE2b-256 | e70b362ba4de3da62060e63461bfefeea4ce43ce63baa00d39a4c048ddbff3c3 |
File details
Details for the file codegaze-0.0.2a0-py3-none-any.whl
.
File metadata
- Download URL: codegaze-0.0.2a0-py3-none-any.whl
- Upload date:
- Size: 12.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7d2c7c3f2a9ff92146ee4be8a16c79afa2e3279680084370093d66aa404451c1 |
|
MD5 | b0c098f0583f7ae503dcee413f75a6b7 |
|
BLAKE2b-256 | 6c51fe228f863278a1d22356f7d23e0cef1a9464d8c98151bfaa23b8849012f6 |