Debug code generation models
Project description
CodeGaze [Beta]: A library for evaluating and debugging code generation models
Still in development.
Code gaze implements a set of evaluation metrics and visualization tools for debugging code generation models.
CodeGaze is build around a set of abstractions that allow for the evaluation of code generation models.
- Dataset: An example code generation dataset e.g. humaneval.
- Experiment: A set of parameters that define the evaluation of a code generation model. Each experiment specifies things like the dataset, model properties (temperature, n_completions), and some metric properties.
- Model: A code generation model that can be evaluated. This is either an OpenAI model or a HuggingFace model.
The basic starting point is to run an experiment on a dataset with a list of models.
Installation
pip install codegaze
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
codegaze-0.0.23a0.tar.gz
(1.1 MB
view details)
Built Distribution
codegaze-0.0.23a0-py3-none-any.whl
(814.2 kB
view details)
File details
Details for the file codegaze-0.0.23a0.tar.gz
.
File metadata
- Download URL: codegaze-0.0.23a0.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e4e128652f8ac8e623752452040cd70960e47581f2040bcc7f41c9ba8de9de5a |
|
MD5 | c51ac7697fa80dbad891571ecfba858b |
|
BLAKE2b-256 | 4e44107a098d7bad8090a5e432cfd35318713534f55fe25ec270749b02f6e675 |
File details
Details for the file codegaze-0.0.23a0-py3-none-any.whl
.
File metadata
- Download URL: codegaze-0.0.23a0-py3-none-any.whl
- Upload date:
- Size: 814.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8d560e5d1399cef5853918523de3f6576654aeb95a449e9ee4ac4ee2e95fe79 |
|
MD5 | 5f76277a8c726a286869b250f2d3df61 |
|
BLAKE2b-256 | 5a01bc2df907f294d850dde9377813c8e3ff8ffa623326249bc801f4ccaf9aa1 |