The official InterCode benchmark package - a framework for interactive code tasks
Project description
🤖💻 Intercode
Build interactive code environments for training, testing, and augmenting code and decision making agents
👋 Overview
InterCode is a lightweight, flexible, and easy-to-use framework for designing interactive code environments. Following the popular gym
interface definition, InterCode makes it easy to quickly define a code environment and deploy an agent to operate in code within the context of the environment.
For an overview of InterCode, building interactive code tasks with InterCode, and evaluating agents on InterCode environments, please check out our wiki, website and the original paper:
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
🛠️ Installation
Note InterCode requires
python
>= 3.8 a localdocker
installation to run. Learn more here to install.
pip install intercode-bench
🚀 Quick Start
- Clone the InterCode repository
- Run
./setup.sh
- Run
python run_sql.py
If InterCode was installed successfully, the InterCode SQL environment should be started successfully and a CLI interpreter should appear, allowing you to enter SQL
commands to interact with the task setting.
🔎 Learn More
To learn more about the InterCode framework, please check out the website and GitHub repository
🪪 License
Check LICENSE.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for intercode_bench-0.1.10-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | eaf3df7fb30370ec64ca8476241f52ceea8dccb3f8ea947120bf964fe86e7f4b |
|
MD5 | e11c8819773de1d7289229435b5677bb |
|
BLAKE2b-256 | b6d029b251fd4b9d9ad45dc6aa51cc873e3c43cdd63315ee9e185609ff3b2b65 |