The official InterCode benchmark package - a framework for interactive code tasks
Project description
🤖💻 Intercode
Build interactive code environments for training, testing, and augmenting code and decision making agents
👋 Overview
InterCode is a lightweight, flexible, and easy-to-use framework for designing interactive code environments. Following the popular gym
interface definition, InterCode makes it easy to quickly define a code environment and deploy an agent to operate in code within the context of the environment.
For an overview of InterCode, building interactive code tasks with InterCode, and evaluating agents on InterCode environments, please check out our wiki, website and the original paper:
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
🛠️ Installation
Note InterCode requires the following installations to run:
python
>= 3.8docker
: Learn more here to install.
pip install intercode-bench
🚀 Quick Start
Before running the below code, make sure the Docker daemon/application is running locally.
Bash
Create a python file and copy + paste the following code to interact with the InterCode Bash environment.
import readline
from intercode.assets import bash_build_docker, bash_image_name, bash_test_data
from intercode.envs import BashEnv
if __name__ == '__main__':
bash_build_docker()
env = BashEnv(bash_image_name, data_path=bash_test_data, traj_dir="logs/", verbose=True)
try:
for idx in range(3):
env.reset()
obs, done = env.observation, False
while not done:
action = input('> ')
obs, reward, done, info = env.step(action)
except KeyboardInterrupt:
print("Keyboard interrupt detected")
finally:
env.close()
If InterCode was installed successfully, the InterCode Bash environment should be started successfully and a CLI interpreter should appear, allowing you to enter bash
commands to interact with the task setting's file system. You can ^c
at any to time to exit the environment.
SQL
Create a python file and copy + paste the following code to interact with the InterCode SQL environment.
import readline
from intercode.assets import sql_build_docker, sql_image_name, sql_test_data
from intercode.envs import SqlEnv
from typing import Dict
def preprocess(record: Dict) -> str:
db = record["extra"]["db"]
return f"use {db}"
if __name__ == '__main__':
sql_build_docker()
env = SqlEnv(sql_image_name, data_path=sql_test_data, preprocess=preprocess, traj_dir="logs/", verbose=True)
try:
for idx in range(3):
env.reset()
obs, done = env.observation, False
while not done:
action = input('> ')
obs, reward, done, info = env.step(action)
except KeyboardInterrupt:
print("Keyboard interrupt detected")
finally:
env.close()
If InterCode was installed successfully, the InterCode SQL environment should be started successfully and a CLI interpreter should appear, allowing you to enter SQL
commands to interact with the task setting's MySQL database. You can ^c
at any to time to exit the environment.
🔎 Learn More
To learn more about the InterCode framework, please check out the website and GitHub repository
🪪 License
Check LICENSE.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file intercode-bench-0.1.22.tar.gz
.
File metadata
- Download URL: intercode-bench-0.1.22.tar.gz
- Upload date:
- Size: 141.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3cbceb06febaf2a1de1c981757d4bd154ec44fe4a344995b6305a3d1d2717e2a |
|
MD5 | ee54a64961ed94bbd2d889e3a99a8631 |
|
BLAKE2b-256 | 18028d99e54630c8d8bd9c827ae9f80a52def3c0806eb31ee9681226fc8b3fbe |
File details
Details for the file intercode_bench-0.1.22-py3-none-any.whl
.
File metadata
- Download URL: intercode_bench-0.1.22-py3-none-any.whl
- Upload date:
- Size: 143.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3cecd6796bdb70276a5915ca1db725b5e419291638a34383b16747530d95b8cc |
|
MD5 | 87e00348f437315601ad07d2d92636ec |
|
BLAKE2b-256 | e23d2539ef4f919bbb601c4a92eaf27ef66d85f72af4d4ac5f0c1d048389f10f |