Skip to main content

The official InterCode benchmark package - a framework for interactive code tasks

Project description

🤖💻 Intercode

Build interactive code environments for training, testing, and augmenting code and decision making agents

Build License

👋 Overview

InterCode is a lightweight, flexible, and easy-to-use framework for designing interactive code environments. Following the popular gym interface definition, InterCode makes it easy to quickly define a code environment and deploy an agent to operate in code within the context of the environment.

For an overview of InterCode, building interactive code tasks with InterCode, and evaluating agents on InterCode environments, please check out our wiki, website and the original paper:

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

🛠️ Installation

Note InterCode requires the following installations to run:

  • python >= 3.8
  • docker: Learn more here to install.
pip install intercode-bench

🚀 Quick Start

Before running the below code, make sure the Docker daemon/application is running locally.

Bash

Create a python file and copy + paste the following code to interact with the InterCode Bash environment.

import readline
from intercode.assets import bash_build_docker, bash_image_name, bash_test_data
from intercode.envs import BashEnv

if __name__ == '__main__':
    bash_build_docker()
    env = BashEnv(bash_image_name, data_path=bash_test_data, traj_dir="logs/", verbose=True)

    try:
        for idx in range(3):
            env.reset()
            obs, done = env.observation, False
            while not done:
                action = input('> ')
                obs, reward, done, info = env.step(action)
    except KeyboardInterrupt:
        print("Keyboard interrupt detected")
    finally:
        env.close()

If InterCode was installed successfully, the InterCode Bash environment should be started successfully and a CLI interpreter should appear, allowing you to enter bash commands to interact with the task setting's file system. You can ^c at any to time to exit the environment.

SQL

Create a python file and copy + paste the following code to interact with the InterCode SQL environment.

import readline
from intercode.assets import sql_build_docker, sql_image_name, sql_test_data
from intercode.envs import SqlEnv

from typing import Dict
def preprocess(record: Dict) -> str:
    db = record["extra"]["db"]
    return f"use {db}"

if __name__ == '__main__':
    sql_build_docker()
    env = SqlEnv(sql_image_name, data_path=sql_test_data, preprocess=preprocess, traj_dir="logs/", verbose=True)

    try:
        for idx in range(3):
            env.reset()
            obs, done = env.observation, False
            while not done:
                action = input('> ')
                obs, reward, done, info = env.step(action)
    except KeyboardInterrupt:
        print("Keyboard interrupt detected")
    finally:
        env.close()

If InterCode was installed successfully, the InterCode SQL environment should be started successfully and a CLI interpreter should appear, allowing you to enter SQL commands to interact with the task setting's MySQL database. You can ^c at any to time to exit the environment.

🔎 Learn More

To learn more about the InterCode framework, please check out the website and GitHub repository

🪪 License

Check LICENSE.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intercode-bench-0.1.22.tar.gz (141.5 kB view hashes)

Uploaded Source

Built Distribution

intercode_bench-0.1.22-py3-none-any.whl (143.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page