Kaggle Environments

These details have been verified by PyPI

Maintainers

Project description

Environments

pip install kaggle-environments

TLDR;

from kaggle_environments import make

# Setup a tictactoe environment.
env = make("tictactoe")

# Basic agent which marks the first available cell.
def my_agent(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]

# Run the basic agent against a default agent which chooses a "random" move.
env.run([my_agent, "random"])

# Render an html ipython replay of the tictactoe game.
env.render(mode="ipython")

Overview

Kaggle Environments was created to evaluate episodes. While other libraries have set interface precedents (such as Open.ai Gym), the emphasis of this library focuses on:

Episode evaluation (compared to training agents).
Configurable environment/agent lifecycles.
Simplified agent and environment creation.
Cross language compatible/transpilable syntax/interfaces.

Help Documentation

# Additional documentation (especially interfaces) can be found on all public functions:
from kaggle_environments import make
help(make)
env = make("tictactoe")
dir(env)
help(env.reset)

Agents

A function which given an observation generates an action.

Writing

Agent functions can have observation and configuration parameters and must return a valid action. Details about the observation, configuration, and actions can seen by viewing the specification.

from kaggle_environments import make
env = make("connectx", {"rows": 10, "columns": 8, "inarow": 5})

def agent(observation, configuration):
  print(observation) # {board: [...], mark: 1}
  print(configuration) # {rows: 10, columns: 8, inarow: 5}
  return 3 # Action: always place a mark in the 3rd column.

# Run an episode using the agent above vs the default random agent.
env.run([agent, "random"])

# Print schemas from the specification.
print(env.specification.observation)
print(env.specification.configuration)
print(env.specification.action)

Loading Agents

Agents are always functions, however there are some shorthand syntax options to make generating/using them easier.

# Agent def accepting an observation and returning an action.
def agent1(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]

# Load a default agent called "random".
agent2 = "random"

# Load an agent from source.
agent3 = """
def act(obs):
  return [c for c in range(len(obs.board)) if obs.board[c] == 0][0]
"""

# Load an agent from a file.
agent4 = "C:\path\file.py"

# Return a fixed action.
agent5 = 3

# Return an action from a url.
agent6 = "http://localhost:8000/run/agent"

Default Agents

Most environments contain default agents to play against. To see the list of available agents for a specific environment run:

from kaggle_environments import make
env = make("tictactoe")

# The list of available default agents.
print(*env.agents)

# Run random agent vs reaction agent.
env.run(["random", "reaction"])

Training

Open AI Gym interface is used to assist with training agents. The None keyword is used below to denote which agent to train (i.e. train as first or second player of connectx).

from kaggle_environments import make

env = make("connectx", debug=True)

# Training agent in first position (player 1) against the default random agent.
trainer = env.train([None, "random"])

obs = trainer.reset()
for _ in range(100):
    env.render()
    action = 0 # Action for the agent being trained.
    obs, reward, done, info = trainer.step(action)
    if done:
        obs = trainer.reset()

Debugging

There are 3 types of errors which can occur from agent execution:

Timeout - the agent runtime exceeded the allowed limit. There are 2 timeouts:
1. agentTimeout - Used for initialization of an agent on first "act".
2. actTimeout - Used for obtaining an action.
Error - the agent raised and error during execution.
Invalid - the agent action response didn't match the action specification or the environment deemed it invalid (i.e. playing twice in the same cell in tictactoe).

To help debug your agent and why it threw the errors above, add the debug flag when setting up the environment.

from kaggle_environments import make

def agent():
  return "Something Bad"

env = make("tictactoe", debug=True)

env.run([agent, "random"])
# Prints: "Invalid Action: Something Bad"

Environments

A function which given a state and agent actions generates a new state.

Name	Description	Make
connectx	Connect 4 in a row but configurable.	`env = make("connectx")`
tictactoe	Classic Tic Tac Toe	`env = make("tictactoe")`
identity	For debugging, action is the reward.	`env = make("identity")`

Making

An environment instance can be made from an existing specification (such as those listed above).

from kaggle_environments import make

# Create an environment instance.
env = make(
  # Specification or name to registered specification.
  "connectx",

  # Override default and environment configuration.
  configuration={"rows": 9, "columns": 10},

  # Initialize the environment from a prior state (episode resume).
  steps=[],

  # Enable verbose logging.
  debug=True
)

Configuration

There are two types of configuration: Defaults applying to every environment and those specific to the environment. The following is a list of the default configuration:

Name	Description
episodeSteps	Maximum number of steps in the episode.
agentTimeout	Maximum runtime (seconds) to initialize an agent.
actTimeout	Maximum runtime (seconds) to obtain an action from an agent.
runTimeout	Maximum runtime (seconds) of an episode (not necessarily DONE).
maxLogLength	Maximum log length (number of characters, `None` -> no limit)

env = make("connectx", configuration={
  "columns": 19, # Specific to ConnectX.
  "actTimeout": 10,
})

Resetting

Environments are reset by default after "make" (unless starting steps are passed in) as well as when calling "run". Reset can be called at anytime to clear the environment.

num_agents = 2
reset_state = env.reset(num_agents)

Running

Execute an episode against the environment using the passed in agents until they are no longer running (i.e. status != ACTIVE).

steps = env.run([agent1, agent2])
print(steps)

Evaluating

Evaluation is used to run an episode (environment + agents) multiple times and just return the rewards.

from kaggle_environments import evaluate

# Same definitions as "make" above.
environment = "connectx"
configuration = {"rows": 10, "columns": 8, "inarow": 5}
steps = []

# Which agents to run repeatedly.  Same as env.run(agents)
agents = ["random", agent1]

# How many times to run them.
num_episodes = 10

rewards = evaluate(environment, agents, configuration, steps, num_episodes)

Stepping

Running above essentially just steps until no agent is still active. To execute a singular game loop, pass in actions directly for each agent. Note that this is normally used for training agents (most useful in a single agent setup such as using the gym interface).

agent1_action = agent1(env.state[0].observation)
agent2_action = agent2(env.state[1].observation)
state = env.step([agent1_action, agent2_action])

Playing

A few environments offer an interactive play against agents within jupyter notebooks. An example of this is using connectx:

from kaggle_environments import make

env = make("connectx")
# None indicates which agent will be manually played.
env.play([None, "random"])

Rendering

The following rendering modes are supported:

json - Same as doing a json dump of env.toJSON()
ansi - Ascii character representation of the environment.
human - ansi just printed to stdout
html - HTML player representation of the environment.
ipython - html just printed to the output of a ipython notebook.

out = env.render(mode="ansi")
print(out)

Command Line

> python main.py -h

List Registered Environments

> python main.py list

Evaluate Episode Rewards

python main.py evaluate --environment tictactoe --agents random random --episodes 10

Run an Episode

> python main.py run --environment tictactoe --agents random /pathtomy/agent.py --debug True

Load an Episode

This is useful when converting an episode json output into html.

python main.py load --environment tictactoe --steps [...] --render '{"mode": "html"}'

HTTP Server

The HTTP server contains the same interface/actions as the CLI above merging both POST body and GET params.

Setup

python main.py http-server --port=8012 --host=0.0.0.0

Running Agents on Separate Servers

# How to run agent on a separate server.
import requests
import json

path_to_agent1 = "/home/ajeffries/git/playground/agent1.py"
path_to_agent2 = "/home/ajeffries/git/playground/agent2.py"

agent1_url = f"http://localhost:5001?agents[]={path_to_agent1}"
agent2_url = f"http://localhost:5002?agents[]={path_to_agent2}"

body = {
    "action": "run",
    "environment": "tictactoe",
    "agents": [agent1_url, agent2_url]
}
resp = requests.post(url="http://localhost:5000", data=json.dumps(body)).json()

# Inflate the response replay to visualize.
from kaggle_environments import make
env = make("tictactoe", steps=resp["steps"], debug=True)
env.render(mode="ipython")
print(resp)

Project details

These details have been verified by PyPI

Maintainers

bobfraser bovard erdalsivri kaggle

Release history Release notifications | RSS feed

1.16.6

Nov 22, 2024

This version

1.16.5

Nov 19, 2024

1.16.4

Nov 15, 2024

1.16.3

Nov 13, 2024

1.16.2

Nov 12, 2024

1.16.1

Nov 8, 2024

1.16.0

Nov 6, 2024

1.15.3

Oct 10, 2024

1.15.2

Oct 10, 2024

1.15.1

Oct 9, 2024

1.15.0

Oct 9, 2024

1.14.17

Oct 2, 2024

1.14.16

Oct 2, 2024

1.14.15

Jun 18, 2024

1.14.14

Jun 17, 2024

1.14.13

Jun 5, 2024

1.14.12

Jun 4, 2024

1.14.11

May 31, 2024

1.14.9

May 28, 2024

1.14.8

May 24, 2024

1.14.7

May 24, 2024

1.14.6

May 23, 2024

1.14.5

May 13, 2024

1.14.3

Sep 20, 2023

1.14.1

Aug 31, 2023

1.14.0

Aug 21, 2023

1.13.0

Mar 2, 2023

1.12.0

Jan 24, 2023

1.11.2

Dec 19, 2022

1.11.1

Dec 9, 2022

1.11.0

Dec 8, 2022

1.10.3

Nov 6, 2022

1.10.2

Oct 31, 2022

1.10.1

Oct 30, 2022

1.10.0

Oct 10, 2022

1.9.11

Jul 11, 2022

1.9.10

May 16, 2022

1.9.9

Apr 12, 2022

1.9.8

Apr 11, 2022

1.9.7

Apr 5, 2022

1.9.5

Mar 28, 2022

1.9.4

Mar 26, 2022

1.9.3

Mar 21, 2022

1.9.2

Mar 17, 2022

1.9.1

Mar 17, 2022

1.9.0

Mar 10, 2022

1.8.12

Sep 1, 2021

1.8.11

Aug 23, 2021

1.8.10

Aug 19, 2021

1.8.9

Aug 19, 2021

1.8.8

Aug 17, 2021

1.8.7

Aug 14, 2021

1.8.6

Aug 13, 2021

1.8.5

Aug 6, 2021

1.8.4

Jul 29, 2021

1.8.3

Jul 24, 2021

1.8.2

Jul 19, 2021

1.8.1

Jul 16, 2021

1.8.0

Jul 12, 2021

1.7.11

Feb 3, 2021

1.7.10

Jan 27, 2021

1.7.9

Jan 27, 2021

1.7.7

Jan 25, 2021

1.7.6

Jan 20, 2021

1.7.3

Dec 9, 2020

1.7.2

Dec 8, 2020

1.7.0

Dec 4, 2020

1.6.0

Nov 16, 2020

1.5.0

Nov 13, 2020

1.4.0

Nov 9, 2020

1.3.14

Oct 27, 2020

1.3.13

Oct 27, 2020

1.3.12

Oct 27, 2020

1.3.11

Oct 27, 2020

1.3.10

Oct 27, 2020

1.3.9

Oct 26, 2020

1.3.8

Oct 19, 2020

1.3.6

Sep 29, 2020

1.3.5

Sep 26, 2020

1.3.4

Sep 26, 2020

1.3.3

Sep 25, 2020

1.3.2

Sep 22, 2020

1.3.1

Sep 17, 2020

1.3.0

Sep 16, 2020

1.2.18

Sep 15, 2020

1.2.17

Sep 15, 2020

1.2.16

Sep 12, 2020

1.2.15

Sep 12, 2020

1.2.14

Sep 11, 2020

1.2.13

Sep 11, 2020

1.2.12

Sep 11, 2020

1.2.11

Sep 11, 2020

1.2.10

Sep 11, 2020

1.2.9

Sep 10, 2020

1.2.7

Sep 9, 2020

1.2.6

Sep 3, 2020

1.2.5

Aug 28, 2020

1.2.4

Aug 27, 2020

1.2.3

Aug 27, 2020

1.2.2

Aug 20, 2020

1.2.1

Jul 31, 2020

1.2.0

Jul 30, 2020

1.1.2

Jul 30, 2020

1.1.1

Jul 29, 2020

1.1.0

Jul 28, 2020

1.0.15

Jul 24, 2020

1.0.14

Jul 24, 2020

1.0.13

Jul 21, 2020

1.0.12

Jul 7, 2020

1.0.11

Jun 30, 2020

1.0.10

Jun 23, 2020

1.0.9

Jun 23, 2020

1.0.8

Jun 18, 2020

1.0.5

Jun 16, 2020

1.0.4

Jun 15, 2020

1.0.3

Jun 14, 2020

1.0.2

Jun 13, 2020

1.0.1

Jun 13, 2020

1.0.0

Jun 13, 2020

0.3.13

Jun 10, 2020

0.3.12

Jun 3, 2020

0.3.11

May 29, 2020

0.3.10

May 29, 2020

0.3.9

May 29, 2020

0.3.8

May 29, 2020

0.3.7

May 29, 2020

0.3.6

May 29, 2020

0.3.5

May 26, 2020

0.3.4

May 21, 2020

0.3.3

May 19, 2020

0.3.2

May 14, 2020

0.3.1

May 13, 2020

0.3.0

May 13, 2020

0.2.8

May 12, 2020

0.2.7

May 12, 2020

0.2.6

May 8, 2020

0.2.5

May 8, 2020

0.2.4

May 8, 2020

0.2.3

May 7, 2020

0.2.2

May 6, 2020

0.2.1

Apr 7, 2020

0.2.0

Apr 7, 2020

0.1.6

Jan 10, 2020

0.1.5

Jan 9, 2020

0.1.4

Jan 3, 2020

0.1.3

Jan 3, 2020

0.1.2

Jan 3, 2020

0.1.1

Jan 3, 2020

0.1.0

Jan 3, 2020

0.0.1

Dec 17, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaggle-environments-1.16.5.tar.gz (1.3 MB view details)

Uploaded Nov 19, 2024 Source

Built Distribution

kaggle_environments-1.16.5-py2.py3-none-any.whl (1.4 MB view details)

Uploaded Nov 19, 2024 Python 2 Python 3

File details

Details for the file kaggle-environments-1.16.5.tar.gz.

File metadata

Download URL: kaggle-environments-1.16.5.tar.gz
Upload date: Nov 19, 2024
Size: 1.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.9

File hashes

Hashes for kaggle-environments-1.16.5.tar.gz
Algorithm	Hash digest
SHA256	`84f0d55b8baa193e0e6015a8ba4818efae359e229a2905f12ff3809a91d068f7`
MD5	`ccedac8cdd4f6aec90c9c37bf8e61bb3`
BLAKE2b-256	`6ffc66db0114ffc347d668ac14455c1524295b4b97d9ff305e36d15f520f7802`

See more details on using hashes here.

File details

Details for the file kaggle_environments-1.16.5-py2.py3-none-any.whl.

File metadata

Download URL: kaggle_environments-1.16.5-py2.py3-none-any.whl
Upload date: Nov 19, 2024
Size: 1.4 MB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.9

File hashes

Hashes for kaggle_environments-1.16.5-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`f283afe1b8f97a50c385af94a85bf5a7b9b2cde4ef8a3c578bdfee9b45469f9a`
MD5	`2ef0eaed9fd63eb1d03c147303250aab`
BLAKE2b-256	`d0962c9b29bd0475761cd9ae5ebc87b4e28b4ac6f8404a83935fbdbb9b4f9b4d`

See more details on using hashes here.

kaggle-environments 1.16.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Environments

TLDR;

Overview

Help Documentation

Agents

Writing

Loading Agents

Default Agents

Training

Debugging

Environments

Making

Configuration

Resetting

Running

Evaluating

Stepping

Playing

Rendering

Command Line

List Registered Environments

Evaluate Episode Rewards

Run an Episode

Load an Episode

HTTP Server

Setup

Running Agents on Separate Servers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes