Rust Game Collection with Reninforcement Learning gym environments
Project description
Zarena
🦀 Rust Game Collection with Reninforcement Learning gym environments. This library aims to serve the same purpose as OpenSpiel, except in Rust to make it easier to use & maintain. The current games are gato, blackjack, chess & poker texas hold'em. All of these additionally support Web Assembly. You can play gato & chess against our Artificial Intelligence at Zeti Games
Configurations
Depending on the cargo file you want. You must change your cargo.toml to match that build.
Cargo.py.toml -> Python Build
Cargo.rs.toml -> Development Build
Cargo.wa.toml -> Web Assembly Build
Cargo.toml -> The actual file that Rust will build on. Copy from py/rs/wa to this file.
Commands
If you don't have Rust, no worries. Download Rust for Linux or Windows Subsystem. If you need more help.
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
Download the C compiler
sudo apt-get update && sudo apt-get install build-essential
Install poetry
curl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/get-poetry.py | python -
Install Maturin via Poetry
poetry install
Build the Maturin Develop Build
poetry run maturin develop
Build the Maturin Test Build
poetry run maturin build
Build the Maturin Production Build. The Python Wheel & source distribution.
poetry run maturin build --release
Build the Web Assembly file
wasm-pack build --target web -- --features wasm
Usage
You can import the Python classes directly, or create pre-defined environments with gym in this case it is also necessary to import the class:
# import gym to use training environments
import gym
# from zarena import the training environment of your choice, for:
# option 1.- use the python class directly
# option 2.- register the environment in gym and use it with gym.make(environment_name)
from zarena import gym_chess
env = gym_chess.ChessEnv() # Option 1
env = gym.make('ChessEnv-v3') # Option 2
# reset the environment and get the initial state observation
observation = env.reset()
# obtain legal actions
actions = env.legal_actions()
# select action according to a criterion, in this case random
action = random.choice(actions)
# pass it to the env and get the next state observation, reward, if the game is over and environment information
observation, reward, done, info = env.step(action)
# get the player to play
env.to_play()
# properly close the game
env.close()
# display the game ovservation
env.render()
Environments id
- Tictactoe:
TictactoeR-v2 - Chess:
ChessEnv-v3 - Blackjack:
BlackjackR-v1 - Poker:
PokerR-v1
Testing
Run all the tests with pytest.
Code linting and fixing
Python code is formatted with black.
Rust code is formatted with cargo fmt.
Building the Rust code
The environment uses a chess engine implemented in Rust that uses PyO3 Maturin to bind to the Python interpreter. Rust is an amazing compiled language and this project holds 2 configurations:
Cargo.py.tomlis used to build the library into a Python module with maturinCargo.rs.tomlis used to build directly withcargoin Rust to access the library in themain.rsscript for developmentCargo.wa.tomlis used to build to build for Javascript with Web Assembly. The games can be played via Web Assembly on Zeti's website https://zeti.ai
Note: we haven't found a way to specify the Cargo toml file to either process, so copy the contents of the config you want to use into Cargo.toml to make it work.
Game of Gato
The game of Xs & Os
API
Initialize environment
>>> env = BlackjackEnv(n_players=1)
n_players: specify the number of players2<=n_players<=7(default:1)
Set actions
>>> env.step(action)
action: mark a position, could be0<=action<=8
> 0 | 1 | 2
> 3 | 4 | 5
> 6 | 7 | 8
Available opponents
- random
- expert
Notes:
Tests not implemented yet.
Blackjack
API
Initialize environment
>>> env = BlackjackEnv(n_players=1)
n_players: specify the number of players2<=n_players<=7(default:1)
Set actions
>>> env.step(action)
action: can be0-> stand1-> HIT2-> double down3-> pull apart (currently disabled)
Notes:
Tests not implemented yet.
Chess
See the chess board and moves
Visualise the current state of the chess game:
env.render()
-------------------------
8 | ♖ ♘ ♗ ♕ ♔ ♗ ♘ ♖ |
7 | ♙ ♙ ♙ ♙ ♙ ♙ ♙ ♙ |
6 | . . . . . . . . |
5 | . . . . . . . . |
4 | . . . . . . . . |
3 | . . . . . . . . |
2 | ♟ ♟ ♟ ♟ ♟ ♟ ♟ ♟ |
1 | ♜ ♞ ♝ ♛ ♚ ♝ ♞ ♜ |
-------------------------
a b c d e f g h
You can also visualise multiple moves:
>>> moves = env.possible_moves
>>> env.render_moves(moves[10:12] + moves[16:18])
API
Initialize environment
>>> env = ChessEnv(player_color="WHITE", opponent="random", log=True, initial_state=DEFAULT_BOARD)
opponent: can be"random","none"or a function. Tells the environment whether to use a bot that picks a random move, play against self or use a specific bot policy (default:"random")log:TrueorFalse, specifies whether to log every move and render every new state (default:True)initial_state: initial board positions, the default value is the default chess starting board. You can specify a custom board. View scriptsgym_chess/test/for some examplesplayer_color:"WHITE"or"BLACK", only useful if playing against a bot (default:"WHITE")
>>> env.get_possible_moves(state=state, player="WHITE", attack=False)
This method will calculate the possible moves. By default they are calculated at the current state for the current player (state.current_player).
state: (optional) state for which to calculate the movesplayer: (optional) "WHITE" or "BLACK", specifies the player
Move specification:
Moves are encoded as either:
- a tuple of coordinates
((from_x, from_y), (to_x, to_y)) - or a string e.g.
"CASTLE_KING_SIDE_WHITE","CASTLE_QUEEN_SIDE_BLACK","RESIGN"
Moves are pre-calculated for every new state and stored in possible_moves.
Get State
>>> print(env.state['board'])
[[-3, -5, -4, -2, -1, -4, -5, -3],
[-6, -6, -6, -6, -6, -6, -6, -6],
[0, 0, 0, 0, 0, 0, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 0],
[0, 0, 0, 0, 0, 0, 0, 0],
[6, 6, 6, 6, 6, 6, 6, 6],
[3, 5, 4, 2, 1, 4, 5, 3]]
Every integer represents a piece. Positive pieces are white and negative ones are black.
Piece IDs are stored in constants that can be imported.
from gym_chess.envs.chess import (
KING_ID,
QUEEN_ID,
ROOK_ID,
BISHOP_ID,
KNIGHT_ID,
PAWN_ID,
)
The schema is:
EMPTY_SQUARE_ID = 0
KING_ID = 1
QUEEN_ID = 2
ROOK_ID = 3
BISHOP_ID = 4
KNIGHT_ID = 5
PAWN_ID = 6
Additional information can be found in other attributes of the environment:
env.current_player
env.white_king_castle_possible
env.white_queen_castle_possible
env.black_king_castle_possible
env.black_queen_castle_possible
env.white_king_on_the_board
env.black_king_on_the_board
Notes:
En-passant has not been implemented yet.
Poker
API
Initialize environment
>>> env = PokerEnv(n_players=2, infinite_game=True)
n_players: specify the number of players2<=n_players<=9(default:2)infinite_game:TrueorFalse, specify if players get their starting credit back after each round (default:True)
Set actions
>>> env.step(action)
action: can be0-> small blind1-> big blind2-> fold3-> check4-> bet5-> call6-> raise to 257-> raise to 508-> raise to 1009-> raise to 50010-> raise to 100011-> all in
Notes:
Tests not implemented yet.
References
- https://github.com/PyO3/maturin
- https://github.com/genyrosk/gym-chess (Thanks to genyrosk for gym-chess)
- https://github.com/deepmind/open_spiel
Contrbutions
Pull Request Are Welcomed!
License
MIT
Social
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file zarena-0.1.3.tar.gz.
File metadata
- Download URL: zarena-0.1.3.tar.gz
- Upload date:
- Size: 58.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d36ffa0f9e316717e13318cbfbcc2a2948b31db0b6d706f429b5f61266d17507
|
|
| MD5 |
3dfefe6da8ff017fa7ce7d8edc6986c5
|
|
| BLAKE2b-256 |
417cf5edbd7753cf7af7e5c85902fb46fc6b3606117932944cf05a098a314200
|
File details
Details for the file zarena-0.1.3-cp36-abi3-win_amd64.whl.
File metadata
- Download URL: zarena-0.1.3-cp36-abi3-win_amd64.whl
- Upload date:
- Size: 233.8 kB
- Tags: CPython 3.6+, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
461f1c1de7476baa63dcebb414cc4536b3551b812b46f949ecfefa1e7947ff07
|
|
| MD5 |
60dfb89718e3c47230d30ed41ae04c5e
|
|
| BLAKE2b-256 |
057c9667f35077a9893ee8aa6d67cdce54bda7bb481e39b73ca170aef5338542
|
File details
Details for the file zarena-0.1.3-cp36-abi3-manylinux_2_5_x86_64.manylinux1_x86_64.whl.
File metadata
- Download URL: zarena-0.1.3-cp36-abi3-manylinux_2_5_x86_64.manylinux1_x86_64.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.6+, manylinux: glibc 2.5+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b551c559948634fdcdd96d59f390a5a1707f17677847f095f5b7995cfbd0f744
|
|
| MD5 |
61ba44fbadb6774ca8d9a1ea1d1b278f
|
|
| BLAKE2b-256 |
39f3acc3d34734d3333ea43e9a92dda58af7fec6f116c8b0e1527c39be5eee5e
|
File details
Details for the file zarena-0.1.3-cp36-abi3-manylinux_2_5_i686.manylinux1_i686.whl.
File metadata
- Download URL: zarena-0.1.3-cp36-abi3-manylinux_2_5_i686.manylinux1_i686.whl
- Upload date:
- Size: 1.1 MB
- Tags: CPython 3.6+, manylinux: glibc 2.5+ i686
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9918ed0cf8d78658818395b5fe3a34ed85687ac609fe7f1270abf789d4922c2
|
|
| MD5 |
81ddfdccc29110c3e9b5bf6c968cf545
|
|
| BLAKE2b-256 |
94156b271e348536852c662dad08669ca5d4c4df9dfdb41267e1d8e0938d15a3
|
File details
Details for the file zarena-0.1.3-cp36-abi3-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl.
File metadata
- Download URL: zarena-0.1.3-cp36-abi3-macosx_10_9_x86_64.macosx_11_0_arm64.macosx_10_9_universal2.whl
- Upload date:
- Size: 657.8 kB
- Tags: CPython 3.6+, macOS 10.9+ universal2 (ARM64, x86-64), macOS 10.9+ x86-64, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7fc76bbb83632fb4545505e486bd848bca0a2a519c46b821355c5aace59bee3c
|
|
| MD5 |
d2bfc8e7fd83dd41d0b590900a236c84
|
|
| BLAKE2b-256 |
510a69bfad5936edbc9ff6e4420719330c4d2a95f2e5e26ba191db18562b725e
|
File details
Details for the file zarena-0.1.3-cp36-abi3-macosx_10_7_x86_64.whl.
File metadata
- Download URL: zarena-0.1.3-cp36-abi3-macosx_10_7_x86_64.whl
- Upload date:
- Size: 349.2 kB
- Tags: CPython 3.6+, macOS 10.7+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.12.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
87ff50aae5eab86fc667c4343fecff0ff51b8c6a7e54b74394da79811a574cc0
|
|
| MD5 |
07a7c9fb59d18a51940cd3bd89b10ebc
|
|
| BLAKE2b-256 |
aab8f556af1ae9b9dfd82dba57fd31f9e721f9abccb53d1ffeb737afcc9c432c
|