Interactive reinforcement learning sandbox for experimenting with AI agents in a classic Snake Game environment.

These details have not been verified by PyPI

Project links

Project description

AI Snake Lab

Introduction

AI Snake Lab is an interactive reinforcement learning sandbox for experimenting with AI agents in a classic Snake Game environment — featuring a live Textual TUI interface, flexible replay memory database, and modular model definitions.

🚀 Features

🐍 Classic Snake environment with customizable grid and rules
🧠 AI agent interface supporting multiple architectures (Linear, RNN, CNN)
🎮 Textual-based simulator for live visualization and metrics
💾 SQLite-backed replay memory for storing frames, episodes, and runs
🧩 Experiment metadata tracking — models, hyperparameters, state-map versions
📊 Built-in plotting for hashrate, scores, and learning progress

🧰 Tech Stack

Component	Description
Python 3.11+	Core language
Textual	Terminal UI framework
SQLite3	Lightweight replay memory + experiment store
PyTorch (optional)	Deep learning backend for models
Plotext / Matplotlib	Visualization tools

Dynamic Training

The Dynamic Training checkbox implements an increasing amount of long training that is executed at the end of each epoch. The AIAgent:train_long_memory() code is run an increasing number of times at the end of each epoch as the total number of epochs increases to a maximum of MAX_ADAPTIVE_TRAINING_LOOPS, which is currently 16 according to the calculation shown below:

    if self.dynamic_training():
        loops = max(
            1, min(self.epoch() // 250, DAIAgent.MAX_DYNAMIC_TRAINING_LOOPS)
        )
    else:
        loops = 1

    while loops > 0:
        loops -= 1
        self.train_long_memory()

Epsilon N

The Epsilon N class, inspired by Richard Sutton's The Bitter Lesson, is a drop-in replacement for the traditional Epsilon Greedy algorithm. It instantiates a traditional Epsilon Greedy instance for each score. For example, when the AI has a score of 7, an Epsilon Greedy instance at self._epsilons[7] is used.

The screenshot below shows the Highscores plot with the traditional, single instance Epsilon Greedy algorithm and with Dynamic Training enabled. The single-instance epsilon algorithm quickly converges but shows uneven progress. The abrupt jumps in the high score line indicate that exploration declines before the AI fully masters each stage of play.

Traditional Epsilon Greedy

The next screenshot shows the Highscores plot without Dynamic Training enabled and with the Epsilon N being used. By maintaining a separate epsilon for each score level, Epsilon N sustains exploration locally. This results in a smoother, more linear high-score curve as the AI consolidates learning at each stage before progressing.

Epsilon N

As shown above, the traditional Epsilon Greedy approach leads to slower improvement, with the highscore curve flattening early. By contrast, Epsilon N maintains steady progress as the AI masters each score level independently.

Installation

This project is on PyPI. You can install the AI Snake Lab software using pip.

Create a Sandbox

python3 -m venv snake_venv
. snake_venv/bin/activate

Install the AI Snake Lab

After you have activated your venv environment:

pip install ai-snake-lab

Running the AI Snake Lab

From within your venv environment:

ai-snake-lab

Limitations

Because the simulation runs in a Textual TUI, terminal resizing and plot redraws can subtly affect the simulation timing. As a result, highscore achievements may appear at slightly different game numbers across runs. This behavior is expected and does not indicate a bug in the AI logic.

If you have a requirement to make simulation runs repeatable, drop me a note and I can implement a headless mode where the simulation runs in a separate process outside of Textual.

Acknowledgements

The original code for this project was based on a YouTube tutorial, Python + PyTorch + Pygame Reinforcement Learning – Train an AI to Play Snake by Patrick Loeber. You can access his original code here on GitHub. Thank you Patrick!!! You are amazing!!!! This project is a port of the pygame and matplotlib solution.

Thanks also go out to Will McGugan and the Textual team. Textual is an amazing framework. Talk about Rapid Application Development. Porting this from a Pygame and MatPlotLib solution to Textual took less than a day.

Inspiration

Creating an artificial intelligence agent, letting it loose and watching how it performs is an amazing process. It's not unlike having children, except on a much, much, much smaller scale, at least today! Watching the AI driven Snake Game is mesmerizing. I'm constantly thinking of ways I could improve it. I credit Patrick Loeber for giving me a fun project to explore the AI space.

Much of my career has been as a Linux Systems administrator. My comfort zone is on the command line. I've never worked as a programmer and certainly not as a front end developer. Textual, as a framework for building rich Terminal User Interfaces is exactly my speed and when I saw Dolphie, I was blown away. Built-in, real-time plots of MySQL metrics: Amazing!

Richard S. Sutton is also an inspiration to me. His thoughts on Reinforcement Learning are a slow motion revolution. His criticisms of the existing AI landscape with it's focus on engineering a specific AI to do a specific task and then considering the job done is spot on. His vision for an AI agent that does continuous, non-linear learning remains the next frontier on the path to General Artificial Intelligence.

Technical Docs

Links

Patrick Loeber's YouTube Tutorial
Will McGugan's Textual Rapid Application Development framework
Dolphie: A single pane of glass for real-time analytics into MySQL/MariaDB & ProxySQL
Richard Sutton's Homepage
Richard Sutton quotes and other materials.
Useful Plots to Diagnose your Neural Network by George V Jose
A Deep Dive into Learning Curves in Machine Learning by Mostafa Ibrahim

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.12.1

Oct 31, 2025

0.12.0

Oct 25, 2025

0.11.3

Oct 23, 2025

0.11.1

Oct 23, 2025

0.10.1

Oct 19, 2025

This version

0.10.0

Oct 16, 2025

0.9.0

Oct 14, 2025

0.8.1

Oct 13, 2025

0.8.0

Oct 13, 2025

0.7.0

Oct 13, 2025

0.6.0

Oct 12, 2025

0.5.0

Oct 12, 2025

0.4.8

Oct 11, 2025

0.4.7

Oct 11, 2025

0.4.6

Oct 11, 2025

0.4.5

Oct 11, 2025

0.4.4

Oct 11, 2025

0.4.3

Oct 11, 2025

0.1.0

Oct 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_snake_lab-0.10.0.tar.gz (40.7 kB view details)

Uploaded Oct 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_snake_lab-0.10.0-py3-none-any.whl (50.8 kB view details)

Uploaded Oct 16, 2025 Python 3

File details

Details for the file ai_snake_lab-0.10.0.tar.gz.

File metadata

Download URL: ai_snake_lab-0.10.0.tar.gz
Upload date: Oct 16, 2025
Size: 40.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.11.13 Linux/6.11.0-1018-azure

File hashes

Hashes for ai_snake_lab-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`1418ea2e57877495e95da241070ab494ea18da1e915e015f04bcbee9e3fbba74`
MD5	`5d553b28180da8f09b20b7c6e7925c68`
BLAKE2b-256	`01acad7ba3671910e63f0893b00afb1cb51c93612d0a4aab01aa554bbb3d5384`

See more details on using hashes here.

File details

Details for the file ai_snake_lab-0.10.0-py3-none-any.whl.

File metadata

Download URL: ai_snake_lab-0.10.0-py3-none-any.whl
Upload date: Oct 16, 2025
Size: 50.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.11.13 Linux/6.11.0-1018-azure

File hashes

Hashes for ai_snake_lab-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d46ec29c0356b0658ce163dadb62adb6bfda9875cfbe12a4965a3c6659f0c1f6`
MD5	`35266e7f034dc918006e4e4950b35489`
BLAKE2b-256	`fd3b9885a27dd5153f497fb7ef1e00a829628b09c1ba0871179464f2c3345dbf`

See more details on using hashes here.

ai-snake-lab 0.10.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

AI Snake Lab

Introduction

🚀 Features

🧰 Tech Stack

Dynamic Training

Epsilon N

Installation

Create a Sandbox

Install the AI Snake Lab

Running the AI Snake Lab

Limitations

Acknowledgements

Inspiration

Technical Docs

Links

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes