Skip to main content

Texas Hold'em environment with LLM players

Project description

README.md

llm-poker

A minimal Texas Hold’em environment that seats multiple LLM-based players (via the llm library) and manages everything from dealing hole cards to forced blinds, betting rounds, and a straightforward showdown.

Core features:

  • Blinds: Each hand forces a small blind and a big blind, ensuring there’s money in the pot.
  • Betting: We query each LLM once per betting round, requesting an action in strict JSON form (fold, call, or raise).
  • Local showdown logic: The environment determines the best 5-card hand from each player’s 7 cards and awards the pot.
  • Pydantic-based JSON validation: The LLM responses are parsed and validated. If invalid, we retry.
  • Optional CLI: The poker-eval command can run multiple rounds using the specified LLMs.

Installation

  1. Clone this repository or download the files.
  2. Install the package (in editable mode) from the project root (where setup.py is located):
    pip install -e .
    
  3. Verify You should see llm-poker installed
    pip list | grep llm-poker
    

You must also configure your llm library with the API keys for whichever LLM models you plan to use (e.g., gpt-4o, Anthropic, etc.). For example:

llm keys set openai

Quickstart Examples

  1. Running the Sample run.py
python run.py

Deals up to 5 rounds between multiple players: gpt-4o, claude-3-5-haiku-latest, claude-3-5-sonnet-latest, deepseek-chat. Uses elimination_count=0 so the game does not stop early (unless someone busts). The minimum raise is 500 chips. Logs each hand’s actions, culminating in a final standings table.

  1. Using the CLI If you installed with the included console script, you can do:
poker-eval --models gpt-4o --models claude-3-5-haiku-latest --rounds 5

This deals 5 rounds of heads-up between gpt-4o and claude-3-5-haiku-latest.

Once installed, you have access to:

poker-eval [OPTIONS]

--models/-m: Multiple model names or aliases recognized by llm (defaults to ["gpt-4o"]). --rounds/-r: How many hands to deal (default 3). --elimination-count/-e: Stop once only this many players remain (default 1). --stack/-s: Starting chip stack (default 10000).


Known Limitations

  • No side pots: Currently, if a player goes all-in, the environment doesn’t handle side pots.
  • Manual environment checks: If the LLM returns “check” while facing a bet, the code interprets it as invalid and re-prompts.
  • Fictitious ‘expert-level poker AI’: The LLM’s strategic brilliance is not guaranteed. This is more a demonstration environment than a truly advanced solver.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_poker-0.1.2.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llm_poker-0.1.2-py3-none-any.whl (12.0 kB view details)

Uploaded Python 3

File details

Details for the file llm_poker-0.1.2.tar.gz.

File metadata

  • Download URL: llm_poker-0.1.2.tar.gz
  • Upload date:
  • Size: 9.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for llm_poker-0.1.2.tar.gz
Algorithm Hash digest
SHA256 bb08ea77d84eafe2135c70a091324119fc2eec2cc216181a38ed4819c8755c9b
MD5 53fe7e0290d030c5ca57798e33782585
BLAKE2b-256 1d918e685268d76479ef0f319780df63ccd3525130695b967ec2bfb7f835921a

See more details on using hashes here.

File details

Details for the file llm_poker-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: llm_poker-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 12.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for llm_poker-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ac2de168f4a94aaad19ab67daf13860028dedcc81f1a31d10466ab1ca9c2b08b
MD5 706422ed746c6e3ebfe8c7f82001c531
BLAKE2b-256 157b3fabe1237cac4a4c0bb58a6f811fa0fcd7e29293092b92947c66403f4858

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page