code-thing | status |
---|---|
master | |
develop | |
maintainability | |
coverage | |
license |
This repository will contain a best effort, open source implementation of the key ideas from the Pluribus poker AI that plays Texas Hold'em Poker. This includes the game engine needed to manage a hand of poker, and will implement the ideas from the paper with respect to the AI algorithms.
This repository assumes Python 3.7 or newer is used.
There isn't much to do with this repository at the moment but one could install the Python package by cloning this repo and pip installing it:
git clone https://github.com/fedden/pluribus-poker-AI.git # Though really we should use ssh here!
cd /path/to/pluribus-poker-AI
pip install .
I'm working on improving the testing as I progress. You can run the tests by moving to this repositories root directory (i.e pluribus-poker-AI/
) and call the python test library pytest
:
cd /path/to/pluribus-poker-AI
pip install pytest
pytest
Below is a rough structure of the repository.
βββ paper # Main source of info and documentation :)
βββ pluribus # Main Python library.
βΒ Β βββ ai # Stub functions for ai algorithms.
βΒ Β βββ games # Implementations of poker games as node based objects that
β β # can be traversed in a depth-first recursive manner.
βΒ Β βββ poker # WIP general code for managing a hand of poker.
βΒ Β βββ utils # Utility code like seed setting.
βββ research # A directory for research/development scripts
β # to help formulate understanding and ideas.
βββ scripts # Scripts to help develop the main library.
βββ test # Python tests.
βββ functional # Functional tests that test multiple components
β # together.
βββ unit # Individual tests for functions and objects.
There are two parts to this repository, the code to manage a game of poker, and the code to train an AI algorithm to play the game of poker. The reason the poker engine is being implemented is because it will likely be useful to have a well-integrated poker environment available during the development of the AI algorithm, incase there are tweaks that must be made to accomadate things like the history of state or the replay of a scenario during Monte Carlo Counterfactual Regret Minimisation. The following code is how one might program a round of poker that is deterministic using the engine. This engine is now the first pass that will be used support self play.
from pluribus import utils
from pluribus.ai.dummy import RandomPlayer
from pluribus.poker.table import PokerTable
from pluribus.poker.engine import PokerEngine
from pluribus.poker.pot import Pot
# Seed so things are deterministic.
utils.random.seed(42)
# Some settings for the amount of chips.
initial_chips_amount = 10000
small_blind_amount = 50
big_blind_amount = 100
# Create the pot.
pot = Pot()
# Instanciate six players that will make random moves, make sure
# they can reference the pot so they can add chips to it.
players = [
RandomPlayer(
name=f'player {player_i}',
initial_chips=initial_chips_amount,
pot=pot)
for player_i in range(6)
]
# Create the table with the players on it.
table = PokerTable(players=players, pot=pot)
# Create the engine that will manage the poker game lifecycle.
engine = PokerEngine(
table=table,
small_blind=small_blind_amount,
big_blind=big_blind_amount)
# Play a round of Texas Hold'em Poker!
engine.play_one_round()
The Pluribus AI algorithm is the next thing to implement so more coming on that as soon as possible...
The following todo will change dynamically as my understanding of the algorithms and the pluribus project evolves.
At first, the goal is to prototype in Python as iteration will be much easier and quicker. Once there is a working prototype, write in a systems level language like C++ and optimise for performance.
Implement a multiplayer working heads up no limit poker game engine to support the self-play.
- Lay down the foundation of game objects (player, card etc).
- Add poker hand evaluation code to the engine.
- Support a player going all in during betting.
- Support a player going all in during payouts.
- Lots of testing for various scenarios to ensure logic is working as expected.
Iterate on the AI algorithms and the integration into the poker engine.
- Integrate the AI strategy to support self-play in the multiplayer poker game engine.
- In the game-engine, allow the replay of any round the current hand to support MCCFR.
- Implement the creation of the blueprint strategy using Monte Carlo CFR miminisation.
- Add the real-time search for better strategies during the game.
Strengthen the game engine with more tests and allow users to see live visualisation of game state.
- Add a simple visualisation to allow a game to be visualised as it progresses.
- Triple check that the rules are implemented in the poker engine as described in the supplimentary material.
- Work through the coverage, adding more tests, can never have enough.
This is an open effort and help, criticisms and ideas are all welcome.
First of all, please check out the CONTRIBUTING guide.
Feel free to start a discussion on the github issues or to reach out to me at leonfedden at gmail dot com.
There have already been a lot of helpful discussions and codebases on the path to building this project, which I'll try to keep updated with links to as I progress.
Naturally the first thing that should be acknowledged is the original paper. Here are the links to the paper that will be referenced to build the AI.
Following are blogposts and discussions on the paper that served as helpful references.
Big shout out to the authors of the following repositories! Here are some MIT licensed codebases that I have found, pillaged and refactored to serve as the basis of the poker engine.
- Poker game code based on this (dead!?!) python package
- Pretty darn efficient poker hand evaluation (python 3 fork)
Useful tools that contributed to the making of the poker engine:
Linked Notes
MISC:
Other useful blog links, papers and resources: