The Method Behind the Poker AI That Took the Pros for $1.7 Million
7 years ago

19 Dec
âThe Machines are coming!â was the cry earlier this year when poker-playing Artificial Intelligence (AI) bot Libratus defeated four of the best pros in the game for the best part of $1.76million, and the scientific paper released this week by the brains behind the machine will do little to allay fears that at some point, humans â in poker at least â will become redundant.
Back in January, the team of Jason Les, Dong Kim, Daniel McAuley and Jimmy Chou started out quite optimistic that they would prevail against a newer version of the program which some of them had defeated the previous year â but by the end of the match they were in awe of the AIâs performance, though little in the way of detail as to how Libratus managed to overpower them was released.
That changed when Sciencemag.org released a paper by the machineâs creators - Noam Brown and Tuomas Sandholm of Carnegie Mellon University in Pittsburgh, Pennsylvania â which went into depth as to how they overcame the challenge of programming an AI which could somehow compete with humans in poker â a game of imperfect information.
The duoâs paper starts out by stating: âDespite AI successes in perfect-information games, the private information and massive game tree have made no-limit poker difficult to tackleâ, the pair referencing perhaps this monthâs big news that saw chess, Go and Shogi-lovers all stunned when an AI program took mere hours to basically learn the game from scratch and then massacre the top computer programs.
Poker, for those in the know, is a different matter altogether â so how did Brown and Sandholm solve the problems inherent in a game which just a couple of months ago they described as having â10^163 (thatâs 10 followed by 163 zeroes!) different game situationsâ?
âOur game-theoretic approach features application-independent techniques: an algorithm for computing a blueprint for the overall strategy, an algorithm that fleshes out the details of the strategy for subgames that are reached during play, and a self-improver algorithm that fixes potential weaknesses that opponents have identified in the blueprint strategy,â write Brown and Sandholm, and if that sound a bit too technical, letâs see if we can break down the three steps to AI victoryâŠ
First up, the duo set up a simple version of the game â an âabstractionâ â and part of the program then âcomputes game-theoretic strategiesâ for this smaller version. This solution becomes the âblueprintâ for later stages of the AI development â âan approximation for how to play in the more numerous later parts of the gameâ.
Secondly, Libratusâ 2nd module refines this blueprint strategy and solves it in âreal-timeâ - a necessary part of competitive play in almost any game, poker of course included. The advantages of their new approach means that âWhenever the opponent makes a move that is not in the abstractionâ - the simpler version in part 1 - a âsubgame is solved with that action included. We call this nested subgame solving.â Basically refining the process and solution as it goes along.
Part 3 of Libratusâ innovative and now-proven technique is called âthe self-improverâ by the computer science geniuses and is designed to âenhances the blueprint strategyâ.
The simpler version could be imagined as having far more intricate âbranchesâ to become the full game, and the self-improver module âcomputes a game-theoretic strategy for those branchesâ.
Because it would become way too complex to solve all of these possibilities, as seen by the numbers given above in the Match Poker post, the authors of the paper on Libratus decided to âtame this complexityâ, by using âthe opponentsâ actual moves to suggest where in the game tree such filling is worthwhileâ. Not every branch is useful in practice, so the AI works on what it faces in reality.

The ground-breaking work done by Brown and Sandholm includes extremely detailed information which only computer science and AI buffs and experts will understand fully, but the basic news that the breakthrough which the match against the top pros appeared to be, is now backed by scientific process - open to peer review and doubtless the beginning of even exciting and fresh approaches to taming the game of poker.
It may or may not happen in our lifetimes â but it would be a brave (or perhaps foolish) person who bets against it!






Comments
You need to be logged in to post a new comment