Mastering the game of Stratego with model-free multiagent reinforcement learning.
Julien PerolatBart De VylderDaniel HennesEugene TarassovFlorian StrubVincent de BoerPaul MullerJerome T ConnorNeil BurchThomas AnthonyStephen McAleerRomuald ElieSarah H CenZhe WangAudrunas GruslysAleksandra MalyshevaMina KhanSherjil OzairFinbarr TimbersToby PohlenTom EcclesMark RowlandMarc LanctotJean-Baptiste LespiauBilal PiotShayegan OmidshafieiEdward LockhartLaurent SifreNathalie BeauguerlangeRemi MunosDavid SilverSatinder SinghDemis HassabisKarl TuylsPublished in: Science (New York, N.Y.) (2022)
We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence (AI) has not yet mastered. It is a game characterized by a twin challenge: It requires long-term strategic thinking as in chess, but it also requires dealing with imperfect information as in poker. The technique underpinning DeepNash uses a game-theoretic, model-free deep reinforcement learning method, without search, that learns to master Stratego through self-play from scratch. DeepNash beat existing state-of-the-art AI methods in Stratego and achieved a year-to-date (2022) and all-time top-three ranking on the Gravon games platform, competing with human expert players.