Multi-armed bandits are also well known in game AI. They got popular with the in...

praptak · on May 27, 2019

UCB on game trees (MCTS) was the first breakthrough that created decently playing Go programs, if I remember correctly.

cgearhart · on May 27, 2019

You are correct. MCTS + UCB and other variants were state of the art leading up to AlphaGo. And even then, MCTS was also used in AlphaGo.

The main change in AlphaGo was using a deep learning network to encode a value network for fast rollouts and a policy network for move selection (rather than using the UCB rule). They later removed the value network and rollouts entirely, but even AlphaZero uses MCTS.