Clip

The Power of Regret Minimization in Learning AI Strategies
listen on SpotifyListen on Youtube
18:31 - 20:25 (01:54)

By picking actions that have higher regret with higher probability, algorithms can learn how to play games and converge to Nash equilibrium. An example is given with AI learning how to play poker.

Similar Clips