The Power of Regret Minimization in Learning AI Strategies
18:31 - 20:25 (01:54)
By picking actions that have higher regret with higher probability, algorithms can learn how to play games and converge to Nash equilibrium. An example is given with AI learning how to play poker.