The Power of Regret Minimization in Learning AI Strategies

Clip

The Power of Regret Minimization in Learning AI Strategies

18:31 - 20:25 (01:54)

By picking actions that have higher regret with higher probability, algorithms can learn how to play games and converge to Nash equilibrium. An example is given with AI learning how to play poker.

Clip

The Power of Regret Minimization in Learning AI Strategies

18:31 - 20:25 (01:54)

Similar Clips