Clip

Understanding the Policy Gradient Algorithm
The policy gradient algorithm is a type of deep reinforcement learning that updates neural networks to make certain actions more likely than others. This algorithm can be used to create more interactive robots that can eventually figure out what kind of behavior is desirable through trial and error.