Understanding Q learning and Predictions Over Time

Clip

Understanding Q learning and Predictions Over Time

50:10 - 53:02 (02:52)

The concept of Q learning is discussed along with another algorithm for reinforcement learning, both of which are off-policy and enable one to learn about the environment's value of different actions while figuring out how to behave optimally. The interesting idea of making predictions over time is also considered in relation to neural networks and backpropagation.

Clip

Understanding Q learning and Predictions Over Time

50:10 - 53:02 (02:52)

Similar Clips