Chapter
Clips
The concept of Q learning is discussed along with another algorithm for reinforcement learning, both of which are off-policy and enable one to learn about the environment's value of different actions while figuring out how to behave optimally.
50:10 - 53:02 (02:52)
Summary
The concept of Q learning is discussed along with another algorithm for reinforcement learning, both of which are off-policy and enable one to learn about the environment's value of different actions while figuring out how to behave optimally. The interesting idea of making predictions over time is also considered in relation to neural networks and backpropagation.
ChapterThe Magic of One-Line Code
Episode#144 – Michael Littman: Reinforcement Learning and the Future of AI
PodcastLex Fridman Podcast
The worst-case scenario in computing is often used to make developers feel good about the performance of their code, but in reality, it doesn't really matter as long as the code works with the actual set of data.
53:02 - 55:34 (02:32)
Summary
The worst-case scenario in computing is often used to make developers feel good about the performance of their code, but in reality, it doesn't really matter as long as the code works with the actual set of data.