The Evolution of Benchmarks in AI Research

Lex Fridman Podcast

/Yann LeCun: Deep Learning, Convolutional Neural Networks, and Self-Supervised Learning

/Exploring the benchmarks of Artificial Intelligence and Machine Learning

/The Evolution of Benchmarks in AI Research

Clip

The Evolution of Benchmarks in AI Research

34:46 - 38:08 (03:22)

The classical paradigm of supervised learning involves partitioning data into a training, validation, and test set. The community has accepted benchmark tasks such as the BABY tasks proposed by FAIR to test machines' ability to reason and access working memory.

Clip

The Evolution of Benchmarks in AI Research

34:46 - 38:08 (03:22)

Similar Clips