Clip

The Evolution of Benchmarks in AI Research
listen on Spotify
34:46 - 38:08 (03:22)

The classical paradigm of supervised learning involves partitioning data into a training, validation, and test set. The community has accepted benchmark tasks such as the BABY tasks proposed by FAIR to test machines' ability to reason and access working memory.

Similar Clips