#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

/#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Episode

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

2:35:49

Published: Sat Jul 31 2021

Description

Ishan Misra is a research scientist at FAIR working on self-supervised visual learning. Please support this podcast by checking out our sponsors: - Onnit: https://lexfridman.com/onnit to get up to 10% off - The Information: https://theinformation.com/lex to get 75% off first month - Grammarly: https://grammarly.com/lex to get 20% off premium - Athletic Greens: https://athleticgreens.com/lex and use code LEX to get 1 month of fish oil EPISODE LINKS: Ishan's twitter: https://twitter.com/imisra_ Ishan's website: https://imisra.github.io Ishan's FAIR page: https://ai.facebook.com/people/ishan-misra/ PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ YouTube Full Episodes: https://youtube.com/lexfridman YouTube Clips: https://youtube.com/lexclips SUPPORT & CONNECT: - Check out the sponsors above, it's the best way to support this podcast - Support on Patreon: https://www.patreon.com/lexfridman - Twitter: https://twitter.com/lexfridman - Instagram: https://www.instagram.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/lexfridman - Medium: https://medium.com/@lexfridman OUTLINE: Here's the timestamps for the episode. On some podcast players you should be able to click the timestamp to jump to that time. (00:00) - Introduction (07:49) - Self-supervised learning (16:24) - Self-supervised learning is the dark matter of intelligence (20:17) - Categorization (28:50) - Is computer vision still really hard? (32:35) - Understanding Language (42:14) - Harder to solve: vision or language (48:59) - Contrastive learning & energy-based models (52:59) - Data augmentation (57:19) - Fixed audio spike by lowering sound with pen tool (1:05:33) - Real data vs. augmented data (1:09:16) - Non-contrastive learning energy based self supervised learning methods (1:12:54) - Unsupervised learning (SwAV) (1:15:37) - Self-supervised Pretraining (SEER) (1:20:44) - Self-supervised learning (SSL) architectures (1:26:43) - VISSL pytorch-based SSL library (1:29:38) - Multi-modal (1:37:06) - Active learning (1:42:45) - Autonomous driving (1:54:12) - Limits of deep learning (1:58:19) - Difference between learning and reasoning (2:03:26) - Building super-human AI (2:11:14) - Most beautiful idea in self-supervised learning (2:15:02) - Simulation for training AI (2:18:27) - Video games replacing reality (2:19:40) - How to write a good research paper (2:24:08) - Best programming language for beginners (2:25:01) - PyTorch vs TensorFlow (2:28:26) - Advice for getting into machine learning (2:30:31) - Advice for young people (2:32:58) - Meaning of life

Chapters

Self-Supervised Learning in Computer Vision with Ishan Misra

Ishan Misra, research scientist at Facebook AI Research, discusses self-supervised machine learning in computer vision, including the use of transformers and self-attention in language models.

00:00 - 02:00 (02:00)

Self-Supervised Learning

Summary

Ishan Misra, research scientist at Facebook AI Research, discusses self-supervised machine learning in computer vision, including the use of transformers and self-attention in language models.

Episode

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

2:35:49

Published: Sat Jul 31 2021

Description

Chapters

Self-Supervised Learning in Computer Vision with Ishan Misra

Ishan Misra, research scientist at Facebook AI Research, discusses self-supervised machine learning in computer vision, including the use of transformers and self-attention in language models.

00:00 - 02:00 (02:00)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Tips for Deep Work Sessions

This podcast discusses the benefits of deep work sessions when tackling specific problems that require depth versus breadth, and provides tips for improving writing and thinking.

02:00 - 06:16 (04:15)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Exploring Self-supervised and Semi-supervised Learning

The podcast discusses different learning paradigms including self-supervised and semi-supervised learning, which overcome some of the challenges of traditional supervised learning.

06:16 - 12:40 (06:23)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Self-Supervised Learning is the Future of Machine Learning Algorithms

The acceptance of the fact that self-supervised learning is likely to play an important role in future machine learning algorithms is growing.

12:40 - 20:06 (07:26)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Role of Categorization and Understanding in Problem Solving

The usefulness of categorization and its limitations in problem-solving are discussed, with a focus on the role of self-supervised learning versus supervised learning.

20:06 - 26:03 (05:56)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Understanding Symbolic AI and Self-supervised Learning for Deep Sense of the World

The podcast discusses the importance of common sense in building a deep understanding of the world and how self-supervised learning can play a role in achieving this understanding.

26:03 - 31:26 (05:23)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Advantages of Self-Supervised Learning

Self-supervised learning can be useful for many tasks, particularly prior to the point where the machine needs to communicate with a human.

31:26 - 36:37 (05:11)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Contextual Understanding of Language and Images by Neural Networks

The improvement in the neural networks used for natural language processing and image recognition has been achieved through the use of context in the form of a wide context to understand a word in context, or local context to understand a pattern in an image.

36:39 - 42:53 (06:13)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Common Thread in Modern Machine Learning Methods

This podcast explores how modern machine learning methods such as GANs, VAEs, and contrastive models are related through a common language and energy function.

42:53 - 53:07 (10:14)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

Understanding Data Augmentation for Neural Networks

This podcast discusses the concept of data augmentation, which involves perturbing and augmenting data to improve a neural network's performance.

53:07 - 58:25 (05:17)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Role of Data Augmentation in Machine Learning

This podcast discusses the importance and potential benefits of data augmentation in machine learning, particularly in the context of medical imaging.

58:25 - 1:05:22 (06:56)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Importance of Data Augmentation for Learning Algorithms in Vision

The success of learning algorithms for vision is heavily dependent on good data augmentation, even with an infinite source of image data.

1:05:22 - 1:15:27 (10:05)

Summary

#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast

The Challenges of Uncurated Data for Self-Supervised Learning

The use of uncurated data for self-supervised learning presents challenges due to the inherent biases of photographers, and the reliance on data augmentation techniques designed for ImageNet.