Clip

A Transformer Model for Predicting the Next Action and Observation
This podcast discusses a transformer model that uses a sequence of modalities, such as observations and actions, to predict what the next action and observation will be. The model is more than just a language model and serves as an agent.