Episode

V-JEPA, AI Reasoning from a Non-Generative Architecture with Mido Assran - #677

Podcast
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Published
Mar 25, 2024
Duration seconds
2867
Processing state
failed
Canonical source
https://twimlai.com/podcast/twimlai/v-jepa-ai-reasoning-from-a-non-generative-architecture/
Audio
https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN9999482985.mp3?updated=1711380866
JSON
/v1/public/podcasts/twiml-ai-podcast/episodes/v-jepa-ai-reasoning-from-a-non-generative-architecture-with-mido-assran-677
Markdown
/podcast/twiml-ai-podcast/v-jepa-ai-reasoning-from-a-non-generative-architecture-with-mido-assran-677.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/v-jepa-ai-reasoning-from-a-non-generative-architecture-with-mido-assran-677/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/twiml-ai-podcast/v-jepa-ai-reasoning-from-a-non-generative-architecture-with-mido-assran-677.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Today we’re joined by Mido Assran, a research scientist at Meta’s Fundamental AI Research (FAIR). In this conversation, we discuss V-JEPA, a new model being billed as “the next step in Yann LeCun's vision” for true artificial reasoning. V-JEPA, the video version of Meta’s Joint Embedding Predictive Architecture, aims to bridge the gap between human and machine intelligence by training models to learn abstract concepts in a more efficient predictive manner than generative models. V-JEPA uses a novel self-supervised training approach that allows it to learn from unlabeled video data without being distracted by pixel-level detail. Mido walks us through the process of developing the architecture and explains why it has the potential to revolutionize AI. The complete show notes for this episode can be found at twimlai.com/go/677.