Episode
Udio & the age of multi-modal AI
- Podcast
- Practical AI
- Published
- Apr 16, 2024
- Duration seconds
- 2332
- Processing state
failed- Canonical source
- https://share.transistor.fm/s/e0bd1784
Actions
POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
2024 promises to be the year of multi-modal AI, and we are already seeing some amazing things. In this “fully connected” episode, Chris and Daniel explore the new Udio product/service for generating music. Then they dig into the differences between recent multi-modal efforts and more “traditional” ways of combining data modalities. Sponsors: Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs . Changelog News – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today . Featuring: Chris Benson – Website , GitHub , LinkedIn , X Daniel Whitenack – Website , GitHub , X Show Notes: Udio CLIP BridgeTower LLaVA Upcoming Events: Register for upcoming webinars here !