Episode

Udio & the age of multi-modal AI

Podcast: Practical AI
Published: Apr 16, 2024
Duration seconds: 2332
Processing state: failed
Canonical source: https://share.transistor.fm/s/e0bd1784
Audio: https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/e0bd1784/3a615bf2.mp3
JSON: /v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai
Markdown: /podcast/practical-ai/udio-the-age-of-multi-modal-ai.md

Actions

POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai/transcription-requests
Idempotently request low-priority transcript generation for this episode.
GET https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md
Read the agent-friendly Markdown representation of this episode resource.

Summary

2024 promises to be the year of multi-modal AI, and we are already seeing some amazing things. In this “fully connected” episode, Chris and Daniel explore the new Udio product/service for generating music. Then they dig into the differences between recent multi-modal efforts and more “traditional” ways of combining data modalities. Sponsors: Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs . Changelog News – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today . Featuring: Chris Benson – Website , GitHub , LinkedIn , X Daniel Whitenack – Website , GitHub , X Show Notes: Udio CLIP BridgeTower LLaVA Upcoming Events: Register for upcoming webinars here !