# Udio & the age of multi-modal AI Page: https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai Text version: https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md Podcast: [Practical AI](https://stenobird.com/podcast/practical-ai) Published: 2024-04-16T18:20:00+00:00 Episode link: https://share.transistor.fm/s/e0bd1784 Audio file: https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/e0bd1784/3a615bf2.mp3 Processing state: failed JSON: https://stenobird.com/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai Duration seconds: 2332 ## Resource 2024 promises to be the year of multi-modal AI, and we are already seeing some amazing things. In this “fully connected” episode, Chris and Daniel explore the new Udio product/service for generating music. Then they dig into the differences between recent multi-modal efforts and more “traditional” ways of combining data modalities. Sponsors: Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs . Changelog News – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today . Featuring: Chris Benson – Website , GitHub , LinkedIn , X Daniel Whitenack – Website , GitHub , X Show Notes: Udio CLIP BridgeTower LLaVA Upcoming Events: Register for upcoming webinars here ! ## Actions - request_transcript: `POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai/transcription-requests` — Idempotently request low-priority transcript generation for this episode. - read_markdown: `GET https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md` — Read the agent-friendly Markdown representation of this episode resource. A page view does not enqueue transcription. Agents should invoke `request_transcript` explicitly when they need this episode processed. ## Transcript Full transcripts are not published on public pages unless there is a clear rights basis.