Episode

Udio & the age of multi-modal AI

Podcast
Practical AI
Published
Apr 16, 2024
Duration seconds
2332
Processing state
failed
Canonical source
https://share.transistor.fm/s/e0bd1784
Audio
https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/e0bd1784/3a615bf2.mp3
JSON
/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai
Markdown
/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/udio-the-age-of-multi-modal-ai/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/practical-ai/udio-the-age-of-multi-modal-ai.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

2024 promises to be the year of multi-modal AI, and we are already seeing some amazing things. In this “fully connected” episode, Chris and Daniel explore the new Udio product/service for generating music. Then they dig into the differences between recent multi-modal efforts and more “traditional” ways of combining data modalities. Sponsors: Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs . Changelog News – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today . Featuring: Chris Benson – Website , GitHub , LinkedIn , X Daniel Whitenack – Website , GitHub , X Show Notes: Udio CLIP BridgeTower LLaVA Upcoming Events: Register for upcoming webinars here !