Episode

Unlocking AI Potential with AMD's ROCm Stack

Podcast
AI Engineering Podcast
Published
Jun 23, 2025
Duration seconds
2538
Processing state
processed
Canonical source
https://www.aiengineeringpodcast.com/amd-rocm-training-and-inference-episode-54
Audio
https://op3.dev/e/dts.podtrac.com/redirect.mp3/serve.podhome.fm/episode/f6ff0caa-931b-4c08-bfdd-08dc7f5cd336/6388628845029162589f1410e3-4278-4244-91e6-93231214fd31v1.mp3
JSON
/v1/public/podcasts/ai-engineering-podcast/episodes/unlocking-ai-potential-with-amd-s-rocm-stack
Markdown
/podcast/ai-engineering-podcast/unlocking-ai-potential-with-amd-s-rocm-stack.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/ai-engineering-podcast/episodes/unlocking-ai-potential-with-amd-s-rocm-stack/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/ai-engineering-podcast/unlocking-ai-potential-with-amd-s-rocm-stack.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Summary In this episode of the AI Engineering podcast Anush Elangovan, VP of AI software at AMD, discusses the strategic integration of software and hardware at AMD. He emphasizes the open-source nature of their software, fostering innovation and collaboration in the AI ecosystem, and highlights AMD's performance and capability advantages over competitors like NVIDIA. Anush addresses challenges and opportunities in AI development, including quantization, model efficiency, and future deployment across various platforms, while also stressing the importance of open standards and flexible solutions that support efficient CPU-GPU communication and diverse AI workloads. Announcements Hello and welcome to the AI Engineering Podcast, your guide to the fast-moving world of building scalable and maintainable AI systems Your host is Tobias Macey and today I'm interviewing Anush Elangovan about AMD's work to expand the playing field for AI training and inference Interview Introduction How did you get involved in machine learning? Can you describe what your work at AMD is focused on? A lot of the current attention on hardware for AI training and inference is focused on the raw GPU hardware. What is the role of the software stack in enabling and differentiating that underlying compute? CUDA has gained a significant amount of attention and adoption in the numeric computation space (AI, ML, scientific computing, etc.). What are the elements of platform risk associated with relying on CUDA as a developer or organization? The ROCm stack is the key element in AMD's AI and HPC strategy. What are the elements that comprise that ecosystem? What are the incentives for anyone outside of AMD to contribute to the ROCm project? How would you characterize the current competitive landscape for AMD…