Episode

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Podcast
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Published
Jan 13, 2025
Duration seconds
3488
Processing state
failed
Canonical source
https://twimlai.com/podcast/twimlai/evolving-mlops-platforms-for-generative-ai-and-agents/
Audio
https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN4141194502.mp3?updated=1736914659
JSON
/v1/public/podcasts/twiml-ai-podcast/episodes/evolving-mlops-platforms-for-generative-ai-and-agents-with-abhijit-bose-714
Markdown
/podcast/twiml-ai-podcast/evolving-mlops-platforms-for-generative-ai-and-agents-with-abhijit-bose-714.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/evolving-mlops-platforms-for-generative-ai-and-agents-with-abhijit-bose-714/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/twiml-ai-podcast/evolving-mlops-platforms-for-generative-ai-and-agents-with-abhijit-bose-714.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Today, we're joined by Abhijit Bose, head of enterprise AI and ML platforms at Capital One to discuss the evolution of the company’s approach and insights on Generative AI and platform best practices. In this episode, we dig into the company’s platform-centric approach to AI, and how they’ve been evolving their existing MLOps and data platforms to support the new challenges and opportunities presented by generative AI workloads and AI agents. We explore their use of cloud-based infrastructure—in this case on AWS—to provide a foundation upon which they then layer open-source and proprietary services and tools. We cover their use of Llama 3 and open-weight models, their approach to fine-tuning, their observability tooling for Gen AI applications, their use of inference optimization techniques like quantization, and more. Finally, Abhijit shares the future of agentic workflows in the enterprise, the application of OpenAI o1-style reasoning in models, and the new roles and skillsets required in the evolving GenAI landscape. The complete show notes for this episode can be found at https://twimlai.com/go/714.