{"podcast":{"title":"AI Engineering Podcast","slug":"ai-engineering-podcast","podcast_index_feed_id":5875646,"rss_url":"https://serve.podhome.fm/rss/c9abdd38-a5dc-5eb2-96fd-f833f93208a7","website_url":"https://www.aiengineeringpodcast.com","image_url":"https://assets.podhome.fm/f6ff0caa-931b-4c08-bfdd-08dc7f5cd336/638557211890591941ai_engineering_podcast_logo.jpg","author":"Tobias Macey","episode_count":79,"summary":"This show is your guidebook to building scalable and maintainable AI systems. You will learn how to architect AI applications, apply AI to your work, and the considerations involved in building or customizing new models. Everything that you need to know to deliver real impact and value with machine learning and artificial intelligence.","last_synced_at":null,"page_url":"https://stenobird.com/podcast/ai-engineering-podcast"},"episode":{"title":"Optimize Your AI Applications Automatically With The TensorZero LLM Gateway","slug":"optimize-your-ai-applications-automatically-with-the-tensorzero-llm-gateway","published_at":"2025-01-22T16:32:38+00:00","page_url":"https://stenobird.com/podcast/ai-engineering-podcast/optimize-your-ai-applications-automatically-with-the-tensorzero-llm-gateway","show_page_url":"https://stenobird.com/podcast/ai-engineering-podcast","url":"https://www.aiengineeringpodcast.com/tensorzero-llm-gateway-prompt-optimization-episode-45","audio_url":"https://op3.dev/e/dts.podtrac.com/redirect.mp3/serve.podhome.fm/episode/f6ff0caa-931b-4c08-bfdd-08dc7f5cd336/638729251179606751c93f9b44-5412-4e68-8e3a-4dee94a1a3dav1.mp3","summary":"Summary In this episode of the AI Engineering podcast Viraj Mehta, CTO and co-founder of TensorZero, talks about the use of LLM gateways for managing interactions between client-side applications and various AI models. He highlights the benefits of using such a gateway, including standardized communication, credential management, and potential features like request-response caching and audit logging. The conversation also explores TensorZero's architecture and functionality in optimizing AI applications by managing structured data inputs and outputs, as well as the challenges and opportunities in automating prompt generation and maintaining interaction history for optimization purposes. Announcements Hello and welcome to the AI Engineering Podcast, your guide to the fast-moving world of building scalable and maintainable AI systems Seamless data integration into AI applications often falls short, leading many to adopt RAG methods, which come with high costs, complexity, and limited scalability. Cognee offers a better solution with its open-source semantic memory engine that automates data ingestion and storage, creating dynamic knowledge graphs from your data. Cognee enables AI agents to understand the meaning of your data, resulting in accurate responses at a lower cost. Take full control of your data in LLM apps without unnecessary overhead. Visit aiengineeringpodcast.com/cognee to learn more and elevate your AI apps and agents.&nbsp; Your host is Tobias Macey and today I'm interviewing Viraj Mehta about the purpose of an LLM gateway and his work on TensorZero Interview Introduction How did you get involved in machine learning? What is an LLM gateway? What purpose does it serve in an AI application architecture? What are some of the different features and capabilitie…","meta_description":"Summary In this episode of the AI Engineering podcast Viraj Mehta, CTO and co-founder of TensorZero, talks about the use of LLM gateways for managing inte…","key_points":[],"chapters":[],"topics":[],"duration_seconds":3785,"processing_state":"failed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/ai-engineering-podcast/episodes/optimize-your-ai-applications-automatically-with-the-tensorzero-llm-gateway/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/ai-engineering-podcast/optimize-your-ai-applications-automatically-with-the-tensorzero-llm-gateway.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}