{"podcast":{"title":"Generative AI Group Podcast","slug":"generative-ai-group-podcast-7342905","podcast_index_feed_id":7342905,"rss_url":"https://github.com/sanand0/generative-ai-group/releases/download/main/podcast.xml","website_url":"https://github.com/sanand0/generative-ai-group","image_url":null,"author":null,"episode_count":74,"summary":"Weekly audio summaries of the Generative AI Group discussions.","last_synced_at":null,"page_url":"https://stenobird.com/podcast/generative-ai-group-podcast-7342905"},"episode":{"title":"Week of 2026-05-03","slug":"week-of-2026-05-03","published_at":"2026-05-03T00:00:00+00:00","page_url":"https://stenobird.com/podcast/generative-ai-group-podcast-7342905/week-of-2026-05-03","show_page_url":"https://stenobird.com/podcast/generative-ai-group-podcast-7342905","url":"https://github.com/sanand0/generative-ai-group/releases/download/main/podcast-2026-05-03.mp3","audio_url":"https://github.com/sanand0/generative-ai-group/releases/download/main/podcast-2026-05-03.mp3","summary":"Alex: Hello and welcome to The Generative AI Group Digest for the week of 03 May 2026! Maya: We're Alex and Maya. Alex: [excited] Big week in the group. We’ve got production questions, cloud inference, voice quality, model battles, agent stacks, vibe coding, and some very real “what is actually working?” stories. Maya: Exactly. And a lot of the thread was less about hype and more about what breaks in real life. Let’s start with the one that felt most practical. Alex: Nirant asked a great question about summarizing device datasheets in JSON, XML, and TXT with LLMs. He wanted best practices for chunking versus hierarchical approaches, factual accuracy, and prompt structure for production. Maya: That’s a classic production problem. The key is that technical documents are not just “long text.” They have structure, fields, relationships, and little details that matter. Alex: Jacob Singh pointed Nirant to PageIndex, saying it “has worked well” for a few cases. That’s interesting because PageIndex is built around structured document retrieval and navigation, which is often better than blindly chopping everything into chunks. Maya: Right. For non-technical listeners: chunking means splitting the document into pieces; hierarchical summarization means summarizing small pieces first, then combining those summaries into a bigger one. For datasheets, hierarchy usually wins when the document has sections, tables, and repeated patterns. Alex: And the big production lesson is: don’t ask the model to “summarize everything.” Instead, extract by schema first, then summarize from verified fields. That reduces hallucinations because the model is working from grounded data. Maya: Exactly. If you need a reliable summary, use a structured output like JSON with fixed keys: product name, key sp…","meta_description":"Alex: Hello and welcome to The Generative AI Group Digest for the week of 03 May 2026! Maya: We're Alex and Maya. Alex: [excited] Big week in the group. W…","key_points":[],"chapters":[],"topics":[],"duration_seconds":null,"processing_state":"not_requested","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/generative-ai-group-podcast-7342905/episodes/week-of-2026-05-03/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/generative-ai-group-podcast-7342905/week-of-2026-05-03.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}