Episode

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

Podcast
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Published
Dec 9, 2024
Duration seconds
3408
Processing state
failed
Canonical source
https://twimlai.com/podcast/twimlai/automated-reasoning-to-prevent-llm-hallucination/
Audio
https://pscrb.fm/rss/p/traffic.megaphone.fm/MLN7157710802.mp3?updated=1733775633
JSON
/v1/public/podcasts/twiml-ai-podcast/episodes/automated-reasoning-to-prevent-llm-hallucination-with-byron-cook-712
Markdown
/podcast/twiml-ai-podcast/automated-reasoning-to-prevent-llm-hallucination-with-byron-cook-712.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/twiml-ai-podcast/episodes/automated-reasoning-to-prevent-llm-hallucination-with-byron-cook-712/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/twiml-ai-podcast/automated-reasoning-to-prevent-llm-hallucination-with-byron-cook-712.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Today, we're joined by Byron Cook, VP and distinguished scientist in the Automated Reasoning Group at AWS to dig into the underlying technology behind the newly announced Automated Reasoning Checks feature of Amazon Bedrock Guardrails. Automated Reasoning Checks uses mathematical proofs to help LLM users safeguard against hallucinations. We explore recent advancements in the field of automated reasoning, as well as some of the ways it is applied broadly, as well as across AWS, where it is used to enhance security, cryptography, virtualization, and more. We discuss how the new feature helps users to generate, refine, validate, and formalize policies, and how those policies can be deployed alongside LLM applications to ensure the accuracy of generated text. Finally, Byron also shares the benchmarks they’ve applied, the use of techniques like ‘constrained coding’ and ‘backtracking,’ and the future co-evolution of automated reasoning and generative AI. The complete show notes for this episode can be found at https://twimlai.com/go/712.