Episode

Program Aided Language Models

Podcast
Data Skeptic
Published
Nov 13, 2023
Duration seconds
1940
Processing state
failed
Canonical source
https://dataskeptic.com/blog/episodes/2023/program-aided-language-models
Audio
https://pscrb.fm/rss/p/mgln.ai/e/35/traffic.libsyn.com/secure/dataskeptic/program-aided-language-models.mp3?dest-id=201630
JSON
/v1/public/podcasts/data-skeptic/episodes/program-aided-language-models
Markdown
/podcast/data-skeptic/program-aided-language-models.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/data-skeptic/episodes/program-aided-language-models/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/data-skeptic/program-aided-language-models.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

We are joined by Aman Madaan and Shuyan Zhou. They are both PhD students at the Language Technology Institute at Carnegie Mellon University. They join us to discuss their latest published paper, PAL: Program-aided Language Models. Aman and Shuyan started by sharing how the application of LLMs has evolved. They talked about the performance of LLMs on arithmetic tasks in contrast to coding tasks. Aman introduced their PAL model and how it helps LLMs improve at arithmetic tasks. He shared examples of the tasks PAL was tested on. Shuyan discussed how PAL's performance was evaluated using Big Bench hard tasks. They discussed the kind of mistakes LLMs tend to make and how the PAL's model circumvents these limitations. They also discussed how these developments in LLMS can improve kids learning. Rounding up, Aman discussed the CoCoGen project, a project that enables NLP tasks to be converted to graphs. Shuyan and Aman shared their next research steps. Follow Shuyan on Twitter @shuyanzhxyc. Follow Aman on @aman_madaan.