Episode

Rise of the AI PC & local LLMs

Podcast
Practical AI
Published
Jun 4, 2024
Duration seconds
2135
Processing state
failed
Canonical source
https://share.transistor.fm/s/66087536
Audio
https://pscrb.fm/rss/p/dts.podtrac.com/redirect.mp3/media.transistor.fm/66087536/5f42ce2d.mp3
JSON
/v1/public/podcasts/practical-ai/episodes/rise-of-the-ai-pc-local-llms
Markdown
/podcast/practical-ai/rise-of-the-ai-pc-local-llms.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/practical-ai/episodes/rise-of-the-ai-pc-local-llms/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/practical-ai/rise-of-the-ai-pc-local-llms.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this episode, we dig into local AI tooling, frameworks, and optimizations to help you navigate this AI niche, and we talk about how this might impact AI adoption in the longer term. Sponsors: Ladder Life Insurance – 100% digital — no doctors, no needles, no paperwork. Don’t put it off until the very last minute to get term coverage life insurance through Ladder. Find out if you’re instantly approved. They’re rated A and A plus. Life insurance costs more as you age, now’s the time to cross it off your list. Neo4j – Is your code getting dragged down by JOINs and long query times? The problem might be your database…Try simplifying the complex with graphs. Stop asking relational databases to do more than they were made for. Graphs work well for use cases with lots of data connections like supply chain, fraud detection, real-time analytics, and genAI. With Neo4j, you can code in your favorite programming language and against any driver. Plus, it’s easy to integrate into your tech stack. Fly.io – The home of Changelog.com — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog and check out the speedrun in their docs . Featuring: Chris Benson – Website , GitHub , LinkedIn , X Daniel Whitenack – Website , GitHub , X Show Notes: Ollama LM Studio llama.cpp OpenVINO MLPerf client working group Article - 5 top small language models GPTQ article Article - Which quantization method is right for you Upcoming Events: Regis…