Episode

Chris Albon — ML Models and Infrastructure at Wikimedia

Podcast
Gradient Dissent: Conversations on AI
Published
Sep 23, 2021
Duration seconds
3375
Processing state
failed
Canonical source
http://wandb.me/gd-chris-albon
Audio
https://podcasts.captivate.fm/media/7356cf35-46ee-450f-9004-2ddca241df33/gd-chris-albon-v2-1.mp3
JSON
/v1/public/podcasts/gradient-dissent/episodes/chris-albon-ml-models-and-infrastructure-at-wikimedia
Markdown
/podcast/gradient-dissent/chris-albon-ml-models-and-infrastructure-at-wikimedia.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/gradient-dissent/episodes/chris-albon-ml-models-and-infrastructure-at-wikimedia/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/gradient-dissent/chris-albon-ml-models-and-infrastructure-at-wikimedia.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

In this episode we're joined by Chris Albon, Director of Machine Learning at the Wikimedia Foundation. Lukas and Chris talk about Wikimedia's approach to content moderation, what it's like to work in a place so transparent that even internal chats are public, how Wikimedia uses machine learning (spoiler: they do a lot of models to help editors), and why they're switching to Kubeflow and Docker. Chris also shares how his focus on outcomes has shaped his career and his approach to technical interviews. Show notes: http://wandb.me/gd-chris-albon --- Connect with Chris: - Twitter: https://twitter.com/chrisalbon - Website: https://chrisalbon.com/ --- Timestamps: 0:00 Intro 1:08 How Wikimedia approaches moderation 9:55 Working in the open and embracing humility 16:08 Going down Wikipedia rabbit holes 20:03 How Wikimedia uses machine learning 27:38 Wikimedia's ML infrastructure 42:56 How Chris got into machine learning 46:43 Machine Learning Flashcards and technical interviews 52:10 Low-power models and MLOps 55:58 Outro