Episode

Neural Network Pruning and Training with Jonathan Frankle at MosaicML

Podcast
Gradient Dissent: Conversations on AI
Published
Apr 4, 2023
Duration seconds
3720
Processing state
failed
Canonical source
https://wandb.ai/site/resources/podcast
Audio
https://podcasts.captivate.fm/media/a616f9bd-2927-4712-945d-ca3ff22e73b8/WEIGHTS-Jonathan-Frankle-V2.mp3
JSON
/v1/public/podcasts/gradient-dissent/episodes/neural-network-pruning-and-training-with-jonathan-frankle-at-mosaicml
Markdown
/podcast/gradient-dissent/neural-network-pruning-and-training-with-jonathan-frankle-at-mosaicml.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/gradient-dissent/episodes/neural-network-pruning-and-training-with-jonathan-frankle-at-mosaicml/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/gradient-dissent/neural-network-pruning-and-training-with-jonathan-frankle-at-mosaicml.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Jonathan Frankle , Chief Scientist at MosaicML and Assistant Professor of Computer Science at Harvard University, joins us on this episode. With comprehensive infrastructure and software tools, MosaicML aims to help businesses train complex machine-learning models using their own proprietary data. We discuss: - Details of Jonathan’s Ph.D. dissertation which explores his “Lottery Ticket Hypothesis.” - The role of neural network pruning and how it impacts the performance of ML models. - Why transformers will be the go-to way to train NLP models for the foreseeable future. - Why the process of speeding up neural net learning is both scientific and artisanal. - What MosaicML does, and how it approaches working with clients. - The challenges for developing AGI. - Details around ML training policy and ethics. - Why data brings the magic to customized ML models. - The many use cases for companies looking to build customized AI models. Jonathan Frankle - https://www.linkedin.com/in/jfrankle/ Resources: - https://mosaicml.com/ - The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation. #OCR #DeepLearning #AI #Modeling #ML