Episode
Stop Feeding Garbage Data To Your ML Models, Clean It Up With Galileo
- Podcast
- AI Engineering Podcast
- Published
- Jul 14, 2022
- Duration seconds
- 2824
- Processing state
failed
Actions
POST https://stenobird.com/v1/public/podcasts/ai-engineering-podcast/episodes/stop-feeding-garbage-data-to-your-ml-models-clean-it-up-with-galileo/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/ai-engineering-podcast/stop-feeding-garbage-data-to-your-ml-models-clean-it-up-with-galileo.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Summary Machine learning is a force multiplier that can generate an outsized impact on your organization. Unfortunately, if you are feeding your ML model garbage data, then you will get orders of magnitude more garbage out of it. The team behind Galileo experienced that pain for themselves and have set out to make data management and cleaning for machine learning a first class concern in your workflow. In this episode Vikram Chatterji shares the story of how Galileo got started and how you can use their platform to fix your ML data so that you can get back to the fun parts. Announcements Hello and welcome to the Machine Learning Podcast, the podcast about machine learning and how to bring it from idea to delivery. Predibase is a low-code ML platform without low-code limits. Built on top of our open source foundations of Ludwig and Horovod, our platform allows you to train state-of-the-art ML and deep learning models on your datasets at scale. Our platform works on text, images, tabular, audio and multi-modal data using our novel compositional model architecture. We allow users to operationalize models on top of the modern data stack, through REST and PQL – an extension of SQL that puts predictive power in the hands of data practitioners. Go to themachinelearningpodcast.com/predibase today to learn more and try it out! Do you wish you could use artificial intelligence to drive your business the way Big Tech does, but don’t have a money printer? Graft is a cloud-native platform that aims to make the AI of the 1% accessible to the 99%. Wield the most advanced techniques for unlocking the value of data, including text, images, video, audio, and graphs. No machine learning skills required, no team to hire, and no infrastructure to build or maintain. For more information on…