Episode

Navigating Common Pitfalls in Data Science: Lessons from Pierpaolo Hipolito - ML 183

Podcast
Adventures in Machine Learning
Published
Jan 24, 2025
Duration seconds
3308
Processing state
processed
Canonical source
https://topenddevs.com/podcasts/adventures-in-machine-learning/episodes/navigating-common-pitfalls-in-data-science-lessons-from-pierpaolo-hipolito-ml-183-366d825b-a921-4796-b5eb-2e54be5d2139
Audio
https://dts.podtrac.com/redirect.mp3/api.spreaker.com/download/episode/63721179/ml_183.mp3
JSON
/v1/public/podcasts/adventures-in-machine-learning/episodes/navigating-common-pitfalls-in-data-science-lessons-from-pierpaolo-hipolito-ml-183
Markdown
/podcast/adventures-in-machine-learning/navigating-common-pitfalls-in-data-science-lessons-from-pierpaolo-hipolito-ml-183.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/adventures-in-machine-learning/episodes/navigating-common-pitfalls-in-data-science-lessons-from-pierpaolo-hipolito-ml-183/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/adventures-in-machine-learning/navigating-common-pitfalls-in-data-science-lessons-from-pierpaolo-hipolito-ml-183.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

Welcome to another insightful episode of Top End Devs, where we delve into the fascinating world of machine learning and data science. In this episode, host Charles Max Wood is joined by special guest Pierpaolo Hipolito, a data scientist at the SAS Institute in the UK. Together, they explore the intriguing paradoxes of data science, discussing how these paradoxes can impact the accuracy of machine learning models and providing insights on how to mitigate them. Pierpaolo shares his expertise on causal reasoning in machine learning, drawing from his master's research and contributions to Towards Data Science and other notable publications. He elaborates on the complexities of data modeling during the early stages of the COVID-19 pandemic, highlighting the use of simulation and synthetic data to address data sparsity. Throughout the conversation, the focus remains on the importance of understanding the underlying system being modeled, the role of feature engineering, and strategies for avoiding common pitfalls in data science. Whether you are a seasoned data scientist or just starting out, this episode offers valuable perspectives on enhancing the reliability and interpretability of your machine learning models. Tune in for a deep dive into the paradoxes of data science, practical advice on feature interaction, and the importance of accurate data representation in achieving meaningful insights. Become a supporter of this podcast: https://www.spreaker.com/podcast/adventures-in-machine-learning--6102041/support .