Episode
The Challenges of Data Processing On Kubernetes - A look at Spark, Flink, Dask, and Ray // Holden Karau (DoK Day North America 2022)
- Podcast
- Data on Kubernetes Community
- Published
- Oct 31, 2022
- Duration seconds
- 1209
- Processing state
failed
Actions
POST https://stenobird.com/v1/public/podcasts/data-on-kubernetes-community/episodes/the-challenges-of-data-processing-on-kubernetes-a-look-at-spark-flink-dask-and-ray-holden-karau-dok-day-north-america-2022/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/data-on-kubernetes-community/the-challenges-of-data-processing-on-kubernetes-a-look-at-spark-flink-dask-and-ray-holden-karau-dok-day-north-america-2022.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
From the DoK Day North America 2022 (https://youtu.be/YWTa-DiVljY) ABSTRACT This talk will go through both the improvements that have been made in Kubernetes for batch analytic workloads as well as some of the current pain experienced by users and developers moving their workloads to Kube. In this talk you will learn about how we “cheated” back in the YARN and Mesos days to make things go fast, why Kubernetes doesn’t like those cheats, and what some alternatives are.