{"podcast":{"title":"Data Skeptic","slug":"data-skeptic","podcast_index_feed_id":587881,"rss_url":"https://dataskeptic.libsyn.com/rss","website_url":"https://dataskeptic.com","image_url":"https://static.libsyn.com/p/assets/0/e/4/b/0e4bd71bb64c6e45/DS_-_New_Logo_assets_-_JL_DS_Logo_Stacked_-_Color_2.jpg","author":"Kyle Polich","episode_count":601,"summary":"The Data Skeptic Podcast features interviews and discussion of topics related to data science, statistics, machine learning, artificial intelligence and the like, all from the perspective of applying critical thinking and the scientific method to evaluate the veracity of claims and efficacy of approaches.","last_synced_at":null,"page_url":"https://stenobird.com/podcast/data-skeptic"},"episode":{"title":"Music Playlist Recommendations","slug":"music-playlist-recommendations","published_at":"2025-10-29T14:00:00+00:00","page_url":"https://stenobird.com/podcast/data-skeptic/music-playlist-recommendations","show_page_url":"https://stenobird.com/podcast/data-skeptic","url":"https://dataskeptic.com/blog/episodes/2025/music-playlist-recommendations","audio_url":"https://pscrb.fm/rss/p/mgln.ai/e/35/traffic.libsyn.com/secure/dataskeptic/rebecca-with-ads.mp3?dest-id=201630","summary":"Algorithmic recommendation systems often perpetuate popularity bias, favoring mainstream hits over niche discovery. This episode explores how multimodal frameworks and semantic datasets can leverage human-centric language to create fairer, more nuanced music discovery.","meta_description":"Explore how the LARP framework and Music Semantics dataset use Reddit discourse to fix bias and improve music playlist recommendations.","key_points":["Main idea: Recommender systems act as modern gatekeepers, often creating popularity bias that suppresses diverse musical discovery","Failure mode: Standard audio features like BPM or tempo fail to capture the 'atmospheric' or 'situational' ways humans actually describe music","Practical takeaway: The LARP framework uses contrastive learning to align text and audio representations for better playlist continuation","Main idea: The Music Semantics dataset uses scraped Reddit discussions to capture organic, granular human contexts like 'songs for a breakup.'","Future direction: The next frontier involves conversational interfaces where users can provide explicit, granular feedback on specific acoustic elements"],"chapters":[{"start_ms":60000,"title":"Researching Fairness in Recommenders","summary":"Rebecca discusses her background in music and how she applies fairness research to graph-based and dynamic recommender systems."},{"start_ms":495000,"title":"Defining Algorithmic Unfairness","summary":"An exploration of the different types of bias in recommendation engines and the mathematical formalization of fairness."},{"start_ms":750000,"title":"The LARP Multimodal Framework","summary":"Introduction to the LARP model, which uses contrastive learning to align audio features with textual representations."},{"start_ms":1465000,"title":"Capturing Human Music Semantics","summary":"How scraping Reddit allows researchers to understand the atmospheric and situational language people use to describe music."},{"start_ms":1705000,"title":"The Music Semantics Dataset","summary":"Details on the taxonomy of music descriptions and the availability of the dataset on Hugging Face."},{"start_ms":2410000,"title":"Evaluating Recommendation Accuracy","summary":"A look at the methodology for testing playlist continuation by masking songs and measuring overlap."},{"start_ms":2645000,"title":"The Future of Music Discovery","summary":"Discussing the 'platformization' of music and the potential for conversational, high-granularity feedback loops."}],"topics":["Recommender Systems","Algorithmic Fairness","Multimodal Machine Learning","Music Information Retrieval","Natural Language Processing","Contrastive Learning","Popularity Bias","Dataset Engineering"],"duration_seconds":3149,"processing_state":"processed","actions":[{"name":"request_transcript","method":"POST","url":"https://stenobird.com/v1/public/podcasts/data-skeptic/episodes/music-playlist-recommendations/transcription-requests","description":"Idempotently request low-priority transcript generation for this episode."},{"name":"read_markdown","method":"GET","url":"https://stenobird.com/podcast/data-skeptic/music-playlist-recommendations.md","description":"Read the agent-friendly Markdown representation of this episode resource."}]}}