Episode
Convert Your Unstructured Data To Embedding Vectors For More Efficient Machine Learning With Towhee
- Podcast
- AI Engineering Podcast
- Published
- Sep 21, 2022
- Duration seconds
- 3114
- Processing state
failed
Actions
POST https://stenobird.com/v1/public/podcasts/ai-engineering-podcast/episodes/convert-your-unstructured-data-to-embedding-vectors-for-more-efficient-machine-learning-with-towhee/transcription-requests
Idempotently request low-priority transcript generation for this episode.GET https://stenobird.com/podcast/ai-engineering-podcast/convert-your-unstructured-data-to-embedding-vectors-for-more-efficient-machine-learning-with-towhee.md
Read the agent-friendly Markdown representation of this episode resource.
Summary
Summary Data is one of the core ingredients for machine learning, but the format in which it is understandable to humans is not a useful representation for models. Embedding vectors are a way to structure data in a way that is native to how models interpret and manipulate information. In this episode Frank Liu shares how the Towhee library simplifies the work of translating your unstructured data assets (e.g. images, audio, video, etc.) into embeddings that you can use efficiently for machine learning, and how it fits into your workflow for model development. Announcements Hello and welcome to the Machine Learning Podcast, the podcast about machine learning and how to bring it from idea to delivery. Building good ML models is hard, but testing them properly is even harder. At Deepchecks, they built an open-source testing framework that follows best practices, ensuring that your models behave as expected. Get started quickly using their built-in library of checks for testing and validating your model’s behavior and performance, and extend it to meet your specific needs as your model evolves. Accelerate your machine learning projects by building trust in your models and automating the testing that you used to do manually. Go to themachinelearningpodcast.com/deepchecks today to get started! Your host is Tobias Macey and today I’m interviewing Frank Liu about how to use vector embeddings in your ML projects and how Towhee can reduce the effort involved Interview Introduction How did you get involved in machine learning? Can you describe what Towhee is and the story behind it? What is the problem that Towhee is aimed at solving? What are the elements of generating vector embeddings that pose the greatest challenge or require the most effort? Once you have an embedding, what…