logo
episode-header-image
Sep 2021
47m 34s

From notebooks to Netflix scale with Met...

CHANGELOG MEDIA
About this episode

As you start developing an AI/ML based solution, you quickly figure out that you need to run workflows. Not only that, you might need to run those workflows across various kinds of infrastructure (including GPUs) at scale. Ville Tuulos developed Metaflow while working at Netflix to help data scientists scale their work. In this episode, Ville tells us a bit more about Metaflow, his new book on data science infrastructure, and his approach to helping scale ML/AI work.

Discuss on Changelog News

Changelog++ members save 2 minutes on this episode because they made the ads disappear. Join today!

Sponsors

  • RudderStack – Smart customer data pipeline made for developers. RudderStack is the smart customer data pipeline. Connect your whole customer data stack. Warehouse-first, open source Segment alternative.
  • SignalWire – Build what’s next in communications with video, voice, and messaging APIs powered by elastic cloud infrastructure. Try it today at signalwire.com and use code AI for $25 in developer credit.
  • Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com

Featuring

Notes and Links

Something missing or broken? PRs welcome!

Up next
Aug 22
Git with your friends (remastered) (Changelog & Friends #107)
Our Changelog & Friends proof-of-concept with Mat Ryer has been remastered! Now with full-length video on YouTube. Originally recorded: 2023-02-08 Mat joins us for some good conversation about some Git tooling that's been on our radar. We speculate, we discuss, we laugh, and Mat ... Show More
1h 47m
Aug 21
From Chef to System Initiative (remastered) (Changelog Interviews #655)
The epic show with Adam Jacob has been remastered! Now with full-length video on YouTube. Adam goes solo with Adam Jacob for an epic pod into his journey to get to System Initiative. From SysAdmin at 8 years old, to discovering Linux and working for Mom-and-pop ISPs, to open sour ... Show More
2h 22m
Aug 18
Cursor’s problem isn't just Cursor's problem (Changelog News #157)
Cursor has a big problem, Alireza Bashiri thinks plaintext beats todo apps, Manish built an offline AI workspace, OverType is a WYSIWYG markdown editor that's just a textarea, and sshrc lets you bring your config with you to remote machines. 
7m 37s
Recommended Episodes
Nov 2021
AI-generated code with OpenAI Codex
Recently, GitHub released Copilot, which is an amazing AI pair programmer powered by OpenAI’s Codex model. In this episode, Natalie Pistunovich tells us all about Codex and helps us understand where it fits in our development workflow. We also discuss MLOps and how AI is influenc ... Show More
46m 37s
Aug 2020
Building the world's most popular data science platform
Everyone working in data science and AI knows about Anaconda and has probably “conda” installed something. But how did Anaconda get started and what are they working on now? Peter Wang, CEO of Anaconda and creator of PyData and popular packages like Bokeh and DataShader, joins us ... Show More
59m 13s
Oct 2022
How To Bring Agile Practices To Your Data Projects
Summary Agile methodologies have been adopted by a majority of teams for building software applications. Applying those same practices to data can prove challenging due to the number of systems that need to be included to implement a complete feature. In this episode Shane Gibson ... Show More
1h 12m
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
Summary Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminating all of the ... Show More
1h 8m
Oct 2018
Using Notebooks As The Unifying Layer For Data Roles At Netflix with Matthew Seal - Episode 54
Summary Jupyter notebooks have gained popularity among data scientists as an easy way to do exploratory analysis and build interactive reports. However, this can cause difficulties when trying to move the work of the data scientist into a more standard production environment, due ... Show More
40m 55s
Mar 2024
AI vs software devs
Daniel and Chris are out this week, so we’re bringing you conversations all about AI’s complicated relationship to software developers from other Changelog pods: JS Party, Go Time & The Changelog.Join the discussionChangelog++ members save 2 minutes on this episode because they m ... Show More
57 m
Jun 2024
Rise of the AI PC & local LLMs
We’ve seen a rise in interest recently and a number of major announcements related to local LLMs and AI PCs. NVIDIA, Apple, and Intel are getting into this along with models like the Phi family from Microsoft. In this episode, we dig into local AI tooling, frameworks, and optimiz ... Show More
35m 35s
Dec 2018
How to Build Your IT Team
Ian sat down with Dion Hinchcliffe, VP and Principal Analyst at Constellation Research. Together they discussed the value of low-code tools, how to lead digital transformation and the best tips to building your IT team. IT Visionaries is brought to you by The Lightning Platform b ... Show More
38m 22s
Sep 2018
AI, Cloud Computing, and Leadership
On today’s episode of IT Visionaries, we are joined by Japjit Tulsi, the CTO of Carta. In his 20 year career, Japjit has led engineering teams at Google, Microsoft, and eBay. He’s helped build products like Google Analytics and, most recently, ShopBot, eBay’s AI tool which combin ... Show More
33m 33s
Apr 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
Summary Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single ... Show More
56m 23s