logo
episode-header-image
Jul 2018
38m 20s

Dev Ops for Data Science

Kyle Polich
About this episode

We revisit the 2018 Microsoft Build in this episode, focusing on the latest ideas in DevOps. Kyle interviews Cloud Developer Advocates Damien Brady, Paige Bailey, and Donovan Brown to talk about DevOps and data science and databases.

For a data scientist, what does it even mean to “build”? Packaging and deployment are things that a data scientist doesn't normally have to consider in their day-to-day work. The process of making an AI app is usually divided into two streams of work: data scientists building machine learning models and app developers building the application for end users to consume.

DevOps includes all the parties involved in getting the application deployed and maintained and thinking about all the phases that follow and precede their part of the end solution. So what does DevOps mean for data science? Why should you adopt DevOps best practices?

In the first half, Paige and Damian share their views on what DevOps for data science would look like and how it can be introduced to provide continuous integration, delivery, and deployment of data science models. In the second half, Donovan and Damian talk about the DevOps life cycle of putting a database under version control and carrying out deployments through a release pipeline.

Up next
Jul 6
The Network Diversion Problem
In this episode, Professor Pål Grønås Drange from the University of Bergen, introduces the field of Parameterized Complexity - a powerful framework for tackling hard computational problems by focusing on specific structural aspects of the input. This framework allows researchers ... Show More
46m 14s
Jun 28
Complex Dynamic in Networks
In this episode, we learn why simply analyzing the structure of a network is not enough, and how the dynamics - the actual mechanisms of interaction between components - can drastically change how information or influence spreads. Our guest, Professor Baruch Barzel of Bar-Ilan Un ... Show More
56 m
Jun 22
Github Network Analysis
In this episode we'll discuss how to use Github data as a network to extract insights about teamwork. Our guest, Gabriel Ramirez, manager of the notifications team at GitHub, will show how to apply network analysis to better understand and improve collaboration within his enginee ... Show More
36m 46s
Recommended Episodes
Mar 2023
DevOps is the Philosophy, Platform is the Practice | Humanitec's Kaspar von Grünberg
"DevOps is dead." Well, not exactly. But the DevOps methodology of "you build it, you run it" has been failing development teams for years. On this week's episode of Dev Interrupted, we sit down with Kaspar von Grünberg, founder & CEO of Humanitec. Listen as Kaspar explains ... Show More
35m 57s
Jan 2022
Making Agile work for data science
Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.Historically, the data scienc ... Show More
20m 52s
Jun 2019
Datanauts 166: Can You Hire ‘DevOps’?
Matt Stratton beams aboard the Datanauts starship to share his opinions and experiences with DevOps. Is DevOps a role you can hire for, or a culture you create? If it's the later, how do you get started, what are the impacts, and how do you iterate? The post Datanauts 166: Can Yo ... Show More
1h 6m
Feb 2018
DevOps_Tear Down That Wall
As the race to deliver applications ramps up, the wall between development and operations comes crashing down. When it does, those on both sides learn to work together like never before. But what is DevOps, really? Developer guests, including Microsoft’s Scott Hanselman and Cindy ... Show More
24m 42s
Oct 2022
2022 State of DevOps Report with Nathen Harvey and Derek DeBellis
On the show this week, we’re talking updated DevOps practices for 2022 with hosts Stephanie Wong and Chloe Condon and our guests Nathen Harvey and Derek DeBellis. Nathen and Derek start the show with a thorough discussion of DORA, the research program dedicated to helping organiz ... Show More
44m 7s
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
Summary Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminating all of the ... Show More
1h 8m
Oct 2022
How To Bring Agile Practices To Your Data Projects
Summary Agile methodologies have been adopted by a majority of teams for building software applications. Applying those same practices to data can prove challenging due to the number of systems that need to be included to implement a complete feature. In this episode Shane Gibson ... Show More
1h 12m
Jun 2021
Lessons Learned From The Pipeline Data Engineering Academy
Summary Data Engineering is a broad and constantly evolving topic, which makes it difficult to teach in a concise and effective manner. Despite that, Daniel Molnar and Peter Fabian started the Pipeline Academy to do exactly that. In this episode they reflect on the lessons that t ... Show More
1h 11m
Jan 2022
Academics and Data Science Innovation with Dr. David Bader, Distinguished Professor and Director, Institute for Data Science, New Jersey Institute of Technology
The data science field is expanding because so many businesses and other institutions require skilled workers who can manage data as well as provide insights. Companies and students are clamoring for more academic programs. There is great need, but academic institutions are still ... Show More
39m 32s