logo
episode-header-image
Jul 2018
38m 20s

Dev Ops for Data Science

Kyle Polich
About this episode

We revisit the 2018 Microsoft Build in this episode, focusing on the latest ideas in DevOps. Kyle interviews Cloud Developer Advocates Damien Brady, Paige Bailey, and Donovan Brown to talk about DevOps and data science and databases.

For a data scientist, what does it even mean to "build"? Packaging and deployment are things that a data scientist doesn't normally have to consider in their day-to-day work. The process of making an AI app is usually divided into two streams of work: data scientists building machine learning models and app developers building the application for end users to consume.

DevOps includes all the parties involved in getting the application deployed and maintained and thinking about all the phases that follow and precede their part of the end solution. So what does DevOps mean for data science? Why should you adopt DevOps best practices?

In the first half, Paige and Damian share their views on what DevOps for data science would look like and how it can be introduced to provide continuous integration, delivery, and deployment of data science models. In the second half, Donovan and Damian talk about the DevOps life cycle of putting a database under version control and carrying out deployments through a release pipeline.

Up next
Nov 23
Designing Recommender Systems for Digital Humanities
<p>In this episode of Data Skeptic, we explore the fascinating intersection of recommender systems and digital humanities with guest Florian Atzenhofer-Baumgartner, a PhD student at Graz University of Technology. Florian is working on <a href= "http://monasterium.net/">Monasteriu ... Show More
36m 48s
Nov 13
DataRec Library for Reproducible in Recommend Systems
<p>In this episode of Data Skeptic's Recommender Systems series, host Kyle Polich explores DataRec, a new Python library designed to bring reproducibility and standardization to recommender systems research. Guest Alberto Carlo Maria Mancino, a postdoc researcher from Politecnico ... Show More
32m 48s
Nov 5
Shilling Attacks on Recommender Systems
In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where mal ... Show More
34m 48s
Recommended Episodes
Mar 2023
DevOps is the Philosophy, Platform is the Practice | Humanitec's Kaspar von Grünberg
<p>&quot;DevOps is dead.&quot;<br/><br/>Well, not exactly. But the DevOps methodology of &quot;you build it, you run it&quot; has been failing development teams for years.<br/><br/>On this week&apos;s episode of Dev Interrupted, we sit down with Kaspar von Grünberg, founder &amp; ... Show More
35m 57s
Jan 2022
Making Agile work for data science
<p>Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.</p><p>Historically, the d ... Show More
20m 52s
Jun 2019
Datanauts 166: Can You Hire ‘DevOps’?
Matt Stratton beams aboard the Datanauts starship to share his opinions and experiences with DevOps. Is DevOps a role you can hire for, or a culture you create? If it's the later, how do you get started, what are the impacts, and how do you iterate? The post Datanauts 166: Can Yo ... Show More
1h 6m
Feb 2018
DevOps_Tear Down That Wall
As the race to deliver applications ramps up, the wall between development and operations comes crashing down. When it does, those on both sides learn to work together like never before. But what is DevOps, really? Developer guests, including Microsoft’s Scott Hanselman and Cindy ... Show More
24m 42s
Oct 2022
2022 State of DevOps Report with Nathen Harvey and Derek DeBellis
<p><span style="font-weight: 400;">On the show this week, we're talking updated DevOps practices for 2022 with hosts</span> <a href="https://twitter.com/stephr_wong" target="_blank" rel= "noopener"><span style="font-weight: 400;">Stephanie Wong</span></a> <span style="font-weight ... Show More
44m 7s
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
<h2>Summary</h2> <p>Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminat ... Show More
1h 8m
Oct 2022
How To Bring Agile Practices To Your Data Projects
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Agile methodologies have been adopted by a majority of teams for building software applications. Applying those same practices to data can prove challenging due to the number of systems that need to be included to implem ... Show More
1h 12m
Jun 2021
Lessons Learned From The Pipeline Data Engineering Academy
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data Engineering is a broad and constantly evolving topic, which makes it difficult to teach in a concise and effective manner. Despite that, Daniel Molnar and Peter Fabian started the Pipeline Academy to do exactly that ... Show More
1h 11m
Jan 2022
Academics and Data Science Innovation with Dr. David Bader, Distinguished Professor and Director, Institute for Data Science, New Jersey Institute of Technology
<p>The data science field is expanding because so many businesses and other institutions require skilled workers who can manage data as well as provide insights. Companies and students are clamoring for more academic programs. There is great need, but academic institutions are st ... Show More
39m 32s