logo
episode-header-image
Jan 2022
20m 52s

Making Agile work for data science

The Stack Overflow Podcast
About this episode

Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.

Historically, the data science team has been purely science-driven. Work on methodologies, prove out something that they wanted to achieve, and then hand it over to the engineering organization. That could take many months.

Over the past three to five years, they’ve been moving their engineering and data science operations onto the cloud as part of an overall Agile transformation and a move from being sales-led to being product-led. With most of their solutions migrated over, they decided that along with modernizing their infrastructure, they wanted to modernize their legacy systems, add new functions and scientific techniques, and take advantage of new technologies to scale and meet the demand coming their way. 

While all of the rituals and the rigor of Agile didn't always facilitate the more open-ended nature of the data science work at 84.51°, having both data science and engineering operating in a similar tech stack has been a breath of fresh air. Working cross-functionally has shortened the implementation delay. At the same time, being closer to the engineering side of the house has given the data science team a better sense of how to fit their work into the pipeline. 

Getting everyone on the same tech stack had a side effect. Between the increasing complexity of the projects, geographic diversity of the folks on these projects, a rise in remote work, and continued growth, locating experts became harder. But with everyone working in the same tech, more people could answer questions and become SMEs. 

Of course, we’d be remiss if we didn’t tell you that 84.51° was asking and answering questions on Stack Overflow for Teams. It was helpful when Chris and Michael no longer had to call on the SMEs they knew by name but could suddenly draw more experts out of the woodwork by asking a question. Check out this episode for insights on data science, agile, and building a great knowledge base for a large, increasingly distributed engineering org.

Up next
Jul 8
Attention isn’t all we need; we need ownership too
NEAR is the blockchain for AI, enabling AI agents to transact freely across networks.Connect with Illia on LinkedIn and X, and read the original Transformers paper that Illia co-authored in 2017.Today’s shoutout goes to Populous badge winner Adi Lester for answering the question ... Show More
36m 32s
Jul 4
Why call one API when you can use GraphQL to call them all?
Apollo GraphQL lets you orchestrate APIs with a composable, declarative, self-service model. Apollo's MCP Server is now available.Connect with Matt on LinkedIn.Today we’re shouting out a Famous Question badge winner, user jkfe, for their question How to hide/show thymeleaf fields ... Show More
25m 45s
Jul 1
Programming problems that seem easy, but aren't, featuring Jon Skeet
Jon Skeet, for those not in the know, is legendary here at Stack Overflow. He even got his own Chuck Norris Facts-style jokes. Jon has graced the podcast before in the early days on episodes 4, 72, and 123.He’s so good at answering Stack Overflow questions that he appeared at Sta ... Show More
32m 34s
Recommended Episodes
Feb 2023
#127 How Data Scientists Can Thrive in Consulting
The most common application for data science is to solve problems within your own organization, and as professionals become more data literate, they rely less and less on others to solve their problems and unlock professional growth and career advancement.But in the world of cons ... Show More
42m 41s
Jul 2018
Dev Ops for Data Science
We revisit the 2018 Microsoft Build in this episode, focusing on the latest ideas in DevOps. Kyle interviews Cloud Developer Advocates Damien Brady, Paige Bailey, and Donovan Brown to talk about DevOps and data science and databases. For a data scientist, what does it even mean t ... Show More
38m 20s
Apr 2021
Opendoor’s Ian Wong on Disrupting the Real Estate Industry with Data-Driven Digital Transformation
“Garbage in, garbage out.” It’s a philosophy every data leader is familiar with. Your algorithms and models are only as good as the data you put in them -- so how do you ensure the data you are leveraging is reliable and trustworthy? Joining Cindi today is Opendoor Co-founder and ... Show More
37m 55s
Jan 2022
Academics and Data Science Innovation with Dr. David Bader, Distinguished Professor and Director, Institute for Data Science, New Jersey Institute of Technology
The data science field is expanding because so many businesses and other institutions require skilled workers who can manage data as well as provide insights. Companies and students are clamoring for more academic programs. There is great need, but academic institutions are still ... Show More
39m 32s
Feb 2023
#571: AWS Data Lab
AWS Data Lab helps customers shave months off of their development timelines by providing an engagement that pairs their teams of builders with dedicated AWS technical resources - helping them make architectural decisions faster, remove technical roadblocks, build with confidence ... Show More
31m 2s
Nov 2015
Data Science for Making the World a Better Place
There's a good chance that great data science is going on close to you, and that it's going toward making your city, state, country, and planet a better place. Not all the data science questions being tackled out there are about finding the sleekest new algorithm or billion-dolla ... Show More
9m 31s
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
Summary Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminating all of the ... Show More
1h 8m
Jul 2021
Exploring The Design And Benefits Of The Modern Data Stack
Summary We have been building platforms and workflows to store, process, and analyze data since the earliest days of computing. Over that time there have been countless architectures, patterns, and "best practices" to make that task manageable. With the growing popularity of clou ... Show More
49m 2s
Mar 2023
#131 How the Aviation Industry Leverages Data Science
Data leaders play a critical role in driving innovation and growth in various industries, and this is particularly true in highly regulated industries such as aviation. In such industries, data leaders face unique challenges and opportunities, working to balance the need for inno ... Show More
35m 53s
Oct 2022
How To Bring Agile Practices To Your Data Projects
Summary Agile methodologies have been adopted by a majority of teams for building software applications. Applying those same practices to data can prove challenging due to the number of systems that need to be included to implement a complete feature. In this episode Shane Gibson ... Show More
1h 12m