logo
episode-header-image
Jul 2024
48m 44s

#225 The Full Stack Data Scientist with ...

DATACAMP
About this episode

The role of the data scientist is changing. Some organizations are splitting the role into more narrowly focused jobs, while others are broadening it. The latter approach, known as the Full Stack Data Scientist, is derived from the concept of a full stack software engineer, with this role often including software engineering tasks. In particular, one of the key functions of a full stack data scientist is to take machine learning models and get them into production inside software. So, what separates projects from production?

Savin Goyal is the Co-Founder & CTO at Outerbounds. In addition to his work at Outerbounds, Savin is the creator of the open source machine learning management platform Metaflow. Previously Savin has worked as a Software Engineer at Netflix and LinkedIn.

In the episode, Richie and Savin explore the definition of production in data science, steps to move from internal projects to production, the lifecycle of a machine learning project, success stories in data science, challenges in quality control, Metaflow, scalability and robustness in production, AI and MLOps, advice for organizations and much more. 

Links Mentioned in the Show:


New to DataCamp?


Up next
Nov 17
#332 How to Build AI Your Users Can Trust with David Colwell, VP of AI & ML at Tricentis
<p>The relationship between data governance and AI quality is more critical than ever. As organizations rush to implement AI solutions, many are discovering that without proper data hygiene and testing protocols, they're building on shaky foundations. How do you ensure your AI sy ... Show More
1h 5m
Nov 12
#331 The Future of Data & AI Education Just Arrived with Jonathan Cornelissen & Yusuf Saber
The future of education is being reshaped by AI-powered personalization. Traditional online learning platforms offer static content that doesn't adapt to individual needs, but new technologies are creating truly interactive experiences that respond to each learner's context, pace ... Show More
58m 24s
Nov 10
#330 Harnessing AI to Help Humanity with Professor Sandy Pentland, HAI Fellow at Stanford, Co-founder of MIT Media Lab
Data storytelling isn't just about presenting numbers—it's about creating shared wisdom that drives better decision-making. In our increasingly polarized world, we often miss that most people actually have reasonable views hidden behind the loudest voices. But how can technology ... Show More
55m 37s
Recommended Episodes
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
<p><strong>Catherine Nelson</strong>, author of the new O'Reilly book, <em data-renderer-mark="true">Software Engineering for Data Scientists</em>, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and ... Show More
48m 19s
Aug 2024
AI in Action: From Machine Learning Interpretability to Cybersecurity with Serg Masís and Nirmal Budhathoki
In this DSS Podcast, Anna Anisin welcomes Serg Masís, Climate and Agronomic Data Scientist at Syngenta. Serg, an expert in machine learning interpretability and responsible AI, shares his diverse background and journey into data science. He discusses the challenges of building fa ... Show More
25m 37s
Jan 2025
The Role of Analytics in Shaping the Future of MLOps
<p dir="ltr">Sophia Rowland, Senior Product Manager at SAS, discusses her journey from data science to product management at SAS, focusing on the integration of AI and analytics. She explains the concepts of Model Ops and ML Ops, the challenges organizations face in operationaliz ... Show More
32m 42s
Jan 2025
Erik Bernhardsson on Creating Tools That Make AI Feel Effortless
Today on No Priors, Elad chats with Erik Bernhardsson, founder and CEO of Modal Labs, a platform simplifying ML workflows by providing a serverless infrastructure designed to streamline deployment, scaling, and development for AI engineers. Erik talks about his early work on Spot ... Show More
23m 36s
Apr 2024
Measuring The Speed of AI Through Benchmarks
<p dir="ltr">David Kanter, Executive Director at MLCommons, discusses the work they're doing with MLPerf Benchmarks, creating the world's first industry standard approach to measuring AI speed and safety. He also shares ways they're testing AI and LLMs for harm, to measure—and, o ... Show More
31m 45s
Feb 2025
OpenAI researcher on why soft skills are the future of work | Karina Nguyen (Research at OpenAI, ex-Anthropic)
<p><strong>Karina Nguyen </strong>leads research at OpenAI, where she’s been pivotal in developing groundbreaking products like Canvas, Tasks, and the o1 language model. Before OpenAI, Karina was at Anthropic, where she led post-training and evaluation work for Claude 3 models, c ... Show More
1h 14m
Mar 2025
NVIDIA RAPIDS and Open Source ML Acceleration with Chris Deotte and Jean-Francois Puget
<p>NVIDIA RAPIDS is an open-source suite of GPU-accelerated data science and AI libraries. It leverages CUDA and significantly enhances the performance of core Python frameworks including Polars, pandas, scikit-learn and NetworkX. Chris Deotte is a Senior Data Scientist at NVIDIA ... Show More
42m 6s
Jun 2025
Architecting AI-Driven Financial Systems: Innovation at the Intersection of Fintech and Emerging Tech
In this episode of the Data Science Salon Podcast, we sit down with Sasibhushan Rao Chanthati, AVP and Senior Software Engineer at T. Rowe Price, where he’s building the future of finance through intelligent, scalable technologies. Sasi specializes in creating secure digital ecos ... Show More
29m 7s
Aug 27
Amperity Reimagines Data and Developer Workflows with AI - Ep. 271
Derek Slager, co-founder and CTO of Amperity, explores how agentic AI and vibe coding are reshaping enterprise data management and the developer experience on the NVIDIA AI Podcast. Hear how Amperity’s platform unifies customer data, powers advanced analytics, and brings conversa ... Show More
36m 40s
Nov 11
From Rabbit Holes to Recommendations: Reddit’s Vishal Gupta
Vishal Gupta, engineering manager, machine learning at Reddit, joins the podcast to explain how the social media community platform uses artificial intelligence to improve user experience and ad relevance. Much of the advertising work relies on increasingly sophisticated recomme ... Show More
25m 9s