logo
episode-header-image
Jul 2024
48m 44s

#225 The Full Stack Data Scientist with ...

DATACAMP
About this episode

The role of the data scientist is changing. Some organizations are splitting the role into more narrowly focused jobs, while others are broadening it. The latter approach, known as the Full Stack Data Scientist, is derived from the concept of a full stack software engineer, with this role often including software engineering tasks. In particular, one of the key functions of a full stack data scientist is to take machine learning models and get them into production inside software. So, what separates projects from production?

Savin Goyal is the Co-Founder & CTO at Outerbounds. In addition to his work at Outerbounds, Savin is the creator of the open source machine learning management platform Metaflow. Previously Savin has worked as a Software Engineer at Netflix and LinkedIn.

In the episode, Richie and Savin explore the definition of production in data science, steps to move from internal projects to production, the lifecycle of a machine learning project, success stories in data science, challenges in quality control, Metaflow, scalability and robustness in production, AI and MLOps, advice for organizations and much more. 

Links Mentioned in the Show:


New to DataCamp?


Up next
Yesterday
#309 What Science Fiction Can Tell Us About the Future of AI with Ken Liu, Sci-Fi Author
Technology and human consciousness are converging in ways that challenge our fundamental understanding of creativity and connection. As AI systems become increasingly sophisticated at mimicking human thought patterns, we're entering uncharted territory where machines don't just a ... Show More
1h 17m
Jul 3
Industry Roundup #5: AI Agents Hype vs. Reality, Meta’s $15B Stake in Scale AI, and the First Fully AI-Generated NBA Ad
Welcome to DataFramed Industry Roundups! In this series of episodes, we sit down to discuss the latest and greatest in data & AI. In this episode, with special guest, DataCamp COO Martijn, we touch upon the hype and reality of AI agents in business, the McKinsey vs. Ethan Mollick ... Show More
53m 2s
Jun 30
#308 A Framework for GenAI App and Agent Development with Jerry Liu, CEO at LlamaIndex
The enterprise adoption of AI agents is accelerating, but significant challenges remain in making them truly reliable and effective. While coding assistants and customer service agents are already delivering value, more complex document-based workflows require sophisticated archi ... Show More
52m 21s
Recommended Episodes
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
Catherine Nelson, author of the new O’Reilly book, Software Engineering for Data Scientists, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and AI projects. Host Philip Winston speaks with Nelson ab ... Show More
48m 19s
Aug 2024
AI in Action: From Machine Learning Interpretability to Cybersecurity with Serg Masís and Nirmal Budhathoki
In this DSS Podcast, Anna Anisin welcomes Serg Masís, Climate and Agronomic Data Scientist at Syngenta. Serg, an expert in machine learning interpretability and responsible AI, shares his diverse background and journey into data science. He discusses the challenges of building fa ... Show More
25m 37s
Jan 2025
The Role of Analytics in Shaping the Future of MLOps
Sophia Rowland, Senior Product Manager at SAS, discusses her journey from data science to product management at SAS, focusing on the integration of AI and analytics. She explains the concepts of Model Ops and ML Ops, the challenges organizations face in operationalizing machine l ... Show More
32m 42s
Jan 2025
Erik Bernhardsson on Creating Tools That Make AI Feel Effortless
Today on No Priors, Elad chats with Erik Bernhardsson, founder and CEO of Modal Labs, a platform simplifying ML workflows by providing a serverless infrastructure designed to streamline deployment, scaling, and development for AI engineers. Erik talks about his early work on Spot ... Show More
23m 36s
Apr 2024
Measuring The Speed of AI Through Benchmarks
David Kanter, Executive Director at MLCommons, discusses the work they’re doing with MLPerf Benchmarks, creating the world’s first industry standard approach to measuring AI speed and safety. He also shares ways they’re testing AI and LLMs for harm, to measure—and, over time, red ... Show More
31m 45s
Mar 2025
Bridging AI and Business: Conversational AI & Communicating Data Value
In this episode of the Data Science Salon Podcast, host Anna Anisin sits down with two incredible leaders driving innovation in AI and data science. First, Noelle Russell, CEO at AI Leadership Institute, shares her expertise on Conversational AI and intelligent contact centers. S ... Show More
24m 53s
Feb 2025
OpenAI researcher on why soft skills are the future of work | Karina Nguyen (Research at OpenAI, ex-Anthropic)
Karina Nguyen leads research at OpenAI, where she’s been pivotal in developing groundbreaking products like Canvas, Tasks, and the o1 language model. Before OpenAI, Karina was at Anthropic, where she led post-training and evaluation work for Claude 3 models, created a document up ... Show More
1h 14m
Feb 2022
Nick Singh - Ace the Data Science Interview #8
Our guest today is Nick Singh, ex-Facebook, Google, Microsoft and Author of "Ace the Data Science Interview", an Amazon best seller book which helps you land your dream Data Science job. In our conversation, we first talk about Nick's career in industry. We explore how he ma ... Show More
59m 12s
Sep 2024
Meet Your New Teammate, AI: Asana’s Saket Srivastava
Saket Srivastava, CIO at work management platform Asana, has had technology roles at organizations such as General Electric, IBM, and Fujitsu, moving from back-end IT services positions to more strategic business leadership roles. Asana has already been working with artificial in ... Show More
35m 17s
Nov 2024
scikit-learn & data science you own
We are at GenAI saturation, so let’s talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the “AI” that is ac ... Show More
52m 2s