logo
episode-header-image
Nov 2023
51m 39s

#162 Scaling Data Engineering in Retail ...

DATACAMP
About this episode

Poor data engineering is like building a shaky foundation for a house—it leads to unreliable information, wasted time and money, and even legal problems, making everything less dependable and more troublesome in our digital world. In the retail industry specifically, data engineering is particularly important for managing and analyzing large volumes of sales, inventory, and customer data, enabling better demand forecasting, inventory optimization, and personalized customer experiences. It helps retailers make informed decisions, streamline operations, and remain competitive in a rapidly evolving market. Insight and frameworks learned from data engineering practices can be applied to a multitude of people and problems, and in turn, learning from someone who has been at the forefront of data engineering is invaluable.  

Mohammad Sabah is SVP of Engineering and Data at Thrive Market, and was appointed to this role in 2018. He joined the company from The Honest Company where he served as VP of Engineering & Chief Data Scientist. Sabah joined The Honest Company following its acquisition of Insnap, which he co-founded in 2015. Over the course of his career, Sabah has held various data science and engineering roles at companies including Facebook, Workday, Netflix, and Yahoo!

In the episode, Richie and Mo explore the importance of using AI to identify patterns and proactively address common errors, the use of tools like dbt and SODA for data pipeline abstraction and stakeholder involvement in data quality, data governance and data quality as foundations for strong data engineering, validation layers at each step of the data pipeline to ensure data quality, collaboration between data analysts and data engineers for holistic problem-solving and reusability of patterns, ownership mentality in data engineering and much more. 

Links from the show:


Up next
Nov 17
#332 How to Build AI Your Users Can Trust with David Colwell, VP of AI & ML at Tricentis
<p>The relationship between data governance and AI quality is more critical than ever. As organizations rush to implement AI solutions, many are discovering that without proper data hygiene and testing protocols, they're building on shaky foundations. How do you ensure your AI sy ... Show More
1h 5m
Nov 12
#331 The Future of Data & AI Education Just Arrived with Jonathan Cornelissen & Yusuf Saber
The future of education is being reshaped by AI-powered personalization. Traditional online learning platforms offer static content that doesn't adapt to individual needs, but new technologies are creating truly interactive experiences that respond to each learner's context, pace ... Show More
58m 24s
Nov 10
#330 Harnessing AI to Help Humanity with Professor Sandy Pentland, HAI Fellow at Stanford, Co-founder of MIT Media Lab
Data storytelling isn't just about presenting numbers—it's about creating shared wisdom that drives better decision-making. In our increasingly polarized world, we often miss that most people actually have reasonable views hidden behind the loudest voices. But how can technology ... Show More
55m 37s
Recommended Episodes
Jan 2024
SingleStore CEO on High-Speed Database Currents
Enterprise data architecture is highly complex, databases deeply fragmented and demand for high-speed information flows continues to grow. In this edition of the Tech Disruptors podcast, SingleStore CEO Raj Verma joins Sunil Rajgopal, Bloomberg Intelligence senior software analys ... Show More
47m 26s
Sep 2020
Vertafore's Chad Hawkinson on Cloud Data Security and Streamlining Workflows
<p>Joining Cindi today is <a href="https://www.linkedin.com/in/chad-h-6b41872/">Chad Hawkinson</a>, the Chief Product and Data Officer at <a href="https://www.vertafore.com/">Vertafore</a>, the leader in creating modern insurance technology. A seasoned data and analytics guru, Ch ... Show More
53m 58s
Sep 2018
Data Engineering
If you’re a data scientist, you know how important it is to keep your data orderly, clean, moving smoothly between different systems, well-documented… there’s a ton of work that goes into building and maintaining databases and data pipelines. This job, that of owner and maintaine ... Show More
16m 22s
Apr 2021
Opendoor’s Ian Wong on Disrupting the Real Estate Industry with Data-Driven Digital Transformation
<p>“Garbage in, garbage out.” It’s a philosophy every data leader is familiar with. Your algorithms and models are only as good as the data you put in them -- so how do you ensure the data you are leveraging is reliable and trustworthy? </p><p>Joining Cindi today is Opendoor Co-f ... Show More
37m 55s
Dec 2021
Making the Turn from Data Inventory to Helpful Information with Mara Reiff, the Chief Data Officer of FreshBooks
<p>If data is in a pool that only keeps getting deeper as data inventory is accounted for, when is the exact moment for a business leader to jump in to do something with all the accumulated information? Leaders who care about data appreciate that it’s necessary to take stock befo ... Show More
32m 50s
Jun 2021
Buying and Selling Homes Algorithmically with Opendoor’s VP of Research and Data Science, Kushal Chakrabarti
<p>For many people, the process of buying and selling a home will undoubtedly be the most difficult decisions they will make in their lifetime. Is the price you’re paying for your home fair? Is the price you’re selling your home for an adequate sale price? For a long time, realto ... Show More
32m 26s
Jan 2024
S5E1 Dora Boussias | Data Architecture, Situational Awareness, & Growth
<p><a target="_blank" href="https://www.buzzsprout.com/twilio/text_messages/995575/open_sms">Send us a text</a></p><p>Aaron Moncur interviews Dora Boussias about her career journey and leadership approach, with a focus on how she leads with empathy, transparency, and inclusivenes ... Show More
53m 13s
Feb 2021
ThoughtSpot’s Cindi Howson on Chief Data Officer Success Strategies
<p>Much like a roller coaster, 2020 was full of many loops, twists, and turns. From accelerated digital transformations to expedited migrations to the cloud, you were asked to do it all— often with far less time and resources. Through it all, <i>The Data Chief</i> was right there ... Show More
38m 3s