logo
episode-header-image
Sep 2018
16m 22s

Data Engineering

Ben Jaffe And Katie Malone
About this episode
If you’re a data scientist, you know how important it is to keep your data orderly, clean, moving smoothly between different systems, well-documented… there’s a ton of work that goes into building and maintaining databases and data pipelines. This job, that of owner and maintainer of the data being used for analytics, is often the realm of data engineers. Fr ... Show More
Up next
Jul 2020
So long, and thanks for all the fish
All good things must come to an end, including this podcast. This is the last episode we plan to release, and it doesn’t cover data science—it’s mostly reminiscing, thanking our wonderful audience (that’s you!), and marveling at how this thing that started out as a side project g ... Show More
35m 44s
Jul 2020
A Reality Check on AI-Driven Medical Assistants
The data science and artificial intelligence community has made amazing strides in the past few years to algorithmically automate portions of the healthcare process. This episode looks at two computer vision algorithms, one that diagnoses diabetic retinopathy and another that cla ... Show More
14 m
Jul 2020
A Data Science Take on Open Policing Data
A few weeks ago, we put out a call for data scientists interested in issues of race and racism, or people studying how those topics can be studied with data science methods, should get in touch to come talk to our audience about their work. This week we’re excited to bring on Tod ... Show More
23m 44s
Recommended Episodes
Apr 2021
Moving Machine Learning Into The Data Pipeline at Cherre
Summary Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformations is actually a full-fledged machine learning projec ... Show More
48m 5s
Nov 2023
#162 Scaling Data Engineering in Retail with Mohammad Sabah, SVP of Engineering & Data at Thrive Market
Poor data engineering is like building a shaky foundation for a house—it leads to unreliable information, wasted time and money, and even legal problems, making everything less dependable and more troublesome in our digital world. In the retail industry specifically, data enginee ... Show More
51m 39s
Aug 2023
Unpacking The Seven Principles Of Modern Data Pipelines
Summary Data pipelines are the core of every data product, ML model, and business intelligence dashboard. If you're not careful you will end up spending all of your time on maintenance and fire-fighting. The folks at Rivery distilled the seven principles of modern data pipeli ... Show More
47m 3s
Nov 2021
Data Quality Starts At The Source
Summary The most important gauge of success for a data platform is the level of trust in the accuracy of the information that it provides. In order to build and maintain that trust it is necessary to invest in defining, monitoring, and enforcing data quality metrics. In this epis ... Show More
58m 55s
Feb 2020
Data Modeling That Evolves With Your Business Using Data Vault
Summary Designing the structure for your data warehouse is a complex and challenging process. As businesses deal with a growing number of sources and types of information that they need to integrate, they need a data modeling strategy that provides them with flexibility and speed ... Show More
1h 6m
Jun 2020
Bringing Business Analytics To End Users With GoodData
Summary The majority of analytics platforms are focused on use internal to an organization by business stakeholders. As the availability of data increases and overall literacy in how to interpret it and take action improves there is a growing need to bring business intelligence u ... Show More
52m 24s
Apr 2022
#83 Empowering the Modern Data Analyst
As data volumes grow and become ever-more complex, the role of the data analyst has never been more important. At the disposal of the modern data analyst, are tools that reduce time to insight, and increase collaboration. However, as the tools of a data analyst evolve, so do the ... Show More
37m 1s