logo
episode-header-image
Feb 2023
20m 27s

Shorten the distance between production ...

The Stack Overflow Podcast
About this episode
Modern networked applications generate a lot of data, and every business wants to make the most of that data. Most of the time, that means moving production data through some transformation process to get it ready for the analytics process. But what if you could have in-app analytics? What if you could generate insights directly from production data? On thi ... Show More
Up next
Today
Building a global engineering team (plus AI agents) with Netlify
In this episode of Leaders of Code, Stack Overflow’s Chief of Product and Technology, Jody Bailey, sits down with Dana Lawson, CTO at Netlify. Dana shares her insights on leading a lean, globally distributed engineering team that powers 5% of the internet. The conversation touche ... Show More
29s
Mar 17
Keeping the lights on for open source
Ryan sits down with Chainguard CEO Dan Lorenc to chat about how his team is keeping the foundation of the internet—open source projects—alive by forking archived but widely-used repos to provide security maintenance and dependency upgrades. They also discuss open source’s sustain ... Show More
29m 6s
Mar 13
Open source for awkward robots
Ryan is joined by Jan Liphardt, CEO and co-founder of OpenMind, to chat about the rapidly evolving world of humanoid robotics and what it means for humans, why OpenMind is building an open source operating system for robots that processes logic in natural language, and how puttin ... Show More
30m 38s
Recommended Episodes
Jun 2021
A Candid Exploration Of Timeseries Data Analysis With InfluxDB
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>While the overall concept of timeseries data is uniform, its usage and applications are far from it. One of the most demanding applications of timeseries data is for application and server monitoring due to the problem o ... Show More
1h 6m
Nov 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data int ... Show More
59m 25s
Aug 2022
An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data has permeated every aspect of our lives and the products that we interact with. As a result, end users and customers have come to expect interactions and updates with services and analytics to be fast and up to date ... Show More
1h 6m
Apr 2021
Moving Machine Learning Into The Data Pipeline at Cherre
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformation ... Show More
48m 5s
Aug 2022
Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data is useless if it isn&#8217;t being used, and you can&#8217;t use it if you don&#8217;t know where it is. Data catalogs were the first solution to this problem, but they are only helpful if you know what you are look ... Show More
53m 24s
Jun 2021
Accelerating ML Training And Delivery With In-Database Machine Learning
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object storage, or querying a database. To speed up the process, why not build the model in ... Show More
1h 5m
Dec 2019
Building The Materialize Engine For Interactive Streaming Analytics In SQL
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Transactional databases used in applications are optimized for fast reads and writes with relatively simple queries on a small number of records. Data warehouses are optimized for batched writes and complex analytical qu ... Show More
48m 7s
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
<h2>Summary</h2> <p>Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminat ... Show More
1h 8m
Nov 2021
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>One of the perennial challenges posed by data lakes is how to keep them up to date as new data is collected. With the improvements in streaming engines it is now possible to perform all of your data integration in near r ... Show More
52m 53s
Apr 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
<h2>Summary</h2> <p>Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while mainta ... Show More
56m 23s