logo
episode-header-image
Feb 2022
24m 33s

Column by your name: The analytics datab...

The Stack Overflow Podcast
About this episode
These days, every company looking at analyzing their data for insights has a data pipeline setup. Many companies have a fast production database, often a NoSQL or key-value store, that goes through a data pipeline.The pipeline process performs some sort of extract-transform-load process on it, then routes it to a larger data store that the analytics tools ca ... Show More
Up next
May 1
Time is a construct but it can still break your software
Ryan welcomes Jason Williams, senior software engineer at Bloomberg and the creator of Rust-based JavaScript engine Boa, to the show to dive into why date and time handling in JavaScript is so difficult and how the Temporal proposal aims to fix it. They explore the current flaws ... Show More
35m 38s
Apr 28
Your LLM issues are really data issues
Ryan welcomes Harsha Chintalapani, co-founder and CTO at Collate and co-creator of Open Metadata, to the show to discuss why AI and LLMs struggle with real-time, structured production data. They explore how schema changes, inconsistent definitions (like “customer”), and weak gove ... Show More
31m 34s
Apr 24
Lights, camera, open source!
Ryan is joined on the show by Cult.Repo producers Emma Tracey and Josiah Mcgarvie to discuss making documentaries about open-source software and the people behind the major technologies that uphold the internet. They explore why open-source projects and the people who maintain th ... Show More
25m 33s
Recommended Episodes
Nov 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data int ... Show More
59m 25s
May 2022
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>A large fraction of data engineering work involves moving data from one storage location to another in order to support different access and query patterns. Singlestore aims to cut down on the number of database engines ... Show More
41m 22s
Feb 2020
Data Modeling That Evolves With Your Business Using Data Vault
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Designing the structure for your data warehouse is a complex and challenging process. As businesses deal with a growing number of sources and types of information that they need to integrate, they need a data modeling st ... Show More
1h 6m
Sep 2018
Data Engineering
If you’re a data scientist, you know how important it is to keep your data orderly, clean, moving smoothly between different systems, well-documented… there’s a ton of work that goes into building and maintaining databases and data pipelines. This job, that of owner and maintaine ... Show More
16m 22s
Oct 2021
On Graph Databases | The Backend Engineering Show
<p>I get a lot of emails asking me to talk about graph databases, so I want to start researching them, but I wanted to give you guys the framework of how I think about any databases to defuse any “magic” that might be there.</p> <p>In this video, I discuss what constrains a datab ... Show More
22m 27s
Jun 2021
Accelerating ML Training And Delivery With In-Database Machine Learning
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object storage, or querying a database. To speed up the process, why not build the model in ... Show More
1h 5m
Aug 2022
An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data has permeated every aspect of our lives and the products that we interact with. As a result, end users and customers have come to expect interactions and updates with services and analytics to be fast and up to date ... Show More
1h 6m
Jun 2020
Bringing Business Analytics To End Users With GoodData
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The majority of analytics platforms are focused on use internal to an organization by business stakeholders. As the availability of data increases and overall literacy in how to interpret it and take action improves ther ... Show More
52m 24s
Apr 2022
#83 Empowering the Modern Data Analyst
As data volumes grow and become ever-more complex, the role of the data analyst has never been more important. At the disposal of the modern data analyst, are tools that reduce time to insight, and increase collaboration. However, as the tools of a data analyst evolve, so do the ... Show More
37m 1s
Jan 2024
SingleStore CEO on High-Speed Database Currents
Enterprise data architecture is highly complex, databases deeply fragmented and demand for high-speed information flows continues to grow. In this edition of the Tech Disruptors podcast, SingleStore CEO Raj Verma joins Sunil Rajgopal, Bloomberg Intelligence senior software analys ... Show More
47m 26s