logo
episode-header-image
Aug 2021
48m 39s

Prepare Your Unstructured Data For Machi...

Tobias Macey
About this episode
tail spinning
Up next
Jan 25
Logical First, Physical Second: A Pragmatic Path to Trusted Data
Summary In this episode of the Data Engineering Podcast Jamie Knowles, Product Director for ER/Studio, talks about data architecture and its importance in driving business meaning. He discusses how data architecture should start with business meaning, not just physical schemas, a ... Show More
40m 50s
Jan 18
Your Data, Your Lake: How Observe Uses Iceberg and Streaming ETL for Observability
Summary In this episode Jacob Leverich, cofounder and CTO of Observe, talks about applying lakehouse architectures to observability workloads. Jacob discusses Observe’s decision to leverage cloud-native warehousing and open table formats for scale and cost efficiency. He digs int ... Show More
1h 12m
Jan 12
Semantic Operators Meet Dataframes: Building Context for Agents with FENIC
Summary In this episode Kostas Pardalis talks about Fenic - an open-source, PySpark-inspired dataframe engine designed to bring LLM-powered semantics into reliable data engineering workflows. Kostas shares why today’s data infrastructure assumptions (BI-first, expert-operated, CP ... Show More
56m 42s
Recommended Episodes
Mar 2022
Bayesian Machine Learning with Ravin Kumar (Ep. 191)
<p>This is one episode where passion for math, statistics and computers are merged. I have a very interesting conversation with Ravin,  data scientist at Google where he uses data to inform decisions.</p> <p>He has previously worked at Sweetgreen, designing systems that would b ... Show More
31m 12s
Nov 2021
Time Plus Data Equals Efficiency with Paul Dix, the Founder and CTO of InfluxData and the Creator of InfluxDB
<p>If the topic of databases is brought up to certain people, their eyes may gloss over. But if that happened, that would be because they just don’t know the awesome power of databases. Data can be valuable but only if it is contextualized, and time is an extremely relevant aspec ... Show More
36m 4s
Feb 2023
Shorten the distance between production data and insight
<p>Modern networked applications generate a lot of data, and every business wants to make the most of that data. Most of the time, that means moving production data through some transformation process to get it ready for the analytics process. But what if you could have in-app an ... Show More
20m 27s
Aug 2018
The Future of Computing
<p>In this episode, we are joined by Alex Wright-Gladstein, CEO and co-founder of Ayar Labs. Ayar Labs has developed new electronic-photonic integrated circuits that move data using light instead of electricity.</p> <p>Alex shares exciting insights around the future of computing ... Show More
29m 8s
Mar 2021
Solving the World's Biggest Problems at Scale, with WekaIO President, Ken Grohe
<p>The No. 1 feature of technology is storage. Ok, so that’s not true. But, it’s one of the most critical pieces of hardware that enables software to function. How fast, how easy, and how much data can be accessed and leveraged inside of applications plays a critical part in tech ... Show More
48m 5s
Mar 2022
Mining the Golden Age of Data with Tableau’s CEO & President Mark Nelson
<p><a href="https://www.linkedin.com/in/markthomasnelson/">Mark Nelson</a> is the President and CEO of <a href="https://www.tableau.com/">Tableau</a>, a company dedicated to democratizing analytics and putting data back in the hands of consumers. But while this digital pioneer ma ... Show More
36m 32s
Jun 2024
Making ETL pipelines a thing of the past
<p>RelationalAI’s first <a href="https://relational.ai/resources/introducing-first-ai-coprocessor" target="_blank">big partner is Snowflake</a>, meaning customers can now start using their data with GenAI without worrying about the privacy, security, and governance hassle that wo ... Show More
26m 13s
Jun 2022
Using AI to Supercharge Data-Driven Applications with Zilliz
Theo is in the interviewer’s chair for this episode as Frank Liu from Zilliz joins the show to talk about how AI and machine learning are making it possible for developers to understand and extract more value from unstructured data such as text, audio, images, video, and more. Tr ... Show More
20 m