logo
episode-header-image
Dec 2024
1h 2m

#491: DuckDB and Python: Ducks and Snake...

MICHAEL KENNEDY
About this episode
Join me for an insightful conversation with Alex Monahan, who works on documentation, tutorials, and training at DuckDB Labs. We explore why DuckDB is gaining momentum among Python and data enthusiasts, from its in-process database design to its blazingly fast, columnar architecture. We also dive into indexing strategies, concurrency considerations, and the fascinating way MotherDuck (the cloud companion to DuckDB) handles large-scale data seamlessly. Don’t miss this chance to learn how a single pip install could totally transform your Python data workflow!

Episode sponsors

Sentry Error Monitoring, Code TALKPYTHON
Data Citizens Podcast
Talk Python Courses

Links from the show

Alex on Mastodon: @__Alex__

DuckDB: duckdb.org
MotherDuck: motherduck.com
SQLite: sqlite.org
Moka-Py: github.com
PostgreSQL: www.postgresql.org
MySQL: www.mysql.com
Redis: redis.io
Apache Parquet: parquet.apache.org
Apache Arrow: arrow.apache.org
Pandas: pandas.pydata.org
Polars: pola.rs
Pyodide: pyodide.org
DB-API (PEP 249): peps.python.org/pep-0249
Flask: flask.palletsprojects.com
Gunicorn: gunicorn.org
MinIO: min.io
Amazon S3: aws.amazon.com/s3
Azure Blob Storage: azure.microsoft.com/products/storage
Google Cloud Storage: cloud.google.com/storage
DigitalOcean: www.digitalocean.com
Linode: www.linode.com
Hetzner: www.hetzner.com
BigQuery: cloud.google.com/bigquery
DBT (Data Build Tool): docs.getdbt.com
Mode: mode.com
Hex: hex.tech
Python: www.python.org
Node.js: nodejs.org
Rust: www.rust-lang.org
Go: go.dev
.NET: dotnet.microsoft.com
Watch this episode on YouTube: youtube.com
Episode transcripts: talkpython.fm

--- Stay in touch with us ---
Subscribe to Talk Python on YouTube: youtube.com
Talk Python on Bluesky: @talkpython.fm at bsky.app
Talk Python on Mastodon: talkpython
Michael on Bluesky: @mkennedy.codes at bsky.app
Michael on Mastodon: mkennedy
Up next
Oct 6
#522: Data Sci Tips and Tricks from CodeCut.ai
Today we’re turning tiny tips into big wins. Khuyen Tran, creator of CodeCut.ai, has shipped hundreds of bite-size Python and data science snippets across four years. We dig into open-source tools you can use right now, cleaner workflows, and why notebooks and scripts don’t have ... Show More
1h 9m
Sep 29
#521: Red Teaming LLMs and GenAI with PyRIT
English is now an API. Our apps read untrusted text; they follow instructions hidden in plain sight, and sometimes they turn that text into action. If you connect a model to tools or let it read documents from the wild, you have created a brand new attack surface. In this episode ... Show More
1h 2m
Sep 23
#520: pyx - the other side of the uv coin (announcing pyx)
A couple years ago, Charlie Marsh lit a fire under Python tooling with Ruff and then uv. Today he’s back with something on the other side of that coin: pyx. Pyx isn’t a PyPI replacement. Think server, not just index. It mirrors PyPI, plays fine with pip or uv, and aims to make in ... Show More
1 h
Recommended Episodes
Aug 5
911: The Future of Python Notebooks is Here, with Marimo’s Dr. Akshay Agrawal
Reproducibility, Python notebooks, and data science communities: Software developer Akshay Agrawal speaks to Jon Krohn about Marimo, the next-generation computational notebook for Python, how he built and fostered a thriving community around the product, and what makes this noteb ... Show More
58m 20s
Jul 28
Revolutionizing Python Notebooks with Marimo
SummaryIn this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. ... Show More
51m 56s
Jun 2023
AI trends: a Latent Space crossover
Daniel had the chance to sit down with @swyx and Alessio from the Latent Space pod in SF to talk about current AI trends and to highlight some key learnings from past episodes. The discussion covers open access LLMs, smol models, model controls, prompt engineering, and LLMOps. Th ... Show More
59m 39s
Sep 9
What's New at CFI | Data Analysis in Python
Ready to take your data analysis skills to the next level? In this episode of What's New at CFI, we chat with subject matter expert Joseph Yeates about his newest course, Data Analysis in Python. This course is the perfect follow-up to our "Getting Started with Python" series and ... Show More
13m 33s
Sep 18
From RAG to Relational: How Agentic Patterns Are Reshaping Data Architecture
SummaryIn this episode of the AI Engineering Podcast Mark Brooker, VP and Distinguished Engineer at AWS, talks about how agentic workflows are transforming database usage and infrastructure design. He discusses the evolving role of data in AI systems, from traditional models to m ... Show More
52m 58s
Nov 2024
Build An App with a Backend Using Ai in 20 min (Cursor Ai, Replit, Firebase, Wispr Flow)
Episode 32: How can you build an app with a backend using AI in just 20 minutes? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) sit down with AI enthusiast Riley Brown (https://x.com/rileybrown_ai) to explore this exciting and challenging process. ... Show More
39m 34s
Sep 18
How People Actually Use ChatGPT
This episode of AI Daily Brief dives into two important reports on how people are really using AI tools like ChatGPT and Claude. OpenAI’s massive study with Harvard and NBER reveals consumer patterns across 1.5 million conversations, while Anthropic’s Economic Index tracks broade ... Show More
27m 39s
Aug 14
Live from Big Sky Dev Con: Code Debates, Content Insights, and What's Next for Web Development
Live from Big Sky Dev Con, Robbie is joined by Aaron Francis, Ken Wheeler, and Typecraft to discuss HTML and CSS programming status, React’s future, app security failures, NPM’s weak spots, content creation hooks, survival tech fantasies, and balancing big tech ambitions with fam ... Show More
50m 55s
Feb 2025
The Future of Data Engineering: AI, LLMs, and Automation
Summary In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large langua ... Show More
59m 39s
Jun 2024
Are Coding Jobs at Risk? AI's Impact on the Future of Coding ft. Python Simplified | Mariya Sha
Episode 12: Are coding jobs at risk with the rise of AI? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) dive into this compelling topic with guest Mariya Sha (https://x.com/mariyasha888), a seasoned coder and the creator of the popular YouTube cha ... Show More
42m 37s