logo
episode-header-image
May 2023
1h 8m

675: Pandas for Data Analysis and Visual...

Jon Krohn
About this episode
Wrangling data in Pandas, when to use Pandas, Matplotlib or Seaborn, and why you should learn to create Python packages: Jon Krohn speaks with guest Stefanie Molin, author of Hands-On Data Analysis with Pandas. This episode is brought to you by Posit, the open-source data science company, and by AWS Inferentia. Interested in sponsoring a SuperDataScience Pod ... Show More
Up next
Nov 21
942: Odds of AGI by 2040? LEAP Expert Forecasts and Workforce Implications
What’s on the horizon for AI? Jon Krohn wades through opinions from more than experts, curated by the Longitudinal Expert AI Panel (LEAP), about what we can expect from the industry. From estimates on AI-assisted workers through energy consumption to AI performance in highly skil ... Show More
6m 13s
Nov 18
941: Multi-Agent Human Societies, with Dr. Vijoy Pandey
Vijoy Pandey imagines a bold new society in which agents and humans make scientific discoveries and complete physical tasks together, and he tells Jon Krohn about his work at AGNTCY, Cisco’s open-source platform for the Internet of Agents. Listen to the episode to hear Vijoy Pand ... Show More
1h 6m
Nov 14
940: In Case You Missed It in October 2025
Jon Krohn curates a selection of clips from the month that was. Hear from the orchestrators of an expanding AI universe in this episode of In Case You Missed It, with news, views and groundbreaking ideas from Sheamus McGovern, Jerry Yurchisin, Stephanie Hare, Larissa Schneider, a ... Show More
43m 27s
Recommended Episodes
Aug 2024
#474: Python Performance for Data Science
See the full show notes for this episode on the website at <a href="https://talkpython.fm/474">talkpython.fm/474</a> 
1h 8m
Feb 2025
#495: OSMnx: Python and OpenStreetMap
See the full show notes for this episode on the website at <a href="https://talkpython.fm/495">talkpython.fm/495</a> 
1h 1m
Jul 2024
#471: Learning and teaching Pandas
See the full show notes for this episode on the website at <a href="https://talkpython.fm/471">talkpython.fm/471</a> 
1h 4m
Sep 9
What's New at CFI | Data Analysis in Python
Ready to take your data analysis skills to the next level? In this episode of What's New at CFI, we chat with subject matter expert Joseph Yeates about his newest course, Data Analysis in Python. This course is the perfect follow-up to our "Getting Started with Python" series and ... Show More
13m 33s
Dec 2024
#489: Anaconda Toolbox for Excel and more with Peter Wang
See the full show notes for this episode on the website at <a href="https://talkpython.fm/489">talkpython.fm/489</a> 
1h 9m
Dec 2024
#491: DuckDB and Python: Ducks and Snakes living together
See the full show notes for this episode on the website at <a href="https://talkpython.fm/491">talkpython.fm/491</a> 
1h 2m
Jul 2024
120: Don’t Learn Python as a Data Analyst (Learn This Instead)
<p>Although Python is talked about a lot in the data world, if you are aiming for your first data analyst role, I don’t think you should learn it.</p> <p>It takes too much time, it’s hard to learn, and it’s hard to use.</p> <p>In this episode, I’ll dive into more of the specifics ... Show More
8m 26s
Jul 2025
Revolutionizing Python Notebooks with Marimo
SummaryIn this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. ... Show More
51m 56s
Mar 2025
NVIDIA RAPIDS and Open Source ML Acceleration with Chris Deotte and Jean-Francois Puget
<p>NVIDIA RAPIDS is an open-source suite of GPU-accelerated data science and AI libraries. It leverages CUDA and significantly enhances the performance of core Python frameworks including Polars, pandas, scikit-learn and NetworkX. Chris Deotte is a Senior Data Scientist at NVIDIA ... Show More
42m 6s
May 2018
MLA 002 Numpy & Pandas
<div> <p>NumPy enables efficient storage and vectorized computation on large numerical datasets in RAM by leveraging contiguous memory allocation and low-level C/Fortran libraries, drastically reducing memory footprint compared to native Python lists. Pandas, built on top of NumP ... Show More
18m 10s