logo
episode-header-image
May 2023
1h 8m

675: Pandas for Data Analysis and Visual...

Jon Krohn
About this episode
Wrangling data in Pandas, when to use Pandas, Matplotlib or Seaborn, and why you should learn to create Python packages: Jon Krohn speaks with guest Stefanie Molin, author of Hands-On Data Analysis with Pandas. This episode is brought to you by Posit, the open-source data science company, and by AWS Inferentia. Interested in sponsoring a SuperDataScience Pod ... Show More
Up next
Jan 30
962: Wharton Prof Ethan Mollick on Why Your AI Strategy Is Already Obsolete
Bestselling author of Co-Intelligence: Living and Working with AI Ethan Mollick speaks to Jon Krohn about just how much US firms have to gain from a willingness to adopt and experiment with AI, as well as the reality behind AI use among employees and the frontier models set to su ... Show More
12m 23s
Jan 27
961: Distributed Artificial Superintelligence, with Dr. Vijoy Pandey
Dr. Vijoy Pandey returns to the show to talk to Jon Krohn about Cisco’s work to advance medicine and mitigate the impact of climate change with distributed artificial super-intelligence. Dr. Vijoy Pandey believes in a future where humans and AI agents work together to tackle our ... Show More
1h 9m
Jan 23
960: In Case You Missed It in December 2025
For 2026’s first episode of In Case You Missed It (ICYMI), Jon Krohn selects 6 clips from December for a wide-ranging look at the current state of AI in business and beyond. Hear from Joel Beasley (Episode 945), Jeff Li (Episode 947), Sandy Pentland (Episode 949), Josh Clemm (Epi ... Show More
40m 45s
Recommended Episodes
Aug 2024
#474: Python Performance for Data Science
See the full show notes for this episode on the website at <a href="https://talkpython.fm/474">talkpython.fm/474</a> 
1h 8m
Feb 2025
#495: OSMnx: Python and OpenStreetMap
See the full show notes for this episode on the website at <a href="https://talkpython.fm/495">talkpython.fm/495</a> 
1h 1m
Jul 2024
#471: Learning and teaching Pandas
See the full show notes for this episode on the website at <a href="https://talkpython.fm/471">talkpython.fm/471</a> 
1h 4m
Sep 2025
What's New at CFI | Data Analysis in Python
Ready to take your data analysis skills to the next level? In this episode of What's New at CFI, we chat with subject matter expert Joseph Yeates about his newest course, Data Analysis in Python. This course is the perfect follow-up to our "Getting Started with Python" series and ... Show More
13m 33s
Dec 2024
#489: Anaconda Toolbox for Excel and more with Peter Wang
See the full show notes for this episode on the website at <a href="https://talkpython.fm/489">talkpython.fm/489</a> 
1h 9m
Dec 2024
#491: DuckDB and Python: Ducks and Snakes living together
See the full show notes for this episode on the website at <a href="https://talkpython.fm/491">talkpython.fm/491</a> 
1h 2m
Jul 2024
120: Don’t Learn Python as a Data Analyst (Learn This Instead)
<p>Although Python is talked about a lot in the data world, if you are aiming for your first data analyst role, I don’t think you should learn it.</p> <p>It takes too much time, it’s hard to learn, and it’s hard to use.</p> <p>In this episode, I’ll dive into more of the specifics ... Show More
9m 32s
Jul 2025
Revolutionizing Python Notebooks with Marimo
SummaryIn this episode of the Data Engineering Podcast Akshay Agrawal from Marimo discusses the innovative new Python notebook environment, which offers a reactive execution model, full Python integration, and built-in UI elements to enhance the interactive computing experience. ... Show More
51m 56s
Mar 2025
NVIDIA RAPIDS and Open Source ML Acceleration with Chris Deotte and Jean-Francois Puget
<p>NVIDIA RAPIDS is an open-source suite of GPU-accelerated data science and AI libraries. It leverages CUDA and significantly enhances the performance of core Python frameworks including Polars, pandas, scikit-learn and NetworkX. Chris Deotte is a Senior Data Scientist at NVIDIA ... Show More
42m 6s
May 2018
MLA 002 Numpy & Pandas
<div> <p>NumPy enables efficient storage and vectorized computation on large numerical datasets in RAM by leveraging contiguous memory allocation and low-level C/Fortran libraries, drastically reducing memory footprint compared to native Python lists. Pandas, built on top of NumP ... Show More
18m 10s