logo
episode-header-image
Feb 2025
1h 6m

863: TabPFN: Deep Learning for Tabular D...

Jon Krohn
About this episode

Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In this episode, Frank Hutter details the path he has found around this obstacle even with limited data by using a ground-breaking transformer architecture. Named TabPFN, this approach is vastly outperforming other architectures, as testified by a write up of TabPFN’s capabilities in Nature. Frank talks about his work on version 2 of TabPFN, the architecture’s cross-industry applicability, and how TabPFN is able to return accurate results with synthetic data.


This episode is brought to you by ODSC, the Open Data Science Conference. Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@superdatascience.com for sponsorship information.


In this episode you will learn:

  • (05:57) All about the TabPFN architecture 
  • (21:27) Use cases for Bayesian inference
  • (35:07) On getting published in Nature
  • (44:03) How TabPFN handles time series data
  • (51:52) All about Prior Labs


Additional materials: www.superdatascience.com/863

Up next
Today
930: In Case You Missed It in September 2025
Jon Krohn’s highlights from this month of interviews focus on ways to future-proof your career, looking at the hardware that will get you the most mileage, the emerging roles that are well worth a look, and the developments in AI that will endure in a field constantly testing the ... Show More
37m 25s
Oct 7
929: Dragon Hatchling: The Missing Link Between Transformers and the Brain, with Adrian Kosowski
Breaking news: Jon Krohn welcomes Adrian Kosowski to the show to talk about the groundbreaking research happening at Pathway. Adrian and his team demonstrate how they have brought attention in AI closer to the way the brain functions, creating, in essence, a “massively parallel s ... Show More
1h 14m
Oct 3
928: The “Lethal Trifecta”: Can AI Agents Ever Be Safe?
Prompt injections, malicious code, and AI agents: In this week’s Five-Minute Friday, Jon Krohn looks into the current security weaknesses found in AI systems. A structural vulnerability that The Economist dubs a “lethal trifecta” could cause havoc for AI users, unless we take the ... Show More
5m 55s
Recommended Episodes
Aug 26
From Academia to Industry: Bridging Data Engineering Challenges
SummaryIn this episode of the Data Engineering Podcast Professor Paul Groth, from the University of Amsterdam, talks about his research on knowledge graphs and data engineering. Paul shares his background in AI and data management, discussing the evolution of data provenance and ... Show More
50m 54s
Jan 2025
Breaking Down Data Silos: AI and ML in Master Data Management
Summary In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr, talks about the application of machine learning (ML) and artificial intelligence (AI) in master data management (MDM). Dan shares his journey from working at CERN to becoming a data ... Show More
57m 30s
Oct 2022
AI Today Podcast: Applying CPMAI in the Real World, Interview with Andrew Stone, Maximus
It’s one thing for us to talk about the Cognitive Project Management for AI (CPMAI) Methodology and the benefits it can bring to managers running AI and advanced data projects, but hearing directly how individuals are applying the CPMAI Methodology can be incredibly valuable. In ... Show More
47m 26s
Feb 2025
The Future of Data Engineering: AI, LLMs, and Automation
Summary In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large langua ... Show More
59m 39s
Aug 2024
Launching the Fastest AI Inference Solution with Cerebras Systems CEO Andrew Feldman
In this episode of Gradient Dissent, Andrew Feldman, CEO of Cerebras Systems, joins host Lukas Biewald to discuss the latest advancements in AI inference technology. They explore Cerebras Systems' groundbreaking new AI inference product, examining how their wafer-scale chips are ... Show More
53m 14s
Aug 2023
Cuttlefish Model Tuning
Hongyi Wang, a Senior Researcher at the Machine Learning Department at Carnegie Mellon University, joins us. His research is in the intersection of systems and machine learning. He discussed his research paper, Cuttlefish: Low-Rank Model Training without All the Tuning, on today’ ... Show More
27m 8s
Jan 2025
Smart Talks with IBM: How Infrastructure is Powering the Age of AI
In this episode of Smart Talks with IBM, Malcolm Gladwell speaks with Ric Lewis, IBM’s Senior Vice President of Infrastructure.  They discuss how hardware capability has enabled the matrix math required to run large language models. Furthermore, they delve into some creative exam ... Show More
55m 34s
Feb 2025
#495: OSMnx: Python and OpenStreetMap
On this episode, I’m joined by Dr. Jeff Boeing, an assistant professor at the University of Southern California whose research spans urban planning, spatial analysis, and data science. We explore why OpenStreetMap is such a powerful source of global map data—and how Jeff’s Python ... Show More
1h 1m
Feb 2025
π0: A Foundation Model for Robotics with Sergey Levine - #719
Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-ba ... Show More
52m 30s
Jul 16
Can AI Accelerate Science? Dr. Andy Beam on AI’s Next Frontier
Dr. Andy Beam has trained models, mentored scientists, and used data to quantify the value of treatments. In this episode of NEJM AI Grand Rounds, Raj Manrai turns the table on his co-host, reflecting on how Andy’s childhood misdiagnosis, and the failure of human recall, revealed ... Show More
1h 7m