logo
episode-header-image
Yesterday
46m 37s

When AI Hears Thunder But Misses the Fea...

FRANCESCO GADALETA
About this episode

Sanjoy Chowdhury reveals AI's hidden weakness: while systems can see objects and hear sounds perfectly, they can't reason across senses like humans do. His research at University of Maryland College Park, including the Meerkat model and AVTrustBench, exposes why AI recognizes worried faces and thunder separately but fails to connect them—and what this means for self-driving cars and medical AI.

 

Sponsors

This episode is proudly sponsored by Amethix Technologies. At the intersection of ethics and engineering, Amethix creates AI systems that don’t just function—they adapt, learn, and serve. With a focus on dual-use innovation, Amethix is shaping a future where intelligent machines extend human capability, not replace it. Discover more at https://amethix.com

 

This episode is brought to you by Intrepid AI. From drones to satellites, Intrepid AI gives engineers and defense innovators the tools to prototype, simulate, and deploy autonomous systems with confidence. Whether it's in the sky, on the ground, or in orbit—if it's intelligent and mobile, Intrepid helps you build it. Learn more at intrepid.ai

 

Resources:

  1. The first audio-visual LLM with fine-grained understanding: Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time (Accepted at ECCV 2024)
  2. Benchmark for evaluating the robustness to adversarial attacks, compositional reasoning: AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs (Accepted at ICCV 2025)
  3. First audio-visual reasoning evaluation benchmark and test time reasoning distillation pipeline AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs Accepted at ICCV 2025
  4. For a detailed list of Sanjoy's work, please visit his webpage: https://schowdhury671.github.io/
Up next
Sep 16
Why VCs Are Funding $100M Remote Control Toys (Ep. 290)
This episode exposes the uncomfortable truth: most defense tech startups are just software engineers cosplaying as military innovators, creating fragmented solutions that Pentagon doesn't need. Not now, at least. References War On The Rocks: https://warontherocks.com/2025/08/ukra ... Show More
39m 56s
Aug 29
How Hacker Culture Died (Ep. 289)
A nostalgic dive into the rise and fall of true hacker culture - from MIT's curious tinkerers to today's hustle-obsessed "founders." Plus, why IRC was peak internet and what we lost when convenience killed community. For anyone who misses when coding was about elegance, not exits ... Show More
44m 57s
Aug 5
Robots Suck (But It’s Not Their Fault) (Ep. 288)
We were promised robot butlers and got Roombas that cry under the couch. In this brutally honest (and slightly hilarious) episode, Francesco dives into why the robot revolution fizzled, why your dishwasher still needs you, and how robotics became more YouTube circus than househol ... Show More
16m 16s
Recommended Episodes
Feb 2017
MLG 001 Introduction
Show notes: ocdevel.com/mlg/1. MLG teaches the fundamentals of machine learning and artificial intelligence. It covers intuition, models, math, languages, frameworks, etc. Where your other ML resources provide the trees, I provide the forest. Consider MLG your syllabus, with high ... Show More
8m 11s
Apr 2017
Feature Processing for Text Analytics
It seems like every day there's more and more machine learning problems that involve learning on text data, but text itself makes for fairly lousy inputs to machine learning algorithms.  That's why there are text vectorization algorithms, which re-format text data so it's ready f ... Show More
17m 28s
Feb 2017
MLG 004 Algorithms - Intuition
Try a walking desk while studying ML or working on your projects! Overview of machine learning algorithms. Infer/predict, error/loss, train/learn. Supervised, unsupervised, reinforcement learning. ocdevel.com/mlg/4 for notes and resources 
22m 56s
Mar 2017
MLG 010 Languages & Frameworks
Try a walking desk while studying ML or working on your projects! Languages & frameworks comparison. Languages: Python, R, MATLAB/Octave, Julia, Java/Scala, C/C++. Frameworks: Hadoop/Spark, Deeplearning4J, Theano, Torch, TensorFlow. ocdevel.com/mlg/10 for notes and resources 
44m 36s
Jun 2024
SE Radio 622: Wolf Vollprecht on Python Tooling in Rust
Wolf Vollprecht, the CEO and founder of Prefix.dev, speaks with host Gregory M. Kapfhammer about how to implement Python tools, such as package managers, in the Rust programming language. They discuss the challenges associated with building Python infrastructure tooling in Python ... Show More
55m 10s
Jul 2023
AI Today Podcast: AI Glossary Series – Automated Machine Learning (AutoML)
In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the term Automated Machine Learning (AutoML), explain how this term relate to AI and why it’s important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certifi ... Show More
9m 11s
Jul 2023
AI Today Podcast: AI Glossary Series – Data Science Notebooks, Jupyter, Colab
In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Data Science Notebooks, Jupyter, Colab, explain how these terms relate to AI and why it’s important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and C ... Show More
11 m
Mar 2017
MLG 009 Deep Learning
Try a walking desk while studying ML or working on your projects! Deep learning and neural networks. How to stack our logisitic regression units into a multi-layer perceptron. ocdevel.com/mlg/9 for notes and resources 
51m 28s
Nov 2024
SE Radio 641: Catherine Nelson on Machine Learning in Data Science
Catherine Nelson, author of the new O’Reilly book, Software Engineering for Data Scientists, discusses the collaboration between data scientists and software engineers -- an increasingly common pairing on machine learning and AI projects. Host Philip Winston speaks with Nelson ab ... Show More
48m 19s
Aug 2024
#474: Python Performance for Data Science
Python performance has come a long way in recent times. And it's often the data scientists, with their computational algorithms and large quantities of data, who care the most about this form of performance. It's great to have Stan Seibert back on the show to talk about Python's ... Show More
1h 8m