logo
episode-header-image
Apr 8
51m 45s

Teaching LLMs to Self-Reflect with Reinf...

Sam Charrington
About this episode
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverages reinforcement learning to improve language model reasoning—enabling model self-reflection, self-correction, and exploration of alternative ... Show More
Up next
Yesterday
Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738
Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted papers and demos featured at this year’s CVPR conference. We start with “DiMA: Distilling Multi-modal Large Language Models for Auton ... Show More
1 h
Jun 24
Building the Internet of Agents with Vijoy Pandey - #737
Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we make specialized agents from different vendors collaborate effectively? As companies like Salesforce, Workday, and Microsoft all dev ... Show More
56m 13s
Jun 17
LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736
Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI in equities feature forecasting, covering how they identify and create features, collect and quantify historical data, and build pre ... Show More
59m 31s
Recommended Episodes
Aug 2023
Cuttlefish Model Tuning
Hongyi Wang, a Senior Researcher at the Machine Learning Department at Carnegie Mellon University, joins us. His research is in the intersection of systems and machine learning. He discussed his research paper, Cuttlefish: Low-Rank Model Training without All the Tuning, on today’ ... Show More
27m 8s
Apr 2022
How The Pros Train (And What We Can Learn From It)
From top level cyclists to world-class runners and speedskaters, pro training regimes offer a fascinating look into what it takes to count yourself among the world's best. Recent research among top-level athletes is also leading us to question entrenched training methods as sport ... Show More
54m 53s
Oct 2024
Dr. Checcucci and Ass. Prof. Puliatti discuss new tech and standardisation trends in surgical training
In 'Episode 3' of the series "Voices of tomorrow's urologists (ESRU): Journeying through Europe's resident perspectives", Dr. Enrico Checcucci (IT) and Assoc. Prof. Stefano Puliatti (IT) discuss "Surgical training unveiled: Exploring new tech and standardisati ... Show More
15m 8s
Nov 2024
How to Improve at Learning Using Neuroscience & AI | Dr. Terry Sejnowski
In this episode, my guest is Dr. Terry Sejnowski, Ph.D., professor of computational neurobiology at the Salk Institute for Biological Studies. He is world-renowned for exploring how our brain processes and stores information and, with that understanding, for developing tools that ... Show More
2h 34m
Jan 2025
Erik Bernhardsson on Creating Tools That Make AI Feel Effortless
Today on No Priors, Elad chats with Erik Bernhardsson, founder and CEO of Modal Labs, a platform simplifying ML workflows by providing a serverless infrastructure designed to streamline deployment, scaling, and development for AI engineers. Erik talks about his early work on Spot ... Show More
23m 36s
Nov 2024
AI and the Future of Math, with DeepMind’s AlphaProof Team
In this week’s episode of No Priors, Sarah and Elad sit down with the Google DeepMind team behind AlphaProof, a new reinforcement learning-based system for formal math reasoning that recently reached a silver-medal standard in solving International Mathematical Olympiad problems. ... Show More
39m 21s
Feb 2025
Grok 3: The New AI Challenger
In this episode, Jaeden discusses the launch of Grok 3, the latest AI model from X AI, highlighting its capabilities, training methods, and performance benchmarks compared to competitors like OpenAI's ChatGPT. He shares personal experiences using Grok 3, including its reasoni ... Show More
16m 45s
Jan 2025
AI vs. Human Educators: Comparing Learning Outcomes in Teaching Videos
Send us a textIn this episode of The TOEFL Speaking Prep Podcast for the AI Era, we explore groundbreaking research comparing AI and human educators in creating educational content. We dive into a study that pits AI-generated teaching videos against human-made ones, analyzing how ... Show More
13m 19s
Feb 2025
863: TabPFN: Deep Learning for Tabular Data (That Actually Works!), with Prof. Frank Hutter
Jon Krohn talks tabular data with Frank Hutter, Professor of Artificial Intelligence at Universität Freiburg in Germany. Despite the great steps that deep learning has made in analysing images, audio, and natural language, tabular data has remained its insurmountable obstacle. In ... Show More
1h 6m
Feb 2025
LLMs and Graphs Synergy
In this episode, Garima Agrawal, a senior researcher and AI consultant, brings her years of experience in data science and artificial intelligence. Listeners will learn about the evolving role of knowledge graphs in augmenting large language models (LLMs) for domain-specific task ... Show More
34m 47s