logo
episode-header-image
Mar 2024
3h 12m

Sholto Douglas & Trenton Bricken - How t...

Dwarkesh Patel
About this episode

Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast.

No way to summarize it, except: 

This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them.

You would be shocked how much of what I know about this field, I've learned just from talking with them.

To the extent that you've enjoyed my other AI interviews, now you know why.

So excited to put this out. Enjoy! I certainly did :)

Watch on YouTube. Listen on Apple PodcastsSpotify, or any other podcast platform. 

There's a transcript with links to all the papers the boys were throwing down - may help you follow along.

Timestamps

(00:00:00) - Long contexts

(00:16:12) - Intelligence is just associations

(00:32:35) - Intelligence explosion & great researchers

(01:06:52) - Superposition & secret communication

(01:22:34) - Agents & true reasoning

(01:34:40) - How Sholto & Trenton got into AI research

(02:07:16) - Are feature spaces the wrong way to think about intelligence?

(02:21:12) - Will interp actually work on superhuman models

(02:45:05) - Sholto’s technical challenge for the audience

(03:03:57) - Rapid fire



Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe
Up next
Jul 10
Stephen Kotkin — How Stalin Became the Most Powerful Dictator in History
The Stephen Kotkin episode. Kotkin is arguably the world’s foremost expert on Joseph Stalin and has written a massive 2-volume biography on him (with a 3rd volume in the works).No other individual had more of a profound impact on the 20th century than Stalin. He held the power of ... Show More
2h 12m
Jul 3
Why I don’t think AGI is right around the corner
I’ve had a lot of discussions on my podcast where we haggle out timelines to AGI. Some guests think it’s 20 years away - others 2 years. Here’s an audio version of where my thoughts stand as of June 2025. If you want to read the original post, you can check it out here. Get full ... Show More
14m 1s
Jun 2
Why I don’t think AGI is right around the corner
I’ve had a lot of discussions on my podcast where we haggle out timelines to AGI. Some guests think it’s 20 years away - others 2 years. Here’s where my thoughts stand as of June 2025. Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe 
14m 1s
Recommended Episodes
May 2023
The AI Will See You Now: Exploring Biomedical AI and Google’s Med-PaLM2 With Karan Singhal
What if AI could revolutionize healthcare with advanced language learning models? Sarah and Elad welcome Karan Singhal, Staff Software Engineer at Google Research, who specializes in medical AI and the development of MedPaLM2. On this episode, Karan emphasizes the importance of s ... Show More
42m 48s
May 17
OpenAI whistleblower Daniel Kokotajlo on superintelligence and existential risk of AI
How much could our relationship with technology change by 2027? In the last few years, new artificial intelligence tools like ChatGPT and DeepSeek have transformed how we think about work, creativity, even intelligence itself. But tech experts are ringing alarm bells that powerfu ... Show More
38m 16s
Aug 2019
AI, Robot
Forget what sci-fi has told you about superintelligent robots that are uncannily human-like; the reality is more prosaic. Inside DeepMind’s robotics laboratory, Hannah explores what researchers call ‘embodied AI’: robot arms that are learning tasks like picking up plastic bricks, ... Show More
32m 33s
Sep 2024
The Frontier of Spatial Intelligence with Fei-Fei Li
Fei-Fei Li and Justin Johnson are pioneers in AI. While the world has only recently witnessed a surge in consumer AI, our guests have long been laying the groundwork for innovations that are transforming industries today.In this episode, a16z General Partner Martin Casado joins F ... Show More
44m 40s
Mar 2025
Apple's Siri-ous Problem + How Starlink Took Over the World + Is AI Making Us Dumb?
This week, as the long-promised new Siri faces increasing delays, we explore why Apple seems to be falling even further behind in artificial intelligence. Then, the New York Times reporter Adam Satariano joins us to explain how Elon Musk’s satellite internet provider Starlink too ... Show More
1h 2m
Sep 2024
The Road to Autonomous Intelligence with Andrej Karpathy
Andrej Karpathy joins Sarah and Elad in this week of No Priors. Andrej, who was a founding team member of OpenAI and former Senior Director of AI at Tesla, needs no introduction. In this episode, Andrej discusses the evolution of self-driving cars, comparing Tesla and Waymo’s app ... Show More
44m 16s
Apr 2024
777: Generative AI in Practice, with Bernard Marr
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and add ... Show More
1h 8m
Sep 2024
Human Data is Key to AI: Alex Wang from Scale AI
What if the key to unlocking AI's full potential lies not just in algorithms or compute, but in data? In this episode, a16z General Partner David George sits down with Alex Wang, founder and CEO of Scale AI, to discuss the crucial role of "frontier data" in advancing artificial i ... Show More
30m 56s
May 8
Industry Roundup #4: O3 & O4-mini, LLama 4’s Rocky Release & Google’s Agent Ecosystem
Welcome to DataFramed Industry Roundups! In this series of episodes, Adel & Richie sit down to discuss the latest and greatest in data & AI. In this episode, we touch upon the launch of OpenAI’s O3 and O4-mini models, Meta’s rocky release of Llama 4, Google’s new agent tooling ec ... Show More
44m 14s
Jun 15
AI PM Crash Course: Prototyping → Observability → Evals + Prompt Engineering vs RAG vs Fine-Tuning
Every PM has to build AI features these days. And with that means a completely new skill set:- AI prototyping- Observability, Akin to Telemetry- AI Evals: The New PRD for AI PMs- RAG v Fine-Tuning v Prompt Engineering- Working with AI EngineersSo, in today’s episode, I bring you ... Show More
2h 4m