logo
episode-header-image
Feb 2025
2h 14m

Jeff Dean & Noam Shazeer — 25 years at G...

Dwarkesh Patel
About this episode

This week I welcome on the show two of the most important technologists ever, in any field.

Jeff Dean is Google's Chief Scientist, and through 25 years at the company, has worked on basically the most transformative systems in modern computing: from MapReduce, BigTable, Tensorflow, AlphaChip, to Gemini.

Noam Shazeer invented or co-invented all the main architectures and techniques that are used for modern LLMs: from the Transformer itself, to Mixture of Experts, to Mesh Tensorflow, to Gemini and many other things.

We talk about their 25 years at Google, going from PageRank to MapReduce to the Transformer to MoEs to AlphaChip – and maybe soon to ASI.

My favorite part was Jeff's vision for Pathways, Google’s grand plan for a mutually-reinforcing loop of hardware and algorithmic design and for going past autoregression. That culminates in us imagining *all* of Google-the-company, going through one huge MoE model.

And Noam just bites every bullet: 100x world GDP soon; let’s get a million automated researchers running in the Google datacenter; living to see the year 3000.Watch on Youtube; listen on Apple Podcasts or Spotify.

Sponsors

Scale partners with major AI labs like Meta, Google Deepmind, and OpenAI. Through Scale’s Data Foundry, labs get access to high-quality data to fuel post-training, including advanced reasoning capabilities. If you’re an AI researcher or engineer, learn about how Scale’s Data Foundry and research lab, SEAL, can help you go beyond the current frontier at scale.com/dwarkesh

Curious how Jane Street teaches their new traders? They use Figgie, a rapid-fire card game that simulates the most exciting parts of markets and trading. It’s become so popular that Jane Street hosts an inter-office Figgie championship every year. Download from the app store or play on your desktop at figgie.com

Meter wants to radically improve the digital world we take for granted. They’re developing a foundation model that automates network management end-to-end. To do this, they just announced a long-term partnership with Microsoft for tens of thousands of GPUs, and they’re recruiting a world class AI research team. To learn more, go to meter.com/dwarkesh

To sponsor a future episode, visit dwarkeshpatel.com/p/advertise

Timestamps

00:00:00 - Intro

00:02:44 - Joining Google in 1999

00:05:36 - Future of Moore's Law

00:10:21 - Future TPUs

00:13:13 - Jeff’s undergrad thesis: parallel backprop

00:15:10 - LLMs in 2007

00:23:07 - “Holy s**t” moments

00:29:46 - AI fulfills Google’s original mission

00:34:19 - Doing Search in-context

00:38:32 - The internal coding model

00:39:49 - What will 2027 models do?

00:46:00 - A new architecture every day?

00:49:21 - Automated chip design and intelligence explosion

00:57:31 - Future of inference scaling

01:03:56 - Already doing multi-datacenter runs

01:22:33 - Debugging at scale

01:26:05 - Fast takeoff and superalignment

01:34:40 - A million evil Jeff Deans

01:38:16 - Fun times at Google

01:41:50 - World compute demand in 2030

01:48:21 - Getting back to modularity

01:59:13 - Keeping a giga-MoE in-memory

02:04:09 - All of Google in one model

02:12:43 - What’s missing from distillation

02:18:03 - Open research, pros and cons

02:24:54 - Going the distance



Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe
Up next
Nov 12
Satya Nadella — How Microsoft is preparing for AGI
<p>As part of this interview, Satya Nadella gave Dylan Patel (founder of <a target="_blank" href="https://semianalysis.com/">SemiAnalysis</a>) and me an exclusive first-look at their brand-new Fairwater 2 datacenter.</p><p>Microsoft is building multiple Fairwaters, each of which ... Show More
1h 27m
Oct 31
Sarah Paine – How Russia sabotaged China's rise
In this lecture, military historian Sarah Paine explains how Russia—and specifically Stalin—completely derailed China’s rise, slowing them down for over a century.This lecture was particularly interesting to me because, in my opinion, the Chinese Civil War is 1 of the top 3 most ... Show More
1h 30m
Oct 17
Andrej Karpathy — AGI is still a decade away
The Andrej Karpathy episode.During this interview, Andrej explains why reinforcement learning is terrible (but everything else is much worse), why AGI will just blend into the previous ~2.5 centuries of 2% GDP growth, why self driving took so long to crack, and what he sees as th ... Show More
2h 25m
Recommended Episodes
Apr 2024
Measuring The Speed of AI Through Benchmarks
<p dir="ltr">David Kanter, Executive Director at MLCommons, discusses the work they're doing with MLPerf Benchmarks, creating the world's first industry standard approach to measuring AI speed and safety. He also shares ways they're testing AI and LLMs for harm, to measure—and, o ... Show More
31m 45s
Jul 2025
Inside Google's AI Lab: Drug Discovery, World AI Model & AlphaEvolve
Want the ultimate guide to Google's Gemini? Get it here: https://clickhubspot.com/evt Episode 68: How is Google DeepMind pushing the boundaries of AI to tackle drug discovery, robotics, and even autonomous AI agents? Matt Wolfe (https://x.com/mreflow) sits down with DeepMind CEO ... Show More
17m 29s
Oct 6
Google: The AI Company
Google faces the greatest innovator's dilemma in history. They invented the Transformer — the breakthrough technology powering every modern AI system from ChatGPT to Claude (and, of course, Gemini). They employed nearly all the top AI talent: Ilya Sutskever, Geoff Hinton, Demis H ... Show More
4h 6m
Oct 2
When Will AI Make Scientific Discoveries?
Today’s AI Daily Brief asks when artificial intelligence will begin making real scientific discoveries. We look at Periodic Labs, which just raised more than $300 million to build AI scientists and autonomous labs for physics and chemistry, and Thinking Machines, which is creatin ... Show More
24m 25s
Sep 2024
Decoding Google Gemini with Jeff Dean
Professor Hannah Fry is joined by Jeff Dean, one of the most legendary figures in computer science and chief scientist of Google DeepMind and Google Research. Jeff was instrumental to the field in the late 1990s, writing the code that transformed Google from a small startup into ... Show More
53m 2s
Sep 23
How Microsoft is Fixing the Biggest AI Agent Problem
Want the guide to create AI Agents? get it here: https://clickhubspot.com/fhc Episode 77: Are we nearing a future where AI agents can autonomously tackle our biggest challenges—while remaining efficient, safe, and truly aligned with human goals? Matt Wolfe (https://x.com/mreflow) ... Show More
30m 8s
Nov 2024
NVIDIA's Jensen Huang on AI Chip Design, Scaling Data Centers, and his 10-Year Bets
In this week’s episode of No Priors, Sarah and Elad sit down with Jensen Huang, CEO of NVIDIA, for the second time to reflect on the company’s extraordinary growth over the past year. Jensen discusses AI’s takeover of datacenters and NVIDIA’s rapid development of x.AI’s superclus ... Show More
36m 53s
Mar 2025
How AI is saving billions of years of human research time | Max Jaderberg
<p>Can AI compress the years long research time of a PhD into seconds? Research scientist Max Jaderberg explores how “AI analogs” simulate real-world lab work with staggering speed and scale, unlocking new insights on protein folding and drug discovery. Drawing on his experience ... Show More
19m 15s
Jan 2025
#229 Mitesh Agrawal: Why Lambda Labs' AI Cloud Is a Game-Changer for Developers
<p dir="ltr">This episode is sponsored by Netsuite by Oracle, the number one cloud financial system, streamlining accounting, financial management, inventory, HR, and more.</p> <p><strong> </strong></p> <p dir="ltr">NetSuite is offering a one-of-a-kind flexible financing program. ... Show More
56m 7s
Dec 2024
Harvard Releases AI Training Dataset, Google Releases Gemini 2.0, and Two New Types of Infinity
We&#39;re experimenting and would love to hear from you!In today&apos;s episode of Discover Daily, we begin with a development for artificial intelligence research. Harvard University has unveiled a comprehensive AI training dataset, marking a significant step forward in democrat ... Show More
10m 21s