logo
episode-header-image
Sep 2024
49m 13s

Jim Fan on Nvidia’s Embodied AI Lab and ...

Sequoia Capital
About this episode

AI researcher Jim Fan has had a charmed career. He was OpenAI’s first intern before he did his PhD at Stanford with “godmother of AI,” Fei-Fei Li. He graduated into a research scientist position at Nvidia and now leads its Embodied AI “GEAR” group. The lab’s current work spans foundation models for humanoid robots to agents for virtual worlds.


Jim describes a three-pronged data strategy for robotics, combining internet-scale data, simulation data and real world robot data. He believes that in the next few years it will be possible to create a “foundation agent” that can generalize across skills, embodiments and realities—both physical and virtual. He also supports Jensen Huang’s idea that “Everything that moves will eventually be autonomous.”


Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital


Mentioned in this episode:

  • World of Bits: Early OpenAI project Jim worked on as an intern with Andrej Karpathy. Part of a bigger initiative called Universe
  • Fei-Fei Li: Jim’s PhD advisor at Stanford who founded the ImageNet project in 2010 that revolutionized the field of visual recognition, led the Stanford Vision Lab and just launched her own AI startup, World Labs
  • Project GR00T: Nvidia’s “moonshot effort” at a robotic foundation model, premiered at this year’s GTC
  • Thinking Fast and Slow: Influential book by Daniel Kahneman that popularized some of his teaching from behavioral economics
  • Jetson Orin chip: The dedicated series of edge computing chips Nvidia is developing to power Project GR00T
  • Eureka: Project by Jim’s team that trained a five finger robot hand to do pen spinning
  • MineDojo: A project Jim did when he first got to Nvidia that developed a platform for general purpose agents in the game of Minecraft. Won NeurIPS 2022 Outstanding Paper Award
  • ADI: artificial dog intelligence
  • Mamba: Selective State Space Models, an alternative architecture to Transformers that Jim is interested in (original paper here)


00:00 Introduction

01:35 Jim’s journey to embodied intelligence

04:53 The GEAR Group

07:32 Three kinds of data for robotics

10:32 A GPT-3 moment for robotics

16:05 Choosing the humanoid robot form factor

19:37 Specialized generalists

21:59 GR00T gets its own chip

23:35 Eureka and Issac Sim

25:23 Why now for robotics?

28:53 Exploring virtual worlds

36:28 Implications for games

39:13 Is the virtual world in service of the physical world?

42:10 Alternative architectures to Transformers

44:15 Lightning round

Up next
Aug 5
Vercel CEO Guillermo Rauch: Building the Generative Web with AI
Vercel CEO Guillermo Rauch has spent years obsessing over reducing the friction between having an idea and getting it online. Now with AI, he's achieving something even more ambitious: making software creation accessible to anyone with a keyboard. Guillermo explains how v0 has gr ... Show More
1 h
Jul 30
OpenAI’s IMO Team on Why Models Are Finally Solving Elite-Level Math
In just two months, a scrappy three-person team at OpenAI sprinted to fulfill what the entire AI field has been chasing for years—gold-level performance on the International Mathematical Olympiad problems. Alex Wei, Sheryl Hsu and Noam Brown discuss their unique approach using ge ... Show More
30m 10s
Jul 22
OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent Yet
Isa Fulford, Casey Chu, and Edward Sun from OpenAI's ChatGPT agent team reveal how they combined Deep Research and Operator into a single, powerful AI agent that can perform complex, multi-step tasks lasting up to an hour. By giving the model access to a virtual computer with tex ... Show More
37m 36s
Recommended Episodes
Nov 2024
AI and the Future of Math, with DeepMind’s AlphaProof Team
In this week’s episode of No Priors, Sarah and Elad sit down with the Google DeepMind team behind AlphaProof, a new reinforcement learning-based system for formal math reasoning that recently reached a silver-medal standard in solving International Mathematical Olympiad problems. ... Show More
39m 21s
Jul 22
AI Just Achieved Something No One Thought it Would Until Years From Now
An experimental reasoning model from OpenAI and Deep Thinking model from Gemini just achieved a Gold Medal performance at the International Math Olympiad. In both cases, the models solved 5 out of 6 IMO problems without any external tools, using pure mathematical reasoning that r ... Show More
26m 5s
Aug 2024
The Zoom Election + Google DeepMind's Math Olympiad + HatGPT! Olympics Edition
This week, with hundreds of thousands of people joining online political rallies for Kamala Harris, we discuss whether 2024 is suddenly becoming the Zoom election, and what that means for both parties’ political organizing. Then, Pushmeet Kohli, a computer scientist at Google Dee ... Show More
1h 1m
Feb 2025
OpenAI researcher on why soft skills are the future of work | Karina Nguyen (Research at OpenAI, ex-Anthropic)
Karina Nguyen leads research at OpenAI, where she’s been pivotal in developing groundbreaking products like Canvas, Tasks, and the o1 language model. Before OpenAI, Karina was at Anthropic, where she led post-training and evaluation work for Claude 3 models, created a document up ... Show More
1h 14m
Feb 2025
Scaling AI: Building the Right AI Team
You’re smart. You know your business. But do you know how to build the right AI team? It’s harder than it looks, and the old playbook won’t cut it. In this episode, host Courtney Baker is joined by CEO David DeWolf, Chief Product & Technology Officer Mohan Rao, and NordLight CEO ... Show More
33m 21s
Apr 2024
Applying CPMAI Methodology in the real world: Interview with George Fountain, Booz Allen Hamilton (BAH) [AI Today Podcast]
Companies of all sizes in every industry are looking to see how Artificial Intelligence (AI), machine learning (ML), and cognitive technology projects can provide them a competitive edge. They want to provide efficiencies and improve ROI in today’s competitive landscape. As a res ... Show More
13m 4s
Jul 20
Anthropic co-founder on quitting OpenAI, AGI predictions, $100M talent wars, 20% unemployment, and the nightmare scenarios keeping him up at night | Ben Mann
Benjamin Mann is a co-founder of Anthropic, an AI startup dedicated to building aligned, safety-first AI systems. Prior to Anthropic, Ben was one of the architects of GPT-3 at OpenAI. He left OpenAI driven by the mission to ensure that AI benefits humanity. In this episode, Ben o ... Show More
1h 14m
Jul 22
Are World Models the Key to AGI?
A groundbreaking Harvard study trained AI on 10 million solar systems and found it perfectly predicted orbits but completely failed to understand gravity, raising questions about whether LLMs can develop true world models. While companies pour billions into scaling, Meta's Yann L ... Show More
21m 28s
Feb 2025
AI won't plateau — if we give it time to think | Noam Brown
To get smarter, traditional AI models rely on exponential increases in the scale of data and computing power. Noam Brown, a leading research scientist at OpenAI, presents a potentially transformative shift in this paradigm. He reveals his work on OpenAI's new o1 model, which focu ... Show More
13m 28s
Apr 2025
Inside monday.com’s transformation: radical transparency, impact over output, and their path to $1B ARR | Daniel Lereya (Chief Product and Technology Officer)
Daniel Lereya, the Chief Product and Technology Officer at monday.com, shares how he and his team realized they were being outpaced by competitors and how that realization completely transformed how they operate and allowed them to build a global powerhouse, doing over $1 billion ... Show More
1h 32m