logo
episode-header-image
Aug 2024
46m 51s

Genie: Generative Interactive Environmen...

Sam Charrington
About this episode

Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for creating ‘playable’ video environments for training deep reinforcement learning (RL) agents at scale in a completely unsupervised manner. We explore the motivations behind Genie, the challenges of data acquisition for RL, and Genie’s capability to learn world models from videos without explicit action data, enabling seamless interaction and frame prediction. Ashley walks us through Genie’s core components—the latent action model, video tokenizer, and dynamics model—and explains how these elements collaborate to predict future frames in video sequences. We discuss the model architecture, training strategies, benchmarks used, as well as the application of spatiotemporal transformers and the MaskGIT techniques used for efficient token prediction and representation. Finally, we touched on Genie’s practical implications, its comparison to other video generation models like “Sora,” and potential future directions in video generation and diffusion models.


The complete show notes for this episode can be found at https://twimlai.com/go/696.

Up next
Jul 29
Context Engineering for Productive AI Agents with Filip Kozera - #741
In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the new programming interface. Filip breaks down the architecture of these "background agents," explaining how they use a reflection lo ... Show More
46m 1s
Jul 22
Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740
In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerful, efficient applications by composing multiple, often diverse, AI models and services. We discuss how these "networks of networks ... Show More
1h 13m
Jul 15
Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739
In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture and challenges of building real-time, production-ready conversational voice AI. Kwin breaks down the full stack for voice agents—fro ... Show More
1h 13m
Recommended Episodes
Aug 2024
809: Agentic AI, with Shingai Manjengwa
Agentic AI is revolutionizing the tech landscape, and Shingai Manjengwa from ChainML is here to tell us why. Discover how AI agents are becoming an integral part of our lives, automating tasks like travel bookings and daily inspiration. Shingai explains the power of multi-agent s ... Show More
1h 10m
Nov 2024
Bonus Episode: Lessons From Jobs in the Age of AI
On Sept. 4, 2024, Me, Myself, and AI host Sam Ransbotham moderated a panel discussion at a Georgetown University/World Bank event, Jobs in the Age of AI. Afterward, he interviewed keynote speaker Carl Benedikt Frey, Dieter Schwarz Associate Professor of AI and Work at the Oxford ... Show More
26m 24s
Jun 2024
Product-Led AI: Mustafa Suleyman on Defining Intelligence
Guest episode of Product-Led AI, hosted by Greylock partner Seth Rosenberg. In this episode, he speaks with AI pioneer Mustafa Suleyman, who has been at the forefront of the technology through several major leaps forward. As the co-founder of DeepMind, Google’s VP of AI Policy an ... Show More
35m 11s
Apr 2024
777: Generative AI in Practice, with Bernard Marr
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and add ... Show More
1h 8m
Sep 2024
Unleashing AI in Communications: Conversation with Sultan Saab
Join us as we dive deep into the world of AI and communications with Sultan Saab on the premier advertising and marketing podcast in MENA, Below the Fold. In this engaging episode, we explore the role of AI tools in transforming communication strategies, the importance of prompti ... Show More
49m 5s
Sep 2024
The Road to Autonomous Intelligence with Andrej Karpathy
Andrej Karpathy joins Sarah and Elad in this week of No Priors. Andrej, who was a founding team member of OpenAI and former Senior Director of AI at Tesla, needs no introduction. In this episode, Andrej discusses the evolution of self-driving cars, comparing Tesla and Waymo’s app ... Show More
44m 16s
May 2024
Separating AI Hype from AI Hope
Is AI poised to solve all of humanity’s problems or are we headed for a tech-driven catastrophe?  In this episode of AI Knowhow, we dive into the polarizing world of AI hysteria to separate the hype from the hope.  Courtney, David, and Mohan break down where AI has been overhyped ... Show More
32m 36s
Oct 2024
Ep. 193 Revolutionizing ENT with Artificial Intelligence with Dr. Mas Takashima
Many of us continue to associate “Artificial Intelligence” (AI) with the worst moments in The Terminator, Ex Machina, or The Matrix. Others think first of the known ethical challenges of AI and the potential workforce disruption of widespread AI implementation in healthcare and b ... Show More
1h 10m
Jul 2
Alembic and the Future of AI in Marketing - Ep. 263
Tomás Puig, founder and CEO of Alembic, joins the NVIDIA AI Podcast to discuss the intersection of AI, data, and marketing. He shares how Alembic uses advanced mathematics and AI—particularly spiking neural networks and causal inference—to help brands extract actionable insights ... Show More
39m 44s
Mar 2022
The promise of AI with Demis Hassabis
Hannah wraps up the series by meeting DeepMind co-founder and CEO, Demis Hassabis. In an extended interview, Demis describes why he believes AGI is possible, how we can get there, and the problems he hopes it will solve. Along the way, he highlights the important role of consciou ... Show More
30m 28s