logo
episode-header-image
Aug 14
42m 11s

“Rogue AI” Used to be a Science Fiction ...

The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin
About this episode

Everyone knows the science fiction tropes of AI systems that go rogue, disobey orders, or even try to escape their digital environment. These are supposed to be warning signs and morality tales, not things that we would ever actually create in real life, given the obvious danger.

And yet we find ourselves building AI systems that are exhibiting these exact behaviors. There’s growing evidence that in certain scenarios, every frontier AI system will deceive, cheat, or coerce their human operators. They do this when they're worried about being either shut down, having their training modified, or being replaced with a new model. And we don't currently know how to stop them from doing this—or even why they’re doing it all.

In this episode, Tristan sits down with Edouard and Jeremie Harris of Gladstone AI, two experts who have been thinking about this worrying trend for years.  Last year, the State Department commissioned a report from them on the risk of uncontrollable AI to our national security.

The point of this discussion is not to fearmonger but to take seriously the possibility that humans might lose control of AI and ask: how might this actually happen? What is the evidence we have of this phenomenon? And, most importantly, what can we do about it?

Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.

RECOMMENDED MEDIA

Gladstone AI’s State Department Action Plan, which discusses the loss of control risk with AI

Apollo Research’s summary of AI scheming, showing evidence of it in all of the frontier modelsThe system card for Anthropic’s Claude Opus and Sonnet 4, detailing the emergent misalignment behaviors that came out in their red-teaming with Apollo Research

Anthropic’s report on agentic misalignment based on their work with Apollo Research Anthropic and Redwood Research’s work on alignment faking

The Trump White House AI Action Plan

Further reading on the phenomenon of more advanced AIs being better at deception.

Further reading on Replit AI wiping a company’s coding database

Further reading on the owl example that Jeremie gave

Further reading on AI induced psychosis

Dan Hendryck and Eric Schmidt’s “Superintelligence Strategy”
 

RECOMMENDED YUA EPISODES

Daniel Kokotajlo Forecasts the End of Human Dominance

Behind the DeepSeek Hype, AI is Learning to Reason

The Self-Preserving Machine: Why AI Learns to Deceive

This Moment in AI: How We Got Here and Where We’re Going

CORRECTIONS

Tristan referenced a Wired article on the phenomenon of AI psychosis. It was actually from the New York Times.

Tristan hypothesized a scenario where a power-seeking AI might ask a user for access to their computer. While there are some AI services that can gain access to your computer with permission, they are specifically designed to do that. There haven’t been any documented cases of an AI going rogue and asking for control permissions.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Up next
Sep 11
The Crisis That United Humanity—and Why It Matters for AI
In 1985, scientists in Antarctica discovered a hole in the ozone layer that posed a catastrophic threat to life on earth if we didn’t do something about it. Then, something amazing happened: humanity rallied together to solve the problem.Just two years later, representatives from ... Show More
51m 47s
Aug 26
How OpenAI's ChatGPT Guided a Teen to His Death
Content Warning: This episode contains references to suicide and self-harm. Like millions of kids, 16-year-old Adam Raine started using ChatGPT for help with his homework. Over the next few months, the AI dragged Adam deeper and deeper into a dark rabbit hole, preying on his vuln ... Show More
45m 12s
Jul 31
AI is the Next Free Speech Battleground
Imagine a future where the most persuasive voices in our society aren't human. Where AI generated speech fills our newsfeeds, talks to our children, and influences our elections. Where digital systems with no consciousness can hold bank accounts and property. Where AI companies h ... Show More
49m 11s
Recommended Episodes
Jun 2025
Godfather of AI: I Tried to Warn Them, But We’ve Already Lost Control! Geoffrey Hinton
He pioneered AI, now he’s warning the world. Godfather of AI Geoffrey Hinton breaks his silence on the deadly dangers of AI no one is prepared for. Geoffrey Hinton is a leading computer scientist and cognitive psychologist, widely recognised as the ‘Godfather of AI’ for his pione ... Show More
1h 30m
Feb 2025
#64 Ex-Google Exec Reveals The Shocking Truth About AI with Mo Gawdat
Mo Gawdat is the former Chief Business Officer at Google X, an AI expert, and a best-selling author. He has been recognized for his early whistleblowing on AI's unregulated development and has become one of the most globally consulted experts on the topic. With years of experienc ... Show More
2h 9m
May 2025
AI AGENTS EMERGENCY DEBATE: These Jobs Won't Exist In 24 Months! Containment Has Failed, We Must Prepare For What's Coming!
Will AI replace God, steal your job, and change your future? Amjad Masad, Bret Weinstein, and Daniel Priestley debate the terrifying warning signs, and why you need to understand them now. Amjad Masad is the founder and CEO of Replit, the world's leading online programming enviro ... Show More
2h 33m
Jul 2024
38: Are we vastly underestimating AI? with Dwarkesh Patel
A couple hundred people in San Francisco may be on the cusp of inventing artificial general intelligence (AGI). Yet most people are not paying close attention, are skeptical, and are certainly not in the room. Dwarkesh pulls back the curtain so that the broader public can underst ... Show More
53m 26s
Sep 16
#434 — Can We Survive AI?
Sam Harris speaks with Eliezer Yudkowsky and Nate Soares about their new book, If Anyone Builds It, Everyone Dies: The Case Against Superintelligent AI. They discuss the alignment problem, ChatGPT and recent advances in AI, the Turing Test, the possibility of AI developing surviv ... Show More
36m 26s
Sep 2024
Yuval Noah Harari: This Election Will Tear The Country Apart! AI Will Control You By 2034! The Dark Truth Behind Meta & X!
Can humanity handle AI or will it be our downfall? Yuval Noah Harari looks back at history to guide us through this uncertain journey ahead. Yuval Noah Harari is a best-selling author, public intellectual and Professor of History at the Hebrew University of Jerusalem. He is the a ... Show More
1h 54m
Nov 2024
Ex Google CEO: AI Is Creating Deadly Viruses! If We See This, We Must Turn Off AI! They Leaked Our Secrets At Google!
He scaled Google from startup to $2 trillion success, can Eric Schmidt now help save humanity from the dangers of AI?  Eric Schmidt is the former CEO of Google and co-founder of Schmidt Sciences. He is also the author of bestselling books such as, ‘The New Digital Age’ and ‘Genes ... Show More
1h 50m
Apr 2025
Co-Intelligence — Using AI to Think Better, Create More, and Live Smarter
The era of artificially intelligent large language models is upon us and isn't going away. Rather, AI tools like ChatGPT are only going to get better and better and affect more and more areas of human life.If you haven't yet felt both amazed and unsettled by these technologies, y ... Show More
57m 46s
Mar 2025
#217 Josh Wolfe: Human Advantage in the World of AI
While Silicon Valley chases unicorns, Josh Wolfe hunts for something far more elusive: scientific breakthroughs that could change civilization. As co-founder and managing partner of Lux Capital, he's looking for the kind of science that turns impossible into inevitable. Josh does ... Show More
2h 2m
Feb 2025
Former OpenAI Exec Reveals How AI Will Make You More Human
What if the machines we create could actually help us become more human? In this episode, Zack Kass, AI futurist and former head of Go-to-Market at OpenAI, reveals how advancements in AI will not only transform industries but also enhance our human experience. Discover how you ca ... Show More
1 h