logo
episode-header-image
May 29
32m 41s

How Googlebot crawls the web

Google
About this episode

In this episode of Search Off the Record, Martin and Gary from the Google Search Relations team take a deep dive into how Googlebot and web crawling work—past, present, and future. Through their humorous and thoughtful conversation, they explore how crawling evolved from the early days of the internet, when scripts could index a chunk of the web from a single homepage, to the more complex and considerate systems used today. They discuss the basics of what a crawler is, how tools like cURL or Wget relate, and how policies like robots.txt ensure crawlers play nice with web infrastructure.

 

The conversation also covers Google's internal shift to unified infrastructure for all crawling needs, highlighting how different teams moved from separate crawlers to a shared system that enforces consistent policies. They explain why some fetches bypass robots.txt (like user-initiated actions) and the rising impact of automated traffic from new products and AI agents. With a nod to initiatives like Common Crawl, the episode ends with a look at the road ahead, acknowledging growing internet congestion but remaining optimistic about the web’s capacity to adapt.

Resources:

Episode transcript → https://goo.gle/sotr092-transcript 

 

Listen to more Search Off the Record → https://goo.gle/sotr-yt

Subscribe to Google Search Channel → https://goo.gle/SearchCentral

 

Search Off the Record is a podcast series that takes you behind the scenes of Google Search with the Search Relations team.

 

#SOTRpodcast #SEO #SearchOfTheRecord

 

Speakers: Martin Splitt, Gary Illyes

Products Mentioned: Googlebot, Search 

Up next
Jun 26
Demystifying SEO for developers
Developers often have misconceptions about SEO. In this episode of Search Off the Record, John Mueller and Martin Splitt clarify what matters and what doesn't. They cover areas including: Optimizing themes for SEO, The indexing API, and common HTML mistakes. This podcast delivers ... Show More
33m 23s
Jun 12
What SEOs should know about devs
In this candid episode of Search Off the Record, Gary Illyes and Martin Splitt from the Google Search team peel back the curtain to reveal a more human side of their work and challenges. Instead of diving deep into technical concepts, they reflect on their personal journeys—touch ... Show More
31m 13s
May 15
Debugging the Internet: HTTP, TCP, and You
In this episode of Search Off the Record, Gary Illyes and Martin Splitt from the Google Search team dive deep into the foundations of how the web works—specifically HTTP, TCP, UDP, and newer technologies like QUIC and HTTP/3. The two reflect on how even experienced web profession ... Show More
33m 25s
Recommended Episodes
Feb 2025
How AI Search Is Changing The SEO Industry
Founder at Knowatoa, Michael Buckbee, discusses how AI search technologies like ChatGPT and Perplexity are revolutionizing the SEO industry by uncovering new ranking opportunities for brands. In this episode, Michael shares his perspectives on: impact of Google's AI-driven search ... Show More
14m 30s
Apr 23
Will Traditional Keyword Research Become Obsolete In An AI Search World?
Keyword research faces significant evolution in the AI search era. Chris Andrew, co-founder and CEO of Scrunch AI, explains how traditional keyword targeting must adapt as search fragments across multiple AI platforms. He outlines how single keywords now expand into countless pro ... Show More
3m 36s
Jul 2024
SEO 2.0: How to Trick Google and Rank AI Content ft. Greg Isenberg
Episode 16: How is AI transforming the future of SEO and job markets? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) are joined by innovator Greg Isenberg (https://x.com/gregisenberg), founder of Late Checkout and Boring Marketer. Greg hosts “The ... Show More
49m 14s
Aug 2024
Why Google search isn’t going anywhere anytime soon (from The Next Wave)
Is Google's dominance in search engines at risk with the rise of generative AI models? In this episode from The Next Wave, a podcast we think you'll like, hosts Matt Wolfe and Nathan Lands dive in to the conversation with Bilawal. They explore the potential challenges facing Goog ... Show More
50m 24s
Dec 2024
The Insane Inefficiencies of AI Crawlers - How to Get Shown on ChatGPT
E536: AI crawlers for generative AI companies like OpenAI or Anthropic or even Meta are REALLY inefficient. Crazy inefficiencies compared to Googlebot. And this means there’s actually a decent chance Generative AI can’t even access your pages and posts to learn off them. If the b ... Show More
19m 22s
Nov 2024
Google's View Of AI Content Revisited
CEO at Originailty.ai, Jonathan Gillham, revisits Google's view of AI content. In this episode, Jonathan shares his perspectives on: exploring the impact of AI on SEO and content marketing and the role of detection technology in understanding AI's authenticity. Show NotesConnect ... Show More
23m 42s
Apr 24
Optimizing For Traditional Search Engines Vs AI Platforms
SEO professionals face a strategic dilemma between traditional search and AI platforms. Chris Andrew, Co-founder and CEO of Scrunch AI, explains why optimizing for both simultaneously is possible by focusing on human-centered content. He reveals how AI models access content in re ... Show More
4m 56s
Aug 2024
How Google's Generative Experience Reduces Site Traffic
Krista Brea, Senior SEO Manager at Morningstar, talks about Google's generative experience and its impact on site traffic. Although Google's generative experience hasn't been officially released, the company has been quietly preparing for AI integration with the recent removal of ... Show More
28m 33s
Oct 2024
The TED AI Show: Is Google’s reign over? The future of AI search w/ Perplexity CEO Aravind Srinivas
Whether finding a restaurant or fact-checking a new claim, search engines are one of the main avenues we use to navigate the world. So why are modern engines so clunky and frustrating – and how is AI already changing the infrastructure we use to access information on the internet ... Show More
35m 51s
Apr 21
How To Win Customers In An AI Search-Driven World
AI search is transforming how consumers discover products and services. Chris Andrew, CEO of Scrunch AI, explains how businesses can optimize for AI-powered search experiences that prioritize answers over links. He details strategies for monitoring AI crawler activity, tracking c ... Show More
27m 48s