E1032: Breaking down new research analyzing 1.4 million ChatGPT prompts to understand a simple but important question: why does ChatGPT cite some pages and ignore others?
This study, conducted by Ahrefs, gives one of the clearest looks yet into how ChatGPT selects sources, what influences citation likelihood, and how you can position your content to be included.
We walk through the key findings, what they mean in practice, and how they connect to real SEO strategy.
What you'll learn in this episode:
- How ChatGPT retrieves dozens of sources but only cites about half
- The role of titles, snippets, and URLs before your page is even opened
- Why semantic relevance to "fan-out queries" is one of the strongest ranking factors
- What fan-out queries are and how to find them yourself
- Why most cited sources come from traditional search results (and what that means for SEO)
- The surprising role of Reddit: heavily used for context, rarely cited
- Citation breakdown across sources like search, news, Reddit, YouTube, and academia
- Why natural language URLs and keyword alignment increase your chances of being cited
- The relationship between content freshness and citation likelihood
- Why older, more established pages often beat newer ones within the same query
- How news content is treated differently, with freshness acting as a tiebreaker
- Why SEO landing pages and product pages are among the most cited content types
We also cover a practical method to uncover the exact queries ChatGPT uses behind the scenes, and how to use those insights to structure your content.
If you're trying to get your site, product pages, or content cited in AI-generated answers, this episode gives you a clear framework based on real data.
⭐️ Why ChatGPT Cites One Page Over Another (Study of 1.4M Prompts) - https://ahrefs.com/blog/why-chatgpt-cites-pages/
🚀 Edward's SEO Articles - https://edwardsturm.com/articles/search-engine-optimization/
💎 Compact Keywords - My SEO Course - Get paying customers through SEO - Clear step-by-step video breakdowns - SEO templates to be copied and adapted for your products and services: https://compactkeywords.com/
00:00 Why Citations Vary
00:34 Gatekeeping Before Reading
01:35 Study Setup and Goals
02:02 Where Sources Come From
04:54 Reddit Not Credited!?
05:31 Semantic Scoring and Titles
07:30 Find ChatGPT Fanout Queries
09:05 Freshness Versus Relevance
11:19 What It Means to Be Citable
11:55 SEO Pages Win Citations
15:19 Wrap Up and Goodbye
The Edward Show. Your daily search engine optimization podcast: https://edwardsturm.com/the-edward-show/
#generativeengineoptimization #answerengineoptimization #searchengineoptimization #seo