logo
episode-header-image
Sep 22
1h 13m

P-Values: Are we using a flawed statisti...

Regina Nuzzo and Kristin Sainani
About this episode

P-values show up in almost every scientific paper, yet they’re one of the most misunderstood ideas in statistics. In this episode, we break from our usual journal-club format to unpack what a p-value really is, why researchers have fought about it for a century, and how that famous 0.05 cutoff became enshrined in science. Along the way, we share stories from our own papers—from a Nature feature that helped reshape the debate to a statistical sleuthing project that uncovered a faulty method in sports science. The result: a behind-the-scenes look at how one statistical tool has shaped the culture of science itself.


Statistical topics

  • Bayesian statistics
  • Confidence intervals 
  • Effect size vs. statistical significance
  • Fisher’s conception of p-values
  • Frequentist perspective
  • Magnitude-Based Inference (MBI)
  • Multiple testing / multiple comparisons
  • Neyman-Pearson hypothesis testing framework
  • P-hacking
  • Posterior probabilities
  • Preregistration and registered reports
  • Prior probabilities
  • P-values
  • Researcher degrees of freedom
  • Significance thresholds (p < 0.05)
  • Simulation-based inference
  • Statistical power 
  • Statistical significance
  • Transparency in research 
  • Type I error (false positive)
  • Type II error (false negative)
  • Winner’s Curse


Methodological morals

  • “​​If p-values tell us the probability the null is true, then octopuses are psychic.”
  • “Statistical tools don't fool us, blind faith in them does.”


References


Kristin and Regina’s online courses: 

Demystifying Data: A Modern Approach to Statistical Understanding  

Clinical Trials: Design, Strategy, and Analysis 

Medical Statistics Certificate Program  

Writing in the Sciences 

Epidemiology and Clinical Research Graduate Certificate Program 

Programs that we teach in:

Epidemiology and Clinical Research Graduate Certificate Program 


Find us on:

Kristin -  LinkedIn & Twitter/X

Regina - LinkedIn & ReginaNuzzo.com

  • (00:00) - Intro & claim of the episode
  • (01:00) - Why p-values matter in science
  • (02:44) - What is a p-value? (ESP guessing game)
  • (06:47) - Big vs. small p-values (psychic octopus example)
  • (08:29) - Significance thresholds and the 0.05 rule
  • (09:00) - Regina’s Nature paper on p-values
  • (11:32) - Misconceptions about p-values
  • (13:18) - Fisher vs. Neyman-Pearson (history & feud)
  • (16:26) - Botox analogy and type I vs. type II errors
  • (19:41) - Dating app analogies for false positives/negatives
  • (22:02) - How the 0.05 cutoff got enshrined
  • (23:46) - Misinterpretations: statistical vs. practical significance
  • (25:22) - Effect size, sample size, and “statistically discernible”
  • (25:51) - P-hacking and researcher degrees of freedom
  • (28:52) - Transparency, preregistration, and open science
  • (29:58) - The 0.05 cutoff trap (p = 0.049 vs 0.051)
  • (30:24) - The biggest misinterpretation: what p-values actually mean
  • (32:35) - Paul the psychic octopus (worked example)
  • (35:05) - Why Bayesian statistics differ
  • (38:55) - Why aren’t we all Bayesian? (probability wars)
  • (40:11) - The ASA p-value statement (behind the scenes)
  • (42:22) - Key principles from the ASA white paper
  • (43:21) - Wrapping up Regina’s paper
  • (44:39) - Kristin’s paper on sports science (MBI)
  • (47:16) - What MBI is and how it spread
  • (49:49) - How Kristin got pulled in (Christie Aschwanden & FiveThirtyEight)
  • (53:11) - Critiques of MBI and “Bayesian monster” rebuttal
  • (55:20) - Spreadsheet autopsies (Welsh & Knight)
  • (57:11) - Cherry juice example (why MBI misleads)
  • (59:28) - Rebuttals and smoke & mirrors from MBI advocates
  • (01:02:01) - Winner’s Curse and small samples
  • (01:02:44) - Twitter fights & “establishment statistician”
  • (01:05:02) - Cult-like following & Matrix red pill analogy
  • (01:07:12) - Wrap-up


Up next
Oct 6
Ultramarathons: Can vitamin D protect your bones?
Ultramarathoners push their bodies to the limit, but can a giant pre-race dose of vitamin D really keep their bones from breaking down? In this episode, we dig into a trial that tested this claim – and found a statistical endurance event of its own: six highly interchangeable pap ... Show More
58m 50s
Sep 8
Exercise and Cancer: Does physical activity improve colon cancer survival?
Exercise has long been hailed as cancer-fighting magic, but is there hard evidence behind the hype? In this episode, we tackle the CHALLENGE trial, a large phase III study of colon cancer patients that tested whether prescribed exercise could improve cancer-free survival. We tran ... Show More
49m 4s
Aug 25
Age Gaps: How much does age matter in dating?
Are we all secretly ageist when it comes to dating? We put the stereotype that older men prefer younger women under the microscope using data from thousands of blind dates. What we found surprised us: the “age penalty” was real but microscopic, women wanted younger partners too, ... Show More
49m 42s
Recommended Episodes
Apr 2025
The Science of Conversation and the Art of Being Ourselves | Dr. Alison Wood Brooks
Award-winning behavioral scientist and leading expert on the psychology of conversation Dr. Alison Wood Brooks joins Google to discuss her book, “Talk: The Science of Conversation and the Art of Being Ourselves.” The book reveals the hidden architecture of our conversations, and ... Show More
58m 3s
Oct 2024
Decoding Academia 30: Sadistic Trolls love Dark Humour *Preview*
This is a preview episode to remind those who might be interested that we have a bonus Decoding Academia series, available at the Patreon at the Revolutionary Genius tier and above, which is now up to episode 30! On Decoding Academia we usually focus on specific papers and indulg ... Show More
33m 11s
Apr 2025
Ep 40: Kink, community, and transformation
The massive variety of human sexual interests is a little mind-boggling. As what we consider to be sexually typical continues to be expanded and shaped by the internet and popular culture, psychologists are making efforts to research kinks and members of the kink community like n ... Show More
38m 12s
Jan 2025
145 - Marginalia Episode: Erica Bailey on Authenticity
Marginalia Episode is a collaboration between the Stanford Psychology Podcast and Marginalia Science. Marginalia Science is a community committed to promoting work of scholars who are traditionally underrepresented in academia. Their mission really resonated with our values at th ... Show More
50m 29s
Feb 2025
Episode 370: The Science of Sexual Pleasure
How do you define sexual pleasure? For some people, it simply means orgasm. But orgasm isn’t the only way we derive pleasure from sex! In this episode, we’re going to dive into the many and varied forms of sexual pleasure that exist, common barriers to experiencing pleasure, and ... Show More
40m 3s
Nov 2024
154| Long COVID – A Conversation With Dr. Lucette Cysique
This episode is a conversation with Dr. Lucette Cysique about long COVID. We discuss terminology, symptom profiles, epidemiology, biological mechanisms, psychological and sociocultural factors, overlap with chronic fatigue syndrome, overlap with functional neurological disorder, ... Show More
1h 33m
Apr 2021
Sex + Intimacy | How do you know if you're enjoying sex? w. Dr Sarah Ashton
Hello friends! Today we have a very insightful and important conversation to share with you all. We are delighted to have Dr Sarah Ashton (psychologist, founder + director of SHIPS psychology in Melbourne) on this episode, who answers some of our listener questions around &apos;H ... Show More
59m 20s
Jul 2018
3 | Alice Dreger on Sexuality, Truth, and Justice
The human mind loves nothing more than to build mental boxes -- categories -- and put things into them, then refuse to accept it when something doesn't fit. Nowhere is this more clear than in the idea that there are men, and there are women, and that's it. Alice Dreger is an hist ... Show More
1h 20m
May 2024
Overthinking About Narcissism
Is it just us, or does American culture have narcissism fever? Between 2004 and 2016, the volume of Google searches for the word “narcissist” grew exponentially. Thanks to factors like the rise of therapyspeak, TikTok mental health diagnoses, and badly behaving reality stars-turn ... Show More
58m 20s
May 2023
Jan Ke-Schutte, "Angloscene: Compromised Personhood in Afro-Chinese Translations" (U California Press, 2023)
Today I had the pleasure of talking to Jay Ke-Schutte on his just released book, Angloscene: Compromised Personhood in Afro-Chinese Translations (U California Press, 2023). Angloscene examines Afro-Chinese interactions within Beijing's aspirationally cosmopolitan student class. J ... Show More
1h 4m