logo
episode-header-image
Sep 2015
13m 22s

[MINI] Sample Sizes

Kyle Polich
About this episode

There are several factors that are important to selecting an appropriate sample size and dealing with small samples. The most important questions are around representativeness - how well does your sample represent the total population and capture all it's variance?

Linhda and Kyle talk through a few examples including elections, picking an Airbnb, produce selection, and home shopping as examples of cases in which the amount of observations one has are more or less important depending on how complex the underlying system one is observing is.

Up next
Yesterday
Designing Recommender Systems for Digital Humanities
<p>In this episode of Data Skeptic, we explore the fascinating intersection of recommender systems and digital humanities with guest Florian Atzenhofer-Baumgartner, a PhD student at Graz University of Technology. Florian is working on <a href= "http://monasterium.net/">Monasteriu ... Show More
36m 48s
Nov 13
DataRec Library for Reproducible in Recommend Systems
<p>In this episode of Data Skeptic's Recommender Systems series, host Kyle Polich explores DataRec, a new Python library designed to bring reproducibility and standardization to recommender systems research. Guest Alberto Carlo Maria Mancino, a postdoc researcher from Politecnico ... Show More
32m 48s
Nov 5
Shilling Attacks on Recommender Systems
In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where mal ... Show More
34m 48s
Recommended Episodes
Jul 2020
Sample Size Calculation for a Hypothesis Test
One of the most common causes for problems we see in manuscripts at JAMA is an inappropriately calculated study sample size. This seemingly mysterious process is explained by Lynne Stokes, PhD, professor of Statistical Science at Southern Methodist University in Dallas, Texas. 
12m 6s
Apr 2020
Keeping ourselves honest when we work with observational healthcare data
The abundance of data in healthcare, and the value we could capture from structuring and analyzing that data, is a huge opportunity. It also presents huge challenges. One of the biggest challenges is how, exactly, to do that structuring and analysis—data scientists working with t ... Show More
19m 8s
Feb 2023
#126 Make Your A/B Testing More Effective and Efficient
One of the toughest parts of any data project is experimentation, not just because you need to choose the right testing method to confirm the project’s effectiveness, but because you also need to make sure you are testing the right hypothesis and measuring the right KPIs to ensur ... Show More
50m 53s
Oct 2012
96: Estimation Cases Should Ideally Be Imprecise
Many candidates are obsessed with generating correct answers in estimations they must make within cases or standalone estimation cases. This is a poor strategy. By obsessing about the final answer in a McKinsey estimation case, they ignore the structure of the estimation case whi ... Show More
7m 58s
Jul 2020
Sample Size Calculation for a Hypothesis Test With Dr Lynne Stokes
One of the most common causes for problems we see in manuscripts at JAMA is an inappropriately calculated study sample size. This seemingly mysterious process is explained by Lynne Stokes, PhD, professor of Statistical Science at Southern Methodist University in Dallas, Texas. 
12m 14s
Jul 2013
146: Estimation Sensitivities During Calculations
This podcast looks at how to make estimations when calculating smaller values or working with enclosed spaces like restaurants, the importance of sensitivity analyses and a new limitation of demand-driven cases. This is a very important technique which can significantly improve a ... Show More
4m 20s
May 2023
#252 Words and Phrases to Describe Small Quantities in English
<p class="p1"><span style= "font-size: 14pt;"><span data-preserver-spaces="true">>> Get my new course: The</span><strong><span data-preserver-spaces= "true"> </span><a class="editor-rtfLink" href= "https://pronunciationcourse.com" target="_blank" rel= "noopener"><span data-preser ... Show More
16m 5s