logo
episode-header-image
Nov 2017
25m 20s

The Kaggle Survey on Data Science

Ben Jaffe And Katie Malone
About this episode
Want to know what's going on in data science these days?  There's no better way than to analyze a survey with over 16,000 responses that recently released by Kaggle.  Kaggle asked practicing and aspiring data scientists about themselves, their tools, how they find jobs, what they find challenging about their jobs, and many other questions.  Then Kaggle relea ... Show More
Up next
Mar 15
The Bitter Lesson
Every AI builder knows the anxiety: you spend months engineering prompts, tuning pipelines, and chaining calls together — then a new model drops and half your work evaporates overnight. It turns out researchers have been wrestling with this exact dynamic for 30 years, and they ke ... Show More
19m 17s
Mar 9
From Atari to ChatGPT: How AI Learned to Follow Instructions
From Atari to ChatGPT: How AI Learned to Follow Instructions by Ben Jaffe and Katie Malone 
25m 53s
Mar 2
It's RAG time: Retrieval-Augmented Generation
Today we are going to talk about the feature with the worst acronym in generative AI: RAG, or Retrieval Augmented Generation. If you've ever used something like "Chat with My Docs," if you have an internal AI chatbot that has access to your company's documents, or you've created ... Show More
17m 14s
Recommended Episodes
May 2023
#139 How Data Scientists Can Thrive in the FMCG Industry
A lot of the times when we walk into a supermarket, we don't necessarily think about the impact data science had in getting these products on shelves. However, as you’ll learn in today's episode, it's safe to say there's a myriad of applications for data science in the FMCG indus ... Show More
42m 10s
Nov 2021
Data Quality Starts At The Source
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The most important gauge of success for a data platform is the level of trust in the accuracy of the information that it provides. In order to build and maintain that trust it is necessary to invest in defining, monitori ... Show More
58m 55s
Aug 2022
Collecting And Retaining Contextual Metadata For Powerful And Effective Data Discovery
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Data is useless if it isn&#8217;t being used, and you can&#8217;t use it if you don&#8217;t know where it is. Data catalogs were the first solution to this problem, but they are only helpful if you know what you are look ... Show More
53m 24s
Nov 2021
Business Intelligence Beyond The Dashboard With ClicData
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Business intelligence is often equated with a collection of dashboards that show various charts and graphs representing data for an organization. What is overlooked in that characterization is the level of complexity and ... Show More
1h 2m
Sep 2020
Chapter 1: What is Data Science?
<p>What actually *is* data science, and what does a data scientist do? What kind of backgrounds do data scientists come from and what skills do you need to be one? In this episode we start with the basics—declaring once and for all what is data science anyway and exploring how th ... Show More
55m 8s
Jun 2021
Buying and Selling Homes Algorithmically with Opendoor’s VP of Research and Data Science, Kushal Chakrabarti
<p>For many people, the process of buying and selling a home will undoubtedly be the most difficult decisions they will make in their lifetime. Is the price you’re paying for your home fair? Is the price you’re selling your home for an adequate sale price? For a long time, realto ... Show More
32m 26s
Apr 2024
Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer
<h2>Summary</h2> <p>Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while mainta ... Show More
56m 23s