logo
episode-header-image
Dec 2017
20m 37s

[MINI] Parallel Algorithms

Kyle Polich
About this episode

When computers became commodity hardware and storage became incredibly cheap, we entered the era of so-call "big" data. Most definitions of big data will include something about not being able to process all the data on a single machine. Distributed computing is required for such large datasets.

Getting an algorithm to run on data spread out over a variety of different machines introduced new challenges for designing large-scale systems. First, there are concerns about the best strategy for spreading that data over many machines in an orderly fashion. Resolving ambiguity or disagreements across sources is sometimes required.

This episode discusses how such algorithms related to the complexity class NC.

Up next
May 1
Student Spotlight: Aaron Payne, Data Analyst
Aaron Payne, an MBA student at Georgia Tech studying business analytics and a Senior Insights Analyst at Chick-fil-A, joins Kyle Polich to talk about turning analytics into decisions that matter. They unpack a real-world forecasting project with Comfama in Colombia, including mes ... Show More
25m 59s
Apr 25
The Future is Agentic in Recommender Systems
Kyle Polich sits down with Yashar Deldjoo, research scientist and Associate Professor at the Polytechnic University of Bari, to explore how recommender systems have evolved and why trustworthiness matters. They unpack key dimensions of responsible AI, including robustness to adve ... Show More
49m 25s
Mar 27
Book Ratings and Recommendations
Goodreads star ratings can be misleading as measures of "book quality," and research from Hannes Rosenbusch suggests that for many professionally published books, differences between readers often matter more than differences between books. The episode also explores how to model ... Show More
39m 19s
Recommended Episodes
Jul 2020
A Reality Check on AI-Driven Medical Assistants
The data science and artificial intelligence community has made amazing strides in the past few years to algorithmically automate portions of the healthcare process. This episode looks at two computer vision algorithms, one that diagnoses diabetic retinopathy and another that cla ... Show More
14 m
Sep 2018
Data Engineering
If you’re a data scientist, you know how important it is to keep your data orderly, clean, moving smoothly between different systems, well-documented… there’s a ton of work that goes into building and maintaining databases and data pipelines. This job, that of owner and maintaine ... Show More
16m 22s
Jul 2023
Tech and AI: 8. The Algorithm
At its simplest, an algorithm is a sequence of step-by-step instructions designed to give a result. They are the building blocks of every computer program and are there to ensure every digital device gives the right results on request. For example, when we type a search query int ... Show More
13m 54s
Sep 2020
Marcello La Rocca on Algorithms and Data Structures
tail spinning
1h 2m
Jan 2022
Making Agile work for data science
<p>Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.</p><p>Historically, the d ... Show More
20m 52s
Feb 2021
The promise of quantum computers | Matt Langione
What if microparticles could help us solve the world's biggest problems in a matter of minutes? That's the promise -- and magic -- of quantum computers, says Matt Langione. Speaking next to IBM's quantum computer, he explains how these machines solve complex challenges like devel ... Show More
12m 49s
May 2020
How Important are algorithm and data structures in backend engineering?
<p>Algorithms &amp; Data Structures are critical to Backend Engineering however it really depends on what kind of application and infrastructure you are building. In this video I want to go through the following &nbsp;&nbsp;1 Backend Engineers are two types - Integrating Existing ... Show More
13m 29s