logo
episode-header-image
Dec 2017
20m 37s

[MINI] Parallel Algorithms

Kyle Polich
About this episode

When computers became commodity hardware and storage became incredibly cheap, we entered the era of so-call "big" data. Most definitions of big data will include something about not being able to process all the data on a single machine. Distributed computing is required for such large datasets.

Getting an algorithm to run on data spread out over a variety of different machines introduced new challenges for designing large-scale systems. First, there are concerns about the best strategy for spreading that data over many machines in an orderly fashion. Resolving ambiguity or disagreements across sources is sometimes required.

This episode discusses how such algorithms related to the complexity class NC.

Up next
Mar 27
Book Ratings and Recommendations
Goodreads star ratings can be misleading as measures of "book quality," and research from Hannes Rosenbusch suggests that for many professionally published books, differences between readers often matter more than differences between books. The episode also explores how to model ... Show More
39m 19s
Mar 10
Disentanglement and Interpretability in Recommender Systems
30m 33s
Feb 27
Collective Altruism in Recommender Systems
Ekaterina (Kat) Fedorova from MIT EECS joins us to discuss strategic learning in recommender systems—what happens when users collectively coordinate to game recommendation algorithms. Kat's research reveals surprising findings: algorithmic "protest movements" can paradoxically he ... Show More
54m 35s
Recommended Episodes
Jul 2020
A Reality Check on AI-Driven Medical Assistants
The data science and artificial intelligence community has made amazing strides in the past few years to algorithmically automate portions of the healthcare process. This episode looks at two computer vision algorithms, one that diagnoses diabetic retinopathy and another that cla ... Show More
14 m
Sep 2018
Data Engineering
If you’re a data scientist, you know how important it is to keep your data orderly, clean, moving smoothly between different systems, well-documented… there’s a ton of work that goes into building and maintaining databases and data pipelines. This job, that of owner and maintaine ... Show More
16m 22s
Jul 2023
Tech and AI: 8. The Algorithm
At its simplest, an algorithm is a sequence of step-by-step instructions designed to give a result. They are the building blocks of every computer program and are there to ensure every digital device gives the right results on request. For example, when we type a search query int ... Show More
13m 54s
Sep 2020
Marcello La Rocca on Algorithms and Data Structures
tail spinning
1h 2m
Jan 2022
Making Agile work for data science
<p>Data scientists and engineers don’t always play well together. Data scientists will plan out a solution, carefully build models, test them in notebooks, then throw that solution over the wall to engineering. Implementing that solution can take months.</p><p>Historically, the d ... Show More
20m 52s
Feb 2021
The promise of quantum computers | Matt Langione
What if microparticles could help us solve the world's biggest problems in a matter of minutes? That's the promise -- and magic -- of quantum computers, says Matt Langione. Speaking next to IBM's quantum computer, he explains how these machines solve complex challenges like devel ... Show More
12m 49s
May 2020
How Important are algorithm and data structures in backend engineering?
<p>Algorithms &amp; Data Structures are critical to Backend Engineering however it really depends on what kind of application and infrastructure you are building. In this video I want to go through the following &nbsp;&nbsp;1 Backend Engineers are two types - Integrating Existing ... Show More
13m 29s