logo
episode-header-image
Mar 2022
23m 39s

Understanding Aggregate Functions Perfor...

Hussein Nasser
About this episode

Aggregate functions like Count, max, min, avg performance really depends on how did you tune your database for that kind of workload. Let us discuss this.

0:00 Intro

1:22 SELECT COUNT(*)

4:30 SELECT AVG(A)

5:15 SELECT MAX(A)

8:00 Best case scenario

11:30 Clustering

14:00 Clustering Sequential Writes

17:19 Clustering Random Writes

20:30 Summary

Up next
Nov 24
CPU and Kernel Page Faults
<p>Page faults occurs when the process tries to access a memory that isn’t backed by a physical page kernel raises a fault which loads a page. It happens on first access, stack expansion, COW, swap and much more. However it comes with a cost. </p><p><br /></p><p>In this episode o ... Show More
48m 37s
Oct 31
Amazon US-EAST-1 Outage in Details
On October 19 2025 AWS experienced an outage that lasted over a day, 10 days later we finally got the root cause analysis and we know exactly what caused the DNS to fail0:00 Summary 5:30 How did Dynamo lost its DNS?13:41 EC2 Errors 16:16 Network Load Balancer ErrorsRCA here https ... Show More
24m 26s
Oct 17
Graceful shutdown in HTTP
There are cases where the backend may need to close the connection to prevent unexpected situations, prevent bad actors or simply just free up resources. Closing a connection gracefully allows clients and backends to clean up and finish any pending requests. In this episode of th ... Show More
25m 49s
Recommended Episodes
Mar 2020
Easier Stream Processing On Kafka With ksqlDB
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Building applications on top of unbounded event streams is a complex endeavor, requiring careful integration of multiple disparate systems that were engineered in isolation. The ksqlDB project was created to address this ... Show More
43m 36s
Sep 2023
Episode 486: High Performance PostgreSQL for Rails with Andrew Atkinson
If you love taking about databases, this is the episode for you. Ahead of the launch of his new book, High Performance PostgreSQL for Rails, Andrew Atkinson joined the show with special guest co-host, Pat Bair, to talk about why he wrote a book, why he focused on PostgreSQL and h ... Show More
31m 29s
Jun 2021
A Candid Exploration Of Timeseries Data Analysis With InfluxDB
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>While the overall concept of timeseries data is uniform, its usage and applications are far from it. One of the most demanding applications of timeseries data is for application and server monitoring due to the problem o ... Show More
1h 6m
Aug 2022
Tech job interview support group
Struggling through the tech job interview process? We feel you! On this episode, Amal, Nick & Amelia get together to discuss the various ways the interview process disappoints, share their own interview stories, and suggest ways we can improve the process for everyone. Discuss on ... Show More
1h 4m
Dec 2019
Building The Materialize Engine For Interactive Streaming Analytics In SQL
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Transactional databases used in applications are optimized for fast reads and writes with relatively simple queries on a small number of records. Data warehouses are optimized for batched writes and complex analytical qu ... Show More
48m 7s
Jun 2021
Accelerating ML Training And Delivery With In-Database Machine Learning
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object storage, or querying a database. To speed up the process, why not build the model in ... Show More
1h 5m
Nov 2022
Analyze Massive Data At Interactive Speeds With The Power Of Bitmaps Using FeatureBase
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>The most expensive part of working with massive data sets is the work of retrieving and processing the files that contain the raw information. FeatureBase (formerly Pilosa) avoids that overhead by converting the data int ... Show More
59m 25s
May 2024
Erik Darling and Kendra Little Rate SQL Server Perf Tuning Techniques
Erik Darling joins Kendra Little to rate different SQL Server Performance Tuning Techniques in episode 81 of the Dear SQL DBA podcast. We share our opinions of... (deep breath)   Recompile hints, Query Store hints and plan forcing, CTEs, Resource Governor, the legacy cardinality ... Show More
1h 4m
Dec 2022
#564: [INTRODUCING] Amazon Athena for Apache Spark
Amazon Athena for Apache Spark enables you to run Apache Spark interactive analytics quicker than ever before, without the need to plan for, configure, and manage resources. In this episode, Raj Devnath (Sr. Product Manager) and Anthony Virtuoso (Sr. Principal Engineer) join Simo ... Show More
19m 57s
May 2023
The ORMazing show
Nick & KBall sit down with the brilliant Stephen Haberman to discuss all things ORMs! 💻🔍 From the advantages and disadvantages of ORMs in general, to delving into the intricacies of his innovative project Joist, which brings a fresh, idiomatic, ActiveRecord-esque approach to Ty ... Show More
1h 12m