logo
episode-header-image
Jul 2021
20m 22s

Scaling CPU-intensive Backends - The Bac...

Hussein Nasser
About this episode

🧑‍🏫 Courses I Teach

https://husseinnasser.com/courses


In this episode, I’d like to discuss the methods of scaling CPU-bound or intensive workloads on the backend. This show will discuss the scaling of the workload on a single machine taking full advantage effectively of all its resources, then we will discuss horizontal scalability to multiple machines.

0:00 Intro

1:00 What do I mean by Scaling

3:20 CPU-Intensive/Bound Workload

6:00 Effective Scaling CPU-Bound Backends in Single Machine

12:00 How Hyperthreading can be useful

15:00 Horizontally Scale to multiple Machines

SO_REUSEPORT https://lwn.net/Articles/542629/

@Gary Explains Hyperthreading https://www.youtube.com/watch?v=mSZpDF-zUoI


Become a Member on YouTube

https://www.youtube.com/channel/UC_ML5xP23TOWKUcc-oAE_Eg/join


🔥 Members Only Content

https://www.youtube.com/playlist?list=UUMO_ML5xP23TOWKUcc-oAE_Eg


Support my work on PayPal

https://bit.ly/33ENps4


🧑‍🏫 Courses I Teach

https://husseinnasser.com/courses

Up next
Nov 24
CPU and Kernel Page Faults
<p>Page faults occurs when the process tries to access a memory that isn’t backed by a physical page kernel raises a fault which loads a page. It happens on first access, stack expansion, COW, swap and much more. However it comes with a cost. </p><p><br /></p><p>In this episode o ... Show More
48m 37s
Oct 31
Amazon US-EAST-1 Outage in Details
On October 19 2025 AWS experienced an outage that lasted over a day, 10 days later we finally got the root cause analysis and we know exactly what caused the DNS to fail0:00 Summary 5:30 How did Dynamo lost its DNS?13:41 EC2 Errors 16:16 Network Load Balancer ErrorsRCA here https ... Show More
24m 26s
Oct 17
Graceful shutdown in HTTP
There are cases where the backend may need to close the connection to prevent unexpected situations, prevent bad actors or simply just free up resources. Closing a connection gracefully allows clients and backends to clean up and finish any pending requests. In this episode of th ... Show More
25m 49s
Recommended Episodes
Jun 2021
Accelerating ML Training And Delivery With In-Database Machine Learning
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object storage, or querying a database. To speed up the process, why not build the model in ... Show More
1h 5m
Feb 2022
Embedded Machine Learning: Part 5 - Machine Learning Compiler Optimization (Ep. 186)
<p>This is the last episode of the series "Embedded ML" and I made it for the bravest :) I speak about machine learning compiler optimization to a much greater detail.</p> <p>Enjoy the episode!</p> <p> </p> Chat with me <p>Join us on <a href='https://discord.com/invite/4UNKGf3' ... Show More
49m 12s
Mar 2024
Linux Kernel Scheduler Developer | David Vernet
<p>The linux kernel is something we all use but have you ever thought about what goes into it, well today we&#39;ve got David Vernet on the show who has spent quite a bit of time focusing on one aspect, that being the scheduler.</p> <p>=========Guest Links==========</p> <p>Twitch ... Show More
1h 55m
Jun 2023
#599: Amazon EC2 Hpc7g Instances
How can you build designs faster and predict the weather more efficiently? How can you can carry out complex calculations across HPC clusters using up to tens of thousands of cores with high performance and lower costs? In this podcast, Heidi Poxon, Principal HPC Technologist, ta ... Show More
13m 48s
Jul 2023
#605: AWS Trainium-powered Amazon EC2 Trn1n instances
How to get the best price performance in Amazon EC2 for the most demanding machine learning training workloads? Tune in to learn how AWS Trainium-based Amazon EC2 Trn1n instances can help you train your network-intensive generative AI models at scale. Amazon EC2 Trn1n instances d ... Show More
20m 58s
Sep 2021
Massively Parallel Data Processing In Python Without The Effort Using Bodo
<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Python has beome the de facto language for working with data. That has brought with it a number of challenges having to do with the speed and scalability of working with large volumes of information.There have been many ... Show More
1h 4m
Jan 2022
125 - Object Caching Systems
<p><a href="https://www.blogger.com/blog/post/edit/4625411064190066516/4098644811321160123#">Download</a></p><p><strong>We are sponsored by audible! </strong><a href="https://www.blogger.com/blog/post/edit/4625411064190066516/4098644811321160123#">http://www.audibletrial.com/prog ... Show More
1h 14m
Oct 2023
Reducing The Barrier To Entry For Building Stream Processing Applications With Decodable
<h2>Summary</h2> <p>Building streaming applications has gotten substantially easier over the past several years. Despite this, it is still operationally challenging to deploy and maintain your own stream processing infrastructure. Decodable was built with a mission of eliminat ... Show More
1h 8m
Mar 2023
Unlocking The Potential Of Streaming Data Applications Without The Operational Headache At Grainite
<h2>Summary</h2> <p>The promise of streaming data is that it allows you to react to new information as it happens, rather than introducing latency by batching records together. The peril is that building a robust and scalable streaming architecture is always more complicated a ... Show More
1h 13m