logo
episode-header-image
Apr 2024
53m 41s

#98 - Service Levels 101 feat. Alex Ewer...

Tobias Schlottke - alphalist CTO Podcast
About this episode
SLIs,SLOs, SLAs - The WHY, WHAT, and HOW

Embrace the Site Reliability Mindset with Alex Ewerlöf, Sr. Staff Engineer @ Volvo Cars 🚗 and SRE thought leader. Understand how the different aspects of site reliability work together 🪢 and when you need a DevOps vs. Platform Team vs. SRE. Find out how to set your own Service Level Indicators (SLI), Service Level Objectives (SLO), and Service Level Agreements (SLA), as laid out by the creator of the Service Level Calculator. Listen to find out:

  • SRE 👷 vs. DevOps ⚒️ vs. Platform Engineering🏗️
  • Service Level 101: How to set SLIs🚦, SLOs 🎯and SLAs 🤝
  • Washing Machine 🧺🫧 vs. Laundry Room🫧🏘️: The Challenges of Standardization
  • OnCall🎧⚠️: Centralised vs. Team-based?
  • The unique software challenges of the automotive industry.

Listen here

BROUGHT TO YOU BY DoiT

Timestamps:

(00:00:00) Intro (00:04:04) Who is Alex (00:05:14) The Nerd Journey: From Assembling Computers to SRE (00:06:58) The Evolution of a Site Reliability Engineer (00:07:55) SRE vs. DevOps vs. Platform Engineering (00:08:32) Washing Machine vs. Laundry Room: The Challenge of Standardization in Large Organizations (00:10:56) Platform vs. SRE (if you are not Google) (00:12:37) Common Platform without Premature Standardisation (00:13:55) Software in Car Companies (00:14:40) Volvo's Strategic Shift Towards In-House Software Development (00:15:06) Swedish Tech Scene: Why Volvo now has offices in Stockholm (00:16:53) Central Platform for Software in the Automotive Industry? (00:18:35) Role of CSO: Chief Software Officer (00:20:09) When the platform is the Product (e.g. Cars) (00:21:42) Implementing Service Levels: A Guide to SLIs, SLOs, and SLAs (00:22:20) What is SLI (Service Level Indicator) (00:22:40) What is SLO (Service Level Objective) (00:23:20) What is SLA (Service Level Agreement) (00:24:24) Leveraging SLOs for autonomous teams in complex products (00:26:08) Relationship between SLI and other Engineering Metrics (00:28:04) Getting Started with Service Levels (00:28:48) STEP 1. Understand SLI/SLOs (00:29:10) STEP 2. Workshop: Identify your Service and What Matters (00:30:02) Setting up SLIs and Alerting for SLIs (00:31:02) STEP 3. Calculate your SLI (00:31:44) STEP 4. Empower team with Good On-Call Practices (00:32:17) Getting Buy-In Across the Organisation (00:33:33) The Significance of Choosing the Right SLIs (00:34:06) Ways to Measure Availability (00:35:43) On-Call Management: Team vs. Centralized Approaches (00:38:22) Downtime of External Vendors (00:40:47) Why SLI needs to come from consumers (00:43:01) SLOs: Setting Realistic SLOs and Avoiding Common Pitfalls (00:44:05) Meaning of 'Objective' differs in OKR and SLO (00:46:18) Financial Incentive to Fail Less? (00:48:43) Biggest Mistakes (00:49:30) No Blame Game - Public Metrics Need Cultural Fit (00:51:50) Advice to Younger Self (00:53:01) Stay Curious (00:53:30) Don't Confuse Confidence with Competence (00:54:47) Outro

About Alex Ewerlöf #

Alex Ewerlof is a Snr. Staff Engineer at Volvo Cars and a Site Reliability Engineering thought leader. He is the author of the Reliability Engineering Mindset, creator of Service Level Calculator and regularly shares insights on his website AlexEwerlof.com. Get 40% off his book with this coupon code.

About our Sponsor

DoiT

As the cloud landscape has evolved, so have its challenges. The shift from adopting to optimising public cloud infrastructure has forced born-in-the-cloud digital natives to grapple not only with growing technical complexities but also the intricacies of cost management and evolving best practices.

DoiT addresses these challenges head-on with an intelligent product portfolio and market-leading cloud expertise that equips engineering and finance teams to understand cloud costs in the context of their business, maximise savings with minimal effort, and make costs more predictable.

Need help making sense of (and optimising) your cloud costs? Set up a call with a DoiT expert to learn about gaining access to DoiT’s FinOps and infrastructure specialists More here

Up next
Jul 10
#125 - Two CTO Dinosaurs vs. Today's Tech Hype with Raz Shuty // CTO @ auxmoney
What happens when two experienced CTOs sit down to debunk the latest tech trends? Raz Schweiger-Shuty, CTO at auxmoney, joins Tobi for an unfiltered discussion about the hypes, myths, and wastes of resources that plague modern tech companies. After taking over a 17-year-old finte ... Show More
1h 3m
Jun 27
#124 - The Path to AGI: Inside poolside’s AI Model Factory for Code with Eiso Kant
How do you build a foundation model that can write code at a human level? Eiso Kant (CTO & co-founder, Poolside) reveals the technical architecture, distributed team strategies, and reinforcement learning breakthroughs powering one of Europe’s most ambitious AI startups. Learn ho ... Show More
1h 3m
Jun 12
#123 - From Nokia to AI-IoT: Engineering the Physical World with Bernd Groß // CEO @ Cumulocity
The physical world is becoming digital—and it requires fundamentally different technical architecture than traditional IT systems. Bernd Groß leads technical leaders through the evolution from enterprise software to industrial IoT, where real-time data from 30,000 wind turbines a ... Show More
1h 3m
Recommended Episodes
Aug 2023
2481: Zenoss - From Cloud to AI: The Evolution of IT Infrastructure
In today's episode of Tech Talks Daily, I sit down with Trent Fitz, a seasoned veteran in the tech space with over two decades of leadership experience, especially in cloud computing, AI, and cybersecurity. As Chief Product Officer of Zenoss, Trent has been at the forefront of te ... Show More
27m 4s
Oct 2023
Leveraging FinOps to Scale a Startup
Anish Bishen (Chief Data Architect @Sliide), Jay Rawal (Head of DevOps @Sliide), Ieva Jonaityte (TAM @DoIT) talk about scaling infrastructure and data services at a rapidly growing startup, with FinOps enabled.  SHOW: 761CLOUD NEWS OF THE WEEK - http://bit.ly/cloudcast-cnotw NEW ... Show More
38m 54s
Feb 2019
Machine Learning In The Enterprise
Summary Machine learning is a class of technologies that promise to revolutionize business. Unfortunately, it can be difficult to identify and execute on ways that it can be used in large companies. Kevin Dewalt founded Prolego to help Fortune 500 companies build, launch, and mai ... Show More
48m 19s
Aug 2020
MBSE – Orchestrate Your Technical Program
‘Product complexity’ is the new buzzword of the past few years. With electrification and with more and more software and autonomy, you have thousands of interfaces and interactions between all these systems and components - and companies need to acquire different skill sets to ma ... Show More
18m 7s
Aug 2023
Understanding SRE
Vlad is Head of Research and Development at Siemens Healthineers, the healthcare arm of tech conglomerate Siemens. He wrote about SRE on our blog here.His book, Establishing SRE Foundations: A Step-by-Step Guide to Introducing Site Reliability Engineering in Software Delivery Org ... Show More
25m 8s
Dec 2018
How to Build Your IT Team
Ian sat down with Dion Hinchcliffe, VP and Principal Analyst at Constellation Research. Together they discussed the value of low-code tools, how to lead digital transformation and the best tips to building your IT team. IT Visionaries is brought to you by The Lightning Platform b ... Show More
38m 22s
Mar 2020
What exactly is "data science" these days? (Practical AI #80)
Matt Brems from General Assembly joins us to explain what “data science” actually means these days and how that has changed over time. He also gives us some insight into how people are going about data science education, how AI fits into the data science workflow, and how to diff ... Show More
48m 40s
Feb 2022
MLOps in Go
MLOps is an increasingly popular topic that is no longer just a subset of DevOps. Go is a great choice for infrastructure. What role does Go play in MLOps? Discuss on Changelog News Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today ... Show More
45m 17s
May 2024
#393 - Renaud Visage - Eventbrite, Slate.vc - De l’API à l’IPO : le français derrière Eventbrite
En 2006, Renaud Visage crée Eventbrite, la première plateforme mondiale d'événementiel en ligne. 1990, San Francisco. La vague du digital déferle sur la côte Ouest et Renaud Visage est alors jeune diplômé d’ingénierie environnementale. Il se prend de passion pour le développement ... Show More
2h 12m
Jan 2024
The engineering mindset | Will Larson (Carta, Stripe, Uber, Calm, Digg)
Will Larson is Chief Technology Officer at Carta. Prior to joining Carta, he was the CTO at Calm and held engineering leadership roles at Stripe, Uber, and Digg. He is the author of two foundational engineering career books, An Elegant Puzzle and Staff Engineer, and The Engineeri ... Show More
1h 16m