logo
episode-header-image
About this episode

Sadaf Khan joins Evan and Russell to explain and talk about Service Reliability Engineering in the Azure engineering group.

 

Media file: https://azpodcast.blob.core.windows.net/episodes/Episode504.mp3

YouTube: https://www.youtube.com/watch?v=QNGdTnb1W90&t=1684s

 

Key Topics:

  • Azure Reliability SRE: Evan introduced the episode's focus on Azure reliability SRE and mentioned a special guest, Sadaf, who would provide insights on the topic. 0:19
  • Azure Storage Public Preview Feature: Russell discussed a new public preview feature for Azure storage that allows customers to manage planned failovers, enhancing the service's reliability. 1:10
  • Virtual Machine Scale Set Update: Russell highlighted an update to virtual machine scale sets that allows mixing different instances, improving flexibility and scalability. 1:38
  • Azure API Management Workspace: Russell introduced a new feature in Azure API management that enables teams to have more autonomy in managing and publishing APIs. 2:08
  • NetApp Files Storage Update: Russell mentioned the general availability of cool access for NetApp files storage, allowing for more cost-effective data storage based on access patterns. 2:40
  • Redis Cache Update: Russell discussed a new tier for Redis Cache that supports larger enterprises with increased memory and compute capabilities. 3:02
  • Azure Red Hat Openshift Update: Russell shared an update on Azure Red Hat Openshift, which now supports up to 250 nodes, significantly increasing scalability. 3:29
  • SRE Role and Impact: Sadaf explained the role of SRE in improving service reliability and quality, detailing their engagement model with various Azure services. 4:52
  • SRE Engagement and Resistance: Sadaf shared insights on the initial resistance faced from service teams during SRE engagements and how trust is built over time to allow for more impactful changes. 7:49
  • SRE's Approach to Service Improvement: Sadaf outlined the SRE team's structured approach to service improvement, focusing on fundamentals, service health, operational efficiency, and scalability. 10:51
  • AI Initiatives in SRE: Sadaf discussed the SRE team's initiatives in leveraging AI to analyze incident data and generate insights, aiming to reduce the cognitive load on engineers. 30:27
Up next
Jun 11
Episode 521 - The Final Episode
Final Episode of the Azure Podcast: A Journey of 12 Years In this special final episode of the Azure Podcast, hosts Cale, Evan, Sujit, Russell, Cynthia, and Kendall come together to reflect on the incredible journey of the podcast over the past 12 years. They share personal anecd ... Show More
42m 41s
May 19
Episode 520 - Azure Native Pure Storage Cloud
Evan and Russell host David Stamen and Vaclav Jirovsky from Pure Storage, diving into how they've integrated their Pure Storage solution as an Azure native service. Media file: https://azpodcast.blob.core.windows.net/episodes/Episode520.mp3 YouTube: https://youtu.be/rok60ox6oDc R ... Show More
1s
May 2
Episode 519 - VM Repair Extension
In this episode of the Azure Podcast, hosts Evan Baslik and Sujit D'Mello are joined by special guests Adam Sandor, Travis Maier, and Leslie Chou to discuss the VM Repair extension. They delve into its capabilities, recent updates, and how it enhances supportability for Azure VMs ... Show More
1s
Recommended Episodes
Apr 2023
SCaLE20x
In this episode we bring you with us to Southern California Linux Expo, or SCaLE20x in Pasadena, California. We interviewed several attendees about their experience at the conference. Featuring: Robin Phantomhive, attendee at SCaLE and community member Mofi Rahman, Developer Advo ... Show More
24m 14s
Jul 2022
Writing, Learning and Tech, with Ian Miell
Ian Miell is a partner at consultancy Container Solutions, and an author of books on Bash, Git, Terraform and Docker. He explains to Craig how writing - whether runbooks, blog posts, training courses, or “real” books, can help you learn and make your team more effective. Do you h ... Show More
45m 38s
Mar 2023
Breaking Kubernetes for Fun and Profit with David Flanagan
David Flanagan is a developer, educator and technology enthusiast with a special interest for Kubernetes and Cloud Native technologies. David is the founder of Rawkode Academy, an online platform aiming at teaching kubernetes to developers. One of the popular shows on RawKode is ... Show More
40m 32s
Mar 2024
SN 965: Passkeys vs. 2FA - Unhelpful CERT, VMware patch, Signal 7.0 Beta
VMware needs immediate patching Midnight Blizzard still on the offensive China is quietly "de-American'ing" their networks Signal Version 7.0, now in beta Meta, WhatsApp, and Messenger -meets- the EU's DMA The Change Healthcare cyberattack SpinRite update Telegram's end-to-end en ... Show More
2h 23m
Sep 2024
NB495: Fortinet Customer Data Stolen; Boeing to Test Quantum Entanglement Networking
Take a Network Break! This week we discuss the theft of 440Gbytes of customer data from a Fortinet cloud repository, how to think about resiliency after an AT&T network update kills access to Azure apps, and new troubleshooting features in Juniper Apstra. HPE goes to the bond mar ... Show More
25m 52s
Apr 2024
Episode 192 - Google Cloud Next 2024 Recap
Join Allen Firstenberg and guest host Stefania Pecore on Two Voice Devs as they delve into the exciting announcements and highlights from Google Cloud Next 2024! This episode focuses on the latest advancements in AI and their impact on the healthcare industry, providing valuable ... Show More
40m 35s
Nov 2024
SN 999: AI Vulnerability Discovery - RT's AI TV Hosts, Windows 10 Updates
Google's record-breaking fine by Russia. (How many 0's is that?) RT's editor-in-chief admits that their TV hosts are AI-generated. Windows 10 security updates set to end next October... or are they? When a good Chrome extension goes bad. Windows .RDP launch config files. What cou ... Show More
1h 53m
Jun 2022
Configuration as Data, with Justin Santa Barbara
What is configuration as data, how is different from infrastructure as code, and why can’t anything just be itself anymore? We posed these questions and more to long-time Kubernetes contributor Justin Santa Barbara at KubeCon EU, and this episode is the result. Justin created the ... Show More
50m 49s
Jun 2024
KU057: Packing Up Kubernetes Unpacked
All good things must come to an end, and in this case that means saying farewell to Kubernetes Unpacked. In this final episode, Michael and Kristina pack up the Kubernetes Unpacked podcast. They look back on covering issues including sustainability, security, open source projects ... Show More
15m 26s
Dec 2022
Kubernetes v1.26 Electrifying, with Leonard Pahlke
Leonard Pahlke is not only the Release Lead for Kubernetes v1.26, he's also a co-chair of the CNCF TAG for Environmental Sustainability and a student working toward a Master's Degree in Computer Science at the Hamburg University of Applied Sciences. In this episode, Leonard talks ... Show More
31m 42s