DevOps

SRE Principles

SLIs, SLOs, SLAs, error budgets, toil reduction, incident management, on-call, blameless postmortems

24 питань зі співбесід·
Senior
1

What is an SLI (Service Level Indicator) in SRE?

Відповідь

An SLI (Service Level Indicator) is a quantitative metric that measures a specific aspect of the service level provided to users. Typical SLIs include availability (uptime), latency (response time), error rate, or throughput. These indicators are objectively measured by monitoring systems and serve as the foundation for defining SLOs. For example, an availability SLI could be the percentage of successful HTTP requests (2xx codes) out of total requests.

2

What is the main difference between an SLO and an SLA?

Відповідь

An SLO (Service Level Objective) is an internal service level target defined by the team to guide SRE efforts, with no legal consequences. An SLA (Service Level Agreement) is a formal contract with the client that includes consequences (refunds, penalties) if targets are not met. The SLO is typically stricter than the SLA to create a safety buffer and avoid SLA violations. For example, an SLO of 99.9% with an SLA of 99.5% provides a margin of safety.

3

What is an error budget in SRE?

Відповідь

An error budget is the acceptable amount of failure or unavailability for a service over a given period. It is calculated as the difference between 100% and the SLO. For example, with an SLO of 99.9%, the error budget is 0.1% (approximately 43 minutes of downtime per month). This error budget allows balancing innovation and reliability: as long as budget remains, the team can deploy new features quickly. If exhausted, focus must shift to stability and releases should be postponed.

4

How to calculate the remaining error budget for a service?

5

What to do when a service's error budget is exhausted?

+21 питань зі співбесід

Інші теми співбесід DevOps

Version Control & Git

Junior
20 запитань

Linux Fundamentals

Junior
22 запитань

Shell Scripting & Bash

Mid-Level
20 запитань

Networking Basics

Junior
22 запитань

Docker Fundamentals

Junior
24 запитань

CI/CD Fundamentals

Junior
18 запитань

GitHub Actions

Mid-Level
22 запитань

GitLab CI/CD

Mid-Level
22 запитань

Jenkins

Mid-Level
22 запитань

Kubernetes Basics

Mid-Level
26 запитань

Kubernetes Networking

Mid-Level
24 запитань

Kubernetes Advanced

Mid-Level
24 запитань

Ingress & API Gateway

Mid-Level
20 запитань

Terraform Basics

Mid-Level
22 запитань

Terraform Advanced

Mid-Level
22 запитань

Ansible & Configuration Management

Mid-Level
20 запитань

AWS Essentials

Mid-Level
26 запитань

Azure Fundamentals

Mid-Level
22 запитань

GCP Fundamentals

Mid-Level
22 запитань

Monitoring & Prometheus

Mid-Level
22 запитань

Logging & ELK Stack

Mid-Level
20 запитань

Alerting & Incident Response

Mid-Level
20 запитань

Cloud Identity & Secrets

Mid-Level
22 запитань

CI/CD Pipeline Security

Mid-Level
20 запитань

Helm & Kubernetes

Mid-Level
20 запитань

Runtime & Cluster Security

Senior
24 запитань

Container Supply Chain Security

Senior
22 запитань

Service Mesh & Istio

Senior
24 запитань

GitOps & ArgoCD

Senior
22 запитань

Progressive Delivery

Senior
20 запитань

Distributed Observability

Senior
22 запитань

Disaster Recovery & Backup

Senior
20 запитань

Performance Optimization

Senior
22 запитань

Cloud Cost Optimization

Senior
20 запитань

Chaos Engineering

Senior
20 запитань

Platform Engineering

Senior
22 запитань

Опануй DevOps для наступної співбесіди

Отримай доступ до всіх питань, flashcards, технічних тестів, вправ code review та симуляторів співбесід.

Почни безкоштовно