DevOps

SRE Principles

SLIs, SLOs, SLAs, error budgets, toil reduction, incident management, on-call, blameless postmortems

24 คำถามสัมภาษณ์·
Senior
1

What is an SLI (Service Level Indicator) in SRE?

คำตอบ

An SLI (Service Level Indicator) is a quantitative metric that measures a specific aspect of the service level provided to users. Typical SLIs include availability (uptime), latency (response time), error rate, or throughput. These indicators are objectively measured by monitoring systems and serve as the foundation for defining SLOs. For example, an availability SLI could be the percentage of successful HTTP requests (2xx codes) out of total requests.

2

What is the main difference between an SLO and an SLA?

คำตอบ

An SLO (Service Level Objective) is an internal service level target defined by the team to guide SRE efforts, with no legal consequences. An SLA (Service Level Agreement) is a formal contract with the client that includes consequences (refunds, penalties) if targets are not met. The SLO is typically stricter than the SLA to create a safety buffer and avoid SLA violations. For example, an SLO of 99.9% with an SLA of 99.5% provides a margin of safety.

3

What is an error budget in SRE?

คำตอบ

An error budget is the acceptable amount of failure or unavailability for a service over a given period. It is calculated as the difference between 100% and the SLO. For example, with an SLO of 99.9%, the error budget is 0.1% (approximately 43 minutes of downtime per month). This error budget allows balancing innovation and reliability: as long as budget remains, the team can deploy new features quickly. If exhausted, focus must shift to stability and releases should be postponed.

4

How to calculate the remaining error budget for a service?

5

What to do when a service's error budget is exhausted?

+21 คำถามสัมภาษณ์

หัวข้อสัมภาษณ์ DevOps อื่นๆ

Version Control & Git

Junior
20 คำถาม

Linux Fundamentals

Junior
22 คำถาม

Shell Scripting & Bash

Mid-Level
20 คำถาม

Networking Basics

Junior
22 คำถาม

Docker Fundamentals

Junior
24 คำถาม

CI/CD Fundamentals

Junior
18 คำถาม

GitHub Actions

Mid-Level
22 คำถาม

GitLab CI/CD

Mid-Level
22 คำถาม

Jenkins

Mid-Level
22 คำถาม

Kubernetes Basics

Mid-Level
26 คำถาม

Kubernetes Networking

Mid-Level
24 คำถาม

Kubernetes Advanced

Mid-Level
24 คำถาม

Ingress & API Gateway

Mid-Level
20 คำถาม

Terraform Basics

Mid-Level
22 คำถาม

Terraform Advanced

Mid-Level
22 คำถาม

Ansible & Configuration Management

Mid-Level
20 คำถาม

AWS Essentials

Mid-Level
26 คำถาม

Azure Fundamentals

Mid-Level
22 คำถาม

GCP Fundamentals

Mid-Level
22 คำถาม

Monitoring & Prometheus

Mid-Level
22 คำถาม

Logging & ELK Stack

Mid-Level
20 คำถาม

Alerting & Incident Response

Mid-Level
20 คำถาม

Cloud Identity & Secrets

Mid-Level
22 คำถาม

CI/CD Pipeline Security

Mid-Level
20 คำถาม

Helm & Kubernetes

Mid-Level
20 คำถาม

Runtime & Cluster Security

Senior
24 คำถาม

Container Supply Chain Security

Senior
22 คำถาม

Service Mesh & Istio

Senior
24 คำถาม

GitOps & ArgoCD

Senior
22 คำถาม

Progressive Delivery

Senior
20 คำถาม

Distributed Observability

Senior
22 คำถาม

Disaster Recovery & Backup

Senior
20 คำถาม

Performance Optimization

Senior
22 คำถาม

Cloud Cost Optimization

Senior
20 คำถาม

Chaos Engineering

Senior
20 คำถาม

Platform Engineering

Senior
22 คำถาม

เชี่ยวชาญ DevOps สำหรับการสัมภาษณ์ครั้งถัดไป

เข้าถึงคำถามทั้งหมด flashcards แบบทดสอบเทคนิค แบบฝึกหัด code review และตัวจำลองสัมภาษณ์

เริ่มใช้ฟรี