
Kubernetes - Production and Scaling
Helm charts, HPA/VPA, StatefulSets, PersistentVolumes, RBAC, GKE, resource limits, Prometheus/Grafana, health probes
1What is the main role of Helm in a Kubernetes ecosystem?
What is the main role of Helm in a Kubernetes ecosystem?
Answer
Helm is the package manager for Kubernetes. It allows defining, installing, and upgrading complex applications through charts, which are collections of templated YAML files. Helm simplifies deployment by managing dependencies, versions, and configurations in a reproducible way.
2What is the fundamental difference between a Deployment and a StatefulSet?
What is the fundamental difference between a Deployment and a StatefulSet?
Answer
A StatefulSet guarantees a stable and persistent identity for each pod (network name, storage) while a Deployment treats pods as interchangeable. StatefulSets are essential for stateful applications like databases where each instance must retain its identity and data across restarts.
3How does the Horizontal Pod Autoscaler (HPA) work to adjust the number of replicas?
How does the Horizontal Pod Autoscaler (HPA) work to adjust the number of replicas?
Answer
HPA monitors pod metrics (CPU, memory, or custom metrics) via the Metrics Server and automatically adjusts the number of replicas to maintain target utilization. It calculates the ratio between current and target usage, then scales up or down accordingly with cooldown periods to avoid thrashing.
What is the difference between HPA (Horizontal Pod Autoscaler) and VPA (Vertical Pod Autoscaler)?
What is the role of a PersistentVolume (PV) and PersistentVolumeClaim (PVC) in Kubernetes?
+17 interview questions
Other Data Engineering interview topics
Linux & Shell - Fundamentals
Git & GitHub - Fundamentals
Advanced Python for Data Engineering
Docker - Fundamentals
Google Cloud Platform - Fundamentals
CI/CD and Code Quality
Docker Compose
FastAPI - Data APIs
Advanced SQL for Data Engineering
Data Lake - Architecture and Ingestion
BigQuery for Data Engineering
PostgreSQL - Administration
Data Modeling for Data Engineering
Fivetran & Airbyte - Data Ingestion
dbt - Fundamentals
Apache Airflow - Fundamentals
Kubernetes - Fundamentals
dbt - Advanced Features
ETL / ELT / ETLT Patterns
Apache Airflow - Advanced
Airflow + dbt - Pipeline Orchestration
PySpark - Large-Scale Processing
Google Pub/Sub - Data Streaming
Apache Beam & Dataflow
Terraform - Infrastructure as Code
NoSQL Databases
Modern Data Architecture
Monitoring and Observability
IAM and Data Security
Master Data Engineering for your next interview
Access all questions, flashcards, technical tests, code review exercises and interview simulators.
Start for free