
Docker - Fundamentals
Dockerfile, images, containers, volumes, networks, registries, multi-stage builds, best practices
1What is a Docker container?
What is a Docker container?
Answer
A Docker container is a runnable instance of a Docker image. It encapsulates an application and all its dependencies in an isolated and portable environment. Unlike virtual machines, containers share the host system's kernel, making them much lighter and faster to start. Each container has its own filesystem, network stack, and isolated processes.
2What is a Docker image?
What is a Docker image?
Answer
A Docker image is a read-only template that contains the instructions for creating a container. It includes the application code, libraries, dependencies, tools, and files needed for execution. Images are built from a Dockerfile and are composed of stacked layers. Each Dockerfile instruction creates a new layer, enabling efficient sharing and reuse of common layers between images.
3What is the purpose of a Dockerfile?
What is the purpose of a Dockerfile?
Answer
A Dockerfile is a text file containing a series of instructions to automatically build a Docker image. Each instruction (FROM, RUN, COPY, etc.) creates a layer in the final image. The Dockerfile allows you to declaratively and reproducibly define an application's runtime environment. It ensures the image will be identical on every build, facilitating deployment and team collaboration.
Which Dockerfile instruction specifies the base image?
What is the difference between COPY and ADD instructions in a Dockerfile?
+22 interview questions
Other Data Engineering interview topics
Linux & Shell - Fundamentals
Git & GitHub - Fundamentals
Advanced Python for Data Engineering
Google Cloud Platform - Fundamentals
CI/CD and Code Quality
Docker Compose
FastAPI - Data APIs
Advanced SQL for Data Engineering
Data Lake - Architecture and Ingestion
BigQuery for Data Engineering
PostgreSQL - Administration
Data Modeling for Data Engineering
Fivetran & Airbyte - Data Ingestion
dbt - Fundamentals
Apache Airflow - Fundamentals
Kubernetes - Fundamentals
dbt - Advanced Features
ETL / ELT / ETLT Patterns
Apache Airflow - Advanced
Airflow + dbt - Pipeline Orchestration
PySpark - Large-Scale Processing
Google Pub/Sub - Data Streaming
Apache Beam & Dataflow
Kubernetes - Production and Scaling
Terraform - Infrastructure as Code
NoSQL Databases
Modern Data Architecture
Monitoring and Observability
IAM and Data Security
Master Data Engineering for your next interview
Access all questions, flashcards, technical tests, code review exercises and interview simulators.
Start for free