
Git & GitHub - Fundamentals
Branching, merge, rebase, pull requests, gitflow, .gitignore, hooks, CI/CD workflows
1Which command initializes a new Git repository in an existing directory?
Which command initializes a new Git repository in an existing directory?
Answer
The git init command creates a new Git repository in the current directory by initializing the .git subdirectory containing all files needed for versioning. This command is the first step to start tracking changes in an existing project.
2What is the difference between git clone and git init?
What is the difference between git clone and git init?
Answer
git init creates a new empty repository in the current directory, while git clone copies an existing remote repository to the local machine, including the full commit history and branches. git clone also automatically sets up the connection with the remote repository (origin).
3Which command creates a new branch and switches to it in a single operation?
Which command creates a new branch and switches to it in a single operation?
Answer
The command git checkout -b branch_name creates a new branch and immediately switches to it. Since Git 2.23, git switch -c branch_name can also be used and is more explicit. These commands avoid doing git branch followed by git checkout.
What is the main difference between git merge and git rebase?
Why should rebasing commits already pushed to a shared branch be avoided?
+17 interview questions
Other Data Engineering interview topics
Linux & Shell - Fundamentals
Advanced Python for Data Engineering
Docker - Fundamentals
Google Cloud Platform - Fundamentals
CI/CD and Code Quality
Docker Compose
FastAPI - Data APIs
Advanced SQL for Data Engineering
Data Lake - Architecture and Ingestion
BigQuery for Data Engineering
PostgreSQL - Administration
Data Modeling for Data Engineering
Fivetran & Airbyte - Data Ingestion
dbt - Fundamentals
Apache Airflow - Fundamentals
Kubernetes - Fundamentals
dbt - Advanced Features
ETL / ELT / ETLT Patterns
Apache Airflow - Advanced
Airflow + dbt - Pipeline Orchestration
PySpark - Large-Scale Processing
Google Pub/Sub - Data Streaming
Apache Beam & Dataflow
Kubernetes - Production and Scaling
Terraform - Infrastructure as Code
NoSQL Databases
Modern Data Architecture
Monitoring and Observability
IAM and Data Security
Master Data Engineering for your next interview
Access all questions, flashcards, technical tests, code review exercises and interview simulators.
Start for free