Question 1

Which command initializes a new Git repository in an existing directory?

Accepted Answer

The git init command creates a new Git repository in the current directory by initializing the .git subdirectory containing all files needed for versioning. This command is the first step to start tracking changes in an existing project.

Question 2

What is the difference between git clone and git init?

Accepted Answer

git init creates a new empty repository in the current directory, while git clone copies an existing remote repository to the local machine, including the full commit history and branches. git clone also automatically sets up the connection with the remote repository (origin).

Question 3

Which command creates a new branch and switches to it in a single operation?

Accepted Answer

The command git checkout -b branch_name creates a new branch and immediately switches to it. Since Git 2.23, git switch -c branch_name can also be used and is more explicit. These commands avoid doing git branch followed by git checkout.

Git & GitHub - Fundamentals

Which command initializes a new Git repository in an existing directory?

Answer

What is the difference between git clone and git init?

Answer

Which command creates a new branch and switches to it in a single operation?

Answer

What is the main difference between git merge and git rebase?

Why should rebasing commits already pushed to a shared branch be avoided?

Other Data Engineering interview topics

Linux & Shell - Fundamentals

Advanced Python for Data Engineering

Docker - Fundamentals

Google Cloud Platform - Fundamentals

CI/CD and Code Quality

Docker Compose

FastAPI - Data APIs

Advanced SQL for Data Engineering

Data Lake - Architecture and Ingestion

BigQuery for Data Engineering

PostgreSQL - Administration

Data Modeling for Data Engineering

Fivetran & Airbyte - Data Ingestion

dbt - Fundamentals

Apache Airflow - Fundamentals

Kubernetes - Fundamentals

dbt - Advanced Features

ETL / ELT / ETLT Patterns

Apache Airflow - Advanced

Airflow + dbt - Pipeline Orchestration

PySpark - Large-Scale Processing

Google Pub/Sub - Data Streaming

Apache Beam & Dataflow

Kubernetes - Production and Scaling

Terraform - Infrastructure as Code

NoSQL Databases

Modern Data Architecture

Monitoring and Observability

IAM and Data Security

Master Data Engineering for your next interview