1
Linux๊ณผ Shell: ํ์ ๋ช
๋ น์ด, bash ์คํฌ๋ฆฝํ
, ๊ถํ, cron jobs
2
Git๊ณผ GitHub: branching, merge, rebase, pull requests, CI/CD ์ํฌํ๋ก
3
๊ณ ๊ธ Python: OOP, ๋ฐ์ฝ๋ ์ดํฐ, ์ ๋๋ ์ดํฐ, ์ปจํ
์คํธ ๋งค๋์ , typing, async/await
4
CI/CD: linting (Ruff, Pylint), packaging (Poetry), tests, GitHub Actions, pipelines
5
Docker: Dockerfile, ์ด๋ฏธ์ง, ์ปจํ
์ด๋, volumes, networks, multi-stage builds
6
Docker Compose: ๋ฉํฐ ์ปจํ
์ด๋ ์๋น์ค, ์์กด์ฑ, healthchecks, ๋ก์ปฌ ์ค์ผ์คํธ๋ ์ด์
7
FastAPI: ๋ผ์ฐํธ, Pydantic ๋ชจ๋ธ, ์์กด์ฑ, middleware, ๋ฐฐํฌ
8
๊ณ ๊ธ SQL: window functions, CTEs, ๋ถ์ ์ฟผ๋ฆฌ, ์ต์ ํ, ์ธ๋ฑ์ฑ
9
BigQuery: ์๋ฒ๋ฆฌ์ค ์ํคํ
์ฒ, ํํฐ์
๋, ํด๋ฌ์คํฐ๋ง, ๋น์ฉ, UDFs, federated queries
10
PostgreSQL: ์ค์ , ๋ณต์ , ์ธ๋ฑ์ฑ (B-tree, GIN, GiST), VACUUM, EXPLAIN ANALYZE
11
๋ฐ์ดํฐ ๋ชจ๋ธ๋ง: ์คํ ์คํค๋ง, ํฉํธ/๋๋ฉ์
ํ
์ด๋ธ, ์ ๊ทํ, SCD, data vault
12
ELT vs ETL vs ETLT: ํจํด, ํธ๋ ์ด๋์คํ, ์ํคํ
์ฒ ์ ํ
13
Fivetran๊ณผ Airbyte: ์ปค๋ฅํฐ, ๋๊ธฐํ ๋ชจ๋, CDC, ์คํค๋ง ์งํ
14
dbt: models, sources, refs, tests, snapshots, incremental models, Jinja macros
15
Apache Airflow: DAGs, operators, sensors, XCom, connections, pools, ํ์คํฌ ์์กด์ฑ
16
PySpark: RDD vs DataFrame, ๋ณํ, ์ก์
, ํํฐ์
๋, broadcast variables
17
์คํธ๋ฆฌ๋ฐ: Pub/Sub (topics, subscriptions), Apache Beam (PCollections, transforms, windowing), Dataflow
18
Kubernetes: pods, deployments, services, ingress, ConfigMaps, Secrets, Helm, scaling
19
Terraform: providers, resources, state, modules, plan/apply, infrastructure as code
20
IAM๊ณผ ๋ณด์: ์ต์ ๊ถํ ์์น, service accounts, GCP ์ญํ
21
NoSQL ๋ฐ์ดํฐ๋ฒ ์ด์ค: GraphDB (Neo4j), Document DBs (MongoDB, Firestore), Wide Column (Cassandra, Bigtable)
22
๋ฐ์ดํฐ ์ํคํ
์ฒ: Data Lake vs Data Warehouse vs Data Lakehouse, Data Mesh, Data Contracts
23
๋ชจ๋ํฐ๋ง๊ณผ ๊ด์ธก์ฑ: ๋ก๊น
, ๋ฉํธ๋ฆญ, ์๋ฆผ, SLA/SLO/SLI, ๋ฐ์ดํฐ ํ์ง ์ฒดํฌ