Data Science & ML

์ง€๋„ ๋จธ์‹ ๋Ÿฌ๋‹: ๋ถ„๋ฅ˜

๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€, KNN, SVM, ํ‰๊ฐ€์ง€ํ‘œ (accuracy, precision, recall, F1, ROC-AUC), ์ž„๊ณ„๊ฐ’

24 ๋ฉด์ ‘ ์งˆ๋ฌธยท
Mid-Level
1

์ง€๋„ ๋ถ„๋ฅ˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์ฃผ์š” ๋ชฉ์ ์€ ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

๋‹ต๋ณ€

์ง€๋„ ๋ถ„๋ฅ˜๋Š” ๋ ˆ์ด๋ธ”์ด ์ง€์ •๋œ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ํ•™์Šตํ•˜์—ฌ ์ž…๋ ฅ features์—์„œ ์นดํ…Œ๊ณ ๋ฆฌ ๋˜๋Š” ํด๋ž˜์Šค(์ด์‚ฐ ๋ณ€์ˆ˜)๋ฅผ ์˜ˆ์ธกํ•˜๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค. ์—ฐ์† ๊ฐ’์„ ์˜ˆ์ธกํ•˜๋Š” ํšŒ๊ท€์™€ ๋‹ฌ๋ฆฌ, ๋ถ„๋ฅ˜๋Š” ๊ฐ ๊ด€์ธก์น˜๋ฅผ ์‚ฌ์ „ ์ •์˜๋œ ํด๋ž˜์Šค(์ด์ง„ ๋˜๋Š” ๋‹ค์ค‘ ํด๋ž˜์Šค)์— ํ• ๋‹นํ•ฉ๋‹ˆ๋‹ค.

2

๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€๊ฐ€ ์˜ˆ์ธก์„ ํ™•๋ฅ ๋กœ ๋ณ€ํ™˜ํ•˜๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉํ•˜๋Š” ์ˆ˜ํ•™ ํ•จ์ˆ˜๋Š” ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

๋‹ต๋ณ€

sigmoid ํ•จ์ˆ˜(๋˜๋Š” ๋กœ์ง€์Šคํ‹ฑ ํ•จ์ˆ˜)๋Š” ์ž„์˜์˜ ์‹ค์ˆ˜ ๊ฐ’์„ 0๊ณผ 1 ์‚ฌ์ด์˜ ํ™•๋ฅ ๋กœ ๋ณ€ํ™˜ํ•ฉ๋‹ˆ๋‹ค. sigma(z) = 1/(1+e^(-z))๋กœ ์ •์˜๋ฉ๋‹ˆ๋‹ค. ์ด ํ•จ์ˆ˜๋ฅผ ํ†ตํ•ด ์ถœ๋ ฅ์„ ์–‘์„ฑ ํด๋ž˜์Šค์— ์†ํ•  ํ™•๋ฅ ๋กœ ํ•ด์„ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

3

๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€ ๋ชจ๋ธ์—์„œ ๊ณ„์ˆ˜๋Š” ๋ฌด์—‡์„ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๊นŒ?

๋‹ต๋ณ€

๋กœ์ง€์Šคํ‹ฑ ํšŒ๊ท€ ๊ณ„์ˆ˜๋Š” ํ•ด๋‹น feature์˜ ๋‹จ์œ„ ๋ณ€ํ™”๋‹น log-odds์˜ ๋ณ€ํ™”๋ฅผ ๋‚˜ํƒ€๋ƒ…๋‹ˆ๋‹ค. ์–‘์ˆ˜ ๊ณ„์ˆ˜๋Š” ์–‘์„ฑ ํด๋ž˜์Šค์˜ ํ™•๋ฅ ์„ ์ฆ๊ฐ€์‹œํ‚ค๊ณ , ์Œ์ˆ˜ ๊ณ„์ˆ˜๋Š” ๊ฐ์†Œ์‹œํ‚ต๋‹ˆ๋‹ค. ๊ณ„์ˆ˜์˜ ์ง€์ˆ˜๋Š” odds ratio๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.

4

K-Nearest Neighbors (KNN) ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ๋ถ„๋ฅ˜์—์„œ ์–ด๋–ป๊ฒŒ ์ž‘๋™ํ•ฉ๋‹ˆ๊นŒ?

5

KNN ์•Œ๊ณ ๋ฆฌ์ฆ˜์—์„œ k ๊ฐ’์„ ์„ ํƒํ•˜๋Š” ๊ฒƒ์˜ ์˜ํ–ฅ์€ ๋ฌด์—‡์ž…๋‹ˆ๊นŒ?

+21 ๋ฉด์ ‘ ์งˆ๋ฌธ

๊ธฐํƒ€ Data Science & ML ๋ฉด์ ‘ ์ฃผ์ œ

Python ๊ธฐ์ดˆ

Junior
25๊ฐœ ์งˆ๋ฌธ

Python ๊ฐ์ฒด ์ง€ํ–ฅ ํ”„๋กœ๊ทธ๋ž˜๋ฐ

Junior
20๊ฐœ ์งˆ๋ฌธ

Python ๋ฐ์ดํ„ฐ ๊ตฌ์กฐ

Junior
20๊ฐœ ์งˆ๋ฌธ

Git ๊ธฐ์ดˆ

Junior
18๊ฐœ ์งˆ๋ฌธ

SQL ๊ธฐ์ดˆ

Junior
20๊ฐœ ์งˆ๋ฌธ

NumPy ๊ธฐ์ดˆ

Junior
22๊ฐœ ์งˆ๋ฌธ

Pandas ๊ธฐ์ดˆ

Junior
22๊ฐœ ์งˆ๋ฌธ

Jupyter & Google Colab

Junior
16๊ฐœ ์งˆ๋ฌธ

SQL Joins ๋ฐ ๊ณ ๊ธ‰ ์ฟผ๋ฆฌ

Mid-Level
22๊ฐœ ์งˆ๋ฌธ

Pandas ๊ณ ๊ธ‰

Mid-Level
24๊ฐœ ์งˆ๋ฌธ

Matplotlib & Seaborn์„ ํ™œ์šฉํ•œ ์‹œ๊ฐํ™”

Mid-Level
20๊ฐœ ์งˆ๋ฌธ

Plotly๋กœ ๋งŒ๋“œ๋Š” ์ธํ„ฐ๋ž™ํ‹ฐ๋ธŒ ์‹œ๊ฐํ™”

Mid-Level
18๊ฐœ ์งˆ๋ฌธ

๊ธฐ์ˆ  ํ†ต๊ณ„

Mid-Level
20๊ฐœ ์งˆ๋ฌธ

์ถ”๋ก  ํ†ต๊ณ„ํ•™

Mid-Level
24๊ฐœ ์งˆ๋ฌธ

Web Scraping

Mid-Level
18๊ฐœ ์งˆ๋ฌธ

BigQuery & Cloud Data

Mid-Level
18๊ฐœ ์งˆ๋ฌธ

Feature Engineering

Mid-Level
22๊ฐœ ์งˆ๋ฌธ

์ง€๋„ ๋จธ์‹ ๋Ÿฌ๋‹: ํšŒ๊ท€

Mid-Level
24๊ฐœ ์งˆ๋ฌธ

๊ฒฐ์ • ํŠธ๋ฆฌ ๋ฐ ์•™์ƒ๋ธ”

Mid-Level
24๊ฐœ ์งˆ๋ฌธ

๋น„์ง€๋„ ML

Mid-Level
22๊ฐœ ์งˆ๋ฌธ

ML ํŒŒ์ดํ”„๋ผ์ธ ๋ฐ ๊ฒ€์ฆ

Mid-Level
22๊ฐœ ์งˆ๋ฌธ

์‹œ๊ณ„์—ด ๋ฐ ์˜ˆ์ธก

Mid-Level
22๊ฐœ ์งˆ๋ฌธ

Deep Learning ๊ธฐ์ดˆ

Senior
24๊ฐœ ์งˆ๋ฌธ

TensorFlow & Keras

Senior
22๊ฐœ ์งˆ๋ฌธ

CNN ๋ฐ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜

Senior
24๊ฐœ ์งˆ๋ฌธ

RNN ๋ฐ ์‹œํ€€์Šค

Senior
22๊ฐœ ์งˆ๋ฌธ

Transformers ๋ฐ Attention

Senior
24๊ฐœ ์งˆ๋ฌธ

NLP ๋ฐ Hugging Face

Senior
24๊ฐœ ์งˆ๋ฌธ

GenAI ๋ฐ LangChain

Senior
24๊ฐœ ์งˆ๋ฌธ

MLOps ๋ฐ ๋ฐฐํฌ

Senior
24๊ฐœ ์งˆ๋ฌธ

๋‹ค์Œ ๋ฉด์ ‘์„ ์œ„ํ•ด Data Science & ML์„ ๋งˆ์Šคํ„ฐํ•˜์„ธ์š”

๋ชจ๋“  ์งˆ๋ฌธ, flashcards, ๊ธฐ์ˆ  ํ…Œ์ŠคํŠธ, ์ฝ”๋“œ ๋ฆฌ๋ทฐ ์—ฐ์Šต, ๋ฉด์ ‘ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์— ์ ‘๊ทผํ•˜์„ธ์š”.

๋ฌด๋ฃŒ๋กœ ์‹œ์ž‘ํ•˜๊ธฐ