1
Python: types, data structures, OOP, decorators, generators, context managers
2
NumPy: arrays, broadcasting, indexing, vectorized operations, linear algebra
3
Pandas: DataFrames, Series, indexing, groupby, merge, pivot, time series
4
SQL: SELECT, JOIN, GROUP BY, window functions, CTEs, query optimization
5
Visualization: Matplotlib (figures, axes, subplots), Seaborn (statistical plots), Plotly (interactive)
6
Statistics: distributions, hypothesis testing, confidence intervals, regression
7
Feature Engineering: encoding, scaling, feature selection, feature creation
8
Supervised ML: linear/logistic regression, trees, Random Forest, XGBoost, metrics
9
Unsupervised ML: K-Means, hierarchical clustering, PCA, t-SNE
10
ML Pipeline: train/test split, cross-validation, hyperparameter tuning, overfitting
11
Deep Learning: perceptrons, backpropagation, activation functions, optimizers, loss functions
12
CNN: convolutions, pooling, architectures (ResNet, VGG), transfer learning
13
RNN/LSTM: sequences, vanishing gradient, attention mechanism, Transformers
14
NLP: tokenization, embeddings, word2vec, BERT, LLM fine-tuning
15
MLOps: versioning (MLflow), containerization (Docker), API (FastAPI), monitoring
16
Cloud: Google Cloud (Compute, Storage, BigQuery), GPU training, Vertex AI
17
AI Ethics: bias, explainability (SHAP, LIME), fairness, GDPR