CE6143 - Introduction to Data Science (Fall 2024)

Lecture language: English

Meeting time

Location

Staff

Recommended textbooks

Supplementary materials

Grading

Slides

Progress (subject to change)

WeekDateContentExercise
19/101. Overview of the course
29/17Holiday (mid autumn festival)
39/241. Introduction to ML
2. KNN
3. k-means
4. Distance measures
Project proposal (group) (due: 9/30 23:59:59)
410/11. Entropy
2. Decision tree
510/81. Decision tree
2. Matrix derivatives
610/15Linear regression and regularization (Lasso, Ridge, Elastic-net)
710/221. Logistic regression and gradient ascent
2. Evaluation metrics for binary classification, multi-class classification, and multi-label classification
810/291. ROC curve vs PR curve
2. Entropy, cross-entropy, and KL-divergence
3. Practical concerns on traditional machine learning
1. Progress report (due: 10/28 23:59:59)
2. Kaggle Competition 1 begins (due: 11/18 23:59:59)
911/5Ensemble learning
1011/121. Gradient boosting machines
2. Linear SVM
1111/191. Kernel SVM
2. Regularized linear regression and classification
3. Linear SVM with poly-2 terms vs. polynomial kernel SVM
Kaggle Competition 2 begins (due: 12/9 23:59:59)
1211/261. Multi-layer perceptron
2. Convolutional neural network
1312/31. Convolutional neural network
2. Recurrent neural network
1412/10Word2Vec, Prod2Vec, Behavior2Vec, and contrastive learning
1512/17Transformer and Large Language Model
1612/24Associated learningFinal project due: 12/30 23:59:59
1712/31Flexible learning week:Invited talk (speaker will be announced)
181/7Flexible learning week: Explainable AI and ethics