CE6143 - Introduction to Data Science (Fall 2025)

Lecture language: English

Meeting time

Location

Staff

Recommended textbooks

Supplementary materials

Grading

Slides

Progress (subject to change)

WeekDateContentExercise
19/21. Overview of the course
29/91. Introduction to ML
2. KNN
3. k-means
4. Distance measures
Project proposal (group) (due: 9/15 23:59:59)
39/161. Entropy
2. Decision tree
49/231. Decision tree
2. Matrix derivatives
59/30Linear regression and regularization (Lasso, Ridge, Elastic-net)
610/71. Logistic regression and gradient ascent
2. Evaluation metrics for binary classification, multi-class classification, and multi-label classification
710/141. ROC curve vs PR curve
2. Entropy, cross-entropy, and KL-divergence
3. Practical concerns on traditional machine learning
810/21Ensemble learning1. Progress report (due: 10/20 23:59:59)
2. Kaggle Competition begins (due: 11/10 23:59:59)
910/281. Gradient boosting machines
2. Linear SVM
1011/41. Kernel SVM
2. Regularized linear regression and classification
3. Linear SVM with poly-2 terms vs. polynomial kernel SVM
1111/111. Multi-layer perceptron
2. Convolutional neural network
1211/181. Convolutional neural network
2. Recurrent neural network
1311/25Word2Vec, Transformer and Large Language ModelMini-ml implementation and report (due: 11/24 23:59:59)
1412/2Associated learning
1512/9Constrastive learning, supervised contrastive learning and SCPL
1612/16Diffusion modelFinal project (due: 12/15 23:59:59)