Machine Learning

COMP 5212, Spring 2026
The Hong Kong University of Science and Technology
Junxian He

Course description: this is an advanced machine learning course to provide a broad introduction to machine learning, covering foundational machine learning concepts, theory, algorithms, and applications. The topics include supervised learning (logistic regression, linear models, classification, SVM, kernal methods, naive Bayes), unsupervised learning (clustering, expectation maximization, graphical models, HMM), and reinforcement learning. It will include some optimization/learning topics (gradient descent, SGD, MLE and MAP estimation) as well. Certain aspects of deep learning are introduced(neural network basics, architectures, VAEs, GANs). The last lectures will give a general introduction to techniques in large language models.

Course Information

Course Logistics (Grading, Policy, etc.)

Time and Location

Lectures: Tue, Thu 4:30 PM - 5:50 PM at Room 4619 (Lift 31-32)

Course Staff and Office Hours

Instructor: Junxian He
TA1: Kashun Shum (ksshumab@connect.ust.hk)
TA2: Junlong Li (jlini@cse.ust.hk)
TA3: Hongxiang Li (hlihg@connect.ust.hk)

Contact and Communication

Canvas is the main platform for communication about technical questions of lecture contents and homeworks.

Prerequisites and Materials

Students are required to be familiar with probability and linear algebra, and able to program well in Python. The CMU 10701 self-assessment exam is a good resource to roughly give you a notion of the background that is required for this course.

This course does not have a textbook, but here are some helpful materials:
Stanford CS229 Notes
Probability Review
The Matrix Cookbook
Linear Algebra Review

Lecture Schedule

The lecture schedule below is tentative and subject to change. Slides will be posted here when available.

SlidesDateTopicReadingsAssignments
Lecture 003/02 TueIntroduction  
Lecture 105/02 ThuMath basics  
Lecture 210/02 TueLinear Regression  
Lecture 312/02 ThuLogistic regression, Exponential Family  
 17/02 TueLunar New Year’s Day (Public Holiday) — No class  
 19/02 ThuThe third day of Lunar New Year (Public Holiday) — No class  
Lecture 424/02 TueGeneralized linear models, Kernel MethodsSection 3 of Notes 
Lecture 526/02 ThuSVMSection 5, 6 of Notes 
Lecture 603/03 TueGenerative ModelsSection 4 of Notes 
Lecture 705/03 ThuNaive Bayes, MLE, MAPSection 4.2 of Notes, Sec 2.1-2.3.6 of Mitchell Ch2, Section 4.2, 4.6 of probml 
Lecture 810/03 TueGeneralization, bias-variance tradeoffSection 8.1-8.2 of Notes 
Lecture 912/03 ThuClustering, EMSection 10, 11.1, 11.2 of Notes 
Lecture 1017/03 TueExpectation MaximizationSection 11 of Notes 
Lecture 1119/03 ThuPCASection 12 of Notes 
Lecture 1224/03 TueProbabilistic Graphical ModelsKevin Murphy’s Tutorial 
Lecture 1326/03 ThuHMMChapter 8, Speech and Language Processing 
Lecture 1431/03 TueMid-term exam  
Lecture 1502/04 ThuHMMChapter 8, Speech and Language Processing 
 07/04 TueMid-term Break (03/04–08/04)  
Lecture 1609/04 ThuNeural Networks, backpropagationSection 7.4 of Notes 
Lecture 1714/04 TueNeural architectures  
Lecture 1816/04 ThuTransformer, Variational autoencoderThe Illustrated Transformer, The Annotated Transformer 
Lecture 1921/04 TueVariational autoencoderAuto-Encoding Variational Bayes 
Lecture 2023/04 ThuGANs, Reinforcement LearningGenerative Adversarial Networks, OpenAI Spinning Up in RL 
Lecture 2128/04 TueLarge language models  
Lecture 2230/04 ThuLLM Agents  
Lecture 2305/05 TueLLM Agents  
Lecture 2407/05 ThuWrap-up / Review