Machine Learning

COMP 5212, Spring 2024
The Hong Kong University of Science and Technology
Junxian He

Course description: this is an advanced machine learning course to provide a broad introduction to machine learning, covering foundational machine learning concepts, theory, algorithms, and applications. The topics include supervised learning (logistic regression, linear models, classification, SVM, kernal methods, naive Bayes), unsupervised learning (clustering, expectation maximization, graphical models, HMM), and reinforcement learning. It will include some optimization/learning topics (gradient descent, SGD, MLE and MAP estimation) as well. Certain aspects of deep learning are introduced(neural network basics, architectures, VAEs, GANs). The last lectures will give a general introduction to techniques in large language models.

Course Information

Course Logistics (Grading, Policy, etc.)

Time and Location

Lectures: Wed, Fri 3:00 PM - 4:20 PM at Lecture Theater E

Course Staff and Office Hours

Instructor: Junxian He. Office Hour: Wed 4:30 PM - 5:30 PM at Room 3512
TA1: Kashun Shum. Office Hour: Wed 2:00 PM - 3:00 PM outside Room 3657
TA2: Chi Xu. Office Hour: Fri 6:00 PM - 7:00 PM at Room 4033

Contact and Communication

Canvas is the main platform for communication about technical questions of lecture contents and homeworks. Please do not directly email the teaching staff on technical questions which may easily get lost. The teaching staff will try to answer questions on Canvas in a prompt manner. Some rules:

  • All announcements and communications will happen over Canvas
  • Please utilize the Canvas Discussion page to ask questions. There are pre-existed discussion topics initiated by the staff such as lecture questions and each homework. Please post your questions in the respective thread, you are also encouraged to answer others’ questions if you know the answer.
  • While we allow students to optionally post anonymous questions and answers on Canvas, please do not post anything inappropriate as anonymous users – in such cases, we will turn off the anonymous option.
  • If you have non-technical, private matters, please email the teaching staff directly.
  • Please consult the Course Logistics page before asking logistical questions.
  • For longer discussions, please come to office hours.

Prerequisites and Materials

Students are required to be familiar with probability and linear algebra, and able to program well in Python. The CMU 10701 self-assessment exam is a good resource to roughly give you a notion of the background that is required for this course.

This course does not have a textbook, but here are some helpful materials:
Stanford CS229 Notes
Probability Review
The Matrix Cookbook
Linear Algebra Review

Lecture Schedule

The lecture schedule below is tentative and subject to change.

SlidesDateTopicReadingsAssignments
Lecture 031/01 WedIntroduction  
Lecture 102/02 FriMath basics  
Lecture 2, draft207/02 WedLinear Regression  
Lecture 3, draft309/02 FriLogistic regression, Exponential Family  
Lecture 4, draft414/02 WedGeneralized linear models, Kernel MethodsSection 3 of Notes 
Lecture 5, draft516/02 FriKernel methods, SVMSection 5 of Notes 
Lecture 6, draft621/02 WedSVMSection 6 of Notes 
Lecture 7, draft723/02 FriSVMSection 6 of NotesHW1 Out
Lecture 8, draft828/02 WedGenerative ModelsSection 4 of Notes 
Lecture 9, draft901/03 FriNaive Bayes, MLE, MAPSection 4.2 of Notes, Sec 2.1-2.3.6 of Mitchell Ch2, Section 4.2, 4.6 of probml 
Lecture 10, draft1006/03 WedGeneralization, bias-variance tradeoffSection 8.1-8.2 of Notes 
Lecture 11, draft1108/03 FriClustering, EMSection 10, 11.1, 11.2 of NotesHW1 Due
Lecture 12, draft1213/03 WedExpectation MaximizationSection 11 of NotesHW2 Out
Lecture 13, draft1315/03 FriPCASection 12 of Notes 
Lecture 1420/03 Wedmid-term exam  
Lecture 15, draft1522/03 FriProbabilistic Graphical ModelsKevin Murphy’s Tutorial 
Lecture 16, draft1627/03 WedHMMChapter 8, Speech and Language Processing 
 28/03 Thu  HW2 Due
 29/03 FriGood Friday holiday  
 30/03 Sat  Programming HW Out
 03/04 Wedmid-term break  
 05/04 Frimid-term break  
Lecture 17, draft1710/04 WedHMMChapter 8, Speech and Language Processing 
Lecture 18, draft1812/04 FriNeural Networks, backpropagationSection 7.4 of Notes 
Lecture 19, draft1917/04 WedNeural architectures HW3 Out
Lecture 20, draft2019/04 FriTransformer, Variational autoencoderThe Illustrated Transformer
The Annotated Transformer
 
Lecture 21, draft2124/04 WedVariational autoencoderAuto-Encoding Variational Bayes 
Lecture 2226/04 FriGANs, Reinforcement Learning  
 30/04 Tue  HW3 Due
 01/05 WedLabor day  
Lecture 2303/05 FriLanguage models Programming HW Due
Lecture 2408/05 WedPretraining  
Lecture 2510/05 FriLarge language models