Machine Learning
COMP 5212, Fall 2024
The Hong Kong University of Science and Technology
Junxian He
Course description: this is an advanced machine learning course to provide a broad introduction to machine learning, covering foundational machine learning concepts, theory, algorithms, and applications. The topics include supervised learning (logistic regression, linear models, classification, SVM, kernal methods, naive Bayes), unsupervised learning (clustering, expectation maximization, graphical models, HMM), and reinforcement learning. It will include some optimization/learning topics (gradient descent, SGD, MLE and MAP estimation) as well. Certain aspects of deep learning are introduced(neural network basics, architectures, VAEs, GANs). The last lectures will give a general introduction to techniques in large language models.
Course Information
Course Logistics (Grading, Policy, etc.)
Time and Location
Lectures: Tue, Thu 1:30 PM - 2:50 PM at LG3008, Lift 10-12
Course Staff and Office Hours
Instructor: Junxian He. Office Hour: Tue 5 PM - 6 PM at Room 3512
TA1: Yuzhen Huang. Office Hour: Wed 9 AM - 10AM at the common place of SENG (near Lift 27/28, Room 2580)
TA2: Jinghan Zhang. Office Hour: Fri 6 PM - 7 PM at the common place of SENG (near Lift 27/28, Room 2580)
Contact and Communication
Canvas is the main platform for communication about technical questions of lecture contents and homeworks. Please do not directly email the teaching staff on technical questions which may easily get lost. The teaching staff will try to answer questions on Canvas in a prompt manner. Some rules:
- All announcements and communications will happen over Canvas
- Please utilize the Canvas Discussion page to ask questions. There are pre-existed discussion topics initiated by the staff such as lecture questions and each homework. Please post your questions in the respective thread, you are also encouraged to answer others’ questions if you know the answer.
- While we allow students to optionally post anonymous questions and answers on Canvas, please do not post anything inappropriate as anonymous users – in such cases, we will turn off the anonymous option.
- If you have non-technical, private matters, please email the teaching staff directly.
- Please consult the Course Logistics page before asking logistical questions.
- For longer discussions, please come to office hours.
Prerequisites and Materials
Students are required to be familiar with probability and linear algebra, and able to program well in Python. The CMU 10701 self-assessment exam is a good resource to roughly give you a notion of the background that is required for this course.
This course does not have a textbook, but here are some helpful materials:
Stanford CS229 Notes
Probability Review
The Matrix Cookbook
Linear Algebra Review
Lecture Schedule
The lecture schedule below is tentative and subject to change.
Slides | Date | Topic | Readings | Assignments |
---|---|---|---|---|
Lecture 0 | 03/09 Tue | Introduction | ||
Lecture 1, annotated1 | 05/09 Thu | Math basics | ||
Lecture 2, annotated2 | 10/09 Tue | Linear Regression | ||
Lecture 3, annotated3 | 12/09 Thu | Logistic regression, Exponential Family | HW1 Out | |
Lecture 4, annotated4 | 19/09 Thu | Generalized linear models, Kernel Methods | Section 3 of Notes | |
Lecture 5, annotated5 | 23/09 Mon | SVM | Section 5 of Notes | |
Lecture 6, annotated6 | 24/09 Tue | SVM | Section 6 of Notes | |
Lecture 7, annotated7 | 26/09 Thu | SVM | Section 6 of Notes | |
01/10 Tue | The National Day Holiday | HW1 Due on 02/10 | ||
Lecture 8, annotated8 | 03/10 Thu | Generative Models | Section 4 of Notes | |
Lecture 9, annotated9 | 08/10 Tue | Naive Bayes, MLE, MAP | Section 4.2 of Notes, Sec 2.1-2.3.6 of Mitchell Ch2, Section 4.2, 4.6 of probml | |
Lecture 10, annotated10 | 10/10 Thu | Generalization, bias-variance tradeoff | Section 8.1-8.2 of Notes | |
Lecture 11, annotated11 | 15/10 Tue | Clustering, EM | Section 10, 11.1, 11.2 of Notes | HW2 Out |
Lecture 12, annotated12 | 17/10 Thu | Expectation Maximization | Section 11 of Notes | |
Lecture 13, annotated13 | 22/10 Tue | PCA | Section 12 of Notes | |
Lecture 14 | 24/10 Thu | Mid-term exam | ||
Lecture 15, annotated15 | 29/10 Tue | Probabilistic Graphical Models | Kevin Murphy’s Tutorial | HW2 Due |
Lecture 16, annotated16 | 31/10 Thu | HMM | Chapter 8, Speech and Language Processing | |
Lecture 17, annotated17 | 05/11 Tue | HMM | Chapter 8, Speech and Language Processing | |
Lecture 18, annotated18 | 07/11 Thu | Neural Networks, backpropagation | Section 7.4 of Notes | HW3 out on 08/11 |
Lecture 19, annotated19 | 12/11 Tue | Neural architectures | Coding assignment out on 10/11 | |
14/11 Thu | Lecture Cancelled due to Typhoon | |||
Lecture 20, annotated20 | 19/11 Tue | Transformer, Variational autoencoder | The Illustrated Transformer, The Annotated Transformer | |
Lecture 21 | 21/11 Thu | Variational autoencoder | Auto-Encoding Variational Bayes | |
Lecture 22 | 26/11 Tue | GANs, Reinforcement Learning | Generative Adversarial Networks, OpenAI Spinning Up in RL | HW3 Due |
Lecture 23 | 28/11 Thu | Large language models | ||
8/12 | Coding Assignment Due | |||
14/12 | Final Exam |