Machine Learning
CS 5350/CS 6350
Spring 2007
![]() |
Machine Learning
CS 5350/CS 6350 Spring 2007
|
![]() |
The field of machine learning attempts to build algorithms that are able to
discover and exploit patterns in data. These techniques have led to
significant advances in many fields, including stock trading, robotics, machine
translation, computer vision, medicine, etc. This course covers the basics of
machine learning (supervised and unsupervised learning: essentially, learning
with and without a teacher) as well as some more advanced, recent research
topics. A good, brief overview of the field is available here.
The catalog lists CS 3510 as a prerequisite; this can be waived if you have
or can quickly acquire
reasonable programming skills (in C, Matlab, Java, R, Perl, whatever; talk
to me) or if you are a graduate student. There will be a fair amount of math
in this class, but all I'll really expect you to know coming in is how to take
multivariate derivatives. Some assignments will probably go quicker if you
have some background in continuous math (probability and/or linear algebra),
but we'll cover in class all you need to know there. See also Reading 0 for some more background.
Your grade will be determined by your performance in the following areas:
homework, midterm exam, course
project and final exam. There are five assignments, which frequently involve both
programming and written aspects. There is one midterm exam, just preceding
spring break, and a final exam. There is a large course project, the topic of which is largely
of your choosing. Teams of two to three are allowed for the project, but
everything else will be done individually.
Each homework is worth 1 point (for a total of 5 points). The midterm is worth
4 points. The project is worth 4 points if you're enrolled in 5350 and 6
points if you're enrolled in 6350. The final exam is worth 6 points if you're
enrolled in 5350 and 4 points if you're enrolled in 6350. One additional point
is withheld for class participation, etc. There will be limited opportunity
for extra credit on some
homework assignments, midterm and final. I do not curve, but do adjust cutoffs
based on overall performance (though the cutoffs for 5350 and 6350 will be
determined separately).
The textbook will be the new book by Chris Bishop, Pattern
Recognition and Machine Learning (ISBN 0387310738). Other recommended (but not required) books:
Information Theory, Inference and Learning Algorithms by David MacKay (ISBN 0521642981)
Machine Learning by Tom Mitchell (ISBN 0070428077)
An Introduction to Computational Learning Theory by Michael Kearns and Umesh Vazirani (ISBN 0262111934)
Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani and Jerome Friedman (ISBN 0387952845)
| Date | Topics | Readings | HW | Notes |
| 09 Jan |
What is machine learning?
Learning theory, Bias, Overfitting |
- | 0 out | ![]() |
| 11 Jan |
Math refresher
Statistic, linear algebra and calculus |
1.1-1.4, [0] | - | ![]() |
| 16 Jan |
Linear models for regression
Least-squares techniques Bias/variance trade-off |
3.1-3.2 | 0 due | ![]() |
| 18 Jan |
Linear models for classification
Logistic regression, naive Bayes |
4.1-4.3, [1] | 1 out | ![]() |
| 23 Jan |
Linear models for classification (cont'd)
Perceptron algorithm, linear SVMs |
7.1.1-7.1.3, [2] |
- | ![]() |
| 25 Jan |
Linear models (cont'd)
Linear SVMs, VC dimension |
6.1-6.2, [2] | - | (same) |
| 30 Jan |
Non-linear models
Nearest neighbors, decision trees |
[3] | - | ![]() |
| 01 Feb |
Non-linear models
Non-linear SVMs, Kernels |
[4] | 1 due 2 out |
![]() |
| 06 Feb |
Combining models
Boosting, bagging and reductions |
14.2, 14.3, [6], [7] |
- | ![]() |
| 08 Feb |
Feature selection
Discussion of projects |
[8] | - | ![]() |
| 13 Feb |
Clustering
K-means, agglomerative |
9.1, [9] | - | ![]() |
| 15 Feb |
Clustering (cont'd)
Mixtures of Gaussians |
9.2, [10] | 2 due 3 out |
![]() |
| 20 Feb | Expectation maximization | 9.4, [11] | - | ![]() |
| 22 Feb |
Expectation maximization (cont'd)
Semi-supervised learning |
[12] | - | ![]() |
| 27 Feb |
Low-dimensional representations
Principle component analysis Locally linear embedding |
12.1, [13] | - | ![]() |
| 01 Mar |
Sequence labeling
Hidden Markov Models |
13.1-13.2 | - | ![]() |
| 06 Mar |
Sequence labeling and beyond
Maximum Entropy Markov Models Conditional Random Fields |
[14], [15] | - | ![]() |
| 08 Mar |
Sequence labeling and beyond (cont'd)
Structured Perceptron Search-based Structured Prediction |
[16], [18] | 3 due 4 out |
![]() |
| 13 Mar | Catch-up | - | PP due | - |
| 15 Mar | MIDTERM | - | - | - |
| 20 Mar | Spring break | - | - | - |
| 22 Mar | Spring break | - | - | - |
| 27 Mar |
Introduction to Bayesian learning
Probability distributions, graphical models |
1.5, 2.1-2.4, 8.1 |
- | ![]() |
| 29 Mar |
Cancelled
(Hal out of town) |
- | - | - |
| 03 Apr |
Inference
Exact inference, basic sampling |
8.2, 8.4.1-8.4.2 |
4 due 5 out |
![]() |
| 05 Apr |
Inference (cont'd)
Markov Chain Monte Carlo Latent Dirichlet allocation |
11.1-11.3, [19], [20] |
- | ![]() |
| 10 Apr |
Classification and regression
Bayesian linear/logistic regression Laplace approximation |
3.3, 4.4-4.5 | - | ![]() |
| 12 Apr | Catch-up | - | - | - |
| 17 Apr |
Project presentations
Virost; Hansen/Oh/Zhu; Kim/Rai; Lanka |
- | - | - |
| 19 Apr |
Project presentations
Gilbert/Bresee; Tandon; Valentine/Alfeld; Ha/Santos/Wang |
- | - | - |
| 24 Apr |
Project presentations
Gerber; Abbasi/Quist; Hetrick/Milyavskaya; Andrade |
- | 5 due PW due |
- |
Several of the topics we will discuss are not covered in sufficient depth in
PRML. These are provided instead in tutorial and research papers, listed
here. These readings are required.