CMSC 422
Introduction to Machine Learning
Spring, 2025
MW 2:00-3:15, CSIC 1115
Introduction
to Machine Learning |
This
course will introduce the basic principles of Machine Learning. These include simple but useful machine
learning algorithms and the mathematics underlying these methods. Recently, Large Language Models have had a
large and growing impact on our lives.
In this class, we will discuss some of the fundamental ideas that
underlie these models, providing a basic understanding of how they operate. We will also look at some of the implications
to society of large scale machine learning. |
Assignments Students will
be assigned seven problem sets and also receive a
number of in-class assessments. Problem
Sets: The schedule
for problem sets is given below. These
will involve implementing algorithms and pencil and paper exercises. Students may make use of LLMs for
assistance and may work with other students.
Students are expected to work through each assignment, and to
understand all pieces of code and all written solutions that they turn in. Students will have two weeks to complete coding
assignments (which may also contain written problems), and one week for
assignments that are purely written. All
problem sets are due at start of class (2pm) on the due date. Two week assignments will count for twice as much as one
week assignments in grading. Problem
sets will count overall for 16½ % of the final grade. Quizzes: There will be seven short (ten minute)
quizzes given in class. These quizzes
will be given on the date that each problem set is due, and
will address the material in that problem set. There will be no partial credit given on
quizzes. These will count for 11% of
the final grade. Midterm
and Final: An
in-class midterm will be given (see schedule below) and a final exam will be
given at the date, time and location set by the university schedule (see
schedule below). Some portions
of assignments and exams may be listed as extra credit. This means that all grades will be assigned
as usual, ignoring extra credit. After
grades are determined, extra credit work may be used to raise students’ grades. |
Texts and Discussion Readings will be given
in the schedule below. We will use the
following texts, along with some additional pointers. CIML - A Course in Machine Learning, by Hal Daume. It is available freely here.
Or here. ISL – An Introduction to Statistical Learning, by
James, et al. It is freely available here. Neural Networks and Machine Learning, by Michael Nielsen. It is freely available here. We will be using Piazza for discussion: https://piazza.com/umd/spring2025/cmsc4220201 |
Disability Support Any student eligible for
and requesting reasonable academic accommodations due to a disability is requested
to provide, to the instructor in office hours, a letter of accommodation from
the Office of Disability Support Services (DSS) within the first TWO
weeks of the semester. |
Academic Integrity In this course you are responsible
for both the University’s Code of Academic
Integrity and the University of Maryland
Guidelines for Acceptable Use of Computing Resources. Any
evidence of unacceptable use of computer accounts or unauthorized cooperation
on tests, quizzes and assignments will be submitted to the Student Honor
Council, which could result in an XF for the course, suspension, or expulsion
from the University. Any work that you hand in must be your own work. Any sources that you draw from, including other students or LLMs, should be appropriately acknowledged. Plagiarism is a serious offense, and will not be tolerated. This should not be an issue with assignments; you just need to acknowledge any sources that you use. |
Anti-Harassment The open exchange of ideas,
the freedom of thought and expression, and respectful scientific debate are
central to the aims and goals of this course. These require a community and
an environment that recognizes the inherent worth of every person and group,
that fosters dignity, understanding, and mutual respect, and that embraces
diversity. Harassment and hostile behavior are unwelcome in any part of this
course. This includes: speech or behavior that intimidates, creates
discomfort, or interferes with a person’s participation or opportunity for
participation in the course. We aim for this course to be an environment
where harassment in any form does not happen, including but not limited to:
harassment based on race, gender, religion, age, color, national origin,
ancestry, disability, sexual orientation, or gender identity. Harassment
includes degrading verbal comments, deliberate intimidation, stalking,
harassing photography or recording, inappropriate physical contact, and
unwelcome sexual attention. Please contact an instructor or CS staff member
if you have questions or if you feel you are the victim of harassment (or
otherwise witness harassment of others). |
Course evaluations We welcome your
suggestions for improving this class, please don’t hesitate to share it with
the instructor or the TAs during the semester! You will also be asked to give
feedback using the CourseEvalUM system
at the end of the semester. Your feedback will help us make the course
better. |
|
Personnel |
|||||
|
Instructor |
Teaching Assistants |
|||
Name |
David Jacobs |
Jiaye Wu |
Bhuvanesh Murali |
Haowen Yu |
Naitri Rajyaguru |
Office |
IRB 4240 |
|
|
|
|
Email |
djacobs@cs |
Please post
on Piazza |
|||
Office Hours |
Friday 9-11,
IRB 4240 |
Monday 3:15-5:15 AVW 4140 |
Tuesday, 2-4 |
Wednesday,
10-12 |
Thursday,
9:30-11:30 |
Homework |
|||
|
Date Assigned |
Date Due |
Topic |
PS1 |
Feb. 3 |
Feb. 10 |
Decision
Trees |
PS2 |
Feb. 12 |
Feb. 24 |
K-means -
Implementation |
PS3 |
Feb. 24 |
March 3 |
Perceptrons, Naïve Bayes |
PS4 |
March 3 |
March 24 |
Logistic
Regression - Implementation |
PS5 |
March 31 |
April 14 |
Neural Networks,
Backpropagation, Gradient Descent - Implementation |
PS6 |
April 14 |
April 28 |
Reinforcement
Learning - Implementation |
PS7 |
April 28 |
May 5 |
SVM, Learning
Theory |
Class
Schedule
This
schedule is tentative.
|
Topic |
Assigned
Reading |
Additional
resources |
1. 1/27 |
Introduction |
|
|
2. 1/29 |
Decision
Trees, Limits of Learning |
CIML, Chapter
1, 2 |
|
3. 2/3 |
Decision
Trees cont’d, ensemble learning. |
|
|
4. 2/5 |
Geometry
Review |
Dot
product notes
TBP (To be Posted)
|
|
5. 2/10 |
Nearest Neighbor
(NN), K-NN, K-Means -Quiz |
CIML, Chapter
3 |
|
6. 2/12 |
Snow Day |
|
|
7. 2/17 |
Perceptrons, multi-variable calculus, gradient
descent. |
CIML, 4-4.4 |
|
8. 2/19 |
Perceptrons, cont’d. Practicalities
of Machine Learning |
CIML, Chapter 5
|
|
9. 2/24 |
Probability -
Quiz |
||
10. 2/26 |
Probabilistic
Models |
CIML 9-9.5 |
|
11. 3/3 |
Linear and
Logistic Regression - Quiz |
ISL 3.1, 4.3
Part II of Notes
|
|
12. 3/5 |
Logistic
Regression, cont’d. |
|
|
13. 3/10 |
Neural
Networks |
CIML 10.1, Neural Networks and Deep Learning Chapter 1 |
|
14. 3/12 |
Backpropagation
|
CIML
10.2, Neural Networks and Deep Learning Chapter 2. |
|
Spring Break |
|||
15. 3/24 |
Next Word
Prediction - Quiz |
Prediction and Entropy of
Printed English, by Shannon, Bell System Technical Journal. |
|
16. 3/26 |
Midterm |
|
|
17. 3/31 |
Going over
Midterm |
|
. |
18. 4/2 |
Transformers |
Attention is all you need, Vaswani et al., Neurips
2017. |
Formal Algorithms for Transformers
by Phuong and Hutter, Arxiv. I highly recommend this for a clear and
precise description of Transformers, but it is not for the faint of heart. |
19. 4/7 |
Reinforcement
Learning |
Reinforcement Learning,
an Introduction, Sutton and Barto Understanding
Chapters 3 and 6 is important, but reading 4 and 5 will probably help with 6. Chapter 1 is fun and quick to read. |
|
20. 4/9 |
Large
Language Models |
|
Training language models to follow instructions with human feedback LLaMA: Open and Efficient Foundation
Language Models Language
Models are Few-Shot Learners
|
21. 4/14 |
Data - Quiz |
TBP |
|
22. 4/16 |
Learning
Theory |
CIML 12 |
|
23. 4/21 |
Kernel
Methods |
CIML 11-11.3 |
|
24. 4/23 |
Support
Vector Machines |
CIML 11.4-11.6
|
|
25. 4/28 |
Kernel SVMs - Quiz |
|
|
26. 4/30 |
Bias and
Fairness |
TBP |
|
27. 5/5 |
TBD, Maybe
Image/Video Generation. - Quiz |
|
|
28. 5/7 |
TBD |
|
|
29. 5/12 |
Conclusions |
|
|
5/19 4-6pm |
Final Exam |
|