CMSC5728: Decision Analysis and Game Theory
General Expectations:
Student/Faculty Expectations on Teaching and Learning
Teacher:
Prof. John C.S. Lui
This is a graduate level course which covers theory on decision science.
There are several main topics I plan to cover, they are:
(a) Multi-armed bandit theory;
(b) Game theory;
(c) Reinforcement learning theory.
I like to emphasize that course
is mathematical and algorithmic in nature.
I will introduce a lot of concepts, show the mathematical proves,
and present the physical meanings and applications.
Students are expected to follow and understand my lecture,
and also do a lot of readings and do some programming (via Python).
Important reminder:
Students
are expected to attend the lecutre,
read the leture notes and understand them,
spend time to read resources on the Internet,
do the homework,
do the programming assignments,..etc,
so to keep pace with this course.
Teaching Assistants
Reference:
Course Grades:
- Written Homework and/or Programming Assignment: 50%
- Exam: 50%
Important note:
Students need to get at least 30% in the final exam to pass, independent
of their performance in programming exercises.
IMPORTANT REMINDERS !!!!!!
- Final Examination will be on December 6th, 2022. 7:00 pm till 9:00 pm.
Venue will be YIA-502 .
Also note that there will be NO MAKE-UP EXAM. So make sure
to attend the final examination.
Policies:
- No late homework, programming assignments or projects will be accepted;
Outline for the course:
(Note: I usually prepare more materials
than we can cover in a semester. I will leave those materials I can't
cover to students as a self-learning tool.)
- Introduction to topics on decision science
- Introduction to Game theory
- Two-player game zero-sum games
- Dominance stratey
- Saddle point
- Mixed strategy
- Minimax theorem
- Two-player game non-zero sum games
- Concept of equilibrium (Nash Equilibrium)
- Cournot Model of Duopoly
- Dynamic Games
- Kuhn's Theorem
- Concept of Subgame
- Subgame Perfect Nash Equilibrium
- Games with continuous strategy space
- Stackelberg Games
- Introduciton to Coalition and Cooperative Games
- Auctions
- Mechanism Design
- Stochastic multi-armed bandit (MAB)
- UCB algorithms and regret bound
- Thompson Sampling and its application to MAB
- Adersarial Bandits
- Linear Bandits
- Contextual Bandit
- MAB application: Dynamic Pricing, networking, crowdsourcing and multi-path protocols
- Markov Decision Process
- ...etc
Lecture Notes
(Lecture Notes are available at CUHK Blackboard (https://blackboard.cuhk.edu.hk/))
-
Please refer to the CUHK Blackboard
Written homework and programming assignment
-
Please go to the "Blackboard" to access the specification.