Fall 2023 Op Research - Computer Science E4529 section 001

Reinforcement Learning

Call Number	12388
Day & Time Location	F 10:10am-12:40pm 517 Hamilton Hall
Points	3
Grading Mode	Standard
Approvals Required	None
Instructor	Shipra Agrawal
Type	LECTURE
Method of Instruction	In-Person
Course Description	Markov Decision Processes (MDP) and Reinforcement Learning (RL) problems. Reinforcement Learning algorithms including Q-learning, policy gradient methods, actor-critic method. Reinforcement learning while doing exploration-exploitation dilemma, multi-armed bandit problem. Monte Carlo Tree Search methods, Distributional, Multi-agent, and Causal Reinforcement Learning.
Web Site	Vergil
Department	Industrial Engineering and Operations Research
Enrollment	58 students (60 max) as of 9:05PM Wednesday, April 1, 2026
Subject	Op Research - Computer Science
Number	E4529
Section	001
Division	School of Engineering and Applied Science: Graduate
Open To	Engineering:Undergraduate, Engineering:Graduate, GSAS
Section key	20233ORCS4529E001