Fall 2024 Op Research - Computer Science E4529 section 001

Reinforcement Learning

Call Number	14542
Day & Time Location	MW 1:10pm-2:25pm 330 Uris Hall
Points	3
Grading Mode	Standard
Approvals Required	None
Instructor	Shipra Agrawal
Type	LECTURE
Method of Instruction	In-Person
Course Description	Markov Decision Processes (MDP) and Reinforcement Learning (RL) problems. Reinforcement Learning algorithms including Q-learning, policy gradient methods, actor-critic method. Reinforcement learning while doing exploration-exploitation dilemma, multi-armed bandit problem. Monte Carlo Tree Search methods, Distributional, Multi-agent, and Causal Reinforcement Learning.
Web Site	Vergil
Department	Industrial Engineering and Operations Research
Enrollment	59 students (60 max) as of 11:44PM Monday, June 16, 2025
Subject	Op Research - Computer Science
Number	E4529
Section	001
Division	School of Engineering and Applied Science: Graduate
Open To	Business, Engineering:Undergraduate, Engineering:Graduate
Section key	20243ORCS4529E001