Course Syllabus

Jump to Today

Reinforcement Learning

CSCI-B659- Spring 2016

Class Meets

When: Tuesday and Thrusday 4:00pm-5:15pm

Where: Swain West, Room 103

Instructor

Adam White

Office: TBA

Email:.adamw@indiana.edu.

Web: http://homes.soic.indiana.edu/adamw (Links to an external site.) Links to an external site.

TBA

Office: TBA

Email:

Office Hours (instructor)

Wednesday 2pm-4pm, or by appointment

Office Hours (AIs)

TBA

Course content

Reinforcement learning is a framework for modeling the an autonomous agent’s interaction with an unknown world. The agent’s objective is to learn the effects of it’s actions, and modify its policy in order to maximize future reward. The study of Reinforcement learning emphasizes a learning approach to artificial intelligence. Unlike supervised learning, the agent is not explicitly told the correct answers (labels), rather an RL agent must learn only from reward and trial and error interaction with the world. This general framework has been used to optimize helicopter flight, schedule elevators, and achieve super-human level performance in many games (e.g., Backgammon, GO, and Atari). Ideas from reinforcement learning has also be used to explain learning in animals, and model dopamine activity in the human brain.

This course provides an introduction to some of the foundational ideas on which modern reinforcement learning is built, including Markov decision processes, value functions, Monte Carlo estimation, dynamic programming, temporal difference learning, eligibility traces, and function approximation. This course will develop an intuitive understanding of these concepts (taking the agent’s perspective), while also focusing on the mathematical theory of reinforcement learning. Programming assignments and projects will require implementing and testing complete decision making systems.

The objective of this course is twofold. The first is to prepare you for conducting research in reinforcement learning. The second is to provide you with the required knowledge to apply reinforcement learning techniques to novel applications.

Topics to be covered:

Overview of reinforcement learning: the agent environment framework, successes of reinforcement learning
Bandit problems and online learning
Markov decision processes
Returns, and value functions
Solution methods: dynamic programming
Solution methods: Monte Carlo learning
Solution methods: Temporal difference learning learning
Eligibility traces
Value function approximation (function approximation)
Models and planning (table lookup case)
Case studies: successful examples of RL systems
Frontiers of RL research

Prerequisites:

This course will rely on basic statistics (e.g., probability distributions and expected values), and basic linear algebra (e.g., inner products). You should be able to program in some language (e.g., python, C).

Grading:

50% from 5 assignments
10% thought questions.
10% Project proposal.
30% Final project.

Text and resources:

Required: Reinforcement Learning: An Introduction, by Richard S. Sutton and Andrew G. Barto

Supplemental: Algorithms for Reinforcement learning, by Csaba Szepesvari

(both freely available online)

Late Policy and Academic Honesty

Assignments can be done in groups but you must clearly state who you collaborated with and the nature of the collaboration. All the sources used for problem solution must be acknowledged, e.g. web sites, books, research papers, personal communication with people, etc. For example,

I worked with Sally on questions 4 and 5. It was Sally's idea to use larger tile sizes in the experiment, but I coded the experiment myself.

Every student must write their own code and conduct their own experiments. No data or results sharing.

The project can be done in pairs, with no restrictions on what is shared. Each pair shall submit one report and both will receive the same grade.

Academic honesty is taken seriously; for detailed information see Indiana University Code of Student Rights, Responsibilities, and Conduct (Links to an external site.) Links to an external site..

Course Summary:

Date	Details	Due
Mon Jan 18, 2016	Assignment Thought questions for Chapter 1 & 2	due by 11:59pm
Tue Jan 26, 2016	Assignment Thought question Chapter 3	due by 6am
Tue Feb 9, 2016	Assignment Assignment #1	due by 11:59pm
Wed Feb 17, 2016	Assignment Thought question Chapters 4&5	due by 6am
Tue Feb 23, 2016	Assignment Thought question Chapter 6	due by 6am
Fri Feb 26, 2016	Assignment Assignment #2	due by 11:59pm
Thu Mar 3, 2016	Assignment Thought question Chapter 7	due by 6am
Sun Mar 13, 2016	Assignment Project Proposal	due by 11:59pm
Tue Mar 22, 2016	Assignment Assignment #3	due by 11:59pm
Thu Mar 24, 2016	Assignment Thought question chapter 8	due by 6am
Thu Mar 31, 2016	Assignment Thought question Chapter 9	due by 9am
Thu Apr 7, 2016	Assignment Assignment #4	due by 11:59pm
Thu Apr 21, 2016	Assignment Thought question Ch15	due by 10am
Tue Apr 26, 2016	Assignment Assignment #5	due by 11:59pm
Thu May 5, 2016	Assignment Project Report	due by 11:59pm

April 2025

Calendar
Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
30 March 2025 Previous month Next month Today Click to view event details	31 March 2025 Previous month Next month Today Click to view event details	1 April 2025 Previous month Next month Today Click to view event details	2 April 2025 Previous month Next month Today Click to view event details	3 April 2025 Previous month Next month Today Click to view event details	4 April 2025 Previous month Next month Today Click to view event details	5 April 2025 Previous month Next month Today Click to view event details
6 April 2025 Previous month Next month Today Click to view event details	7 April 2025 Previous month Next month Today Click to view event details	8 April 2025 Previous month Next month Today Click to view event details	9 April 2025 Previous month Next month Today Click to view event details	10 April 2025 Previous month Next month Today Click to view event details	11 April 2025 Previous month Next month Today Click to view event details	12 April 2025 Previous month Next month Today Click to view event details
13 April 2025 Previous month Next month Today Click to view event details	14 April 2025 Previous month Next month Today Click to view event details	15 April 2025 Previous month Next month Today Click to view event details	16 April 2025 Previous month Next month Today Click to view event details	17 April 2025 Previous month Next month Today Click to view event details	18 April 2025 Previous month Next month Today Click to view event details	19 April 2025 Previous month Next month Today Click to view event details
20 April 2025 Previous month Next month Today Click to view event details	21 April 2025 Previous month Next month Today Click to view event details	22 April 2025 Previous month Next month Today Click to view event details	23 April 2025 Previous month Next month Today Click to view event details	24 April 2025 Previous month Next month Today Click to view event details	25 April 2025 Previous month Next month Today Click to view event details	26 April 2025 Previous month Next month Today Click to view event details
27 April 2025 Previous month Next month Today Click to view event details	28 April 2025 Previous month Next month Today Click to view event details	29 April 2025 Previous month Next month Today Click to view event details	30 April 2025 Previous month Next month Today Click to view event details	1 May 2025 Previous month Next month Today Click to view event details	2 May 2025 Previous month Next month Today Click to view event details	3 May 2025 Previous month Next month Today Click to view event details
4 May 2025 Previous month Next month Today Click to view event details	5 May 2025 Previous month Next month Today Click to view event details	6 May 2025 Previous month Next month Today Click to view event details	7 May 2025 Previous month Next month Today Click to view event details	8 May 2025 Previous month Next month Today Click to view event details	9 May 2025 Previous month Next month Today Click to view event details	10 May 2025 Previous month Next month Today Click to view event details

Assignments are weighted by group:

Group	Weight
Assignments	50%
Thought questions	10%
Research Project	40%
Total	100%