CPS841/CP8309 (Winter 2017): Reinforcement Learning



The following textbook is the excellent introduction to real life probability concepts: Data Management 12 Student Edition by Wayne Erdman, Maria Rosa Cruiscuolo, Roland Meisel, David Petro, Jacob Speijer, Wendy Telford, McGrawHill Education, ISBN 9781259256363. In particular, read Chapter 1 "Introduction to Probability". Do all exercises from sections 1.4 and 1.5 in Chapter 1 to make sure you understand well mutually exclusive events, independent/dependent events and conditional probability.

Google DeepMind has developed a successful computer program Alpha Go.

The following technical papers describe techniques underlying the recent success of Alpha Go program. Some of the concepts and ideas behind these techniques are to be introduced in this new course.
Human-level control through deep reinforcement learning, published in Nature, 2015, Feb 26; vol 518 (N7540), pages 529-33, Feb 26, 2015.
Mastering the game of Go with deep neural networks and tree search, published in Nature, vol 529 (N7587), pages 484-489, Jan 28, 2016.

The White House, October 12, 2016: The Administration’s Report on the Future of Artificial Intelligence.

The White House, December 20, 2016: The Executive Office of the President Report on the Artificial Intelligence, Automation and the Economy.

Mikhail Soutchanski