Intelligent control through learning and optimization

AMATH / CSE 579

Winter 2012: SLN 20029, MW 3:30-4:50, MGH 284




Emo Todorov

Office: Guggenheim 415h
Email: todorov@cs.washington.edu


Course Description

Design of near-optimal controllers for complex dynamical systems, using analytical techniques, machine learning, and optimization. Topics from deterministic and stochastic optimal control, reinforcement learning and dynamic programming, numerical optimization in the context of control, and robotics. Prerequisite: vector calculus, linear algebra, and Matlab. Recommended: differential equations, stochastic processes, and optimization.

Lecture slides

Lecture 1: Introduction

Lecture 2: Markov Decision Processes and Bellman Equations

Lecture 3: Controlled Diffusions and Hamilton-Jacobi-Bellman Equations

Lecture 4: Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters

Lecture 5: Pontryagin's Maximum Principle

Lecture 6: Trajectory-based Optimization

Lecture 7: Linearly-Solvable Stochastic Optimal Control Problems

Homework

Homework 1, due Feb 3

Homework 2, due Feb 29

Default Final Project

Project description, due Mar 16

Code

MDP solver: all problem formulations and algorithms

Acrobot dynamics

Animation of acrobot dynamics

Lecture slides from 2010

Inverse Optimal Control

Applications to biological movement

General Readings

A. Barto and R. Sutton (1998) Reinforcement learning: An introduction (online book)

E. Todorov (2006) Optimal control theory (book chapter)

D. Bertsekas (2008) Dynamic programming (lecture slides)

R. Tedrake (2009) Underactuated robotics: Learning, planning and control (lecture notes)

B. Van Roy (2004) Approximate dynamic programming (lecture notes)

P. Abbeel (2009) Advanced robotics (lecture slides)

Lecture-specific Readings

Lecture 6:
Todorov and Li (2005) A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In proceedings of ACC

Lecture 7:
Todorov (2009) Efficient computation of optimal actions. PNAS 106: 11478-11483

Krishnamurthy and Todorov (2010) Inverse optimal control with linearly-solvable MDPs. In proceedings of ICML

Muico, Lee, Popovic and Popovic (2009) Contact-aware nonlinear control of dynamics characters. In proceedings of SIGGRAPH

Wampler and Popovic (2009) Optimal gait and form for animal locomotion. In proceedings of SIGGRAPH

Bertsekas (2010) Approximate dynamic programming. In Dynamic Programming and Optimal Control, vol 2, 3rd ed

Todorov (2004) Optimality principles in sensorimotor control. Nature Neuroscience