ESE 680, Fall 2019 – Schedule and Course Materials

Course Materials

There is no textbook for the course. Our discussions will be guided by papers, monographs, and lecture notes that are available online. The following incomplete list will grow:

Additional Resources

Schedule (subject to change)

Logistics

System Identification

  • Lecture 2: Aug 29

    • System identification 1: Identification of Linear-Time-Invariant systems

  • Lecture 5: Sep 10

    • System identification 4: student led discussion, full state single trajectory case

  • Lecture 6: Sep 12

    • System identification 5: student led discussion, partial state single trajectory case

Control of Uncertain Systems

  • Lecture 7: Sep 17

    • Control of Uncertain Systems 1: introduction to optimal/robust control, modeling uncertainty, small gain theorem

  • Lecture 8: Sep 19

    • Control of Uncertain Systems 2: small gain theorem, a very brief introduction to the structured singular value (mu) and integral quadratic constraints (IQCs)

  • Lecture 9: Sep 24

    • Control of Uncertain Systems 3: student led presentation on fundamental limits of robust control

  • Lecture 10: Sep 26

    • Control of Uncertain Systems 4: student led presentation on model (in)validation from a robust control perspective

  • Additional Resources:

    • Sanjay Lall's Engr201a course at Stanford

    • [ZhouDoyleGlover] Robust and Optimal Control, Zhou, Doyle, and Glover

    • [ZhouDoyle] Essentials of Robust Control, Zhou and Doyle

Model-based control of learned systems

  • Lecture 13: Oct 08

    • Model-based control of learned systems 3: student led presentation on Optimism in the Face of Uncertainty (OFU) for LQR

    • Paper: AbbasiYadkoriSzepesvari11

    • Presenter: Shaoru Chen

  • Lecture 14: Oct 15

    • Model-based control of learned systems 4: student led presentation on Thompson Sampling for LQR

    • Paper: AbeilleLazaric18

    • Presenter: Rebecca Li

Learning Theory

  • Lecture 15: Oct 17

    • Learning theory 1: Empirical Risk Minimization and Uniform Convergence

      • Reading: Chapters 2-4 ShalevSchwartzAndBenDavid,

      • Methods/Algorithms: ERM, uniform convergence for bounded loss functions and finite hypothesis classes, and bounded and Lipschitz loss functions and compact hypothesis classes.

      • Scribe: Shuo Li, lecture notes

  • Lecture 16: Oct 22

    • Learning theory 2: Algorithmic Stability and Stochastic Gradient Descent

  • Lecture 17: Oct 24

    • Learning theory 3: student led presentation on Rademacher and Gaussian Complexities for Risk Bounds

  • Lecture 18: Oct 29

    • Learning theory 4: student led presentation on Smoothness, Low Noise, and Fast Rates

Model Free Methods

  • Lecture 20: Nov 05

    • Model free methods 2: student led presentation on if good representations are sufficient for efficient reinforcement living.

  • Lecture 22: Nov 12

    • Model free methods 4: student led presentation on the gap between model-free and model-based methods for LQR

Safe Learning and Control

  • Lecture 25: Nov 21

    • Safe learning and control 3: student led presentation on control barrier functions

  • Lecture 26: Nov 26

    • Safe learning and control 4: student led presentation on safe reinforcement learning with stability guarantees

Final Project Presentations

  • Lecture 27: Dec 03

    • Final project presentations 1

  • Lecture 28: Dec 05

    • Final project presentations 2