ESE 680, Fall 2019 – Schedule and Course Materials
Course Materials
There is no textbook for the course. Our discussions will be guided by papers, monographs, and lecture notes that are available online. The following incomplete list will grow:
[Recht19] A Tour of Reinforcement Learning: The View from Continuous Control, Recht 2019
[Viberg95] Subspace-based Methods for the Identification of Linear Time-invariant Systems, Viberg 1995
[MatniAndTu19] A Tutorial on Concentration Bounds for System Identification, Matni and Tu, 2019
[Rigollet] Lecture Notes on High-Dimensional Statistics, Rigollet
[DeanEtAl17] On the Sample Complexity of the Linear Quadratic Regulator, Dean, Mania, Matni, Recht, and Tu, 2017
[Wasserman] CMU Stats 705, Lecture 13: The Boostrap, Wasserman
[SarkarAndRakhlin19] Near optimal finite time identification of arbitrary linear dynamical systems, Sarkar and Rakhlin, 2019
[SimchowitzEtAl18] Learning Without Mixing: Towards A Sharp Analysis of Linear System Identification, Simchowitz, Mania, Tu, Jordan, and Recht, 2018
[OymakAndOzay19] Non-asymptotic Identification of LTI Systems from a Single Trajectory, Oymak and Ozay, 2019
[TsiamisAndPappas19] Finite Sample Analysis of Stochastic System Identification, Tsiamis and Pappas, 2019
[DFT] Feedback Control Theory, Doyle, Francis, and Tannenbaum
[Lall] Stanford Engr210a, Lecture 17: LFTs and robustness, Lall
[LessardRechtPackard16] Analysis and design of optimization algorithms via integral quadratic constraints, Lessard, Recth, and Packard, 2016
[Jonsson] Lecture Notes on Integral Quadratic Constraints, Jonsson
[LeongDoyle16] Understanding Robust Control Theory Via Stick Balancing, Leong and Doyle, 2016
[LeongDoyle17] Effects of Delays, Poles, and Zeros on Time Domain Waterbed Tradeoffs and Oscillations, Leong and Doyle, 2017
[SmithDoyle92] Model Validation: A Connection between Robust Control and System Identification, Smith and Doyle, 1992
[PoollaEtAl94] A Time-Domain Approach to Model Validation, Poolla, Khargonekar, Tikku, Krause, and Nagpal, 1994
[Prajna05] Barrier Certificates for nonlinear model validation
[AndersonEtAl19] System Level Synthesis, Anderson, Doyle, Low, and Matni, 2019
[MatniEtAl19] From self-tuning regulators to reinforcement learning and back again, Matni, Proutiere, Rantzer, and Tu, 2019
[DannLattimoreBrunskill17] Unifying PAC and Regret: Uniform PAC Bounds for
Episodic Reinforcement Learning, Dann, Lattimore, and Brunskill, 2017
[AbbasiYadkoriSzepesvari11] Regret Bounds for the Adaptive Control of Linear Quadratic Systems, Abbasi-Yadkori and Szepesari, 2011
[AbeilleLazaric18] Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems, Abeille and Lazaric, 2018
[BousquetElisseeff02] Stability and Generalization
[HardtRechtSinger16] Train faster, generalize better: Stability of stochastic gradient descent, Hardt, Recht, and Singer, 2016
[BartlettMendelson02] Rademacher and Gaussian Complexities: Risk Bounds and Structural Results
[SrebroSridharanTewari10] Smoothness, Low Noise, and Fast Rates, Srebro, Sridharan, and Tewari, 2010
[SuttonBarto] Reinforcement Learning Sutton and Barto, 2017
[BradtkeYdstieBarto94] Adaptive linear quadratic control using policy iteration, Bradtke, Ydstie, and Barto, 1994
[DuEtAl2019] Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?, Du, Kakade, Wang, and Yang, 2019
[FazelGeKakadeMesbahi2019] Global Convergence of Policy Gradient
Methods for the Linear Quadratic Regulator Fazel, Ge, Kakade, and Mesbahi, 2019
[TuRecht2018] The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint, Tu and Recht, 2018
[AmesEtAl19] Control Barrier Functions: Theory and Applications, Ames, Coogan, Egerstedt, Notomista, Sreenath, and Tabuada, 2019
[BerkenkampEtAl17] Safe Model-based Reinforcement Learning with Stability Guarantees, Berkenkamp, Turchetta, Schoellig, and Krause, 2017
[FazlyabRobeyMorariPappas19] Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks Fazlyab, Robey, Hassani, Morari, and Pappas, 2019
[FazlyabMorariPappas19a] Probabilistic Verification and Reachability Analysis of Neural Networks via Semidefinite Programming, Fazlyab, Morari, and Pappas, 2019
[FazlyabMorariPappas19b] Safety Verification and Robustness Analysis of Neural Networks via Quadratic Constraints and Semidefinite Programming Fazlyab, Morari, and Pappas, 2019
Additional Resources
[Ljung] System Identification: Theory for the User, Ljung (survey paper)
[Vershynin] High-Dimensional Probability, Verhsynin
Sanjay Lall's Engr201a Robust Control course at Stanford
[ZhouDoyleGlover] Robust and Optimal Control, Zhou, Doyle, and Glover
[ZhouDoyle] Essentials of Robust Control, Zhou and Doyle
[DullerudPaganini] A course in robust control: a convex approach, Dullerud and Paganini
[RussoEtAl17] A Tutorial on Thompson Sampling, Russo et al, 2017
[Ioannou] Robust Adaptive Control, Ioannou, 1995
[ShalevSchwartzAndBenDavid] Understanding Machine Learning: from Theory to Algorithms
[BousquetEtAl] Introduction to Statistical Learning Theory
[CuckerSmale01] On the Mathematical Foundations of Learning
MIT's 9.520 Statistical Learning Theory and Applications
[Bertsekas] Reinforcement Learning and Optimal Control
[BersekasTsitsiklis] Neuro-dynamic Programming
[KrauthTuRecht2019] Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator, Krauth, Tu, and Recth, 2019
[GaussianProcesses] Gaussian Processes for Machine Learning, Rasmussen and Williams, 2006.
Schedule (subject to change)
Logistics
System Identification
Control of Uncertain Systems
Additional Resources:
Sanjay Lall's Engr201a course at Stanford
[ZhouDoyleGlover] Robust and Optimal Control, Zhou, Doyle, and Glover
[ZhouDoyle] Essentials of Robust Control, Zhou and Doyle
Model-based control of learned systems
Lecture 12: 0ct 03
Model-based control of learned systems 2: PAC, regret, and beyond, and what do we know about learning to control the linear quadratic regulator
Learning Theory
Model Free Methods
Safe Learning and Control
Final Project Presentations
|