Data-Enabled Predictive Control : In the Shallows of the DeePC

Size: px
Start display at page:

Download "Data-Enabled Predictive Control : In the Shallows of the DeePC"

Transcription

1 Data-Enabled Predictive Control : In the Shallows of the DeePC Florian Do rfler Automatic Control Laboratory, ETH Zu rich

2 Acknowledgements Linbin Huang Jeremy Coulson Brain-storming: P. Mohajerin Esfahani, B. Recht, R. Smith, B. Bamieh, and M. Morari John Lygeros 1/34

3 Big, deep, intelligent and so on unprecedented availability of computation, storage, and data theoretical advances in optimization, statistics, and machine learning... and big-data frenzy increasing importance of data-centric methods in all of science / engineering Make up your own opinion, but machine learning works too well to be ignored. 2/34

4 Feedback our central paradigm actuation making a difference to the world physical world sensing making sense of the world automation and control information technology inference and data science 3/34

5 Control in a data-rich world ever-growing trend in CS and robotics: data-driven control by-passing models canonical problem: black/gray-box system control based on I/O samples u 2 y 2 Q: Why give up physical modeling and reliable model-based algorithms? Data-driven control is viable alternative when models are too complex to be useful (e.g., fluid dynamics & building automation) first-principle models are not conceivable (e.g., human-in-the-loop & perception) modeling & system ID is too cumbersome (e.g., robotics & power applications) u 1 y 1 data-driven control Central promise: It is often easier to learn control policies directly from data, rather than learning a model. Example: PID 4/34

6 Snippets from the literature 1. reinforcement learning / or stochastic adaptive control / or approximate dynamic programming with key mathematical challenges (approximate/neuro) DP to learn approx. value/q-function or optimal policy (stochastic) function approximation exploration-exploitation trade-offs and practical limitations inefficiency: computation & samples complex and fragile algorithms safe real-time exploration ø suitable for physical control systems with real-time & safety constraints? action unknown system reinforcement learning control estimate reward observation 5/34

7 Snippets from the literature cont d 2. gray-box safe learning & control robust conservative & complex control adaptive hard & asymptotic performance y? contemporary learning algorithms (e.g., MPC + Gaussian processes / RL) non-conservative, optimal, & safe ø limited applicability: need a-priori safety u robust/adaptive control 3. Sequential system ID + control ID with uncertainty quantification followed by robust control design recent finite-sample & end-to-end ID + control pipelines out-performing RL ø ID seeks best but not most useful model ø easier to learn policies than models u 2 y 2 u 1 y 1 +? 6/34

8 Key take-aways claim: easier to learn controllers from data rather than models data-driven approach is no silver bullet (see previous ø) predictive models are preferable over data (even approximate) models are tidied-up, compressed, & de-noised representations model-based methods vastly out-perform model-agnostic ones ø deadlock? a useful ML insight: non-parametric methods are often preferable over parametric ones (e.g., basis functions vs. kernels) build a predictive & non-parametric model directly from raw data? 7/34

9 Colorful idea u 1 = 1 u 2 = u 3 = = 0 x 0 =0 y 1 y 3 y 5 y 2 y 4 y 6 y 7 If you had the impulse response of a LTI system, then... can build state-space system identification (Kalman-Ho realization)... but can also build predictive model directly from raw data : u future (t) y future (t) = [ y 1 y 2 y 3... ] u future (t 1) u future (t 2). model predictive control from data: dynamic matrix control (DMC) today: can we do so with arbitrary, finite, and corrupted I/O samples? 8/34

10 Contents I. Data-Enabled Predictive Control (DeePC): Basic Idea J. Coulson, J. Lygeros, and F. Dörfler. Data-Enabled Predictive Control: In the Shallows of the DeePC. arxiv.org/abs/ II. From Heuristics & Numerical Promises to Theorems J. Coulson, J. Lygeros, and F. Dörfler. Regularized and Distributionally Robust Data-Enabled Predictive Control. arxiv.org/abs/ III. Application: End-to-End Automation in Energy Systems L. Huang, J. Coulson, J. Lygeros, and F. Dörfler. Data-Enabled Predictive Control for Grid-Connected Power Converters. arxiv.org/abs/

11 Preview complex 2-area power system: large (n 10 2 ), nonlinear, noisy, stiff, & with input constraints control objective: damping of inter-area oscillations via HVDC but without model control ( no control ) collect data control seek method that works reliably, can be efficiently implemented, & certifiable automating ourselves 9/34

12 Behavioral view on LTI systems Definition: A discrete-time dynamical system is a 3-tuple (Z 0, W, B) where (i) Z 0 is the discrete-time axis, (ii) W is a signal space, and (iii) B W Z 0 is the behavior. Definition: The dynamical system (Z 0, W, B) is (i) linear if W is a vector space & B is a subspace of W Z 0, (ii) time-invariant if B σb, where σw t = w t+1, and y (iii) complete if B is closed W is finite dimensional. In the remainder we focus on discrete-time LTI systems. u 10/34

13 Behavioral view cont d B = set of trajectories in W Z 0 & BT is restriction to t [0, T ] A system (Z 0, W, B) is controllable if any two trajectories w 1, w 2 B can be patched with a trajectory w B T. w w 2 w T I/O : B = B u B y where B u = (R m ) Z 0 and B y (R p ) Z 0 are the spaces of input and output signals w = col(u, y) B different parametric representations: state space, kernel, image,... kernel representation (ARMA) : B = col(u, y) (R m+p ) Z 0 s.t. b 0 u + b 1 σu + + b n σ n u + a 0 y + a 1 σy +... a n σ n y = 0 11/34

14 LTI systems and matrix time series foundation of state-space subspace system ID & signal recovery algorithms u(t) u 1 u 3 u 2 u 4 u 5 u 6 u 7 t y(t) y 4 y 1 y 3 y 5 y 2 y 6 y 7 t ( ) u(t), y(t) satisfy recursive difference equation b 0 u t +b 1 u t b n u t+n + a 0 y t +a 1 y t a n y t+n = 0 (ARMA / kernel representation) under assumptions [ b 0 a 0 b 1 a 1... b n a n ] spans left nullspace of Hankel matrix (collected from data) ( u1 y 1 ) ( u2 y 2 ) ( u3 y 3 ) (u T L+1 ) y T L+1 ( u2 y 2 ) ( u3 y 3 ) ( u4 y 4 ). H L ( u y ) = ( u3 y 3 ) ( u4 y 4 ) ( u5 y 5 ) ( u L yl ) ( u T y T ) 12/34

15 The Fundamental Lemma Definition : The signal u = col(u 1,..., u T ) R mt is persistently exciting of order L if H L (u) = u 1 u T L u L u T is of full row rank, i.e., if the signal is sufficiently rich and long (T L + 1 ml). Fundamental lemma [Willems et al, 05] : Let T, t Z >0, Consider a controllable LTI system (Z 0, R m+p, B), and a T -sample long trajectory col(u d, y d ) B T, where u is persistently exciting of order t + n (prediction span + # states). Then colspan (H t ( u y )) = B t. 13/34

16 Cartoon of Fundamental Lemma u(t) y(t) y 3 u 4 y u 1 u 3 u 7 y 4 1 y 5 y 7 u 5 u 2 u 6 t y 2 t y 6 persistently exciting controllable LTI sufficiently many samples ( u1 y 1 ) ( u2 y 2 ) ( u3 y 3 )... x k+1 =Ax k + Bu k ( u2 y 2 ) ( u3 y 3 ) ( u4 y 4 )... colspan y k =Cx k + Du ( u3 y 3 ) ( u4 y 4 ) ( u5 y 5 )... k }{{}}{{} parametric state-space model non-parametric model from raw data all trajectories constructible from finitely many previous trajectories 14/34

17 Data-driven simulation [Markovsky & Rapisarda 08] Problem : predict future output y R p T future based on input signal u R m T future to predict forward past data col(u d, y d ) B Tdata to form Hankel matrix Assume: B controllable & u d persistently exciting of order T future + n Solution: given (u 1,..., u Tfuture ) compute g & (y 1,..., y Tfuture ) from u d 1 u d 2 u d T N+1 u u d T future u d T future +1 u d. T u Tfuture g = y1 d y2 d yt d N+1 y yt d future yt d future +1 yt d y Tfuture Issue: predicted output is not unique need to set initial conditions! 15/34

18 Refined problem : predict future output y R p T future based on initial trajectory col(u ini, y ini ) R (m+p)t ini input signal u R m T future past data col(u d, y d ) B Tdata to estimate initial x ini to predict forward to form Hankel matrix Assume: B controllable & u d persist. exciting of order T ini +T future +n Solution: given (u 1,..., u Tfuture ) & col(u ini, y ini ) compute g & (y 1,..., y Tfuture ) from if T ini lag of system, then y is unique U p Y p U f g = Y f u ini y ini u y u d 1 u d T T future T ini [ ] Up u d T u d U ini T T future f u d T ini +1 u d T T future u d T ini +T u d future T y1 d yt d T future T ini [ ] Yp yt d y d Y ini T T future f yt d ini +1 yt d T future yt d ini +T y d future T 16/34

19 Output Model Predictive Control The canonical receding-horizon MPC optimization problem : minimize u, x, y T future 1 k=0 y k r t+k 2 Q + u k 2 R subject to x k+1 = Ax k + Bu k, k {0,..., T future 1}, y k = Cx k + Du k, k {0,..., T future 1}, x k+1 = Ax k + Bu k, k { T ini 1,..., 1}, y k = Cx k + Du k, k { T ini 1,..., 1}, u k U, k {0,..., T future 1}, y k Y, k {0,..., T future 1} quadratic cost with R 0, Q 0 & ref. r model for prediction over k [0, T future 1] model for estimation (many variations) hard operational or safety constraints For a deterministic LTI plant and an exact model of the plant, MPC is the gold standard of control : safe, optimal, tracking,... 17/34

20 Data-Enabled Predictive Control DeePC uses non-parametric and data-based Hankel matrix time series as prediction/estimation model inside MPC optimization problem: minimize g, u, y subject to T future 1 k=0 U p Y p U f Y f y k r t+k 2 Q + u k 2 R g = u ini y ini u y, u k U, k {0,..., T future 1}, y k Y, k {0,..., T future 1} quadratic cost with R 0, Q 0 & ref. r non-parametric model for prediction and estimation hard operational or safety constraints Hankel matrix with T ini + T future rows from past data [ ] [ ] Up = H U Tini +T f future (u d Yp ) and = H Y Tini +T f future (y d ) collected offline (could be adapted online) past T ini lag samples (u ini, y ini ) for x ini estimation updated online 18/34

21 Correctness for LTI Systems Theorem: Consider a controllable LTI system and the DeePC & MPC optimization problems with persistently exciting data of order T ini +T future +n. Then the feasible sets of DeePC & MPC coincide. Corollary: If U, Y are convex, then also the trajectories coincide. Aerial robotics case study : 19/34

22 Thus, MPC carries over to DeePC... at least in the nominal case. Beyond LTI, what about measurement noise, corrupted past data, and nonlinearities?

23 Noisy real-time measurements minimize g, u, y subject to T future 1 k=0 U p Y p U f Y f y k r t+k 2 Q + u k 2 R + λ y σ y 1 g = u ini y ini u y 0 + σ y 0, 0 u k U, k {0,..., T future 1}, y k Y, k {0,..., T future 1} Solution : add slack to ensure feasibility with l 1 -penalty for λ y sufficiently large σ y 0 only if constraint infeasible c.f. sensitivity analysis over randomized sims cost average cost average constraint violations duration violations (s)20 20/34

24 Hankel matrix corrupted by noise minimize g, u, y subject to T future 1 k=0 U p Y p U f Y f y k r t+k 2 Q + u k 2 R + λ g g 1 g = u ini y ini u y, u k U, k {0,..., T future 1}, y k Y, k {0,..., T future 1} Solution : add a l 1 -penalty on g intuition: l 1 sparsely selects {Hankel matrix columns} = {past trajectories} = {motion primitives} c.f. sensitivity analysis over randomized sims cost average cost average constraint violations duration violations (s)20 21/34

25 Towards nonlinear systems... Idea : lift nonlinear system to large/ -dimensional bi-/linear system Carleman, Volterra, Fliess, Koopman, Sturm-Liouville methods exploit size rather than nonlinearity and find features in data exploit size, collect more data, & build a larger Hankel matrix regularization singles out relevant features / basis functions DeePC 3 2 x DeePC ydeepc 2 zdeepc x ref 1 yref 1 zref 0.5 Constraints m case study : regularization for g and σy s 22/34

26 recall the central promise : it is easier to learn control policies directly from data, rather than learning a model

27 Comparison to system ID + MPC Setup : nonlinear stochastic quadcopter model with full state info DeePC + l 1 -regularization for g and σ y MPC : system ID via prediction error method + nominal MPC 3 DeePC 5 MPC 2 m x DeePC y DeePC z DeePC 4 x ref y ref z ref Constraints single fig-8 run m x MPC y MPC z MPC x ref y ref z ref Constraints s s Number of simulations Cost DeePC System ID + MPC random sims Number of simulations Constraint Violations DeePC System ID + MPC Cost Duration constraints violated 23/34

28 from heuristics & numerical promises to theorems

29 Robust problem formulation 1. the nominal problem (without g-regularization) minimize g, u, y subject to T future 1 k=0 Û p Ŷ p Û f g = Ŷ f y k r t+k 2 Q + u k 2 R + λ y σ y 1 u ini ŷ ini u y 0 + σ y 0, 0 u k U, k {0,..., T future 1} where denotes measured & thus possibly corrupted data ) 2. an abstraction of this problem (Ûf minimize g G f g, Ŷfg 1 + λ y Ŷ p g ŷ ini { } where G = g : Û p g = u ini & Ûfg U 24/34

30 ) 3. a further abstraction minimize ( ξ, g G c g = minimize g G E P [c (ξ, g)] { } with G = g : Û p g = u ini & Ûfg U, measured ξ ) = (Ŷp, Ŷf, ŷ ini, & P = δ ξ denotes the empirical distribution from which we obtained ξ 4. the solution g of the above problem gives poor out-of-sample performance for the problem we really want to solve: E P [c (ξ, g )] where P is the unknown probability distribution of ξ 5. distributionally robust formulation inf g G sup E Q [c (ξ, g)] Q B ɛ( P ) where the ambiguity set B ɛ ( P ) is an ɛ-wasserstein ball centered at P : { } B ɛ ( P ) = P : inf ξ ξ W dπ ɛ where Π has marginals ˆP and P Π 25/34

31 5. distributionally robust formulation inf g G sup E Q [c (ξ, g)] Q B ɛ( P ) where the ambiguity set B ɛ ( P ) is an ɛ-wasserstein ball centered at P : { } B ɛ ( P ) = P : inf ξ ξ W dπ ɛ where Π has marginals ˆP and P Π Theorem : Under minor technical conditions: inf g G ) sup E Q [c (ξ, g)] min ( ξ, c g + ɛλ y g Q B ɛ( P W g G ) Cor: l -robustness in trajectory space l 1 -regularization of DeePC Proof uses methods by Kuhn & Esfahani: semi-infinite problem becomes finite after marginalization & for discrete worst case Cost cost ǫ 26/34

32 Relation to system ID & MPC 1. regularized DeePC problem minimize g, u U, y Y f(u, y) + λ g g 2 2 subject to U p Y p U f Y f g = u ini y ini u y 2. standard model-based MPC (ARMA parameterization) minimize u U, y Y subject to f(u, y) u ini y = K y ini u 3. subspace ID y = Y f g where g = g (u ini, y ini, u) solves arg min g g 2 2 U p u ini subject to Y p g = y ini U f u 4. equivalent prediction error ID d u inij minimize K j yd j K d y inij u d j u ini y = K y ini = Y f g u 2 27/34

33 subsequent ID & MPC minimize u U, y Y subject to f(u, y) u ini y = K y ini u where K solves u inij arg min K j y j K y inij u j regularized DeePC minimize g, u U, y Y f(u, y) + λ g g 2 2 subject to U p Y p U f g = Y f 2 u ini y ini u y minimize u U, y Y subject to f(u, y) [ ] y = u where g solves [ Yf U f ] g arg min g 2 2 g subject to U p Y p g = u ini y ini U f u feasible set of ID & MPC feasible set for DeePC DeePC MPC + λ g ID easier to learn control policies from data rather than models 28/34

34 application: end-to-end automation in energy systems

35 Grid-connected converter control Task: control converter (nonlinear, noisy & constrained) without a model of the grid, line, passives, or inner loops '"$ '"# '"! DeePC!"& open!"% loop!"!!"#!"$!"%!"& '"! '"# '"$ Three-Phase VSC * Uabc Iabc LF LCL Vabc Igabc CF Lg Line Rg Uabc AC Grid Power Part!"#!"'!"! DeePC dq abc Current Control Loop θ ω PLL PI Vq )!"' open )!"# loop!"!!"#!"$!"%!"& '"! '"# '"$ * Ud * Uq PI PI Id! "! " ref Id ref Iq Vd Vq Id Iq abc dq Vabc Iabc #"! '"( '"! DeePC Iq u1 u2 y1 y2 y3 Control Part!"( open!"! loop!"!!"#!"$!"%!"& '"! '"# '"$ DeePC tracking constant dq-frame references subject to constraints open loop inject noise collect data activate DeePC 29/34

36 Effect of regularizations DeePC time-domain cost!"#$%$&'#$()*+(,# DeePC Sys ID + MPC λ g DeePC time-domain cost = k y k r k 2 Q + u k 2 R (closed-loop measurements)! " Optimization cost = k y k r k 2 Q + u k 2 R +λ g g 2 (closed-loop measurements) 30/34

37 DeePC time-domain cost PEM-MPC time-domain c Data length T ini = 40, T future = 30 Sys ID + MPC T ini '"$ '"# '"!!"&!"%!"!!"#!"$!"%!"& '"!!"#!"'!"! (!"' (!"#!"!!"#!"$!"%!"& '"! works like a charm for T large, but card(g) = T T ini T future + 1 (possibly?) prohibitive on µdsp T #"! '") '"!!")!"!!"!!"#!"$!"%!"& '"! 31/34

38 Power system case study extrapolation from previous case study: const. voltage grid control complex 2-area power system: large (n 10 2 ), nonlinear, noisy, stiff, & with input constraints ( no control ) control objective: damping of inter-area oscillations via HVDC real-time closed-loop MPC & DeePC become prohibitive (on laptop) choose T, T ini, and T future wisely 32/34

39 time-domain cost PEM-MPC time-domain cost Choice of time constants!')(!'(!!'"( T ini = 200, T future = 80! " # $ % &! &" &# choose T sufficiently large short horizon T future 10 T ini 10 estimates sufficiently rich model complexity!')(!'(!!'"( T ini = 10, T future = 10! " # $ % &! &" &#!')(!'(!!'"( T ini = 5, T future = 10! " # $ % &! &" &# T ini time-domain cost = k y k r k 2 Q + u k 2 R (closed-loop measurements) 33/34

40 Summary & conclusions fundamental lemma from behavioral systems matrix time series serves as predictive model data-enabled predictive control (DeePC) X certificates for deterministic LTI systems X distributional robustness via regularizations -1 * Uabc X outperforms ID + MPC in optimization metric I abc Vabc LF Three-Phase VSC I gabc adaptive extensions, explicit policies,... applications to building automation, bio, etc. Why have these powerful ideas not been mixed long before? Uabc Rg AC Grid LCL Power Part θ abc certificates for nonlinear & stochastic setup Line Lg CF -0.2 dq ω Id Ud* PI! " PI "! Vq PLL Current Control Loop Uq* PI Vd Idref abc Vabc Vq Iqref Id Iq Iq I abc dq Control Part u1 u2 y1 y2 y3 Willems 07: [MPC] has perhaps too little system theory and too much brute force computation in it. The other side often proclaims behavioral systems theory is beautiful but did not prove utterly useful 34/34

ELEC system identification workshop. Subspace methods

ELEC system identification workshop. Subspace methods 1 / 33 ELEC system identification workshop Subspace methods Ivan Markovsky 2 / 33 Plan 1. Behavioral approach 2. Subspace methods 3. Optimization methods 3 / 33 Outline Exact modeling Algorithms 4 / 33

More information

Data-driven signal processing

Data-driven signal processing 1 / 35 Data-driven signal processing Ivan Markovsky 2 / 35 Modern signal processing is model-based 1. system identification prior information model structure 2. model-based design identification data parameter

More information

EE C128 / ME C134 Feedback Control Systems

EE C128 / ME C134 Feedback Control Systems EE C128 / ME C134 Feedback Control Systems Lecture Additional Material Introduction to Model Predictive Control Maximilian Balandat Department of Electrical Engineering & Computer Science University of

More information

Sparsity in system identification and data-driven control

Sparsity in system identification and data-driven control 1 / 40 Sparsity in system identification and data-driven control Ivan Markovsky This signal is not sparse in the "time domain" 2 / 40 But it is sparse in the "frequency domain" (it is weighted sum of six

More information

Subspace Identification

Subspace Identification Chapter 10 Subspace Identification Given observations of m 1 input signals, and p 1 signals resulting from those when fed into a dynamical system under study, can we estimate the internal dynamics regulating

More information

Learning Model Predictive Control for Iterative Tasks: A Computationally Efficient Approach for Linear System

Learning Model Predictive Control for Iterative Tasks: A Computationally Efficient Approach for Linear System Learning Model Predictive Control for Iterative Tasks: A Computationally Efficient Approach for Linear System Ugo Rosolia Francesco Borrelli University of California at Berkeley, Berkeley, CA 94701, USA

More information

Overview of the Seminar Topic

Overview of the Seminar Topic Overview of the Seminar Topic Simo Särkkä Laboratory of Computational Engineering Helsinki University of Technology September 17, 2007 Contents 1 What is Control Theory? 2 History

More information

Robotics. Control Theory. Marc Toussaint U Stuttgart

Robotics. Control Theory. Marc Toussaint U Stuttgart Robotics Control Theory Topics in control theory, optimal control, HJB equation, infinite horizon case, Linear-Quadratic optimal control, Riccati equations (differential, algebraic, discrete-time), controllability,

More information

Two well known examples. Applications of structured low-rank approximation. Approximate realisation = Model reduction. System realisation

Two well known examples. Applications of structured low-rank approximation. Approximate realisation = Model reduction. System realisation Two well known examples Applications of structured low-rank approximation Ivan Markovsky System realisation Discrete deconvolution School of Electronics and Computer Science University of Southampton The

More information

A Globally Stabilizing Receding Horizon Controller for Neutrally Stable Linear Systems with Input Constraints 1

A Globally Stabilizing Receding Horizon Controller for Neutrally Stable Linear Systems with Input Constraints 1 A Globally Stabilizing Receding Horizon Controller for Neutrally Stable Linear Systems with Input Constraints 1 Ali Jadbabaie, Claudio De Persis, and Tae-Woong Yoon 2 Department of Electrical Engineering

More information

Stochastic Tube MPC with State Estimation

Stochastic Tube MPC with State Estimation Proceedings of the 19th International Symposium on Mathematical Theory of Networks and Systems MTNS 2010 5 9 July, 2010 Budapest, Hungary Stochastic Tube MPC with State Estimation Mark Cannon, Qifeng Cheng,

More information

Identification and Control of Mechatronic Systems

Identification and Control of Mechatronic Systems Identification and Control of Mechatronic Systems Philadelphia University, Jordan NATO - ASI Advanced All-Terrain Autonomous Systems Workshop August 15 24, 2010 Cesme-Izmir, Turkey Overview Mechatronics

More information

Partially Observable Markov Decision Processes (POMDPs)

Partially Observable Markov Decision Processes (POMDPs) Partially Observable Markov Decision Processes (POMDPs) Sachin Patil Guest Lecture: CS287 Advanced Robotics Slides adapted from Pieter Abbeel, Alex Lee Outline Introduction to POMDPs Locally Optimal Solutions

More information

Robotics. Dynamics. Marc Toussaint U Stuttgart

Robotics. Dynamics. Marc Toussaint U Stuttgart Robotics Dynamics 1D point mass, damping & oscillation, PID, dynamics of mechanical systems, Euler-Lagrange equation, Newton-Euler recursion, general robot dynamics, joint space control, reference trajectory

More information

A software package for system identification in the behavioral setting

A software package for system identification in the behavioral setting 1 / 20 A software package for system identification in the behavioral setting Ivan Markovsky Vrije Universiteit Brussel 2 / 20 Outline Introduction: system identification in the behavioral setting Solution

More information

ELEC4631 s Lecture 2: Dynamic Control Systems 7 March Overview of dynamic control systems

ELEC4631 s Lecture 2: Dynamic Control Systems 7 March Overview of dynamic control systems ELEC4631 s Lecture 2: Dynamic Control Systems 7 March 2011 Overview of dynamic control systems Goals of Controller design Autonomous dynamic systems Linear Multi-input multi-output (MIMO) systems Bat flight

More information

Introduction to System Identification and Adaptive Control

Introduction to System Identification and Adaptive Control Introduction to System Identification and Adaptive Control A. Khaki Sedigh Control Systems Group Faculty of Electrical and Computer Engineering K. N. Toosi University of Technology May 2009 Introduction

More information

Modeling and Control Overview

Modeling and Control Overview Modeling and Control Overview D R. T A R E K A. T U T U N J I A D V A N C E D C O N T R O L S Y S T E M S M E C H A T R O N I C S E N G I N E E R I N G D E P A R T M E N T P H I L A D E L P H I A U N I

More information

Optimal Polynomial Control for Discrete-Time Systems

Optimal Polynomial Control for Discrete-Time Systems 1 Optimal Polynomial Control for Discrete-Time Systems Prof Guy Beale Electrical and Computer Engineering Department George Mason University Fairfax, Virginia Correspondence concerning this paper should

More information

I. D. Landau, A. Karimi: A Course on Adaptive Control Adaptive Control. Part 9: Adaptive Control with Multiple Models and Switching

I. D. Landau, A. Karimi: A Course on Adaptive Control Adaptive Control. Part 9: Adaptive Control with Multiple Models and Switching I. D. Landau, A. Karimi: A Course on Adaptive Control - 5 1 Adaptive Control Part 9: Adaptive Control with Multiple Models and Switching I. D. Landau, A. Karimi: A Course on Adaptive Control - 5 2 Outline

More information

Basics of reinforcement learning

Basics of reinforcement learning Basics of reinforcement learning Lucian Buşoniu TMLSS, 20 July 2018 Main idea of reinforcement learning (RL) Learn a sequential decision policy to optimize the cumulative performance of an unknown system

More information

Analysis and synthesis: a complexity perspective

Analysis and synthesis: a complexity perspective Analysis and synthesis: a complexity perspective Pablo A. Parrilo ETH ZürichZ control.ee.ethz.ch/~parrilo Outline System analysis/design Formal and informal methods SOS/SDP techniques and applications

More information

Theory in Model Predictive Control :" Constraint Satisfaction and Stability!

Theory in Model Predictive Control : Constraint Satisfaction and Stability! Theory in Model Predictive Control :" Constraint Satisfaction and Stability Colin Jones, Melanie Zeilinger Automatic Control Laboratory, EPFL Example: Cessna Citation Aircraft Linearized continuous-time

More information

RECURSIVE SUBSPACE IDENTIFICATION IN THE LEAST SQUARES FRAMEWORK

RECURSIVE SUBSPACE IDENTIFICATION IN THE LEAST SQUARES FRAMEWORK RECURSIVE SUBSPACE IDENTIFICATION IN THE LEAST SQUARES FRAMEWORK TRNKA PAVEL AND HAVLENA VLADIMÍR Dept of Control Engineering, Czech Technical University, Technická 2, 166 27 Praha, Czech Republic mail:

More information

Geodesic Density Tracking with Applications to Data Driven Modeling

Geodesic Density Tracking with Applications to Data Driven Modeling Geodesic Density Tracking with Applications to Data Driven Modeling Abhishek Halder and Raktim Bhattacharya Department of Aerospace Engineering Texas A&M University College Station, TX 77843-3141 American

More information

AN IDENTIFICATION ALGORITHM FOR ARMAX SYSTEMS

AN IDENTIFICATION ALGORITHM FOR ARMAX SYSTEMS AN IDENTIFICATION ALGORITHM FOR ARMAX SYSTEMS First the X, then the AR, finally the MA Jan C. Willems, K.U. Leuven Workshop on Observation and Estimation Ben Gurion University, July 3, 2004 p./2 Joint

More information

Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A.

Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A. . Selected Examples of CONIC DUALITY AT WORK Robust Linear Optimization Synthesis of Linear Controllers Matrix Cube Theorem A. Nemirovski Arkadi.Nemirovski@isye.gatech.edu Linear Optimization Problem,

More information

Distributed and Real-time Predictive Control

Distributed and Real-time Predictive Control Distributed and Real-time Predictive Control Melanie Zeilinger Christian Conte (ETH) Alexander Domahidi (ETH) Ye Pu (EPFL) Colin Jones (EPFL) Challenges in modern control systems Power system: - Frequency

More information

State Estimation of Linear and Nonlinear Dynamic Systems

State Estimation of Linear and Nonlinear Dynamic Systems State Estimation of Linear and Nonlinear Dynamic Systems Part I: Linear Systems with Gaussian Noise James B. Rawlings and Fernando V. Lima Department of Chemical and Biological Engineering University of

More information

A structured low-rank approximation approach to system identification. Ivan Markovsky

A structured low-rank approximation approach to system identification. Ivan Markovsky 1 / 35 A structured low-rank approximation approach to system identification Ivan Markovsky Main message: system identification SLRA minimize over B dist(a,b) subject to rank(b) r and B structured (SLRA)

More information

Partially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS

Partially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS Partially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS Many slides adapted from Jur van den Berg Outline POMDPs Separation Principle / Certainty Equivalence Locally Optimal

More information

A missing data approach to data-driven filtering and control

A missing data approach to data-driven filtering and control 1 A missing data approach to data-driven filtering and control Ivan Markovsky Abstract In filtering, control, and other mathematical engineering areas it is common to use a model-based approach, which

More information

ECE557 Systems Control

ECE557 Systems Control ECE557 Systems Control Bruce Francis Course notes, Version.0, September 008 Preface This is the second Engineering Science course on control. It assumes ECE56 as a prerequisite. If you didn t take ECE56,

More information

Introduction to Support Vector Machines

Introduction to Support Vector Machines Introduction to Support Vector Machines Hsuan-Tien Lin Learning Systems Group, California Institute of Technology Talk in NTU EE/CS Speech Lab, November 16, 2005 H.-T. Lin (Learning Systems Group) Introduction

More information

Optimal Transport in Risk Analysis

Optimal Transport in Risk Analysis Optimal Transport in Risk Analysis Jose Blanchet (based on work with Y. Kang and K. Murthy) Stanford University (Management Science and Engineering), and Columbia University (Department of Statistics and

More information

Approximate dynamic programming for stochastic reachability

Approximate dynamic programming for stochastic reachability Approximate dynamic programming for stochastic reachability Nikolaos Kariotoglou, Sean Summers, Tyler Summers, Maryam Kamgarpour and John Lygeros Abstract In this work we illustrate how approximate dynamic

More information

Predictive Control of Gyroscopic-Force Actuators for Mechanical Vibration Damping

Predictive Control of Gyroscopic-Force Actuators for Mechanical Vibration Damping ARC Centre of Excellence for Complex Dynamic Systems and Control, pp 1 15 Predictive Control of Gyroscopic-Force Actuators for Mechanical Vibration Damping Tristan Perez 1, 2 Joris B Termaat 3 1 School

More information

Simple criteria for controller performance monitoring

Simple criteria for controller performance monitoring Simple criteria for controller performance monitoring Kostas Tsakalis (ASU) and Sachi Dash (Honeywell) Introduction/motivation Performance monitoring philosophy Simple criteria based on small-gain arguments

More information

Optimal Control. McGill COMP 765 Oct 3 rd, 2017

Optimal Control. McGill COMP 765 Oct 3 rd, 2017 Optimal Control McGill COMP 765 Oct 3 rd, 2017 Classical Control Quiz Question 1: Can a PID controller be used to balance an inverted pendulum: A) That starts upright? B) That must be swung-up (perhaps

More information

Distirbutional robustness, regularizing variance, and adversaries

Distirbutional robustness, regularizing variance, and adversaries Distirbutional robustness, regularizing variance, and adversaries John Duchi Based on joint work with Hongseok Namkoong and Aman Sinha Stanford University November 2017 Motivation We do not want machine-learned

More information

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Pontus Giselsson Department of Automatic Control LTH Lund University Box 118, SE-221 00 Lund, Sweden pontusg@control.lth.se

More information

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning Christos Dimitrakakis Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands

More information

Iterative Learning Control Analysis and Design I

Iterative Learning Control Analysis and Design I Iterative Learning Control Analysis and Design I Electronics and Computer Science University of Southampton Southampton, SO17 1BJ, UK etar@ecs.soton.ac.uk http://www.ecs.soton.ac.uk/ Contents Basics Representations

More information

MPC Infeasibility Handling

MPC Infeasibility Handling MPC Handling Thomas Wiese, TU Munich, KU Leuven supervised by H.J. Ferreau, Prof. M. Diehl (both KUL) and Dr. H. Gräb (TUM) October 9, 2008 1 / 42 MPC General MPC Strategies 2 / 42 Linear Discrete-Time

More information

ESC794: Special Topics: Model Predictive Control

ESC794: Special Topics: Model Predictive Control ESC794: Special Topics: Model Predictive Control Nonlinear MPC Analysis : Part 1 Reference: Nonlinear Model Predictive Control (Ch.3), Grüne and Pannek Hanz Richter, Professor Mechanical Engineering Department

More information

Introduction to Reinforcement Learning. CMPT 882 Mar. 18

Introduction to Reinforcement Learning. CMPT 882 Mar. 18 Introduction to Reinforcement Learning CMPT 882 Mar. 18 Outline for the week Basic ideas in RL Value functions and value iteration Policy evaluation and policy improvement Model-free RL Monte-Carlo and

More information

On a Data Assimilation Method coupling Kalman Filtering, MCRE Concept and PGD Model Reduction for Real-Time Updating of Structural Mechanics Model

On a Data Assimilation Method coupling Kalman Filtering, MCRE Concept and PGD Model Reduction for Real-Time Updating of Structural Mechanics Model On a Data Assimilation Method coupling, MCRE Concept and PGD Model Reduction for Real-Time Updating of Structural Mechanics Model 2016 SIAM Conference on Uncertainty Quantification Basile Marchand 1, Ludovic

More information

Empirical Risk Minimization

Empirical Risk Minimization Empirical Risk Minimization Fabrice Rossi SAMM Université Paris 1 Panthéon Sorbonne 2018 Outline Introduction PAC learning ERM in practice 2 General setting Data X the input space and Y the output space

More information

A Tour of Reinforcement Learning The View from Continuous Control. Benjamin Recht University of California, Berkeley

A Tour of Reinforcement Learning The View from Continuous Control. Benjamin Recht University of California, Berkeley A Tour of Reinforcement Learning The View from Continuous Control Benjamin Recht University of California, Berkeley trustable, scalable, predictable Control Theory! Reinforcement Learning is the study

More information

MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications. Class 19: Data Representation by Design

MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications. Class 19: Data Representation by Design MIT 9.520/6.860, Fall 2017 Statistical Learning Theory and Applications Class 19: Data Representation by Design What is data representation? Let X be a data-space X M (M) F (M) X A data representation

More information

Real-Time Feasibility of Nonlinear Predictive Control for Semi-batch Reactors

Real-Time Feasibility of Nonlinear Predictive Control for Semi-batch Reactors European Symposium on Computer Arded Aided Process Engineering 15 L. Puigjaner and A. Espuña (Editors) 2005 Elsevier Science B.V. All rights reserved. Real-Time Feasibility of Nonlinear Predictive Control

More information

ELEC system identification workshop. Behavioral approach

ELEC system identification workshop. Behavioral approach 1 / 33 ELEC system identification workshop Behavioral approach Ivan Markovsky 2 / 33 The course consists of lectures and exercises Session 1: behavioral approach to data modeling Session 2: subspace identification

More information

Efficient robust optimization for robust control with constraints Paul Goulart, Eric Kerrigan and Danny Ralph

Efficient robust optimization for robust control with constraints Paul Goulart, Eric Kerrigan and Danny Ralph Efficient robust optimization for robust control with constraints p. 1 Efficient robust optimization for robust control with constraints Paul Goulart, Eric Kerrigan and Danny Ralph Efficient robust optimization

More information

Hidden Markov Models (HMM) and Support Vector Machine (SVM)

Hidden Markov Models (HMM) and Support Vector Machine (SVM) Hidden Markov Models (HMM) and Support Vector Machine (SVM) Professor Joongheon Kim School of Computer Science and Engineering, Chung-Ang University, Seoul, Republic of Korea 1 Hidden Markov Models (HMM)

More information

Technical work in WP2 and WP5

Technical work in WP2 and WP5 Technical work in WP2 and WP5 UNIZG-FER Mato Baotić, Branimir Novoselnik, Jadranko Matuško, Mario Vašak, Andrej Jokić Aachen, October 9, 2015 This project has received funding from the European Union s

More information

IDENTIFICATION FOR CONTROL

IDENTIFICATION FOR CONTROL IDENTIFICATION FOR CONTROL Raymond A. de Callafon, University of California San Diego, USA Paul M.J. Van den Hof, Delft University of Technology, the Netherlands Keywords: Controller, Closed loop model,

More information

Gaussian Estimation under Attack Uncertainty

Gaussian Estimation under Attack Uncertainty Gaussian Estimation under Attack Uncertainty Tara Javidi Yonatan Kaspi Himanshu Tyagi Abstract We consider the estimation of a standard Gaussian random variable under an observation attack where an adversary

More information

Target Tracking Using Double Pendulum

Target Tracking Using Double Pendulum Target Tracking Using Double Pendulum Brian Spackman 1, Anusna Chakraborty 1 Department of Electrical and Computer Engineering Utah State University Abstract: This paper deals with the design, implementation

More information

Event-Triggered Decentralized Dynamic Output Feedback Control for LTI Systems

Event-Triggered Decentralized Dynamic Output Feedback Control for LTI Systems Event-Triggered Decentralized Dynamic Output Feedback Control for LTI Systems Pavankumar Tallapragada Nikhil Chopra Department of Mechanical Engineering, University of Maryland, College Park, 2742 MD,

More information

D(s) G(s) A control system design definition

D(s) G(s) A control system design definition R E Compensation D(s) U Plant G(s) Y Figure 7. A control system design definition x x x 2 x 2 U 2 s s 7 2 Y Figure 7.2 A block diagram representing Eq. (7.) in control form z U 2 s z Y 4 z 2 s z 2 3 Figure

More information

CHAPTER 5 ROBUSTNESS ANALYSIS OF THE CONTROLLER

CHAPTER 5 ROBUSTNESS ANALYSIS OF THE CONTROLLER 114 CHAPTER 5 ROBUSTNESS ANALYSIS OF THE CONTROLLER 5.1 INTRODUCTION Robust control is a branch of control theory that explicitly deals with uncertainty in its approach to controller design. It also refers

More information

Location Prediction of Moving Target

Location Prediction of Moving Target Location of Moving Target Department of Radiation Oncology Stanford University AAPM 2009 Outline of Topics 1 Outline of Topics 1 2 Outline of Topics 1 2 3 Estimation of Stochastic Process Stochastic Regression:

More information

THIS paper deals with robust control in the setup associated

THIS paper deals with robust control in the setup associated IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 50, NO 10, OCTOBER 2005 1501 Control-Oriented Model Validation and Errors Quantification in the `1 Setup V F Sokolov Abstract A priori information required for

More information

Nonlinear Model Predictive Control Tools (NMPC Tools)

Nonlinear Model Predictive Control Tools (NMPC Tools) Nonlinear Model Predictive Control Tools (NMPC Tools) Rishi Amrit, James B. Rawlings April 5, 2008 1 Formulation We consider a control system composed of three parts([2]). Estimator Target calculator Regulator

More information

Autonomous Mobile Robot Design

Autonomous Mobile Robot Design Autonomous Mobile Robot Design Topic: Guidance and Control Introduction and PID Loops Dr. Kostas Alexis (CSE) Autonomous Robot Challenges How do I control where to go? Autonomous Mobile Robot Design Topic:

More information

Denis ARZELIER arzelier

Denis ARZELIER   arzelier COURSE ON LMI OPTIMIZATION WITH APPLICATIONS IN CONTROL PART II.2 LMIs IN SYSTEMS CONTROL STATE-SPACE METHODS PERFORMANCE ANALYSIS and SYNTHESIS Denis ARZELIER www.laas.fr/ arzelier arzelier@laas.fr 15

More information

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Dynamic Programming Marc Toussaint University of Stuttgart Winter 2018/19 Motivation: So far we focussed on tree search-like solvers for decision problems. There is a second important

More information

Dynamic measurement: application of system identification in metrology

Dynamic measurement: application of system identification in metrology 1 / 25 Dynamic measurement: application of system identification in metrology Ivan Markovsky Dynamic measurement takes into account the dynamical properties of the sensor 2 / 25 model of sensor as dynamical

More information

On Input Design for System Identification

On Input Design for System Identification On Input Design for System Identification Input Design Using Markov Chains CHIARA BRIGHENTI Masters Degree Project Stockholm, Sweden March 2009 XR-EE-RT 2009:002 Abstract When system identification methods

More information

Further Results on Model Structure Validation for Closed Loop System Identification

Further Results on Model Structure Validation for Closed Loop System Identification Advances in Wireless Communications and etworks 7; 3(5: 57-66 http://www.sciencepublishinggroup.com/j/awcn doi:.648/j.awcn.735. Further esults on Model Structure Validation for Closed Loop System Identification

More information

State Estimation using Moving Horizon Estimation and Particle Filtering

State Estimation using Moving Horizon Estimation and Particle Filtering State Estimation using Moving Horizon Estimation and Particle Filtering James B. Rawlings Department of Chemical and Biological Engineering UW Math Probability Seminar Spring 2009 Rawlings MHE & PF 1 /

More information

Feedback Control of Turbulent Wall Flows

Feedback Control of Turbulent Wall Flows Feedback Control of Turbulent Wall Flows Dipartimento di Ingegneria Aerospaziale Politecnico di Milano Outline Introduction Standard approach Wiener-Hopf approach Conclusions Drag reduction A model problem:

More information

A RECEDING HORIZON CONTROL FOR LIFTING PING-PONG BALL. Sumiko Majima and Keii Chou

A RECEDING HORIZON CONTROL FOR LIFTING PING-PONG BALL. Sumiko Majima and Keii Chou A RECEDING HORIZON CONROL FOR LIFING PING-PONG BALL Sumiko Majima and Keii Chou Graduate School of Systems and Information Engineering, University of sukuba, sukuba, Ibaraki, Japan Abstract: his paper

More information

Prashant Mhaskar, Nael H. El-Farra & Panagiotis D. Christofides. Department of Chemical Engineering University of California, Los Angeles

Prashant Mhaskar, Nael H. El-Farra & Panagiotis D. Christofides. Department of Chemical Engineering University of California, Los Angeles HYBRID PREDICTIVE OUTPUT FEEDBACK STABILIZATION OF CONSTRAINED LINEAR SYSTEMS Prashant Mhaskar, Nael H. El-Farra & Panagiotis D. Christofides Department of Chemical Engineering University of California,

More information

Lecture 8 Receding Horizon Temporal Logic Planning & Finite-State Abstraction

Lecture 8 Receding Horizon Temporal Logic Planning & Finite-State Abstraction Lecture 8 Receding Horizon Temporal Logic Planning & Finite-State Abstraction Ufuk Topcu Nok Wongpiromsarn Richard M. Murray AFRL, 26 April 2012 Contents of the lecture: Intro: Incorporating continuous

More information

Neural Map. Structured Memory for Deep RL. Emilio Parisotto

Neural Map. Structured Memory for Deep RL. Emilio Parisotto Neural Map Structured Memory for Deep RL Emilio Parisotto eparisot@andrew.cmu.edu PhD Student Machine Learning Department Carnegie Mellon University Supervised Learning Most deep learning problems are

More information

Optimal Transport Methods in Operations Research and Statistics

Optimal Transport Methods in Operations Research and Statistics Optimal Transport Methods in Operations Research and Statistics Jose Blanchet (based on work with F. He, Y. Kang, K. Murthy, F. Zhang). Stanford University (Management Science and Engineering), and Columbia

More information

FIR Filters for Stationary State Space Signal Models

FIR Filters for Stationary State Space Signal Models Proceedings of the 17th World Congress The International Federation of Automatic Control FIR Filters for Stationary State Space Signal Models Jung Hun Park Wook Hyun Kwon School of Electrical Engineering

More information

Path Integral Stochastic Optimal Control for Reinforcement Learning

Path Integral Stochastic Optimal Control for Reinforcement Learning Preprint August 3, 204 The st Multidisciplinary Conference on Reinforcement Learning and Decision Making RLDM203 Path Integral Stochastic Optimal Control for Reinforcement Learning Farbod Farshidian Institute

More information

FRTN 15 Predictive Control

FRTN 15 Predictive Control Department of AUTOMATIC CONTROL FRTN 5 Predictive Control Final Exam March 4, 27, 8am - 3pm General Instructions This is an open book exam. You may use any book you want, including the slides from the

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning Lecture 6: RL algorithms 2.0 Alexandre Proutiere, Sadegh Talebi, Jungseul Ok KTH, The Royal Institute of Technology Objectives of this lecture Present and analyse two online algorithms

More information

Distributionally Robust Optimization with ROME (part 1)

Distributionally Robust Optimization with ROME (part 1) Distributionally Robust Optimization with ROME (part 1) Joel Goh Melvyn Sim Department of Decision Sciences NUS Business School, Singapore 18 Jun 2009 NUS Business School Guest Lecture J. Goh, M. Sim (NUS)

More information

Overview of sparse system identification

Overview of sparse system identification Overview of sparse system identification J.-Ch. Loiseau 1 & Others 2, 3 1 Laboratoire DynFluid, Arts et Métiers ParisTech, France 2 LIMSI, Université d Orsay CNRS, France 3 University of Washington, Seattle,

More information

Theory of Vibrations in Stewart Platforms

Theory of Vibrations in Stewart Platforms Theory of Vibrations in Stewart Platforms J.M. Selig and X. Ding School of Computing, Info. Sys. & Maths. South Bank University London SE1 0AA, U.K. (seligjm@sbu.ac.uk) Abstract This article develops a

More information

ORIGINS OF STOCHASTIC PROGRAMMING

ORIGINS OF STOCHASTIC PROGRAMMING ORIGINS OF STOCHASTIC PROGRAMMING Early 1950 s: in applications of Linear Programming unknown values of coefficients: demands, technological coefficients, yields, etc. QUOTATION Dantzig, Interfaces 20,1990

More information

Control Strategies for Microgrids

Control Strategies for Microgrids Control Strategies for Microgrids Ali Mehrizi-Sani Assistant Professor School of Electrical Engineering and Computer Science Washington State University Graz University of Technology Thursday, November

More information

Robotics. Dynamics. University of Stuttgart Winter 2018/19

Robotics. Dynamics. University of Stuttgart Winter 2018/19 Robotics Dynamics 1D point mass, damping & oscillation, PID, dynamics of mechanical systems, Euler-Lagrange equation, Newton-Euler, joint space control, reference trajectory following, optimal operational

More information

Industrial Model Predictive Control

Industrial Model Predictive Control Industrial Model Predictive Control Emil Schultz Christensen Kongens Lyngby 2013 DTU Compute-M.Sc.-2013-49 Technical University of Denmark DTU Compute Matematiktovet, Building 303B, DK-2800 Kongens Lyngby,

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Probabilistic Graphical Models Brown University CSCI 2950-P, Spring 2013 Prof. Erik Sudderth Lecture 12: Gaussian Belief Propagation, State Space Models and Kalman Filters Guest Kalman Filter Lecture by

More information

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization Instructor: Farid Alizadeh Author: Ai Kagawa 12/12/2012

More information

LQR, Kalman Filter, and LQG. Postgraduate Course, M.Sc. Electrical Engineering Department College of Engineering University of Salahaddin

LQR, Kalman Filter, and LQG. Postgraduate Course, M.Sc. Electrical Engineering Department College of Engineering University of Salahaddin LQR, Kalman Filter, and LQG Postgraduate Course, M.Sc. Electrical Engineering Department College of Engineering University of Salahaddin May 2015 Linear Quadratic Regulator (LQR) Consider a linear system

More information

Reinforcement learning

Reinforcement learning Reinforcement learning Based on [Kaelbling et al., 1996, Bertsekas, 2000] Bert Kappen Reinforcement learning Reinforcement learning is the problem faced by an agent that must learn behavior through trial-and-error

More information

Mathematical Methods for Data Analysis

Mathematical Methods for Data Analysis Mathematical Methods for Data Analysis Massimiliano Pontil Istituto Italiano di Tecnologia and Department of Computer Science University College London Massimiliano Pontil Mathematical Methods for Data

More information

Momentum-centric whole-body control and kino-dynamic motion generation for floating-base robots

Momentum-centric whole-body control and kino-dynamic motion generation for floating-base robots Momentum-centric whole-body control and kino-dynamic motion generation for floating-base robots Alexander Herzog The Movement Generation and Control Group (Ludovic Righetti) Conflicting tasks & constraints

More information

Control Systems I. Lecture 7: Feedback and the Root Locus method. Readings: Jacopo Tani. Institute for Dynamic Systems and Control D-MAVT ETH Zürich

Control Systems I. Lecture 7: Feedback and the Root Locus method. Readings: Jacopo Tani. Institute for Dynamic Systems and Control D-MAVT ETH Zürich Control Systems I Lecture 7: Feedback and the Root Locus method Readings: Jacopo Tani Institute for Dynamic Systems and Control D-MAVT ETH Zürich November 2, 2018 J. Tani, E. Frazzoli (ETH) Lecture 7:

More information

Here represents the impulse (or delta) function. is an diagonal matrix of intensities, and is an diagonal matrix of intensities.

Here represents the impulse (or delta) function. is an diagonal matrix of intensities, and is an diagonal matrix of intensities. 19 KALMAN FILTER 19.1 Introduction In the previous section, we derived the linear quadratic regulator as an optimal solution for the fullstate feedback control problem. The inherent assumption was that

More information

Gramians based model reduction for hybrid switched systems

Gramians based model reduction for hybrid switched systems Gramians based model reduction for hybrid switched systems Y. Chahlaoui Younes.Chahlaoui@manchester.ac.uk Centre for Interdisciplinary Computational and Dynamical Analysis (CICADA) School of Mathematics

More information

NONLINEAR AND ADAPTIVE (INTELLIGENT) SYSTEMS MODELING, DESIGN, & CONTROL A Building Block Approach

NONLINEAR AND ADAPTIVE (INTELLIGENT) SYSTEMS MODELING, DESIGN, & CONTROL A Building Block Approach NONLINEAR AND ADAPTIVE (INTELLIGENT) SYSTEMS MODELING, DESIGN, & CONTROL A Building Block Approach P.A. (Rama) Ramamoorthy Electrical & Computer Engineering and Comp. Science Dept., M.L. 30, University

More information

Low-rank approximation and its applications for data fitting

Low-rank approximation and its applications for data fitting Low-rank approximation and its applications for data fitting Ivan Markovsky K.U.Leuven, ESAT-SISTA A line fitting example b 6 4 2 0 data points Classical problem: Fit the points d 1 = [ 0 6 ], d2 = [ 1

More information

ECE7850 Lecture 8. Nonlinear Model Predictive Control: Theoretical Aspects

ECE7850 Lecture 8. Nonlinear Model Predictive Control: Theoretical Aspects ECE7850 Lecture 8 Nonlinear Model Predictive Control: Theoretical Aspects Model Predictive control (MPC) is a powerful control design method for constrained dynamical systems. The basic principles and

More information