Lecture 20: Riccati Equations and Least Squares Feedback Control

Similar documents
SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL

An Introduction to Malliavin calculus and its applications

10. State Space Methods

Chapter 3 Boundary Value Problem

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

IMPLICIT AND INVERSE FUNCTION THEOREMS PAUL SCHRIMPF 1 OCTOBER 25, 2013

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Asymptotic instability of nonlinear differential equations

EXERCISES FOR SECTION 1.5

EE363 homework 1 solutions

System of Linear Differential Equations

( ) ( ) if t = t. It must satisfy the identity. So, bulkiness of the unit impulse (hyper)function is equal to 1. The defining characteristic is

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

The Linear Quadratic Regulator on Time Scales

Linear Quadratic Regulator (LQR) - State Feedback Design

DISCRETE GRONWALL LEMMA AND APPLICATIONS

ODEs II, Lecture 1: Homogeneous Linear Systems - I. Mike Raugh 1. March 8, 2004

Announcements: Warm-up Exercise:

POSITIVE AND MONOTONE SYSTEMS IN A PARTIALLY ORDERED SPACE

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

Hybrid Control and Switched Systems. Lecture #3 What can go wrong? Trajectories of hybrid systems

Intermediate Differential Equations Review and Basic Ideas

Tracking Adversarial Targets

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

t 2 B F x,t n dsdt t u x,t dxdt

EE 315 Notes. Gürdal Arslan CLASS 1. (Sections ) What is a signal?

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

4. Advanced Stability Theory

Convergence of the Neumann series in higher norms

2. Nonlinear Conservation Law Equations

F This leads to an unstable mode which is not observable at the output thus cannot be controlled by feeding back.

Chapter 6. Systems of First Order Linear Differential Equations

Utility maximization in incomplete markets

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems

MATH 128A, SUMMER 2009, FINAL EXAM SOLUTION

Math Week 14 April 16-20: sections first order systems of linear differential equations; 7.4 mass-spring systems.

Math Final Exam Solutions

STATE-SPACE MODELLING. A mass balance across the tank gives:

Chapter 2. First Order Scalar Equations

BU Macro BU Macro Fall 2008, Lecture 4

Problemas das Aulas Práticas

FEEDBACK NULL CONTROLLABILITY OF THE SEMILINEAR HEAT EQUATION

LAPLACE TRANSFORM AND TRANSFER FUNCTION

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Heat kernel and Harnack inequality on Riemannian manifolds

On-line Adaptive Optimal Timing Control of Switched Systems

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

AN OPTIMAL CONTROL PROBLEM FOR SPLINES ASSOCIATED TO LINEAR DIFFERENTIAL OPERATORS

Ordinary Differential Equations

dt = C exp (3 ln t 4 ). t 4 W = C exp ( ln(4 t) 3) = C(4 t) 3.

Ann. Funct. Anal. 2 (2011), no. 2, A nnals of F unctional A nalysis ISSN: (electronic) URL:

(1) (2) Differentiation of (1) and then substitution of (3) leads to. Therefore, we will simply consider the second-order linear system given by (4)

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018

Recursive Least-Squares Fixed-Interval Smoother Using Covariance Information based on Innovation Approach in Linear Continuous Stochastic Systems

Hamilton Jacobi equations

15. Vector Valued Functions

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance.

Optimal Path Planning for Flexible Redundant Robot Manipulators

Math 10B: Mock Mid II. April 13, 2016

CERTAIN CLASSES OF SOLUTIONS OF LAGERSTROM EQUATIONS

NEW APPROACH TO DIFFERENTIAL EQUATIONS WITH COUNTABLE IMPULSES

Oscillation of an Euler Cauchy Dynamic Equation S. Huff, G. Olumolode, N. Pennington, and A. Peterson

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

THE BELLMAN PRINCIPLE OF OPTIMALITY

t dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP

( ) = Q 0. ( ) R = R dq. ( t) = I t

Linear Response Theory: The connection between QFT and experiments

Robust estimation based on the first- and third-moment restrictions of the power transformation model

CONTRIBUTION TO IMPULSIVE EQUATIONS

ON QUANTIZATION AND COMMUNICATION TOPOLOGIES IN MULTI-VEHICLE RENDEZVOUS 1. Karl Henrik Johansson Alberto Speranzon,2 Sandro Zampieri

arxiv: v1 [math.fa] 9 Dec 2018

Optimal Consumption and Investment Portfolio in Jump markets. Optimal Consumption and Portfolio of Investment in a Financial Market with Jumps

Math-Net.Ru All Russian mathematical portal

CHAPTER 2 Signals And Spectra

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients

An introduction to the theory of SDDP algorithm

1 Subdivide the optimization horizon [t 0,t f ] into n s 1 control stages,

non -negative cone Population dynamics motivates the study of linear models whose coefficient matrices are non-negative or positive.

Module 4: Time Response of discrete time systems Lecture Note 2

Backward stochastic dynamics on a filtered probability space

L p -L q -Time decay estimate for solution of the Cauchy problem for hyperbolic partial differential equations of linear thermoelasticity

Designing Information Devices and Systems I Spring 2019 Lecture Notes Note 17

Logic in computer science

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

Exercises: Similarity Transformation

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

5. Response of Linear Time-Invariant Systems to Random Inputs

Physics 1402: Lecture 22 Today s Agenda

MA 214 Calculus IV (Spring 2016) Section 2. Homework Assignment 1 Solutions

BOUNDED VARIATION SOLUTIONS TO STURM-LIOUVILLE PROBLEMS

Introduction to Probability and Statistics Slides 4 Chapter 4

References are appeared in the last slide. Last update: (1393/08/19)

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Lectures 29 and 30 BIQUADRATICS AND STATE SPACE OP AMP REALIZATIONS. I. Introduction

Transcription:

34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he purpose of realizing he opimal conrol law in real ime. is o find a differenial Thus he marix differenial equaion can be approximaed by a difference equaion which can be solved numerically. Theorem: (properies of he backward conrollabiliy Grammian) The backward conrollabiliy Grammian W ( has he following properies for : (a) W ( is symmeric (b) W ( is posiive semidefinie d W ( ) = AW ( ) ( ) + W ( ) A ( ) BB ( ) ( ) W ( ) = θn n d (c) (d) W ( saisfies he equaion W( ) = W( ) +Φ( ) W( ) Φ ( ) Similar o Theorem on properies of he forward conrollabiliy Grammian in L8. Noe: The forward and backward conrollabiliy Grammians are relaed as follows: W ( ) =Φ( ) M ( ) Φ ( ) M ( ) =Φ( ) W ( ) Φ ( ) (5.7) Proposiion: The inverse of he backward conrollabiliy Grammian W equaion: ( ) saisfies he marix differenial Riccai d W ( A () W ( W ( A () W ( B () B () W = + ( d W ( ) = (5.73) L- /9

34-5 LINEAR SYSTEMS Differeniae: d d W ( ) = W ( ) W ( W ( ) d d Remarks: (a) One way of generaing W ( ) in real ime is o solve recursively a difference equaion approximaing he differenial Riccai Equaion (5.73) subjec o he iniial condiion W (. (b) The opimal conrol for he ransfer o he origin ( x ) ( ) is he sae feedback u = B W x. op () () ( () 5.7 Leas Squares Sae Feedback Conrol The leas-squares feedback conrol problem is defined as follows. Given: A sysem x () = A () x () + Bu () () x ( ) = x A se Σ x ( ) o which x ( belongs A cos funcion I N() u() J ( x u ) u ( ) x ( ) d x ( ) Qx( ) [ ] N () L() + x() The leas squares problem is ha of finding a conrol law is minimized subjec o he consrains. u U ad such ha he cos J ( x u ) [ ] This problem was parly solved for he minimum-norm inpu bu here we add he sae in he cos funcion in order o minimize i as well. The leas-squares problem is classified ino: (i) (ii) Free-end-poin problem Fixed-end-poin problem. 5.7. Free-End-Poin Problem The free-end-poin problem is defined as follows. Given: A sysem x () = A () x () + Bu () () x ( ) = x L- /9

34-5 LINEAR SYSTEMS A cos funcion [ ] = + + J ( x u ) u ( ) u( ) x ( ) L( ) x( ) d x ( ) Qx( ) L () L = () Q= Q ( N () = θ in he general problem definiion) Find a conrol law u U ad such ha he cos J ( x u ) is minimized subjec o he consrains. [ ] Noe ha because L () and Q are no assumed o be nonnegaive i is no generally rue ha a minimum exiss for J ( x u ). However in many conrol applicaions hese condiions will hold. [ ] I urns ou ha he deerminaion of condiions under which a minimum exiss as well as he acual calculaion of he minimizing conrol can be made o depend upon he soluion of he following Riccai equaion: P () A = () P () P () A () + PBB () () () P () L () (5.74) Since his equaion is nonlinear i is no clear ha for a given iniial condiion P a soluion will exis. Moreover even if soluions do exis for some ime inervals hey may fail o exis over longer inervals because of he possibiliy of having a finie escape ime. Example: (finie escape ime) Suppose ha he Riccai equaion is scalar and has he form p p p p ( ) = an( ) p( π /) = + () = () + () = Thus p () does no exis over he inerval. Noaion: The soluion o he Riccai equaion (5.74) will be denoed as Π (; P (or P ()) meaning ha is value a ime is Π ( ; P ) = P and is value a ime is generaed from Π ( ; P ) = P. Lemma: Suppose P () = P () and P () on he inerval [ ]. Then for he sysem x () = Ax () () + Bu () () we have: B ( ) P( ) u () u x d x P x = PB () () P () A() P () P () A () x () + + () () () () (). L- 3/9

34-5 LINEAR SYSTEMS If x() is a differeniable rajecory and if P() is a differeniable marix-valued funcion of ime hen: d x () Px () () d x () Px () () d = = + + x() Px () () x() P () x () x() Px () () x() Px () () d { } A () x () + Bu () () P () x () + x () { A() P () P () A () + PBB () () () P () L ()} x () + x() () () () () () P Ax Bu B ( ) P( ) u () u () x () d () () () () () () () x () PB P + A P + P A { + } d Theorem: Suppose L () = L() Q= Q and here exiss over [ ] a soluion o he Riccai differenial equaion P () A = () P () P () A () + PBB () () () P () L () P ( = Q hen: (a) There exiss a conrol u() which minimizes J ( x u op[ ]) subjec o he consrains x () = A () x () + Bu () () x ( ) = x (b) The minimum value of he cos funcion is ( op[ ] ) = Π( ; J x u x Q x (c) The opimal feedback conrol law is uop () =B() Π(; Q ) x() (d) The opimal open-loop conrol law is uop[ ]() =B() Π(; Q ) Φ ( ) x where Φ( ) is he sae ransiion marix for x () = A () BB () () Π(; Q x (). To ease he noaion le's use P () insead of Π (; Q o denoe he soluion of he Riccai equaion. From he lemma we have he ideniy for x () P() x() which we use in he cos funcion: L- 4/9

34-5 LINEAR SYSTEMS J [ ] ( x u ) I u( ) u () x () d + x ( ) Qx( ) L ( ) x ( ) I u( ) u x d + x P x + x P x L ( ) x ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) I u( ) u () x () L ( ) d x ( ) B ( ) P( ) u () + u ( ) x ( ) d+ x ( ) P( ) x( ) PB () () P () A() P () P () A () x () + + () () u () I B P u () x () d () () () () () () () () x () + x ( ) P( ) x( ) PB P + A P + P A + L I B () P() u () u () x () d x ( ) P( ) x( ) + PB () () PBB () () () P () x () = u () + B() Px () () d+ x( ) P ( ) x ( ) From his las equaliy we conclude ha he inpu ha minimizes he cos funcion is u () = B () P() x() which proves (a) and (c) and for which he cos funcion reaches is op minimum op ( op[ ] ) ( ; J x u = x Π Q x proving (b). For (d) noice ha he opimal rajecory x is given by he closed-loop equaion: x op () = A() B() B () Π(; Q ) xop (). Thus if Φ ( ) is he sae ransiion marix for he closed-loop sysem we have x () ( ) x( ) () = () () Φ ( ) ( ) is he opimal open-loop op policy. =Φ and hence uop B P x Remarks: Suppose ha L () = which amouns o weighing only he conrol signal in he cos funcion. Then he Riccai equaion reduces o P () A = () P () P () A () + PBB () () () P (). L- 5/9

34-5 LINEAR SYSTEMS Hence P() saisfies he same equaion as he inverse of he backward conrollabiliy Grammian W ( ). Furhermore i can be shown ha provided he inverse exiss. Π ( Q ; = W ( +Φ( Q Φ ( Theorem: Assume Q = Q. Le W ( be he backward conrollabiliy Grammian of x () = A () x () + Bu () () x ( ) = x. If he marix W ( ) +Φ( ) Q Φ ( ) is inverible [ ] hen here exiss a conrol ha minimizes [ ] = + J ( x u ) u ( ) u( ) d x ( ) Qx( ). Follows from he previous remark. 5.7. Fixed-End-Poin Problem The fixed-end-poin problem is defined as follows. Given: A sysem x () = A () x () + Bu () () x ( ) = x x ( = x A cos funcion ( N () Find a conrol law J ( x u[ ] ) u ( ) u( ) + x ( ) L( ) x( ) d L () = L () Q= Q = θ in he general problem definiion) u U ad such ha he cos J ( x u ) is minimized subjec o he consrains. [ ] This is an exension of he conrollabiliy problem sudied earlier in which a weigh on he sae is added. However here we discuss he soluion from he opimal conrol poin of view. If he erminal sae is required o belong o a cerain se insead of a fixed vecor he mehodology can be modified o accoun for his requiremen. In general calculus of variaions is he subjec ha deals wih such opimizaion problems. From Secion 5.6 we found an expression for he minimum-norm conrol: [ ] u () = B() Φ ( ) W ( ) Φ( ) x x (5.75) op which leads o he following lemma. L- 6/9

34-5 LINEAR SYSTEMS Lemma: Le W ( be he backward conrollabiliy Grammian of x () = Ax () () + Bu () (). If uop () is any conrol of he form uop () =B() Φ ( ) ξ where vecor ξ is a soluion of [ ] W( ) ξ= Φ( ) x x (5.76) hen he conrol uop () drives he sae from ( x ) ( x ). If u () is any oher conrol wih he same propery i.e. ( x ) ( x ) hen op = op op L u u () u () d u u() d. (5.77) Moreover if W ( is nonsingular hen op L[ ] op op [ ] [ ] u = u () u () d = x Φ( ) x W ( ) x Φ( ) x Theorem:. (5.78) Suppose ha L () = L () and ha here exiss a symmeric marix P ( such ha he soluion Π (; P ( ) ) of he differenial Riccai equaion P () A = () P () P () A () + PBB () () () P () L () (5.79) holds. Then (a) A differeniable rajecory x() defined on he inerval [ ] and saisfying x () = A () x () + Bu () () x ( ) = x x ( ) = x minimizes he cos J ( x u[ ] ) u ( ) u( ) + x ( ) L( ) x( ) d (5.8) if and only if i minimizes he coss ( [ ]) ( ) ( ) = L[ ] J x u v vd v (5.8 subjec o he consrains x () = A () BB () () Π (; P ( ) ) x () + Bv () () x ( ) = x x ( ) = x (5.8) L- 7/9

34-5 LINEAR SYSTEMS (b) Along any sae rajecory saisfying he boundary condiions we have J x u J x u x P x x P x (5.83) ( [ ] ) ( [ ] ) = + Π( ; ( ( From he Lemma in 5.7. we have J [ ] ( x u ) I u( ) u () x () d L ( ) x ( ) I u( ) u () x () d L ( ) x ( ) B ( ) P( ) u () + u () x () d PB () () P () A() P () P () A () x () x () P() x() + + () () u () I B P u () x () d x () P() x() PB () () P () A() P () P () A () L () x () + + + I B () P() u () u () x () d PB () () PBB () () () P () x () x () P() x() = u () + B() Px () () d+ x P ( ) x x P ( ) x Leing v (): = u () + B () Px () () (which proves (b)) we obain he equivalen problem of minimizing he norm of v subjec o he consrain x () = A () x () + B () v () B() Px () () = A () BB () () P () x () + Bv ()() x ( ) = x x ( ) = x (5.84) The soluion of which is given by he above lemma i.e. v () =B() Φ ( ) W ( ) Φ( ) x x (or oher sae feedback forms obained) op [ ] where he sae ransiion marix and he backward conrollabiliy Grammian correspond o he sysem (5.84) wih he "new A marix" A () BB () () P (). Once v () is compued u () can be solved for. op op L- 8/9

34-5 LINEAR SYSTEMS Noe: The relaionship beween v () and u () is shown in he block diagram below. v () - + u () + B () s I + x () A () B () Π(; P( ) ) L- 9/9