GRASP in Switching Input Optimal Control Synthesis

Size: px
Start display at page:

Download "GRASP in Switching Input Optimal Control Synthesis"

Transcription

1 MIC th Metaheuristics International Conference 381 GRASP in Switching Input Optimal Control Synthesis Paola Festa Giancarlo Raiconi Dept. of Mathematics and Computer Science, University of Salerno Via S. Allende, Baronissi (SA), Italy 1 Switching input control In this paper, several optimal control problems are introduced, referred to the switching input control scheme. The system is characterized by several control variables and at any time only one of them can be acted. At any time, the control decision involves the input variable to be activated and its value. Under suitable assumption about both the finite time and the infinite time but periodic case, the optimal control can be computed as standard feedback law, once the switching policy is defined. The problem turns in a high dimension discrete problem and a search procedure can be used to solve it. Following the line used in the companion problem of optimal selection of output variables for filtering purpose [1], here we apply the meta-heuristic Greedy Randomized Search Procedure with good results. The system is modeled as linear invariant discrete time, deteristic or stochastic. In the deteristic case, we suppose that the state if fully observed: x(k +1) = Ax(k)+b(k)u(k) (1) x(0) = x 0,x(k) R n,k =0, 1, 2..., A R n n,b(k) R n (2) In the stochastic case, the state can be either fully observed or partially observed with possible output error: x(k +1) = Ax(k)+b(k)u(k)+w(k) (3) y(k) = Cx(k)+v(k) (4) E(x(0)) = 0, E(x(0)x T (0)) = P 0 R n n (5) w(k) R n,e(w(k)) = 0, E(w(k)w T (j)) = Qδ kj (6) C R p n,v(k) R p,e(v(k)) = 0, E(v(k)v T (j)) = Rδ kj (7) E(w(k)x T (0)) = 0,E(v(k)x T (0)) = 0, E(v(k)w T (j)) = 0, (8) where P 0,Qare assigned symmetric semidefinite n n covariance matrices and R is a symmetric definite positive p p covariance matrix. In all cases, the term b(k)u(k) represents the switching control term. In particular, b(k) R n represents the input variable to be activated at time k. It is a vector that at any time can assume only a value in a finite preassigned set b(k) b 1,b 2,..., b q},b i R n. The real scalar variable u(k) represents the value of the input variable itself. An alternative description of the adopted control scheme can be given by introducing binary control variables α i (k),i =1, 2,..., q. In this case, the control action consists of detering at any time k which ones among the α i -s have to be set to 1 and the value of the control u. q q b(k)u(k) = α i (k)b i u(k); α i (k) 0, 1}, α i (k) =1k =0, 1, 2... (9) i=1 i=1

2 382 MIC th Metaheuristics International Conference 2 Finite time optimal control The optimality criterion is a quadratic error state added to proper control energy function. In the finite time deteristic case, the problem is } N 1 J = u 2 (k)+x T (k)mx(k)+x T (N)W N x(n) (10) k =0, 1,..., N 1 under constraints defined by equations (1),(2),(9). Note that fixed symmetric semidefinite positive n n matrices are involved. The problem can be easily transformed into a combinatorial optimization problem by first imizing with respect to the control variable u(.) and then with respect switching variables α s. Once the sequence α i (k),k =0, 1,...N 1 is fixed, the problem is a classical linear quadratic control problem for the time varying system (1)-(2). For any k the optimal control is given in the following feedback form. û(k) =b T (k)w (k)x(k), (11) where W (k) satisfies the discrete time Riccati equation: W (k) =A T W (k +1)A + N (AT W (k +1)A + N)b T (k)b(k)(a T W (k +1))A + N) 1+b T (k)(a T (12) W (k +1)A + N)b(k) with endpoint condition W (N) =W N. The corresponding cost function is given by J = u(.) J(α, u) =J(α, û) =xt (0)W (0)x(0). (13) Summarizing, the problem is reduced now to the following discrete one. α i (k) k =0, 1, 2,..., N 1 J = x T 0 W (0)x 0 } under constraints (9). Since such a formulation requires the aprioriknowledge of the initial state, it is not suitable for on-line applications, where the initial state is usually unknown. In these cases, it is useful to change the cost function to a scalar measure of the symmetric semidefinite positive matrix W (0) such as the trace or the deterant. To imize such indexes is equivalent to maximize quality indexes introduced by Muller and Weber [6] for controllability of deteristic systems. Indeed, by defining as in [6] the scalar function of eigenvalues of the symmetric semidefinite positive matrix W, µ s (W )= ( n i=1 1 n λs i (W )) 1 s,wehave: J 1 = µ 1 (W 1 n )= Tr(W ),J 0 = µ 0 (W 1 ) = lim µ 0 (W 1 )= W 1 n. s Finite time stochastic optimal control In the stochastic case, the system is described by equation (3) and the optimization criterion is equivalent to (7). k =0, 1,..., N 1 E(J) =E [ N 1 u 2 (k)+x T (k)mx(k)+x T (N)W N x(n) Given any valid sequence of α-s, the optimal control is always expressed by formulas (11),(12) and the criterion value is given by ( ) J = N 1 E [J(α, u)] = E [J(α, û)] = Tr P u(.) 0 W (0) + W (k)q. (15) ]} (14)

3 MIC th Metaheuristics International Conference 383 In the presence of incomplete state information it can be used the separation principle, which is valid under further hypothesis of Gaussian disturbances. In this case, û(.) can be expressed as function of the optimal Kalman filter estimate of the state û(k) =b T (k)w (k) x(k), and the value J of optimality criterion depends also from the covariance of filter Discrete formulation Even in the stochastic case, the final problem consists of imizing a function J or J of 0 1variables α i (k),i=1, 2,..p, k =0, 1,...N 1 subject to constraints (9). If the time horizon is very long, the size of the solution space (q N ) can be very big. Moreover, computation of performance index is somewhat costly in terms of computational effort, so an optimization method able to find an approximate solution with a relatively small number of function evaluations seems to be adequate. A further characteristic observed in the behavior of the particular problem is the presence of many local ima in such a way that a classical local search procedure cannot give good results. Our approach is to use a randomized multi-start heuristic algorithm in order to limit the number of function evaluations. 3 Infinite time horizon problems (periodic) In the stochastic case, the optimal performance index (11) diverges when N and two possible approaches can be followed: either to introduce a discount factor 0 <β<1 in the index : k =0, 1,... lim N [ N E(J) =E β k ( u 2 (k)+x T (k)mx(k) )]} (16) or to consider the average cost per stage: k =0, 1,..., lim N [ E(J) N = 1 N 1 N E u 2 (k)+x T (k)mx(k) ]} (17) Whatever method we chose, it can be shown that the limits on (16) and (17) converge under suitable hypotheses on system matrices and on the sequence α i (k) (i.e. the sequence is such that the system A, b(k) is stabilizable in time varying sense[2]). Therefore, at least in principle, the same reasoning used for the finite time case can be extended to infinite time case. Unfortunately, such an approach seems not applicable in practice because of two issues: 1. The sequence α i (k) is infinite, so it does not make any sense to find imum with any search procedure. 2. Since the optimal sequence α i (k) is infinite, it is not practically possible to embed the optimal switching sequence in the feedback controller. From a practical point of view, it is much more attractive to have a periodic switching policy, that can be easily embedded in the controller programg. Moreover, choosing a sufficiently long period, the performance index related to any non periodic sequence can be strictly approached. The formulation of the infinite time periodic optimal control problem, (stochastic problem with complete state information) is the following. k =0, 1,..., N 1 lim m 1 N E mn 1 k=(m 1)N u 2 (k)+x T (k)mx(k) (18)

4 384 MIC th Metaheuristics International Conference It is easy to show that if the periodic system A, b(k),k =0, 1, 2,,,N 1 is stabilizable [3], then it holds that Assug as teral condition any symmetric semidefinite positive matrix W N,thesolution of equation (12) converges to a periodic symmetric definite positive matrix sequence W (k) as k, independently from W N ; W (k) satisfies Riccati equation with periodic two point boundary conditions W (0) = W (N); The average cost per stage computed with the optimal control corresponding to the steady state matrix û(k) =b T (k)w (k)x(k) isgivenby J p (α) = lim m 1 N E mn 1 k=(m 1)N u 2 (k)+x T (k)mx(k) = 1 N N 1 Tr(W (k)q). (19) Even in this case, the problem is reduced to imize a function with respect to variables α s under constraints (9). In this case, to compute a function is more complicated because of the splitted boundary conditions. In fact, the most straightforward way to compute the periodic function W (k) is to iterate the backward Riccati equation starting from a semidefinite positive matrix until convergence to a periodic solution is attained. In most cases, this occurs in relatively few iterations, but if the sequence A, b(k) is scarcely controllable, convergence can be extremely slow (if A, b(k) is not detectable equation does not converge at all). Fortunately, in such cases the matrix solution has a growing value of trace and such phenomenon can be used in search procedure to stop computation, saving a lot of time. In the incomplete information case, the situation is even more complex because function value depends also from the steady state value of the covariance of the filter and then a function evaluation involves iteration until convergence of the backward Riccati equation for controller and of the forward Riccati equation for the filter. 4 GRASP A Greedy Randomized Adaptive Search Procedure (GRASP) is a metaheuristic for finding approximate solutions for difficult combinatorial problems. GRASP is a multistart method characterized by two phases: a construction phase and a local search also known as local improvement phase. During the construction phase a feasible solution is iteratively constructed. One element at time is randomly chosen from a Restricted Candidate List (RCL), whose elements are sorted according to some greedy criterion, and added to the building admissible solution. As the found solution could not be locally optimal with respect to the adopted neighborhood definition, the local search phase tries to improve it. These two phases are iterated and the best solution found is kept as an approximation of the optimal one. GRASP has been proposed in 1989 by Feo and Resende in [4]. Since 1989 numerous papers on the basic aspects of GRASP, as well as enhancements to the basic metaheuristic have been appeared in the literature. GRASP has been applied to a wide range of combinatorial optimization problems, ranging from scheduling and routing to drawing and turbine balancing, as reported in a very recent annotated bibliography of the GRASP literature[5], due to Festa and Resende. Greedy construction phase: At eachgrasp constructionphase a starting point for the local search procedure is built by following the first N steps of the sequential greedy procedure. Any step of Riccati equation iteration can be spilt into two steps: W (k) =A T W (k +1)A + N L(k, b(k)) L(k, b(k)) = (AT W (k+1)a+n)b(k)b T (k)(a T W (k+1)a+n) 1+b T (k)(a T W (k+1)a+n)b(k). The greedy criterion is to maximize the trace of term L, which is subtracted to the quadratic cost matrix. GRASP construction phase paradigm requires to choose at any step a member of the

5 MIC th Metaheuristics International Conference 385 RCL defined on the basis of the adopted greedy criterion. In our case, the complete procedure to choose the randomized greedy initial α for a fixed N is the following construction phase: 1 choose a large value for W N, for example 1 ɛ I,set0<λ<1 put k = N 2 for i =1, 2,..., q, compute c i = Tr(L, b i )andsetc m =c i },c M =maxc i } 3 choose at random j 1, 2,..., q} such that c M λ(c M c m ) c j,setα l (k 1) = δ jl 4 if k>1, compute W (k 1), put k = k 1, and go to step 2, else exit with the matrix α ls } full. Local search phase: Onceastartingpointα is found, it must be verified if it is locally optimal and possibly an improving solution must be searched in a prescribed neighborhood of α. For our purpose, a distance (α, η) must be introduced in order to measure the similarity of two different candidate solutions α, η. The definition of depends on the physical meaning of relevant parameters. We used the Hamg distance defined as H (α, η) = 1 2 q i=1 N j=1 α i(j) η i (j). In our application, H measures the number of different columns between matrices α and β. Actually, as well as other similarly defined metrics, H compares two strings of length N and is unable to recognize equivalent sequences 1, but this is not a drawback when used for a local search procedure. For designing an efficient local search phase, the size of the neighborhood must be not too large, because this phase is performed iteratively starting from a great number of initial values. The whole GRASP algorithm pseudo-code is reported in the following. 1 Read systems data. Put: it =0andĴ equal to a large number; 2 Build an initial GRASP sequence α 0 ; 3 Find. of J on a neighborhood of α 0.Put:α arg = α (α, α 0 ) s} J(α)},J = J(α ); 4 put it = it +1ifJ < Ĵ put Ĵ = J, α = α ; 5 if it = maxit stop and return the solution α else go to Step 2. Numerical experiments carried out by applying the proposed metaheuristic based algorithm have given very good results, compared with a standard local search method which suffers to fall in local ima. References [1] Festa P., Raiconi G.: Using GRASP for choosing best periodic observation strategy in stochastic systems filtering. to appear on: Cooperative Control and Optimization, P.Pardalos&Eds.Kluwer A.P [2] Anderson B.D.O., Moore J.B.: Detectability and stabilizability of time varying discrete time linear systems, SIAM Journal on Control and Optimization. 19, (1981), pp [3] Bittanti S., Colaneri P.: De Nicolao G.: The difference periodic Riccati equation for the periodic prediction problem. IEEE Transaction on Automatic Control, 33, (1988), pp [4] Feo T.A. Resende M.G.C.: A probabilistic heuristic for computationally difficult set covering problems. Operation Research Letters, 8, (1989), pp [5] Festa P., Resende M.G.C: GRASP: A bibliogaphy, to appear on: Essays and surveys on metaheuristic, P. Hansen, C. C. Rebeiro (eds.), Kluwer A. P., 2001 [6] Muller P.C., Weber H. I.: Analysis and optimization of certain qualities of controllobility and observability for linear dynamical systems, Automatica, 8, (1972), In our application, two sequences are equivalent if cyclically shifted, i.e.: β ij = α ij+ν, j =1,.., N ν and β in ν+j = α ij j =1, 2,..ν.

6

Optimal control and estimation

Optimal control and estimation Automatic Control 2 Optimal control and estimation Prof. Alberto Bemporad University of Trento Academic year 2010-2011 Prof. Alberto Bemporad (University of Trento) Automatic Control 2 Academic year 2010-2011

More information

Linear-Quadratic Optimal Control: Full-State Feedback

Linear-Quadratic Optimal Control: Full-State Feedback Chapter 4 Linear-Quadratic Optimal Control: Full-State Feedback 1 Linear quadratic optimization is a basic method for designing controllers for linear (and often nonlinear) dynamical systems and is actually

More information

A Study of the Duality between Kalman Filters and LQR Problems

A Study of the Duality between Kalman Filters and LQR Problems Purdue University Purdue e-pubs Department of Electrical and Computer Engineering Technical Reports Department of Electrical and Computer Engineering 11-3-2016 A Study of the Duality between Kalman Filters

More information

Stochastic Tube MPC with State Estimation

Stochastic Tube MPC with State Estimation Proceedings of the 19th International Symposium on Mathematical Theory of Networks and Systems MTNS 2010 5 9 July, 2010 Budapest, Hungary Stochastic Tube MPC with State Estimation Mark Cannon, Qifeng Cheng,

More information

HYBRIDIZATIONS OF GRASP WITH PATH-RELINKING FOR THE FAR FROM MOST STRING PROBLEM

HYBRIDIZATIONS OF GRASP WITH PATH-RELINKING FOR THE FAR FROM MOST STRING PROBLEM HYBRIDIZATIONS OF GRASP WITH PATH-RELINKING FOR THE FAR FROM MOST STRING PROBLEM DANIELE FERONE, PAOLA FESTA, AND MAURICIO G.C. RESENDE Abstract. Among the sequence selection and comparison problems, the

More information

Multi-objective Quadratic Assignment Problem instances generator with a known optimum solution

Multi-objective Quadratic Assignment Problem instances generator with a known optimum solution Multi-objective Quadratic Assignment Problem instances generator with a known optimum solution Mădălina M. Drugan Artificial Intelligence lab, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels,

More information

An Empirical Algorithm for Relative Value Iteration for Average-cost MDPs

An Empirical Algorithm for Relative Value Iteration for Average-cost MDPs 2015 IEEE 54th Annual Conference on Decision and Control CDC December 15-18, 2015. Osaka, Japan An Empirical Algorithm for Relative Value Iteration for Average-cost MDPs Abhishek Gupta Rahul Jain Peter

More information

IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS

IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS IMPROVED MPC DESIGN BASED ON SATURATING CONTROL LAWS D. Limon, J.M. Gomes da Silva Jr., T. Alamo and E.F. Camacho Dpto. de Ingenieria de Sistemas y Automática. Universidad de Sevilla Camino de los Descubrimientos

More information

Decentralized LQG Control of Systems with a Broadcast Architecture

Decentralized LQG Control of Systems with a Broadcast Architecture Decentralized LQG Control of Systems with a Broadcast Architecture Laurent Lessard 1,2 IEEE Conference on Decision and Control, pp. 6241 6246, 212 Abstract In this paper, we consider dynamical subsystems

More information

GRASP A speedy introduction

GRASP A speedy introduction GRASP A speedy introduction thst@man..dtu.dk DTU-Management Technical University of Denmark 1 GRASP GRASP is an abbreviation for Greedy Randomized Adaptive Search Procedure. It was invented by Feo and

More information

Value and Policy Iteration

Value and Policy Iteration Chapter 7 Value and Policy Iteration 1 For infinite horizon problems, we need to replace our basic computational tool, the DP algorithm, which we used to compute the optimal cost and policy for finite

More information

Lecture 5: Control Over Lossy Networks

Lecture 5: Control Over Lossy Networks Lecture 5: Control Over Lossy Networks Yilin Mo July 2, 2015 1 Classical LQG Control The system: x k+1 = Ax k + Bu k + w k, y k = Cx k + v k x 0 N (0, Σ), w k N (0, Q), v k N (0, R). Information available

More information

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013 Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013 Abstract As in optimal control theory, linear quadratic (LQ) differential games (DG) can be solved, even in high dimension,

More information

Square-Root Algorithms of Recursive Least-Squares Wiener Estimators in Linear Discrete-Time Stochastic Systems

Square-Root Algorithms of Recursive Least-Squares Wiener Estimators in Linear Discrete-Time Stochastic Systems Proceedings of the 17th World Congress The International Federation of Automatic Control Square-Root Algorithms of Recursive Least-Squares Wiener Estimators in Linear Discrete-Time Stochastic Systems Seiichi

More information

6.231 DYNAMIC PROGRAMMING LECTURE 9 LECTURE OUTLINE

6.231 DYNAMIC PROGRAMMING LECTURE 9 LECTURE OUTLINE 6.231 DYNAMIC PROGRAMMING LECTURE 9 LECTURE OUTLINE Rollout algorithms Policy improvement property Discrete deterministic problems Approximations of rollout algorithms Model Predictive Control (MPC) Discretization

More information

Temporal-Difference Q-learning in Active Fault Diagnosis

Temporal-Difference Q-learning in Active Fault Diagnosis Temporal-Difference Q-learning in Active Fault Diagnosis Jan Škach 1 Ivo Punčochář 1 Frank L. Lewis 2 1 Identification and Decision Making Research Group (IDM) European Centre of Excellence - NTIS University

More information

Contents lecture 5. Automatic Control III. Summary of lecture 4 (II/II) Summary of lecture 4 (I/II) u y F r. Lecture 5 H 2 and H loop shaping

Contents lecture 5. Automatic Control III. Summary of lecture 4 (II/II) Summary of lecture 4 (I/II) u y F r. Lecture 5 H 2 and H loop shaping Contents lecture 5 Automatic Control III Lecture 5 H 2 and H loop shaping Thomas Schön Division of Systems and Control Department of Information Technology Uppsala University. Email: thomas.schon@it.uu.se,

More information

A Comparison of Fixed Point Designs And Time-varying Observers for Scheduling Repetitive Controllers

A Comparison of Fixed Point Designs And Time-varying Observers for Scheduling Repetitive Controllers Proceedings of the 46th IEEE Conference on Decision and Control New Orleans, LA, USA, Dec. 12-14, 27 A Comparison of Fixed Point Designs And Time-varying Observers for Scheduling Repetitive Controllers

More information

Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems

Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems July 2001 Ronald J. Balvers Douglas W. Mitchell Department of Economics Department of Economics P.O.

More information

Structured State Space Realizations for SLS Distributed Controllers

Structured State Space Realizations for SLS Distributed Controllers Structured State Space Realizations for SLS Distributed Controllers James Anderson and Nikolai Matni Abstract In recent work the system level synthesis (SLS) paradigm has been shown to provide a truly

More information

An LQ R weight selection approach to the discrete generalized H 2 control problem

An LQ R weight selection approach to the discrete generalized H 2 control problem INT. J. CONTROL, 1998, VOL. 71, NO. 1, 93± 11 An LQ R weight selection approach to the discrete generalized H 2 control problem D. A. WILSON², M. A. NEKOUI² and G. D. HALIKIAS² It is known that a generalized

More information

Denis ARZELIER arzelier

Denis ARZELIER   arzelier COURSE ON LMI OPTIMIZATION WITH APPLICATIONS IN CONTROL PART II.2 LMIs IN SYSTEMS CONTROL STATE-SPACE METHODS PERFORMANCE ANALYSIS and SYNTHESIS Denis ARZELIER www.laas.fr/ arzelier arzelier@laas.fr 15

More information

Positive semidefinite matrix approximation with a trace constraint

Positive semidefinite matrix approximation with a trace constraint Positive semidefinite matrix approximation with a trace constraint Kouhei Harada August 8, 208 We propose an efficient algorithm to solve positive a semidefinite matrix approximation problem with a trace

More information

Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters

Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters Linear-Quadratic-Gaussian (LQG) Controllers and Kalman Filters Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 204 Emo Todorov (UW) AMATH/CSE 579, Winter

More information

5. Simulated Annealing 5.1 Basic Concepts. Fall 2010 Instructor: Dr. Masoud Yaghini

5. Simulated Annealing 5.1 Basic Concepts. Fall 2010 Instructor: Dr. Masoud Yaghini 5. Simulated Annealing 5.1 Basic Concepts Fall 2010 Instructor: Dr. Masoud Yaghini Outline Introduction Real Annealing and Simulated Annealing Metropolis Algorithm Template of SA A Simple Example References

More information

Neural Networks and Deep Learning

Neural Networks and Deep Learning Neural Networks and Deep Learning Professor Ameet Talwalkar November 12, 2015 Professor Ameet Talwalkar Neural Networks and Deep Learning November 12, 2015 1 / 16 Outline 1 Review of last lecture AdaBoost

More information

Reinforcement Learning

Reinforcement Learning Reinforcement Learning March May, 2013 Schedule Update Introduction 03/13/2015 (10:15-12:15) Sala conferenze MDPs 03/18/2015 (10:15-12:15) Sala conferenze Solving MDPs 03/20/2015 (10:15-12:15) Aula Alpha

More information

Research Article Indefinite LQ Control for Discrete-Time Stochastic Systems via Semidefinite Programming

Research Article Indefinite LQ Control for Discrete-Time Stochastic Systems via Semidefinite Programming Mathematical Problems in Engineering Volume 2012, Article ID 674087, 14 pages doi:10.1155/2012/674087 Research Article Indefinite LQ Control for Discrete-Time Stochastic Systems via Semidefinite Programming

More information

A COMPUTATIONAL APPROACH FOR OPTIMAL PERIODIC OUTPUT FEEDBACK CONTROL

A COMPUTATIONAL APPROACH FOR OPTIMAL PERIODIC OUTPUT FEEDBACK CONTROL A COMPUTATIONAL APPROACH FOR OPTIMAL PERIODIC OUTPUT FEEDBACK CONTROL A Varga and S Pieters DLR - Oberpfaffenhofen German Aerospace Research Establishment Institute for Robotics and System Dynamics POB

More information

Optimal Distributed Lainiotis Filter

Optimal Distributed Lainiotis Filter Int. Journal of Math. Analysis, Vol. 3, 2009, no. 22, 1061-1080 Optimal Distributed Lainiotis Filter Nicholas Assimakis Department of Electronics Technological Educational Institute (T.E.I.) of Lamia 35100

More information

Approximate dynamic programming for stochastic reachability

Approximate dynamic programming for stochastic reachability Approximate dynamic programming for stochastic reachability Nikolaos Kariotoglou, Sean Summers, Tyler Summers, Maryam Kamgarpour and John Lygeros Abstract In this work we illustrate how approximate dynamic

More information

Approximate active fault detection and control

Approximate active fault detection and control Approximate active fault detection and control Jan Škach Ivo Punčochář Miroslav Šimandl Department of Cybernetics Faculty of Applied Sciences University of West Bohemia Pilsen, Czech Republic 11th European

More information

Time Varying Optimal Control with Packet Losses.

Time Varying Optimal Control with Packet Losses. Time Varying Optimal Control with Packet Losses. Bruno Sinopoli, Luca Schenato, Massimo Franceschetti, Kameshwar Poolla, Shankar S. Sastry Department of Electrical Engineering and Computer Sciences University

More information

Linear-quadratic-Gaussian control of linear system with Rayleigh flat fading channel

Linear-quadratic-Gaussian control of linear system with Rayleigh flat fading channel 30 11 2013 11 DOI: 10.7641/CTA.2013.21325 Control Theory & Applications Vol. 30 No. 11 Nov. 2013, ( ;, 310027) :. (LQG),.,,,.,,,.. : LQG ; ; ; : TP273 : A Linear-quadratic-Gaussian control of linear system

More information

Linear Discrimination Functions

Linear Discrimination Functions Laurea Magistrale in Informatica Nicola Fanizzi Dipartimento di Informatica Università degli Studi di Bari November 4, 2009 Outline Linear models Gradient descent Perceptron Minimum square error approach

More information

Linear Systems. Manfred Morari Melanie Zeilinger. Institut für Automatik, ETH Zürich Institute for Dynamic Systems and Control, ETH Zürich

Linear Systems. Manfred Morari Melanie Zeilinger. Institut für Automatik, ETH Zürich Institute for Dynamic Systems and Control, ETH Zürich Linear Systems Manfred Morari Melanie Zeilinger Institut für Automatik, ETH Zürich Institute for Dynamic Systems and Control, ETH Zürich Spring Semester 2016 Linear Systems M. Morari, M. Zeilinger - Spring

More information

Prediction, filtering and smoothing using LSCR: State estimation algorithms with guaranteed confidence sets

Prediction, filtering and smoothing using LSCR: State estimation algorithms with guaranteed confidence sets 2 5th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC) Orlando, FL, USA, December 2-5, 2 Prediction, filtering and smoothing using LSCR: State estimation algorithms with

More information

SIMULATION OF TURNING RATES IN TRAFFIC SYSTEMS

SIMULATION OF TURNING RATES IN TRAFFIC SYSTEMS SIMULATION OF TURNING RATES IN TRAFFIC SYSTEMS Balázs KULCSÁR István VARGA Department of Transport Automation, Budapest University of Technology and Economics Budapest, H-, Bertalan L. u. 2., Hungary e-mail:

More information

IEOR 265 Lecture 14 (Robust) Linear Tube MPC

IEOR 265 Lecture 14 (Robust) Linear Tube MPC IEOR 265 Lecture 14 (Robust) Linear Tube MPC 1 LTI System with Uncertainty Suppose we have an LTI system in discrete time with disturbance: x n+1 = Ax n + Bu n + d n, where d n W for a bounded polytope

More information

EL2520 Control Theory and Practice

EL2520 Control Theory and Practice EL2520 Control Theory and Practice Lecture 8: Linear quadratic control Mikael Johansson School of Electrical Engineering KTH, Stockholm, Sweden Linear quadratic control Allows to compute the controller

More information

Disturbance Attenuation Properties for Discrete-Time Uncertain Switched Linear Systems

Disturbance Attenuation Properties for Discrete-Time Uncertain Switched Linear Systems Disturbance Attenuation Properties for Discrete-Time Uncertain Switched Linear Systems Hai Lin Department of Electrical Engineering University of Notre Dame Notre Dame, IN 46556, USA Panos J. Antsaklis

More information

Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems

Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems Linear Riccati Dynamics, Constant Feedback, and Controllability in Linear Quadratic Control Problems July 2001 Revised: December 2005 Ronald J. Balvers Douglas W. Mitchell Department of Economics Department

More information

A Hybrid Data Mining Metaheuristic for the p-median Problem

A Hybrid Data Mining Metaheuristic for the p-median Problem A Hybrid Data Mining Metaheuristic for the p-median Problem Alexandre Plastino Erick R. Fonseca Richard Fuchshuber Simone de L. Martins Alex A. Freitas Martino Luis Said Salhi Abstract Metaheuristics represent

More information

OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN

OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN Dynamic Systems and Applications 16 (2007) 393-406 OPTIMAL FUSION OF SENSOR DATA FOR DISCRETE KALMAN FILTERING Z. G. FENG, K. L. TEO, N. U. AHMED, Y. ZHAO, AND W. Y. YAN College of Mathematics and Computer

More information

A Stable Block Model Predictive Control with Variable Implementation Horizon

A Stable Block Model Predictive Control with Variable Implementation Horizon American Control Conference June 8-,. Portland, OR, USA WeB9. A Stable Block Model Predictive Control with Variable Implementation Horizon Jing Sun, Shuhao Chen, Ilya Kolmanovsky Abstract In this paper,

More information

Stabilization of Discrete-Time Switched Linear Systems: A Control-Lyapunov Function Approach

Stabilization of Discrete-Time Switched Linear Systems: A Control-Lyapunov Function Approach Stabilization of Discrete-Time Switched Linear Systems: A Control-Lyapunov Function Approach Wei Zhang 1, Alessandro Abate 2 and Jianghai Hu 1 1 School of Electrical and Computer Engineering, Purdue University,

More information

Enlarged terminal sets guaranteeing stability of receding horizon control

Enlarged terminal sets guaranteeing stability of receding horizon control Enlarged terminal sets guaranteeing stability of receding horizon control J.A. De Doná a, M.M. Seron a D.Q. Mayne b G.C. Goodwin a a School of Electrical Engineering and Computer Science, The University

More information

Vehicle Routing and Scheduling. Martin Savelsbergh The Logistics Institute Georgia Institute of Technology

Vehicle Routing and Scheduling. Martin Savelsbergh The Logistics Institute Georgia Institute of Technology Vehicle Routing and Scheduling Martin Savelsbergh The Logistics Institute Georgia Institute of Technology Vehicle Routing and Scheduling Part II: Algorithmic Enhancements Handling Practical Complexities

More information

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games Adaptive State Feedbac Nash Strategies for Linear Quadratic Discrete-Time Games Dan Shen and Jose B. Cruz, Jr. Intelligent Automation Inc., Rocville, MD 2858 USA (email: dshen@i-a-i.com). The Ohio State

More information

Predictive Iterative Learning Control using Laguerre Functions

Predictive Iterative Learning Control using Laguerre Functions Milano (Italy) August 28 - September 2, 211 Predictive Iterative Learning Control using Laguerre Functions Liuping Wang Eric Rogers School of Electrical and Computer Engineering, RMIT University, Victoria

More information

Here represents the impulse (or delta) function. is an diagonal matrix of intensities, and is an diagonal matrix of intensities.

Here represents the impulse (or delta) function. is an diagonal matrix of intensities, and is an diagonal matrix of intensities. 19 KALMAN FILTER 19.1 Introduction In the previous section, we derived the linear quadratic regulator as an optimal solution for the fullstate feedback control problem. The inherent assumption was that

More information

Nonlinear Model Predictive Control Tools (NMPC Tools)

Nonlinear Model Predictive Control Tools (NMPC Tools) Nonlinear Model Predictive Control Tools (NMPC Tools) Rishi Amrit, James B. Rawlings April 5, 2008 1 Formulation We consider a control system composed of three parts([2]). Estimator Target calculator Regulator

More information

Chapter 2 Event-Triggered Sampling

Chapter 2 Event-Triggered Sampling Chapter Event-Triggered Sampling In this chapter, some general ideas and basic results on event-triggered sampling are introduced. The process considered is described by a first-order stochastic differential

More information

QUALIFYING EXAM IN SYSTEMS ENGINEERING

QUALIFYING EXAM IN SYSTEMS ENGINEERING QUALIFYING EXAM IN SYSTEMS ENGINEERING Written Exam: MAY 23, 2017, 9:00AM to 1:00PM, EMB 105 Oral Exam: May 25 or 26, 2017 Time/Location TBA (~1 hour per student) CLOSED BOOK, NO CHEAT SHEETS BASIC SCIENTIFIC

More information

OUTPUT CONSENSUS OF HETEROGENEOUS LINEAR MULTI-AGENT SYSTEMS BY EVENT-TRIGGERED CONTROL

OUTPUT CONSENSUS OF HETEROGENEOUS LINEAR MULTI-AGENT SYSTEMS BY EVENT-TRIGGERED CONTROL OUTPUT CONSENSUS OF HETEROGENEOUS LINEAR MULTI-AGENT SYSTEMS BY EVENT-TRIGGERED CONTROL Gang FENG Department of Mechanical and Biomedical Engineering City University of Hong Kong July 25, 2014 Department

More information

Piecewise Constant Prediction

Piecewise Constant Prediction Piecewise Constant Prediction Erik Ordentlich Information heory Research Hewlett-Packard Laboratories Palo Alto, CA 94304 Email: erik.ordentlich@hp.com Marcelo J. Weinberger Information heory Research

More information

Flow Shop and Job Shop Models

Flow Shop and Job Shop Models Outline DM87 SCHEDULING, TIMETABLING AND ROUTING Lecture 11 Flow Shop and Job Shop Models 1. Flow Shop 2. Job Shop Marco Chiarandini DM87 Scheduling, Timetabling and Routing 2 Outline Resume Permutation

More information

On Finding Optimal Policies for Markovian Decision Processes Using Simulation

On Finding Optimal Policies for Markovian Decision Processes Using Simulation On Finding Optimal Policies for Markovian Decision Processes Using Simulation Apostolos N. Burnetas Case Western Reserve University Michael N. Katehakis Rutgers University February 1995 Abstract A simulation

More information

DESIGNING A KALMAN FILTER WHEN NO NOISE COVARIANCE INFORMATION IS AVAILABLE. Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof

DESIGNING A KALMAN FILTER WHEN NO NOISE COVARIANCE INFORMATION IS AVAILABLE. Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof DESIGNING A KALMAN FILTER WHEN NO NOISE COVARIANCE INFORMATION IS AVAILABLE Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof Delft Center for Systems and Control, Delft University of Technology, Mekelweg

More information

The ϵ-capacity of a gain matrix and tolerable disturbances: Discrete-time perturbed linear systems

The ϵ-capacity of a gain matrix and tolerable disturbances: Discrete-time perturbed linear systems IOSR Journal of Mathematics (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765X. Volume 11, Issue 3 Ver. IV (May - Jun. 2015), PP 52-62 www.iosrjournals.org The ϵ-capacity of a gain matrix and tolerable disturbances:

More information

Pattern Recognition Approaches to Solving Combinatorial Problems in Free Groups

Pattern Recognition Approaches to Solving Combinatorial Problems in Free Groups Contemporary Mathematics Pattern Recognition Approaches to Solving Combinatorial Problems in Free Groups Robert M. Haralick, Alex D. Miasnikov, and Alexei G. Myasnikov Abstract. We review some basic methodologies

More information

Cover Page. The handle holds various files of this Leiden University dissertation

Cover Page. The handle  holds various files of this Leiden University dissertation Cover Page The handle http://hdl.handle.net/1887/39637 holds various files of this Leiden University dissertation Author: Smit, Laurens Title: Steady-state analysis of large scale systems : the successive

More information

Reachability metrics for bilinear complex networks

Reachability metrics for bilinear complex networks Reachability metrics for bilinear complex networks Yingbo Zhao and Jorge Cortés Abstract Controllability metrics based on the controllability Gramian have been widely used in linear control theory, and

More information

1 The independent set problem

1 The independent set problem ORF 523 Lecture 11 Spring 2016, Princeton University Instructor: A.A. Ahmadi Scribe: G. Hall Tuesday, March 29, 2016 When in doubt on the accuracy of these notes, please cross chec with the instructor

More information

Optimal Control of Linear Systems with Stochastic Parameters for Variance Suppression: The Finite Time Case

Optimal Control of Linear Systems with Stochastic Parameters for Variance Suppression: The Finite Time Case Optimal Control of Linear Systems with Stochastic Parameters for Variance Suppression: The inite Time Case Kenji ujimoto Soraki Ogawa Yuhei Ota Makishi Nakayama Nagoya University, Department of Mechanical

More information

Performance Analysis of Distributed Tracking with Consensus on Noisy Time-varying Graphs

Performance Analysis of Distributed Tracking with Consensus on Noisy Time-varying Graphs Performance Analysis of Distributed Tracking with Consensus on Noisy Time-varying Graphs Yongxiang Ruan, Sudharman K. Jayaweera and Carlos Mosquera Abstract This paper considers a problem of distributed

More information

Adaptive Dual Control

Adaptive Dual Control Adaptive Dual Control Björn Wittenmark Department of Automatic Control, Lund Institute of Technology Box 118, S-221 00 Lund, Sweden email: bjorn@control.lth.se Keywords: Dual control, stochastic control,

More information

LMI Methods in Optimal and Robust Control

LMI Methods in Optimal and Robust Control LMI Methods in Optimal and Robust Control Matthew M. Peet Arizona State University Lecture 02: Optimization (Convex and Otherwise) What is Optimization? An Optimization Problem has 3 parts. x F f(x) :

More information

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica Tilburg University An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans Published in: Automatica Publication date: 2003 Link to publication Citation for

More information

McGill University Department of Electrical and Computer Engineering

McGill University Department of Electrical and Computer Engineering McGill University Department of Electrical and Computer Engineering ECSE 56 - Stochastic Control Project Report Professor Aditya Mahajan Team Decision Theory and Information Structures in Optimal Control

More information

Christopher Watkins and Peter Dayan. Noga Zaslavsky. The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015

Christopher Watkins and Peter Dayan. Noga Zaslavsky. The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015 Q-Learning Christopher Watkins and Peter Dayan Noga Zaslavsky The Hebrew University of Jerusalem Advanced Seminar in Deep Learning (67679) November 1, 2015 Noga Zaslavsky Q-Learning (Watkins & Dayan, 1992)

More information

A NUMERICAL METHOD TO SOLVE A QUADRATIC CONSTRAINED MAXIMIZATION

A NUMERICAL METHOD TO SOLVE A QUADRATIC CONSTRAINED MAXIMIZATION A NUMERICAL METHOD TO SOLVE A QUADRATIC CONSTRAINED MAXIMIZATION ALIREZA ESNA ASHARI, RAMINE NIKOUKHAH, AND STEPHEN L. CAMPBELL Abstract. The problem of maximizing a quadratic function subject to an ellipsoidal

More information

CONSTRAINED MODEL PREDICTIVE CONTROL ON CONVEX POLYHEDRON STOCHASTIC LINEAR PARAMETER VARYING SYSTEMS. Received October 2012; revised February 2013

CONSTRAINED MODEL PREDICTIVE CONTROL ON CONVEX POLYHEDRON STOCHASTIC LINEAR PARAMETER VARYING SYSTEMS. Received October 2012; revised February 2013 International Journal of Innovative Computing, Information and Control ICIC International c 2013 ISSN 1349-4198 Volume 9, Number 10, October 2013 pp 4193 4204 CONSTRAINED MODEL PREDICTIVE CONTROL ON CONVEX

More information

Lecture 2: Discrete-time Linear Quadratic Optimal Control

Lecture 2: Discrete-time Linear Quadratic Optimal Control ME 33, U Berkeley, Spring 04 Xu hen Lecture : Discrete-time Linear Quadratic Optimal ontrol Big picture Example onvergence of finite-time LQ solutions Big picture previously: dynamic programming and finite-horizon

More information

dynamical zeta functions: what, why and what are the good for?

dynamical zeta functions: what, why and what are the good for? dynamical zeta functions: what, why and what are the good for? Predrag Cvitanović Georgia Institute of Technology November 2 2011 life is intractable in physics, no problem is tractable I accept chaos

More information

An Even Order Symmetric B Tensor is Positive Definite

An Even Order Symmetric B Tensor is Positive Definite An Even Order Symmetric B Tensor is Positive Definite Liqun Qi, Yisheng Song arxiv:1404.0452v4 [math.sp] 14 May 2014 October 17, 2018 Abstract It is easily checkable if a given tensor is a B tensor, or

More information

FIR Filters for Stationary State Space Signal Models

FIR Filters for Stationary State Space Signal Models Proceedings of the 17th World Congress The International Federation of Automatic Control FIR Filters for Stationary State Space Signal Models Jung Hun Park Wook Hyun Kwon School of Electrical Engineering

More information

Optimal Control. McGill COMP 765 Oct 3 rd, 2017

Optimal Control. McGill COMP 765 Oct 3 rd, 2017 Optimal Control McGill COMP 765 Oct 3 rd, 2017 Classical Control Quiz Question 1: Can a PID controller be used to balance an inverted pendulum: A) That starts upright? B) That must be swung-up (perhaps

More information

Networked Control Systems: Estimation and Control over Lossy Networks

Networked Control Systems: Estimation and Control over Lossy Networks Noname manuscript No. (will be inserted by the editor) Networked Control Systems: Estimation and Control over Lossy Networks João P. Hespanha Alexandre R. Mesquita the date of receipt and acceptance should

More information

A comparison of numerical methods for the. Solution of continuous-time DSGE models. Juan Carlos Parra Alvarez

A comparison of numerical methods for the. Solution of continuous-time DSGE models. Juan Carlos Parra Alvarez A comparison of numerical methods for the solution of continuous-time DSGE models Juan Carlos Parra Alvarez Department of Economics and Business, and CREATES Aarhus University, Denmark November 14, 2012

More information

Linear Quadratic Zero-Sum Two-Person Differential Games

Linear Quadratic Zero-Sum Two-Person Differential Games Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard To cite this version: Pierre Bernhard. Linear Quadratic Zero-Sum Two-Person Differential Games. Encyclopaedia of Systems and Control,

More information

The norms can also be characterized in terms of Riccati inequalities.

The norms can also be characterized in terms of Riccati inequalities. 9 Analysis of stability and H norms Consider the causal, linear, time-invariant system ẋ(t = Ax(t + Bu(t y(t = Cx(t Denote the transfer function G(s := C (si A 1 B. Theorem 85 The following statements

More information

CCP Estimation. Robert A. Miller. March Dynamic Discrete Choice. Miller (Dynamic Discrete Choice) cemmap 6 March / 27

CCP Estimation. Robert A. Miller. March Dynamic Discrete Choice. Miller (Dynamic Discrete Choice) cemmap 6 March / 27 CCP Estimation Robert A. Miller Dynamic Discrete Choice March 2018 Miller Dynamic Discrete Choice) cemmap 6 March 2018 1 / 27 Criteria for Evaluating Estimators General principles to apply when assessing

More information

ESC794: Special Topics: Model Predictive Control

ESC794: Special Topics: Model Predictive Control ESC794: Special Topics: Model Predictive Control Discrete-Time Systems Hanz Richter, Professor Mechanical Engineering Department Cleveland State University Discrete-Time vs. Sampled-Data Systems A continuous-time

More information

ECE7850 Lecture 7. Discrete Time Optimal Control and Dynamic Programming

ECE7850 Lecture 7. Discrete Time Optimal Control and Dynamic Programming ECE7850 Lecture 7 Discrete Time Optimal Control and Dynamic Programming Discrete Time Optimal control Problems Short Introduction to Dynamic Programming Connection to Stabilization Problems 1 DT nonlinear

More information

Data Mining. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395 Data Mining Dimensionality reduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 42 Outline 1 Introduction 2 Feature selection

More information

RECURSIVE ESTIMATION AND KALMAN FILTERING

RECURSIVE ESTIMATION AND KALMAN FILTERING Chapter 3 RECURSIVE ESTIMATION AND KALMAN FILTERING 3. The Discrete Time Kalman Filter Consider the following estimation problem. Given the stochastic system with x k+ = Ax k + Gw k (3.) y k = Cx k + Hv

More information

Education in Linear System Theory with the Geometric Approach Tools

Education in Linear System Theory with the Geometric Approach Tools Education in Linear System Theory with the Geometric Approach Tools Giovanni Marro DEIS, University of Bologna, Italy GAS Workshop - Sept 24, 2010 Politecnico di Milano G. Marro (University of Bologna)

More information

Chapter 1. Introduction. 1.1 Problem statement and examples

Chapter 1. Introduction. 1.1 Problem statement and examples Chapter 1 Introduction 1.1 Problem statement and examples In this course we will deal with optimization problems. Such problems appear in many practical settings in almost all aspects of science and engineering.

More information

Parameter Estimation in a Moving Horizon Perspective

Parameter Estimation in a Moving Horizon Perspective Parameter Estimation in a Moving Horizon Perspective State and Parameter Estimation in Dynamical Systems Reglerteknik, ISY, Linköpings Universitet State and Parameter Estimation in Dynamical Systems OUTLINE

More information

Relaxations and Randomized Methods for Nonconvex QCQPs

Relaxations and Randomized Methods for Nonconvex QCQPs Relaxations and Randomized Methods for Nonconvex QCQPs Alexandre d Aspremont, Stephen Boyd EE392o, Stanford University Autumn, 2003 Introduction While some special classes of nonconvex problems can be

More information

Discrete-Time H Gaussian Filter

Discrete-Time H Gaussian Filter Proceedings of the 17th World Congress The International Federation of Automatic Control Discrete-Time H Gaussian Filter Ali Tahmasebi and Xiang Chen Department of Electrical and Computer Engineering,

More information

STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE. Jun Yan, Keunmo Kang, and Robert Bitmead

STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE. Jun Yan, Keunmo Kang, and Robert Bitmead STATE ESTIMATION IN COORDINATED CONTROL WITH A NON-STANDARD INFORMATION ARCHITECTURE Jun Yan, Keunmo Kang, and Robert Bitmead Department of Mechanical & Aerospace Engineering University of California San

More information

CS181 Midterm 2 Practice Solutions

CS181 Midterm 2 Practice Solutions CS181 Midterm 2 Practice Solutions 1. Convergence of -Means Consider Lloyd s algorithm for finding a -Means clustering of N data, i.e., minimizing the distortion measure objective function J({r n } N n=1,

More information

Riccati difference equations to non linear extended Kalman filter constraints

Riccati difference equations to non linear extended Kalman filter constraints International Journal of Scientific & Engineering Research Volume 3, Issue 12, December-2012 1 Riccati difference equations to non linear extended Kalman filter constraints Abstract Elizabeth.S 1 & Jothilakshmi.R

More information

Reinforcement Learning. Introduction

Reinforcement Learning. Introduction Reinforcement Learning Introduction Reinforcement Learning Agent interacts and learns from a stochastic environment Science of sequential decision making Many faces of reinforcement learning Optimal control

More information

Exponential Moving Average Based Multiagent Reinforcement Learning Algorithms

Exponential Moving Average Based Multiagent Reinforcement Learning Algorithms Exponential Moving Average Based Multiagent Reinforcement Learning Algorithms Mostafa D. Awheda Department of Systems and Computer Engineering Carleton University Ottawa, Canada KS 5B6 Email: mawheda@sce.carleton.ca

More information

Iterative Methods for Solving A x = b

Iterative Methods for Solving A x = b Iterative Methods for Solving A x = b A good (free) online source for iterative methods for solving A x = b is given in the description of a set of iterative solvers called templates found at netlib: http

More information

Bayesian Networks Inference with Probabilistic Graphical Models

Bayesian Networks Inference with Probabilistic Graphical Models 4190.408 2016-Spring Bayesian Networks Inference with Probabilistic Graphical Models Byoung-Tak Zhang intelligence Lab Seoul National University 4190.408 Artificial (2016-Spring) 1 Machine Learning? Learning

More information

Auxiliary signal design for failure detection in uncertain systems

Auxiliary signal design for failure detection in uncertain systems Auxiliary signal design for failure detection in uncertain systems R. Nikoukhah, S. L. Campbell and F. Delebecque Abstract An auxiliary signal is an input signal that enhances the identifiability of a

More information