Zhongping Jiang Tengfei Liu

Size: px
Start display at page:

Download "Zhongping Jiang Tengfei Liu"

Transcription

1 Zhongpng Jang Tengfe Lu 4

2 F.L. Lews, NAI Moncref-O Donnell Char, UTA Research Insttute (UTARI) The Unversty of Texas at Arlngton, USA and Qan Ren Consultng Professor, State Key Laboratory of Synthetcal Automaton for Process Industres, Northeastern Unversty, Shenyang, Chna Output Regulaton of Heterogeneous MAS- Renforcement Learnng Soluton for Unknown Systems Supported by : Chna Qan Ren Program, NEU Chna Educaton Mnstry Project 111 (No.B08015) US NSF ONR Work of Reza Modares and S. Nageshrao and Zuo Shan wth Davd Song and Dr. A. Davoud Talk avalable onlne at

3 Output Synchronzaton of Heterogeneous MAS 7

4 Heterogeneous Mult Agents = + x A x B u y = C x leader node x 0 Leader z y = S z = Rz Output regulaton error () t y () t y () t 0 0 Output regulator equatons AP + BG = PS C P = R Dynamcs are dfferent, state dmensons can be dfferent o/p reg eqs capture the common core of all the agents dynamcs And defne a synchronzaton manfold 8

5 Heterogeneous Mult Agents = + x A x B u y = C x Leader z = S z y = Rz Output regulator equatons P + G = P A B S C P = R Trackng error x 0 Output regulaton error () t y () t y () t 0 0 s the nserton map of S n A Dynamcs are dfferent, state dmensons can be dfferent o/p reg eqs capture the common core of all the agents dynamcs And defne a synchronzaton manfold 9

6 Two Control Methods Output regulator equatons AP + BG = PS C P = R Control Method #1 é N ù z = Sz + c a ( z - z ) + g ( z -z ) j j 0 êå ë ú j = 1 û u = K ( x -P z ) +G z = K x + ( G-K P ) z º K x + K z Control Method #2- more ntrgung Local neghborhood output trackng error compensator e a y y g y y y 0. jn j j z Fz Ge y u K x H z ether u K x H z K x ( K ) z Must know agent and leader s dynamcs S,R Or, assume p copy n compensator Then K, H are ndependent

7 Data drven Adaptve Soluton of o/p Reg Eqs n Real Tme Output regulator equatons AP + BG = PS C P = R Solve o/p regulator eqs onlne usng measured data by Renforcement Learnng

8 Optmal Output Synchronzaton of Heterogeneous MAS Usng Off-polcy IRL Nageshrao, Modares, Lopes, Babuska, Lews MAS = + x A x B u y = C x Optmal Tracker Problem Augmented Systems T T T n Xt () = éx() t z ù ê Î 0 ë úû + p z y = S z 0 0 = Rz 0 0 X = T X + B u T B 1 1 Leader éa 0 ù éb ù, = = 0 S 0 êë úû êë úû Performance ndex - g ( t t ) - T T T = ò + t 1 1 T = X () t P X () t VX ( ( t)) e X ( C QC KWK) Xdt Control u = K x + K z = K X 1 2 0

9 Optmal Tracker Soluton by Renforcement Learnng Tracker ARE K = [ K, K ] =-W B P -1 T T P TP P C QC P B W B P T T -1 T + - g + - = Algorthm 1. On-polcy IRL State-feedback algorthm Polcy Evaluaton Solve IRL Bellman equaton t+ t -gdt ( t) T k T k d -g t- T =- ò - - t 0 0 e X ( t d t ) P X ( t d t ) X ( t ) P X ( t ) e ( y y ) Q ( y y ) d t Polcy Update K = [ K, K ] =-W B P k + 1 k + 1 k T k Theorem Algorthm 1 converges to the soluton to the ARE Bellman equaton s solved usng RLS or batch LS It requres a Persstence of Exctaton (PE) condton that may be hard to satsfy Must know B1 14

10 On polcy RL Target polcy: The polcy that we are learnng about. Behavor polcy: The polcy that generates actons and behavor Ref. Target and behavor polcy System Target polcy and behavor polcy are the same 15

11 Off polcy RL Humans can learn optmal polces whle actually applyng suboptmal polces Target polcy Ref. Behavor Polcy System Target polcy and behavor polcy are dfferent 16

12 Off Polcy RL Tracker dynamcs X = TX + B u 1 Rewrte as k k k X = ( T + B K ) X + B ( u - K X ) º T X + B ( u -K X ) Now the Bellman equaton becomes t+ dt -gdt ( ) T k T k -g t-t T ( + d ) ( + d )- () () =- ( ) ( ) ò - - t t 0 0 t+ dt -g ( t- t) k T k ò ( - ) t e X t t P X t t X t P X t e y y Q y y d 1 Extra term contanng K k+ Algorthm 2. Off-polcy IRL Data-based algorthm e u K X W K X dt k k+ 1 Iterate on ths equaton and solve for P, K smultaneously at each step Note about probng nose If u K k k = X + e then ( u - K X ) = e Do not have to know any dynamcs agent = + x A x B u y = C x Or leader z y = S z = Rz

13 Theorem Off polcy Algorthm 2converges to the soluton to the ARE T P TP P C QC P B W B P T T -1 T + - g + - = Theorem o/p reg eq soluton ép P ù = ê ë úû Let P ê êp P AP + BG = PS Then the soluton to the output regulator equatons 1 Is gven by ( - P =-P ) P G = ( - K K P ) P C P = R Do not have to know the Agent dynamcs or the leader s dynamcs (S,R) 18

14 Observer for Leader s State and Dynamcs To avod knowledge of leader s state n u = K x + K z = K X Use adaptve observer for leader s state é N ù z = Sˆ z + c a ( z - z ) + g ( z -z ) j j 0 êå ë ú j = 1 û ˆ é N ù S =-G ( I Äz ) a ( z - z ) + g ( z -z ) vec S q êå j j 0 ë ú j = 1 û Then use control éx ù ˆ u = K x + K z º K X º K 1 2 z ê ë úû So ths s the Control Method #1 Do not have to know the leader s dynamcs (S,R) 19

15

16 Two Control Methods Output regulator equatons AP + BG = PS C P = R We just found a Controller #1 usng data-based control é N ù z = Sz + c a ( z - z ) + g ( z -z ) j j 0 êå ë ú j = 1 û u = K ( x -P z ) +G z = K x + ( G-K P ) z º K x + K z Now we seek a Controller usng Method #2- more ntrgung Local neghborhood output trackng error e a y y g y y y 0. jn j j compensator z Fz Ge y u Kx Hz ether u K x H z K x ( K ) z Or, assume p copy n compensator Then K, H are ndependent Do not want to solve o/p reg eqs or know any dynamcs

17 Overall Dynamcs Structure Agents Leader Compensator Control Input Local neghborhood output error Global form x 1 e dag{ }( ) d { } g L G I p dag C x

18 Global Dynamcs A dag{ A } x z H dag{ }( LG) I d 1 g p 31

19 Problems to Get Local Desgn Procedure 34

20

21 39

22 Heterogeneous Mult agent Dynamcs Leader Dynamcs Output Regulaton Problem 40

23 Local Neghborhood Output Error Compensator 1 local state feedback Compensator 2 local output feedback Compensator 3 no local system feedback 41

24 Local Systems wth Interactons from Neghbors Rewrte local o/p error Local Dynamcs Interacton terms x A BK BH x = z GC F z 0 x j 0 aj Cj 0 gr 0, jn G z j G x C 0 R0. z 42

25 State Regulaton Errors x X 0 o/p reg eqs AX+ GR XS. CX R Error dynamcs A G a C jn j j j A BK BH GC F A BK BH GC F G jn j j j Gz a C C Local transfer functons 1 T s C s A G 43

26 State Regulaton Errors Error Dynamcs Global Form 44

27 Man Theorem Local Desgn Interacton Small Gan Condton 46

28 Local Desgn Procedure z u 1 ˆ A Bu 1 Gz C w Eˆ ˆ w Closed loop Systems Open loop Systems A G a C jn j j j A BK BH G a j N jcj j GC F A BK BH Gz GC F A 0 B K H G z GC F 0 A 0 B K G z. GC F 0 dag K I p Then problem s formulated as ˆ ˆ A Bu 1 Gz, C Statc State feedback u K 1 48

29 Optmal Local Desgn Algorthm H nfnty desgn 49

30 Off polcy RL Algorthm to Solve Output Regulaton for Heterogeneous MAS T T t e t t P t t t P t T t 2 tt t t t t T T e Q K RK d e z z d t tt t t t 1 T T t 2 1 T e 2 K R K K d e 2 z z z d. t t T Ths s a standard parameter ID equaton from Adaptve Control It s lnear n the unknown parameters Solve usng Batch LS or RLS to get the updated values P, K, z 1 1 Ths solves the o/p regulaton problem for heterogeneous MAS Wthout solvng o/p reg equatons and wthout knowng any agent or leader s dynamcs 51

31

32 55

33

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems Off-polcy Renforcement Learnng for Robust Control of Dscrete-tme Uncertan Lnear Systems Yonglang Yang 1 Zhshan Guo 2 Donald Wunsch 3 Yxn Yn 1 1 School of Automatc and Electrcal Engneerng Unversty of Scence

More information

F.L. Lewis, NAI. Talk available online at Supported by : NSF AFOSR Europe ONR Marc Steinberg US TARDEC

F.L. Lewis, NAI. Talk available online at  Supported by : NSF AFOSR Europe ONR Marc Steinberg US TARDEC F.L. Lewis, NAI Moncrief-O Donnell Chair, UTA Research Institute (UTARI) The University of Texas at Arlington, USA and Qian Ren Consulting Professor, State Key Laboratory of Synthetical Automation for

More information

F.L. Lewis. New Developments in Integral Reinforcement Learning: Continuous-time Optimal Control and Games

F.L. Lewis. New Developments in Integral Reinforcement Learning: Continuous-time Optimal Control and Games F.L. Lewis National Academy of Inventors Moncrief-O Donnell Chair, UTA Research Institute (UTARI) The University of Texas at Arlington, USA and Qian Ren Consulting Professor, State Key Laboratory of Synthetical

More information

A new Approach for Solving Linear Ordinary Differential Equations

A new Approach for Solving Linear Ordinary Differential Equations , ISSN 974-57X (Onlne), ISSN 974-5718 (Prnt), Vol. ; Issue No. 1; Year 14, Copyrght 13-14 by CESER PUBLICATIONS A new Approach for Solvng Lnear Ordnary Dfferental Equatons Fawz Abdelwahd Department of

More information

Optimal Synchronization and Games on Graphs

Optimal Synchronization and Games on Graphs Optmal Synchronzaton and Games on Graphs F.L. Lews Moncref-O Donnell Endowed Char Head, Controls & Sensors Group UA Research Insttute (UARI) he Unversty of exas at Arlngton Optmal Desgn for Synchronzaton

More information

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute

More information

Resilient Control in Cooperative and Adversarial Multi-agent Networks

Resilient Control in Cooperative and Adversarial Multi-agent Networks Reslent Control n Cooperatve and Adversaral Mult-agent Networks F.L. Lews Natonal Academy of Inventors Moncref-O Donnell Char, UA Research Insttute (UARI) he Unversty of exas at Arlngton, USA And Qan Ren

More information

Lyapunov Design for Controls

Lyapunov Design for Controls F.L. Lewis, NAI Moncrief-O Donnell Chair, UTA Research Institute (UTARI) The University of Texas at Arlington, USA and Foreign Professor, Chongqing University, China Supported by : China Qian Ren Program,

More information

Dynamic Programming. Preview. Dynamic Programming. Dynamic Programming. Dynamic Programming (Example: Fibonacci Sequence)

Dynamic Programming. Preview. Dynamic Programming. Dynamic Programming. Dynamic Programming (Example: Fibonacci Sequence) /24/27 Prevew Fbonacc Sequence Longest Common Subsequence Dynamc programmng s a method for solvng complex problems by breakng them down nto smpler sub-problems. It s applcable to problems exhbtng the propertes

More information

Hashing. Alexandra Stefan

Hashing. Alexandra Stefan Hashng Alexandra Stefan 1 Hash tables Tables Drect access table (or key-ndex table): key => ndex Hash table: key => hash value => ndex Man components Hash functon Collson resoluton Dfferent keys mapped

More information

Controller Design for Networked Control Systems in Multiple-packet Transmission with Random Delays

Controller Design for Networked Control Systems in Multiple-packet Transmission with Random Delays Appled Mehans and Materals Onlne: 03-0- ISSN: 66-748, Vols. 78-80, pp 60-604 do:0.408/www.sentf.net/amm.78-80.60 03 rans eh Publatons, Swtzerland H Controller Desgn for Networed Control Systems n Multple-paet

More information

Distributed Cooperative Control of AC Micro-grids

Distributed Cooperative Control of AC Micro-grids 4 Dstrbuted Cooperatve Control of AC Mcro-grds 6 F.L. Lews, NAI Moncref-O Donnell Char, UTA Research Insttute (UTARI) The Unversty of Texas at Arlngton, USA and Supported by : ONR US NSF Chna Qan Ren Program,

More information

Dynamic Systems on Graphs

Dynamic Systems on Graphs Prepared by F.L. Lews Updated: Saturday, February 06, 200 Dynamc Systems on Graphs Control Graphs and Consensus A network s a set of nodes that collaborates to acheve what each cannot acheve alone. A network,

More information

Joint Energy Management and Resource Allocation in Rechargable Sensor Networks

Joint Energy Management and Resource Allocation in Rechargable Sensor Networks Jont Energy Management and Resource Allocaton n Rechargable Sensor Networks Ren-Shou Lu, Prasun Snha and C. Emre Koksal Department of CSE and ECE The Oho State Unversty Envronmental Energy Harvestng Many

More information

Lecture 14: Bandits with Budget Constraints

Lecture 14: Bandits with Budget Constraints IEOR 8100-001: Learnng and Optmzaton for Sequental Decson Makng 03/07/16 Lecture 14: andts wth udget Constrants Instructor: Shpra Agrawal Scrbed by: Zhpeng Lu 1 Problem defnton In the regular Mult-armed

More information

Erratum: A Generalized Path Integral Control Approach to Reinforcement Learning

Erratum: A Generalized Path Integral Control Approach to Reinforcement Learning Journal of Machne Learnng Research 00-9 Submtted /0; Publshed 7/ Erratum: A Generalzed Path Integral Control Approach to Renforcement Learnng Evangelos ATheodorou Jonas Buchl Stefan Schaal Department of

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Approximate optimal cooperative decentralized control for consensus in a topological network of agents with uncertain nonlinear dynamics

Approximate optimal cooperative decentralized control for consensus in a topological network of agents with uncertain nonlinear dynamics Approxmate optmal cooperatve decentralzed control for consensus n a topologcal network of agents wth uncertan nonlnear dynamcs Rushkesh Kamalapurkar, Huyen Dnh, Patrck Walters, and Warren Dxon arxv:34.3479v

More information

VARIOUS complex systems in social and engineering

VARIOUS complex systems in social and engineering 418 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 Decentralzed Stablzaton for a Class of Contnuous-Tme Nonlnear Interconnected Systems Usng Onlne Learnng Optmal

More information

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1]

Outline. Communication. Bellman Ford Algorithm. Bellman Ford Example. Bellman Ford Shortest Path [1] DYNAMIC SHORTEST PATH SEARCH AND SYNCHRONIZED TASK SWITCHING Jay Wagenpfel, Adran Trachte 2 Outlne Shortest Communcaton Path Searchng Bellmann Ford algorthm Algorthm for dynamc case Modfcatons to our algorthm

More information

The Tangential Force Distribution on Inner Cylinder of Power Law Fluid Flowing in Eccentric Annuli with the Inner Cylinder Reciprocating Axially

The Tangential Force Distribution on Inner Cylinder of Power Law Fluid Flowing in Eccentric Annuli with the Inner Cylinder Reciprocating Axially Open Journal of Flud Dynamcs, 2015, 5, 183-187 Publshed Onlne June 2015 n ScRes. http://www.scrp.org/journal/ojfd http://dx.do.org/10.4236/ojfd.2015.52020 The Tangental Force Dstrbuton on Inner Cylnder

More information

Tracking with Kalman Filter

Tracking with Kalman Filter Trackng wth Kalman Flter Scott T. Acton Vrgna Image and Vdeo Analyss (VIVA), Charles L. Brown Department of Electrcal and Computer Engneerng Department of Bomedcal Engneerng Unversty of Vrgna, Charlottesvlle,

More information

Errors for Linear Systems

Errors for Linear Systems Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch

More information

Modeling and Simulation of a Hexapod Machine Tool for the Dynamic Stability Analysis of Milling Processes. C. Henninger, P.

Modeling and Simulation of a Hexapod Machine Tool for the Dynamic Stability Analysis of Milling Processes. C. Henninger, P. Smpack User Meetng 27 Modelng and Smulaton of a Heapod Machne Tool for the Dynamc Stablty Analyss of Mllng Processes C. Hennnger, P. Eberhard Insttute of Engneerng project funded by the DFG wthn the framework

More information

Introduction. - The Second Lyapunov Method. - The First Lyapunov Method

Introduction. - The Second Lyapunov Method. - The First Lyapunov Method Stablty Analyss A. Khak Sedgh Control Systems Group Faculty of Electrcal and Computer Engneerng K. N. Toos Unversty of Technology February 2009 1 Introducton Stablty s the most promnent characterstc of

More information

DO NOT DO HOMEWORK UNTIL IT IS ASSIGNED. THE ASSIGNMENTS MAY CHANGE UNTIL ANNOUNCED.

DO NOT DO HOMEWORK UNTIL IT IS ASSIGNED. THE ASSIGNMENTS MAY CHANGE UNTIL ANNOUNCED. EE 539 Homeworks Sprng 08 Updated: Tuesday, Aprl 7, 08 DO NOT DO HOMEWORK UNTIL IT IS ASSIGNED. THE ASSIGNMENTS MAY CHANGE UNTIL ANNOUNCED. For full credt, show all work. Some problems requre hand calculatons.

More information

Neuro-Adaptive Design II:

Neuro-Adaptive Design II: Lecture 37 Neuro-Adaptve Desgn II: A Robustfyng Tool for Any Desgn Dr. Radhakant Padh Asst. Professor Dept. of Aerospace Engneerng Indan Insttute of Scence - Bangalore Motvaton Perfect system modelng s

More information

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites 7 Asa-Pacfc Engneerng Technology Conference (APETC 7) ISBN: 978--6595-443- The Two-scale Fnte Element Errors Analyss for One Class of Thermoelastc Problem n Perodc Compostes Xaoun Deng Mngxang Deng ABSTRACT

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Concurrent learning-based online approximate feedback-nash equilibrium solution of N-player nonzero-sum differential games

Concurrent learning-based online approximate feedback-nash equilibrium solution of N-player nonzero-sum differential games Concurrent learnng-based onlne approxmate feedbac-nash equlbrum soluton of N-player nonzero-sum dfferental games Rushesh Kamalapurar, Justn Klotz, and arren E. Dxon arxv:30.38v [cs.sy] Oct 03 Abstract

More information

Changing Topology and Communication Delays

Changing Topology and Communication Delays Prepared by F.L. Lews Updated: Saturday, February 3, 00 Changng Topology and Communcaton Delays Changng Topology The graph connectvty or topology may change over tme. Let G { G, G,, G M } wth M fnte be

More information

The equation of motion of a dynamical system is given by a set of differential equations. That is (1)

The equation of motion of a dynamical system is given by a set of differential equations. That is (1) Dynamcal Systems Many engneerng and natural systems are dynamcal systems. For example a pendulum s a dynamcal system. State l The state of the dynamcal system specfes t condtons. For a pendulum n the absence

More information

Lecture 3 January 31, 2017

Lecture 3 January 31, 2017 CS 224: Advanced Algorthms Sprng 207 Prof. Jelan Nelson Lecture 3 January 3, 207 Scrbe: Saketh Rama Overvew In the last lecture we covered Y-fast tres and Fuson Trees. In ths lecture we start our dscusson

More information

Dynamic Programming 4/5/12. Dynamic programming. Fibonacci numbers. Fibonacci: a first attempt. David Kauchak cs302 Spring 2012

Dynamic Programming 4/5/12. Dynamic programming. Fibonacci numbers. Fibonacci: a first attempt. David Kauchak cs302 Spring 2012 Dynamc Programmng Davd Kauchak cs32 Sprng 212 Dynamc programmng l One of the most mportant algorthm tools! l Very common ntervew queston l Method for solvng problems where optmal solutons can be defned

More information

Neuro-Adaptive Design - I:

Neuro-Adaptive Design - I: Lecture 36 Neuro-Adaptve Desgn - I: A Robustfyng ool for Dynamc Inverson Desgn Dr. Radhakant Padh Asst. Professor Dept. of Aerospace Engneerng Indan Insttute of Scence - Bangalore Motvaton Perfect system

More information

Classification as a Regression Problem

Classification as a Regression Problem Target varable y C C, C,, ; Classfcaton as a Regresson Problem { }, 3 L C K To treat classfcaton as a regresson problem we should transform the target y nto numercal values; The choce of numercal class

More information

Appendix for Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Appendix for Causal Interaction in Factorial Experiments: Application to Conjoint Analysis A Appendx for Causal Interacton n Factoral Experments: Applcaton to Conjont Analyss Mathematcal Appendx: Proofs of Theorems A. Lemmas Below, we descrbe all the lemmas, whch are used to prove the man theorems

More information

Support Vector Machines. Vibhav Gogate The University of Texas at dallas

Support Vector Machines. Vibhav Gogate The University of Texas at dallas Support Vector Machnes Vbhav Gogate he Unversty of exas at dallas What We have Learned So Far? 1. Decson rees. Naïve Bayes 3. Lnear Regresson 4. Logstc Regresson 5. Perceptron 6. Neural networks 7. K-Nearest

More information

n α j x j = 0 j=1 has a nontrivial solution. Here A is the n k matrix whose jth column is the vector for all t j=0

n α j x j = 0 j=1 has a nontrivial solution. Here A is the n k matrix whose jth column is the vector for all t j=0 MODULE 2 Topcs: Lnear ndependence, bass and dmenson We have seen that f n a set of vectors one vector s a lnear combnaton of the remanng vectors n the set then the span of the set s unchanged f that vector

More information

Canonical transformations

Canonical transformations Canoncal transformatons November 23, 2014 Recall that we have defned a symplectc transformaton to be any lnear transformaton M A B leavng the symplectc form nvarant, Ω AB M A CM B DΩ CD Coordnate transformatons,

More information

Discriminative classifier: Logistic Regression. CS534-Machine Learning

Discriminative classifier: Logistic Regression. CS534-Machine Learning Dscrmnatve classfer: Logstc Regresson CS534-Machne Learnng robablstc Classfer Gven an nstance, hat does a probablstc classfer do dfferentl compared to, sa, perceptron? It does not drectl predct Instead,

More information

CLASSICAL optimal control problems are formulated in

CLASSICAL optimal control problems are formulated in IEEE/CAA JOURNAL OF AUOMAICA SINICA, VOL., NO. 3, JULY 39 Concurrent Learnng-based Approxmate Feedbac-Nash Equlbrum Soluton of N-player Nonzero-sum Dfferental Games Rushesh Kamalapurar Justn R. Klotz arren

More information

Robust observed-state feedback design. for discrete-time systems rational in the uncertainties

Robust observed-state feedback design. for discrete-time systems rational in the uncertainties Robust observed-state feedback desgn for dscrete-tme systems ratonal n the uncertantes Dmtr Peaucelle Yosho Ebhara & Yohe Hosoe Semnar at Kolloquum Technsche Kybernetk, May 10, 016 Unversty of Stuttgart

More information

Lecture 13 APPROXIMATION OF SECOMD ORDER DERIVATIVES

Lecture 13 APPROXIMATION OF SECOMD ORDER DERIVATIVES COMPUTATIONAL FLUID DYNAMICS: FDM: Appromaton of Second Order Dervatves Lecture APPROXIMATION OF SECOMD ORDER DERIVATIVES. APPROXIMATION OF SECOND ORDER DERIVATIVES Second order dervatves appear n dffusve

More information

Report on Image warping

Report on Image warping Report on Image warpng Xuan Ne, Dec. 20, 2004 Ths document summarzed the algorthms of our mage warpng soluton for further study, and there s a detaled descrpton about the mplementaton of these algorthms.

More information

Integrals and Invariants of Euler-Lagrange Equations

Integrals and Invariants of Euler-Lagrange Equations Lecture 16 Integrals and Invarants of Euler-Lagrange Equatons ME 256 at the Indan Insttute of Scence, Bengaluru Varatonal Methods and Structural Optmzaton G. K. Ananthasuresh Professor, Mechancal Engneerng,

More information

Modelli Clamfim Equazioni differenziali 22 settembre 2016

Modelli Clamfim Equazioni differenziali 22 settembre 2016 CLAMFIM Bologna Modell 1 @ Clamfm Equazon dfferenzal 22 settembre 2016 professor Danele Rtell danele.rtell@unbo.t 1/22? Ordnary Dfferental Equatons A dfferental equaton s an equaton that defnes a relatonshp

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

PHYS 705: Classical Mechanics. Canonical Transformation II

PHYS 705: Classical Mechanics. Canonical Transformation II 1 PHYS 705: Classcal Mechancs Canoncal Transformaton II Example: Harmonc Oscllator f ( x) x m 0 x U( x) x mx x LT U m Defne or L p p mx x x m mx x H px L px p m p x m m H p 1 x m p m 1 m H x p m x m m

More information

LocalReliableControlforLinearSystems with Saturating Actuators

LocalReliableControlforLinearSystems with Saturating Actuators LocalRelableControlforLnearSystems wth Saturatng Actuators Shoudong Huang James Lam Bng Chen Dept. of Engneerng Dept. of Mechancal Engneerng Dept. of Mathematcs Australan Natonal Unversty Unversty of Hong

More information

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty Addtonal Codes usng Fnte Dfference Method Benamn Moll 1 HJB Equaton for Consumpton-Savng Problem Wthout Uncertanty Before consderng the case wth stochastc ncome n http://www.prnceton.edu/~moll/ HACTproect/HACT_Numercal_Appendx.pdf,

More information

VQ widely used in coding speech, image, and video

VQ widely used in coding speech, image, and video at Scalar quantzers are specal cases of vector quantzers (VQ): they are constraned to look at one sample at a tme (memoryless) VQ does not have such constrant better RD perfomance expected Source codng

More information

6.3.4 Modified Euler s method of integration

6.3.4 Modified Euler s method of integration 6.3.4 Modfed Euler s method of ntegraton Before dscussng the applcaton of Euler s method for solvng the swng equatons, let us frst revew the basc Euler s method of numercal ntegraton. Let the general from

More information

FTCS Solution to the Heat Equation

FTCS Solution to the Heat Equation FTCS Soluton to the Heat Equaton ME 448/548 Notes Gerald Recktenwald Portland State Unversty Department of Mechancal Engneerng gerry@pdx.edu ME 448/548: FTCS Soluton to the Heat Equaton Overvew 1. Use

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Operating conditions of a mine fan under conditions of variable resistance

Operating conditions of a mine fan under conditions of variable resistance Paper No. 11 ISMS 216 Operatng condtons of a mne fan under condtons of varable resstance Zhang Ynghua a, Chen L a, b, Huang Zhan a, *, Gao Yukun a a State Key Laboratory of Hgh-Effcent Mnng and Safety

More information

Generative classification models

Generative classification models CS 675 Intro to Machne Learnng Lecture Generatve classfcaton models Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square Data: D { d, d,.., dn} d, Classfcaton represents a dscrete class value Goal: learn

More information

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system Transfer Functons Convenent representaton of a lnear, dynamc model. A transfer functon (TF) relates one nput and one output: x t X s y t system Y s The followng termnology s used: x y nput output forcng

More information

PHYS 705: Classical Mechanics. Calculus of Variations II

PHYS 705: Classical Mechanics. Calculus of Variations II 1 PHYS 705: Classcal Mechancs Calculus of Varatons II 2 Calculus of Varatons: Generalzaton (no constrant yet) Suppose now that F depends on several dependent varables : We need to fnd such that has a statonary

More information

Hierarchical State Estimation Using Phasor Measurement Units

Hierarchical State Estimation Using Phasor Measurement Units Herarchcal State Estmaton Usng Phasor Measurement Unts Al Abur Northeastern Unversty Benny Zhao (CA-ISO) and Yeo-Jun Yoon (KPX) IEEE PES GM, Calgary, Canada State Estmaton Workng Group Meetng July 28,

More information

Analysis of Dynamic Cross Response between Spindles in a Dual Spindle Type Multi-Functional Turning Machine

Analysis of Dynamic Cross Response between Spindles in a Dual Spindle Type Multi-Functional Turning Machine Journal of Power and Energy Engneerng, 2013, 1, 20-24 http://dx.do.org/10.4236/jpee.2013.17004 Publshed Onlne December 2013 (http://www.scrp.org/journal/jpee) Analyss of Dynamc Cross Response between Spndles

More information

risk and uncertainty assessment

risk and uncertainty assessment Optmal forecastng of atmospherc qualty n ndustral regons: rsk and uncertanty assessment Vladmr Penenko Insttute of Computatonal Mathematcs and Mathematcal Geophyscs SD RAS Goal Development of theoretcal

More information

Discriminative classifier: Logistic Regression. CS534-Machine Learning

Discriminative classifier: Logistic Regression. CS534-Machine Learning Dscrmnatve classfer: Logstc Regresson CS534-Machne Learnng 2 Logstc Regresson Gven tranng set D stc regresson learns the condtonal dstrbuton We ll assume onl to classes and a parametrc form for here s

More information

3.2. Cournot Model Cournot Model

3.2. Cournot Model Cournot Model Matlde Machado Assumptons: All frms produce an homogenous product The market prce s therefore the result of the total supply (same prce for all frms) Frms decde smultaneously how much to produce Quantty

More information

One-sided finite-difference approximations suitable for use with Richardson extrapolation

One-sided finite-difference approximations suitable for use with Richardson extrapolation Journal of Computatonal Physcs 219 (2006) 13 20 Short note One-sded fnte-dfference approxmatons sutable for use wth Rchardson extrapolaton Kumar Rahul, S.N. Bhattacharyya * Department of Mechancal Engneerng,

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Other NN Models. Reinforcement learning (RL) Probabilistic neural networks

Other NN Models. Reinforcement learning (RL) Probabilistic neural networks Other NN Models Renforcement learnng (RL) Probablstc neural networks Support vector machne (SVM) Renforcement learnng g( (RL) Basc deas: Supervsed dlearnng: (delta rule, BP) Samples (x, f(x)) to learn

More information

Modelli Clamfim Equazioni differenziali 7 ottobre 2013

Modelli Clamfim Equazioni differenziali 7 ottobre 2013 CLAMFIM Bologna Modell 1 @ Clamfm Equazon dfferenzal 7 ottobre 2013 professor Danele Rtell danele.rtell@unbo.t 1/18? Ordnary Dfferental Equatons A dfferental equaton s an equaton that defnes a relatonshp

More information

Adaptive Consensus Control of Multi-Agent Systems with Large Uncertainty and Time Delays *

Adaptive Consensus Control of Multi-Agent Systems with Large Uncertainty and Time Delays * Journal of Robotcs, etworkng and Artfcal Lfe, Vol., o. (September 04), 5-9 Adaptve Consensus Control of Mult-Agent Systems wth Large Uncertanty and me Delays * L Lu School of Mechancal Engneerng Unversty

More information

FE REVIEW OPERATIONAL AMPLIFIERS (OP-AMPS)( ) 8/25/2010

FE REVIEW OPERATIONAL AMPLIFIERS (OP-AMPS)( ) 8/25/2010 FE REVEW OPERATONAL AMPLFERS (OP-AMPS)( ) 1 The Op-amp 2 An op-amp has two nputs and one output. Note the op-amp below. The termnal labeled l wth the (-) sgn s the nvertng nput and the nput labeled wth

More information

Outline and Reading. Dynamic Programming. Dynamic Programming revealed. Computing Fibonacci. The General Dynamic Programming Technique

Outline and Reading. Dynamic Programming. Dynamic Programming revealed. Computing Fibonacci. The General Dynamic Programming Technique Outlne and Readng Dynamc Programmng The General Technque ( 5.3.2) -1 Knapsac Problem ( 5.3.3) Matrx Chan-Product ( 5.3.1) Dynamc Programmng verson 1.4 1 Dynamc Programmng verson 1.4 2 Dynamc Programmng

More information

Queueing Networks II Network Performance

Queueing Networks II Network Performance Queueng Networks II Network Performance Davd Tpper Assocate Professor Graduate Telecommuncatons and Networkng Program Unversty of Pttsburgh Sldes 6 Networks of Queues Many communcaton systems must be modeled

More information

Adjoint Methods of Sensitivity Analysis for Lyapunov Equation. Boping Wang 1, Kun Yan 2. University of Technology, Dalian , P. R.

Adjoint Methods of Sensitivity Analysis for Lyapunov Equation. Boping Wang 1, Kun Yan 2. University of Technology, Dalian , P. R. th World Congress on Structural and Multdscplnary Optmsaton 7 th - th, June 5, Sydney Australa Adjont Methods of Senstvty Analyss for Lyapunov Equaton Bopng Wang, Kun Yan Department of Mechancal and Aerospace

More information

Dynamic Programming. Lecture 13 (5/31/2017)

Dynamic Programming. Lecture 13 (5/31/2017) Dynamc Programmng Lecture 13 (5/31/2017) - A Forest Thnnng Example - Projected yeld (m3/ha) at age 20 as functon of acton taken at age 10 Age 10 Begnnng Volume Resdual Ten-year Volume volume thnned volume

More information

The Study of Teaching-learning-based Optimization Algorithm

The Study of Teaching-learning-based Optimization Algorithm Advanced Scence and Technology Letters Vol. (AST 06), pp.05- http://dx.do.org/0.57/astl.06. The Study of Teachng-learnng-based Optmzaton Algorthm u Sun, Yan fu, Lele Kong, Haolang Q,, Helongang Insttute

More information

Note: Please use the actual date you accessed this material in your citation.

Note: Please use the actual date you accessed this material in your citation. MIT OpenCourseWare http://ocw.mt.edu 6.13/ESD.13J Electromagnetcs and Applcatons, Fall 5 Please use the followng ctaton format: Markus Zahn, Erch Ippen, and Davd Staeln, 6.13/ESD.13J Electromagnetcs and

More information

Zeros and Zero Dynamics for Linear, Time-delay System

Zeros and Zero Dynamics for Linear, Time-delay System UNIVERSITA POLITECNICA DELLE MARCHE - FACOLTA DI INGEGNERIA Dpartmento d Ingegnerua Informatca, Gestonale e dell Automazone LabMACS Laboratory of Modelng, Analyss and Control of Dynamcal System Zeros and

More information

A Simple Inventory System

A Simple Inventory System A Smple Inventory System Lawrence M. Leems and Stephen K. Park, Dscrete-Event Smulaton: A Frst Course, Prentce Hall, 2006 Hu Chen Computer Scence Vrgna State Unversty Petersburg, Vrgna February 8, 2017

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

LECTURE 9 CANONICAL CORRELATION ANALYSIS

LECTURE 9 CANONICAL CORRELATION ANALYSIS LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

LA PRISE DE CALAIS. çoys, çoys, har - dis. çoys, dis. tons, mantz, tons, Gas. c est. à ce. C est à ce. coup, c est à ce

LA PRISE DE CALAIS. çoys, çoys, har - dis. çoys, dis. tons, mantz, tons, Gas. c est. à ce. C est à ce. coup, c est à ce > ƒ? @ Z [ \ _ ' µ `. l 1 2 3 z Æ Ñ 6 = Ð l sl (~131 1606) rn % & +, l r s s, r 7 nr ss r r s s s, r s, r! " # $ s s ( ) r * s, / 0 s, r 4 r r 9;: < 10 r mnz, rz, r ns, 1 s ; j;k ns, q r s { } ~ l r mnz,

More information

Chapter 2 A Class of Robust Solution for Linear Bilevel Programming

Chapter 2 A Class of Robust Solution for Linear Bilevel Programming Chapter 2 A Class of Robust Soluton for Lnear Blevel Programmng Bo Lu, Bo L and Yan L Abstract Under the way of the centralzed decson-makng, the lnear b-level programmng (BLP) whose coeffcents are supposed

More information

Hidden Markov Models

Hidden Markov Models Hdden Markov Models Namrata Vaswan, Iowa State Unversty Aprl 24, 204 Hdden Markov Model Defntons and Examples Defntons:. A hdden Markov model (HMM) refers to a set of hdden states X 0, X,..., X t,...,

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

ECE559VV Project Report

ECE559VV Project Report ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate

More information

% & 5.3 PRACTICAL APPLICATIONS. Given system, (49) , determine the Boolean Function, , in such a way that we always have expression: " Y1 = Y2

% & 5.3 PRACTICAL APPLICATIONS. Given system, (49) , determine the Boolean Function, , in such a way that we always have expression:  Y1 = Y2 5.3 PRACTICAL APPLICATIONS st EXAMPLE: Gven system, (49) & K K Y XvX 3 ( 2 & X ), determne the Boolean Functon, Y2 X2 & X 3 v X " X3 (X2,X)", n such a way that we always have expresson: " Y Y2 " (50).

More information

Research on the Fuzzy Control for Vehicle Semi-active Suspension. Xiaoming Hu 1, a, Wanli Li 1,b

Research on the Fuzzy Control for Vehicle Semi-active Suspension. Xiaoming Hu 1, a, Wanli Li 1,b Advanced Materals Research Onlne: 0-0- ISSN: -9, Vol., pp -9 do:0.0/www.scentfc.net/amr.. 0 Trans Tech Publcatons, Swterland Research on the Fuy Control for Vehcle Sem-actve Suspenson Xaomng Hu, a, Wanl

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

Lagrange Multipliers Kernel Trick

Lagrange Multipliers Kernel Trick Lagrange Multplers Kernel Trck Ncholas Ruozz Unversty of Texas at Dallas Based roughly on the sldes of Davd Sontag General Optmzaton A mathematcal detour, we ll come back to SVMs soon! subject to: f x

More information

IV. Performance Optimization

IV. Performance Optimization IV. Performance Optmzaton A. Steepest descent algorthm defnton how to set up bounds on learnng rate mnmzaton n a lne (varyng learnng rate) momentum learnng examples B. Newton s method defnton Gauss-Newton

More information

Review: Fit a line to N data points

Review: Fit a line to N data points Revew: Ft a lne to data ponts Correlated parameters: L y = a x + b Orthogonal parameters: J y = a (x ˆ x + b For ntercept b, set a=0 and fnd b by optmal average: ˆ b = y, Var[ b ˆ ] = For slope a, set

More information

Time-Varying Systems and Computations Lecture 6

Time-Varying Systems and Computations Lecture 6 Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy

More information

3 Riccati Design for Synchronization of Continuous-Time Systems

3 Riccati Design for Synchronization of Continuous-Time Systems 3 Rccat Desgn for Synchronzaton of Contnuous-Tme Systems Ths chapter studes cooperatve trackng control of mult-agent dynamcal systems nterconnected by a fxed communcaton graph topology. Each agent or node

More information

November 5, 2002 SE 180: Earthquake Engineering SE 180. Final Project

November 5, 2002 SE 180: Earthquake Engineering SE 180. Final Project SE 8 Fnal Project Story Shear Frame u m Gven: u m L L m L L EI ω ω Solve for m Story Bendng Beam u u m L m L Gven: m L L EI ω ω Solve for m 3 3 Story Shear Frame u 3 m 3 Gven: L 3 m m L L L 3 EI ω ω ω

More information

Boise State University Department of Electrical and Computer Engineering ECE 212L Circuit Analysis and Design Lab

Boise State University Department of Electrical and Computer Engineering ECE 212L Circuit Analysis and Design Lab Bose State Unersty Department of Electrcal and omputer Engneerng EE 1L rcut Analyss and Desgn Lab Experment #8: The Integratng and Dfferentatng Op-Amp rcuts 1 Objectes The objectes of ths laboratory experment

More information

Mechanics Physics 151

Mechanics Physics 151 Mechancs hyscs 151 Lecture Canoncal Transformatons (Chater 9) What We Dd Last Tme Drect Condtons Q j Q j = = j, Q, j, Q, Necessary and suffcent j j for Canoncal Transf. = = j Q, Q, j Q, Q, Infntesmal CT

More information

Implicit Integration Henyey Method

Implicit Integration Henyey Method Implct Integraton Henyey Method In realstc stellar evoluton codes nstead of a drect ntegraton usng for example the Runge-Kutta method one employs an teratve mplct technque. Ths s because the structure

More information

Chapter 12. Ordinary Differential Equation Boundary Value (BV) Problems

Chapter 12. Ordinary Differential Equation Boundary Value (BV) Problems Chapter. Ordnar Dfferental Equaton Boundar Value (BV) Problems In ths chapter we wll learn how to solve ODE boundar value problem. BV ODE s usuall gven wth x beng the ndependent space varable. p( x) q(

More information