VARIOUS complex systems in social and engineering

Size: px
Start display at page:

Download "VARIOUS complex systems in social and engineering"

Transcription

1 418 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 Decentralzed Stablzaton for a Class of Contnuous-Tme Nonlnear Interconnected Systems Usng Onlne Learnng Optmal Control Approach Derong Lu, Fellow, IEEE, Dng Wang, and Honglang L Abstract In ths paper, usng a neural-network-based onlne learnng optmal control approach, a novel decentralzed control strategy s developed to stablze a class of contnuoustme nonlnear nterconnected large-scale systems. Frst, optmal controllers of the solated subsystems are desgned wth cost functons reflectng the bounds of nterconnectons. Then, t s proven that the decentralzed control strategy of the overall system can be establshed by addng approprate feedback gans to the optmal control polces of the solated subsystems. Next, an onlne polcy teraton algorthm s presented to solve the Hamlton Jacob Bellman equatons related to the optmal control problem. Through constructng a set of crtc neural networks, the cost functons can be obtaned approxmately, followed by the control polces. Furthermore, the dynamcs of the estmaton errors of the crtc networks are verfed to be unformly and ultmately bounded. Fnally, a smulaton example s provded to llustrate the effectveness of the present decentralzed control scheme. Index Terms Adaptve dynamc programmng, decentralzed control, large-scale systems, neural networks, nonlnear nterconnected systems, optmal control, polcy teraton, renforcement learnng. I. INTRODUCTION VARIOUS complex systems n socal and engneerng areas, such as ecosystems, transportaton systems, and power systems, are consdered as large-scale systems. Generally speakng, a large-scale system s comprsed of several subsystems wth obvous nterconnectons, whch leads to the ncreasng dffculty of analyss and synthess when usng classcal centralzed control technques. Bakule [1] ponted out wth smlar results that t s, therefore, necessary to partton the desgn ssue of the overall system nto manageable subproblems. Then, the overall plant s no longer controlled by a sngle controller but by an array of ndependent controllers that all together represent a decentralzed controller. Therefore, the decentralzed control has been a control of choce for large-scale systems because t s computatonally effcent to Manuscrpt receved January 5, 2013; revsed May 6, 2013; accepted July 25, Date of publcaton September 16, 2013; date of current verson January 10, Ths work was supported n part by the Natonal Natural Scence Foundaton of Chna under Grants , , and , and n part by the Early Career Development Award of SKLMCCS. The actng Edtor-n-Chef who handled the revew of ths paper was Danl Prokhorov. The authors are wth The State Key Laboratory of Management and Control for Complex Systems, Insttute of Automaton, Chnese Academy of Scences, Bejng , Chna (e-mal: derong.lu@a.ac.cn; dng.wang@a.ac.cn; honglang.l@a.ac.cn). Color versons of one or more of the fgures n ths paper are avalable onlne at Dgtal Object Identfer /TNNLS formulate control law that use only locally avalable subsystem states or outputs [2]. Actually, consderable attenton has been pad to the decentralzed stablzaton of large-scale systems durng the last several decades [3] [7]. As prevously mentoned, a decentralzed strategy conssts of some nonnteractng local controllers correspondng to the solated subsystems, not the overall system. Thus, n many stuatons, the desgn of the solated subsystems s a matter of great sgnfcance. In [8], t was shown that the decentralzed control of the nterconnected system was related to the optmal control of the solated subsystems. Therefore, the optmal control method can be employed to facltate the desgn process of the decentralzed control strategy. However, n [8], the cost functons of the solated subsystems were not chosen as the general forms, not to menton that the detaled procedure was not gven. For ths reason, n ths paper, by employng the onlne polcy teraton algorthm, we wll nvestgate the decentralzed stablzaton problem usng neural-network-based learnng optmal control approach. The optmal control of nonlnear system often leads to solvng the Hamlton Jacob Bellman (HJB) equaton nstead of the Rccat equaton of the lnear case. Though dynamc programmng s a useful technque to solve the optmzaton and optmal control problems, n many cases, t s computatonally dffcult to apply t because of the curse of dmensonalty. Fortunately, based on functon approxmators, such as neural networks, adaptve (or approxmate) dynamc programmng (ADP) was proposed by Werbos [9], [10] as an alternatve method to solve the optmal control problems forward-n-tme. There are several synonyms used for ADP, ncludng adaptve dynamc programmng [11] [15], approxmate dynamc programmng [16] [18], neuro-dynamc programmng [19], neural dynamc programmng [20], adaptve crtc desgns [21], and renforcement learnng [22]. In the recent years, great efforts have been made to ADP and related research n theory and applcatons. Numerous excellent results have been obtaned that greatly promotes the development of relevant dscplnes [23] [40]. In lght of [13], the ADP technque s closely related to renforcement learnng when engagng n the research of feedback control. In general, value and polcy teratons are fundamental algorthms for the renforcement learnng-based ADP n optmal control. Polcy teraton starts wth a stablzng control, whereas value teraton cannot always guarantee the stablty of control durng the mplementaton process. Al-Tamm et al. [18], Zhang et al. [23], and Lu et al. [26] X 2013 IEEE. Personal use s permtted, but republcaton/redstrbuton requres IEEE permsson. See for more nformaton.

2 LIU et al.: ONLINE LEARNING OPTIMAL CONTROL APPROACH 419 studed the optmal control problem of dscrete-tme nonlnear systems usng value teraton algorthm. Specfcally, polcy teraton represents a class of algorthms contanng two basc teratons,.e., polcy evaluaton and polcy mprovement [41] [46]. Abu-Khalaf and Lews [41] derved an offlne optmal control scheme for nonlnear systems wth saturatng actuators. Then, Vrabe and Lews [42] and Vamvoudaks and Lews [43] used onlne polcy teraton algorthm to study the nfnte horzon optmal control of contnuous-tme nonlnear systems, respectvely. The former was performed based on the sequental updates of two neural networks, namely, crtc network and acton network, whereas n the latter, the two networks were traned smultaneously. Recently, Lu et al. [44] extended the polcy teraton algorthm to nonlnear optmal control problem wth unknown nternal dynamcs and dscounted cost functon. Besdes, Bhasn et al. [46] constructed an actor-crtc-dentfer archtecture to deal wth the nfnte horzon optmal control of uncertan nonlnear systems, characterzed by the ntroducton of a robust dynamc neural network. In ths paper, we employ the onlne polcy teraton algorthm to tackle the decentralzed control of a class of nonlnear nterconnected systems. To desgn the decentralzed control scheme of the overall system, the optmal controllers of the solated subsystems are desgned at frst wth the cost functons modfed to account for the nterconnectons. Then, the decentralzed control strategy can be establshed by addng approprate feedback gans to the local optmal control polces. Next, the onlne polcy teraton algorthm s developed to solve the HJB equatons related to the optmal control by constructng and tranng some crtc networks. It s shown that the approxmate closed-form expressons of the optmal control polces are avalable. Hence, there s no need to buld acton networks. Addtonally, the unform ultmate boundedness (UUB) of the dynamcs of the weght estmaton errors s analyzed usng the Lyapunov approach. Remarkably, consderng the effectveness of ADP and renforcement learnng technques n solvng the nonlnear optmal control problem, the decentralzed control approach establshed here s natural and convenent. More mportantly, t can be employed to stablze a broad class of nonlnear large-scale systems. Ths paper s organzed as follows. In Secton II, the decentralzed control problem of the large-scale system s descrbed. In Secton III, the optmal control of solated subsystems s presented n the framework of HJB equatons, based on whch, the decentralzed control strategy can be developed. In Secton IV, the onlne polcy teraton algorthm s ntroduced to solve the HJB equatons wth convergence analyss. In addton, crtc networks are constructed for facltatng the mplementaton of onlne algorthm. The UUB of the dynamcs of the weght estmaton errors s proved as well. In Secton V, an example s gven to demonstrate the effectveness of the establshed approach. In Secton VI, concludng remarks are provded. II. PROBLEM STATEMENT In ths paper, we study a class of contnuous-tme nonlnear large-scale systems composed of N nterconnected subsystems Fg. 1. Structural dagram of the decentralzed control problem of the nterconnected system. descrbed by ẋ (t) = f (x (t)) + g (x (t)) ( ū (x (t)) + Z (x(t)) ) = 1, 2,...,N (1) where x (t) R n and ū (x (t)) R m are the state and control vectors of the th subsystem, respectvely. In large-scale system (1), x = [ x1 T x 2 T... xn T ] T R n denotes the overall state, where n = N =1 n. Correspondngly, x 1, x 2,..., x N are called local states, whereas ū 1 (x 1 ), ū 2 (x 2 ),..., ū N (x N ) are local controls. Note that for subsystem, f (x ), g (x ),and g (x ) Z (x) represent the nonlnear nternal dynamcs, nput gan matrx, and nterconnected term, respectvely. Let x (0) = x 0 be the ntal state of the th subsystem, = 1, 2,..., N. Addtonally, we let the followng assumptons hold throughout ths paper. Assumpton 1: The state vector x = 0 s the equlbrum of the th subsystem, = 1, 2,...,N. Assumpton 2: The functons f ( ) and g ( ) are dfferentable n ther arguments wth f (0) = 0, where = 1, 2,...,N. Assumpton 3: The feedback control vector ū (x ) = 0 when x = 0, where = 1, 2,...,N. Let R R m m, = 1, 2,...,N, be symmetrc postve defnte matrces. Then, we denote Z (x) = R 1/2 Z (x), where Z (x) R m, = 1, 2,...,N, are bounded as follows: Z (x) ρ j h j (x j ), = 1, 2,...,N. (2) j=1 In (2), ρ j are nonnegatve constants and h j (x j ) are postve semdefnte functons wth, j = 1, 2,...,N. If we defne h (x ) = max{h 1 (x ), h 2 (x ),...,h N (x )}, = 1, 2,...,N, then (2) can be formulated as Z (x) λ j h j (x j ), = 1, 2,...,N (3) j=1 where λ j ρ j h j (x j )/h j (x j ),, j = 1, 2,...,N, arealso nonnegatve constants. When dealng wth the decentralzed control problem, we am at fndng N control polces ū 1 (x 1 ), ū 2 (x 2 ),..., ū N (x N ) to stablze the large-scale system (1). It s mportant to note that n the control par (ū 1 (x 1 ), ū 2 (x 2 ),...,ū N (x N )), ū (x ) s only a functon of the correspondng local state, namely x, where = 1, 2,..., N. The schematc dagram of the decentralzed control problem s shown n Fg. 1.

3 420 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 III. DECENTRALIZED CONTROLLER DESIGN VIA OPTIMAL CONTROL SCHEME In ths secton, we nvestgate the methodology for decentralzed controller desgn. Two sectons are ncluded n ths part. In the frst secton, the optmal control of the solated subsystems s descrbed under the framework of HJB equatons, whereas n the second secton, the decentralzed control strategy can be constructed based on the optmal control polces. A. Optmal Control and the HJB Equatons Now, we consder the N solated subsystems correspondng to (1) that are gven by ẋ (t) = f (x (t)) + g (x (t))u (x (t)), = 1, 2,...,N. (4) For the th solated subsystem, we further assume that f + g u s Lpschtz contnuous on a set n R n contanng the orgn, and the subsystem s controllable n the sense that there exsts a contnuous control polcy on that asymptotcally stablzes the subsystem. In ths paper, to deal wth the nfnte horzon optmal control problem, we have to fnd the control polces u (x ), = 1, 2,...,N, whch mnmze the local cost functons { J (x 0 ) = Q 2 (x (τ)) + u T (x (τ))r u (x (τ)) } dτ 0 = 1, 2,...,N (5) where Q (x ), = 1, 2,...,N, are postve defnte functons satsfyng h (x ) Q (x ), = 1, 2,...,N. (6) Based on optmal control theory, here, the desgned feedback controls must not only stablze the subsystems on, = 1, 2,...,N, but also guarantee that the cost functons (5) are fnte. In other words, the control polces must be admssble. Below s the defnton of admssble control. Defnton 1: Consder the solated subsystem, a control polcy μ (x ) s defned as admssble wth respect to (5) on, denoted by μ ( ),fμ (x ) s contnuous on, μ (0) = 0, u (x ) = μ (x ) stablzes (4) on,and J (x 0 ) s fnte for all x 0. For any set of admssble control polces μ ( ), = 1, 2,...,N, f the assocated cost functons { V (x 0 ) = Q 2 (x (τ)) + μ T (x (τ))r μ (x (τ)) } dτ 0 = 1, 2,...,N (7) are contnuously dfferentable, then the nfntesmal versons of (7) are the so-called nonlnear Lyapunov equatons 0 = Q 2 (x ) + μ T (x )R μ (x ) + ( V (x )) T ( f (x ) + g (x )μ (x )), = 1, 2,...,N (8) wth V (0) = 0, = 1, 2,...,N. In (8), the terms V (x ), = 1, 2,...,N, denote the partal dervatves of the local cost functons V (x ) wth respect to local states x,.e., V (x ) = V (x )/ x,where = 1, 2,...,N. Defne the Hamltonan functons of the N solated subsystems as follows: H (x,μ, V (x )) = Q 2 (x ) + μ T (x )R μ (x ) + ( V (x )) T ( f (x ) + g (x )μ (x )) (9) where = 1, 2,...,N. The optmal cost functons of the N solated subsystems can be formulated as J (x { 0) = mn Q 2 (x (τ)) + μ T (x (τ))r μ ( ) 0 μ (x (τ)) } dτ, = 1, 2,...,N. (10) In vew of optmal control theory, the optmal cost functons J (x ), = 1, 2,...,N, satsfy the HJB equatons 0 = mn H (x,μ, J (x )), = 1, 2,...,N (11) μ ( ) where J (x ) = J (x )/ x, = 1, 2,...,N. Assume that the mnma on the rght hand sde of (11) exst and are unque. Then, the optmal control polces for the N solated subsystems are u (x ) = arg mn H (x,μ, J (x )) μ ( ) = 1 2 R 1 g T (x ) J (x ), = 1, 2,...,N. (12) Substtutng the optmal control polces (12) nto the nonlnear Lyapunov equatons (8), we can obtan the formulaton of the HJB equatons n terms of J (x ), = 1, 2,...,N, as follows: 0 = Q 2 (x ) + ( J (x )) T f (x ) 1 4 ( J (x )) T g (x)r 1 g T (x ) J (x ) (13) wth J (0) = 0and = 1, 2,...,N. Remark 1: The formulas developed n (12) dsplay an array of closed-form expressons of the optmal control polces, whch obvates the need to search for the optmal control polces va optmzaton process. However, the knowledge of J (x ), = 1, 2,...,N, s requred, whch mples the mportance of the solutons of HJB equatons. B. Decentralzed Control Strategy Accordng to (12), we have expressed the optmal control polces,.e., u 1 (x 1), u 2 (x 2),..., u N (x N ), for the N solated subsystems (4). In the followng, we wll show that by proportonally ncreasng some local feedback gans, a stablzng decentralzed control scheme can be establshed for the nterconnected system (1). Now, we gve the followng lemma, ndcatng how the feedback gans can be added, to guarantee the asymptotc stablty of the solated subsystems. Lemma 1: Consder the solated subsystems (4), the feedback controls ū (x ) = π u (x ) = 1 2 π R 1 g T (x ) J (x ), = 1, 2,...,N (14)

4 LIU et al.: ONLINE LEARNING OPTIMAL CONTROL APPROACH 421 ensure that the N closed-loop solated subsystems are asymptotcally stable for all π 1/2, where = 1, 2,...,N. Proof: The lemma can be proved by showng J (x ), = 1, 2,..., N, are Lyapunov functons. Frst of all, n lght of (10), we can fnd that J (x )>0foranyx = 0 and J (x ) = 0whenx = 0, whch mples that J (x ), = 1, 2,..., N, are postve defnte functons. Next, the dervatves of J (x ), = 1, 2,...,N, along the correspondng trajectores of the closed-loop solated subsystems are gven by J (x ) = ( J (x )) T ẋ = ( J (x )) T ( f (x ) + g (x )ū (x )) (15) where = 1, 2,..., N. Then, by addng and subtractng (1/2)( J (x )) T g (x )u (x ) to (15) and consderng (12) (14), we have J (x ) = ( J (x )) T f (x ) 1 4 ( J (x )) T g (x )R 1 ) ( J (x )) T ( π g (x )R 1 g T (x ) J (x ) g T (x ) J (x ) = Q 2 (x ) 1 ( π 1 ) R 1/2 g T (x ) J 2 2 (x ) 2 (16) where = 1, 2,..., N. Observng (16), we can obtan that J (x ) < 0 for all π 1/2 and x = 0, where = 1, 2,...,N. Therefore, the condtons for Lyapunov local stablty theory are satsfed and the proof s completed. Remark 2: Lemma 1 reveals that any feedback controls ū (x ), = 1, 2,...,N, can ensure the asymptotc stablty of the closed-loop solated subsystems as long as π 1/2, = 1, 2,...,N. However, only when π = 1, = 1, 2,...,N, the feedback controls are optmal. In fact, smlar results have been gven n [47] [49], showng that the optmal controls u (x ), = 1, 2,...,N, are robust n the sense that they have nfnte gan margns. Now, we present the man theorem of ths paper based on that the acqured decentralzed control strategy can be establshed. Theorem 1: For nterconnected system (1), there exst N postve numbers π > 0, = 1, 2,...,N, such that for any π π, = 1, 2,...,N, the feedback controls developed by (14) ensure that the closed-loop nterconnected system s asymptotcally stable. In other words, the control par (ū 1 (x 1 ), ū 2 (x 2 ),..., ū N (x N )) s the decentralzed control strategy of large-scale system (1). Proof: In accordance wth Lemma 1, we observe that J (x ), = 1, 2,...,N, are all Lyapunov functons. Here, we select a composte Lyapunov functon gven by L(x) = =1 θ J (x ) (17) where θ, = 1, 2,...,N, are arbtrary postve constants. Takng the tme dervatve of L(x) along the trajectores of the closed-loop nterconnected system, we can obtan L(x) = = =1 =1 θ J (x ) θ { ( J (x )) T ( f (x ) + g (x )ū (x )) + ( J (x )) T g (x ) Z (x) }. (18) Then, takng (3), (6), and (16) nto consderaton, (18) can be turned nto the followng form: L(x) θ {Q 2 (x ) =1 + 1 ( π 1 ) R 1/2 g T 2 2 (x ) J (x ) } ( J (x )) T g (x )R 1/2 Z (x) θ {Q 2 (x ) =1 Here, we denote + 1 ( π 1 ) ( J 2 2 ( J (x )) T g (x )R 1/2 2 (x )) T g (x )R 1/2 j=1 2 } λ j Q j (x j ). (19) = dag{θ 1,θ 2,..., θ N } (20) λ 11 λ 12 λ 1N λ 21 λ 22 λ 2N =..... (21). λ N1 λ N2 λ NN and { ( 1 = dag π 1 1 ), 1 ( π 2 1 ),..., 1 ( π N 1 )}. (22) Therefore, through ntroducng a 2N-dmensonal vector Q 1 (x 1 ) Q 2 (x 2 ). Q N (x N ) ξ = ( J 1 (x 1)) T g 1 (x 1 )R 1/2 1 (23) ( J2 (x 2)) T g 2 (x 2 )R 1/2 2. ( JN (x N )) T g N (x N )R 1/2 N

5 422 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 we can transform (19) to the followng compact form: L(x) ξ T 1 2 T 1 ξ 2 ξ T A ξ. (24) Accordng to (24), suffcently large π, = 1, 2,...,N, can be chosen such that the matrx A s postve defnte. That s to say, there exst π, = 1, 2,...,N, sothatany π π, = 1, 2,...,N, are large enough to guarantee the postve defnteness of A. Then, we have L(x) <0. Therefore, the closed-loop nterconnected system s asymptotcally stable under the acton of the control par (ū 1 (x 1 ), ū 2 (x 2 ),...,ū N (x N )). The proof s completed. Clearly, the focal pont of desgnng the decentralzed control strategy becomes to derve the optmal controllers for the N solated subsystems on the bass of Theorem 1. Then, we should put our emphass on solvng the HJB equatons, whch yet, s regarded as a dffcult task [12], [13]. Hence, n the followng, we wll employ a more pragmatc approach to obtan the approxmate solutons based on onlne polcy teraton algorthm and neural network technques. IV. NEURAL-NETWORK-BASED LEARNING OPTIMAL CONTROL OF THE ISOLATED SUBSYSTEMS USING ONLINE POLICY ITERATION ALGORITHM Three sectons are ncluded here. In the frst secton, the onlne polcy algorthm s ntroduced to tackle the optmal control problem of the solated subsystems, whereas the neural network mplementaton process s gven n the second secton. The stablty proof of the dynamcs of the estmaton errors s developed n the last secton. A. Onlne Polcy Iteraton Algorthm and Its Convergence Here, the onlne polcy teraton algorthm s ntroduced to solve the HJB equatons. The polcy teraton algorthm conssts of polcy evaluaton based on (8) and polcy mprovement based on (12), as shown n [22]. Specfcally, ts teraton procedure can be descrbed as follows. Step 1: Choose a small postve number ɛ. Letp = 0and V (0) (x ) = 0, where = 1, 2,...,N. Then, start wth N ntal admssble control polces μ (0) 1 (x 1), μ (0) 2 (x 2),..., μ (0) N (x N ). Step 2: Based on the control polces μ (p) (x ), = 1, 2,...,N, solve the followng nonlnear Lyapunov equatons ( ) 0 = Q 2 (x ) + μ (p) T (x ) R μ (p) (x ) ( ) + V (p+1) T ( ) (x ) f (x ) + g (x )μ (p) (x ) (25) wth V (p+1) (0) = 0and = 1, 2,...,N. Step 3: Update the control polces va μ (p+1) (x ) = 1 2 R 1 where = 1, 2,...,N. g T (x ) V (p+1) (x ) (26) Step 4: If V (p+1) (x ) V (p) (x ) ɛ, = 1, 2,...,N, stop and obtan the approxmate optmal controls of the N solated subsystems; else, let p = p + 1and go back to Step 2. Note that N ntal admssble control polces are requred n the above algorthm. In the followng, we present the convergence analyss of the onlne polcy teraton algorthm for the solated subsystems. Theorem 2: Consder the N solated subsystems (4), gven N ntal admssble control polces μ (0) 1 (x 1), μ (0) 2 (x 2),..., μ (0) N (x N ). Then, usng the polcy teraton algorthm establshed n (25) and (26), the cost functons and control polces converge to the optmal ones as p,.e., V (p) (x ) J (x ) and μ (p) (x ) u (x ) as p,where = 1, 2,...,N. Proof: Frst, we consder the subsystem. Accordng to [41] and [44], when gven an ntal admssble control polcy μ (0) (x ),wehaveμ (p) (x ) ( ) for any p 0. Addtonally, for any ζ>0, there exsts an nteger p 0,such that for any p p 0, the formulas sup V (p) (x ) J (x ) <ζ (27) x and sup μ (p) (x ) u (x ) <ζ (28) x hold smultaneously. Next, we consder the N solated subsystems. When gven μ (0) 1 (x 1), μ (0) 2 (x 2),..., μ (0) N (x N ), whereμ (0) (x ) s the ntal admssble control polcy correspondng to the th subsystem, we can acqure that μ (p) (x ) ( ) for any p 0, where = 1, 2,..., N. In addton, we denote p 0 = max{p 01, p 02,...,p 0N }. Thus, we can conclude that for any ζ > 0, there exsts an nteger p 0, such that for any p p 0, (27) and (28) are true wth = 1, 2,...,N. In other words, the algorthm wll converge to the optmal cost functons and optmal control polces of the N solated subsystems. The proof s completed. B. Implementaton Procedure va Neural Networks For the N solated subsystems, assume that the cost functons V (x ), = 1, 2,...,N, are contnuously dfferentable. Then, accordng to the unversal approxmaton property of neural networks, V (x ) can be reconstructed by a sngle-layer neural network on a compact set as V (x ) = ω T c σ c(x ) + ε c (x ), = 1, 2,...,N (29) where ω c R l s the deal weght, σ c (x ) R l s the actvaton functon, l s the number of neurons n the hdden layer, and ε c (x ) s the approxmaton error of the th neural network, = 1, 2,...,N. The dervatves of the cost functons wth respect to ther state vectors are formulated as V (x ) = ( σ c (x )) T ω c + ε c (x ), = 1, 2,...,N (30)

6 LIU et al.: ONLINE LEARNING OPTIMAL CONTROL APPROACH 423 where σ c (x ) = σ c (x )/ x R l n and ε c (x ) = ε c (x )/ x R n are the gradent of the actvaton functon and approxmaton error of the th neural network, respectvely, = 1, 2,...,N. Based on (30), the Lyapunov equatons (8) becomes 0 = Q 2 (x ) +μ T R μ + ( ω T c σ c(x ) + ( ε c (x )) T ) ẋ (31) where = 1, 2,...,N. For the th neural network, = 1, 2,...,N, assume that the neural network weght vector ω c, the gradent σ c (x ), and the approxmaton error ε c (x ) and ts dervatve ε c (x ) are all bounded on the compact set. In addton, accordng to [43], we have ε c (x ) 0and ε c (x ) 0asl, where = 1, 2,...,N. Because the deal weghts are unknown, N crtc neural networks can be bult to approxmate the cost functons as follows: ˆV (x ) =ˆω T c σ c(x ), = 1, 2,...,N (32) where ˆω c, = 1, 2,...,N, s the estmated weghts. Here, σ c (x ), = 1, 2,...,N, s selected such that ˆV (x )>0for any x = 0and ˆV (x ) = 0whenx = 0. Smlarly, the dervatves of the approxmate cost functons wth respect to the state vectors can be expressed by ˆV (x ) = ( σ c (x )) T ˆω c, = 1, 2,...,N (33) where ˆV (x ) = ˆV (x )/ x, = 1, 2,...,N. Then, the approxmate Hamltonan functons can be expressed as H (x,μ, ˆω c ) = Q 2 (x ) + μ T R μ +ˆω T c σ c(x )ẋ = e c, = 1, 2,...,N. (34) For the purpose of tranng the crtc networks of the solated subsystems, t s desred to desgn ˆω c, = 1, 2,...,N, to mnmze the followng objectve functons: E c = 1 2 et c e c, = 1, 2,...,N. (35) The standard steepest descent algorthm s ntroduced to tune the crtc networks, then ther weghts are updated through [ ] Ec ˆω c = α c, = 1, 2,...,N (36) ˆω c where α c > 0, = 1, 2,...,N, s the learnng rates of the crtc networks. On the other hand, based on (30), the Hamltonan functons take the followng forms: H (x,μ,ω c ) = Q 2 (x ) + μ T R μ + ω T c σ c(x )ẋ = e ch, = 1, 2,...,N (37) where e ch = ( ε c (x )) T ẋ, = 1, 2,...,N, s the resdual errors because of the neural network approxmaton. Denote δ = σ c (x )ẋ, = 1, 2,...,N. We assume that there exst N postve constants δ M, = 1, 2,...,N, such that δ δ M, = 1, 2,...,N. (38) In addton, we defne the weght estmaton errors of the crtc networks as ω c = ω c ˆω c,where = 1, 2,...,N. Then, combnng (34) wth (37) yelds e ch e c = ω T c δ, = 1, 2,...,N. (39) Therefore, the dynamcs of the weght estmaton errors can be gven as follows: ω c = α c (e ch ω c T δ )δ, = 1, 2,...,N. (40) Incdentally, the persstency of exctaton condton s requred to tune the th crtc network to guarantee that δ δ m, where δ m, = 1, 2,...,N, are postve constants. Thus, a set of probng noses wll be added to the solated subsystems to satsfy the condton n practce. When mplementng the onlne polcy teraton algorthm, to accomplsh the polcy mprovement, we should obtan the control polces that mnmze the current cost functons. Hence, accordng to (12) and (30), we have μ (x ) = 1 2 R 1 g T (x ) V (x ) = 1 2 R 1 g T (x ) ( ( σ c (x )) T ω c + ε c (x ) ) (41) where = 1, 2,..., N. Correspondngly, the approxmate control polces can be obtaned by ˆμ (x ) = 1 2 R 1 g T (x ) ˆV (x ) = 1 2 R 1 g T (x )( σ c (x )) T ˆω c (42) where = 1, 2,...,N. Remark 3: Accordng to (42), t s obvous to observe that the approxmate control polces of the N solated subsystems can be derved drectly based on the traned crtc networks. Therefore, unlke the tradtonal actor-crtc archtecture, the acton neural networks are not requred any more. C. Stablty Analyss When consderng the crtc networks, the weght estmaton dynamcs are UUB as descrbed n the followng theorem. Theorem 3: For the N solated subsystems (4), the weght update laws for tunng the crtc networks are gven by (36). Then, the dynamcs of the weght estmaton errors of the crtc networks are UUB. Proof: Choose N Lyapunov functon canddates descrbed as follows: L (t) = 1 tr ( ω c T α ω ) c, = 1, 2,...,N. (43) c The tme dervatves of the Lyapunov functons L (t), = 1, 2,...,N, along the trajectores of the error dynamcs (40) are L (t) = 2 tr ( ω T α ω ) c c c = 2 tr ( ω c T α α ( c ech ω c T δ ) ) δ (44) c

7 424 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 Fg. 2. Convergence of the weght vector of the crtc network 1 (ω ac11, ω ac12,andω ac13 represent ˆω c11, ˆω c12,and ˆω c13, respectvely). Fg D plot of the approxmaton error of the cost functon of solated subsystem 1,.e., J1 (x 1) ˆV 1 (x 1 ). Fg. 3. Convergence of the weght vector of the crtc network 2 (ω ac21, ω ac22,andω ac23 represent ˆω c21, ˆω c22,and ˆω c23, respectvely). Fg D plot of the approxmaton error of the control polcy of solated subsystem 1,.e., u 1 (x 1) ˆμ 1 (x 1 ). where = 1, 2,...,N. After some basc manpulatons, t yelds L (t) (2 α c ) ω T c δ ech 2 (45) α c where = 1, 2,..., N. In vew of the Cauchy Schwarz nequalty and (38), we can conclude that L (t) <0 as long as 0 <α c < 2 e 2 (46) ch ω c > α c (2 α c )δ 2 M where = 1, 2,..., N. In accordance wth the Lyapunov stablty theory, we obtan that the dynamcs of the weght estmaton errors of the crtc networks are all UUB. Meanwhle, the norms of the weght estmaton errors are bounded as well. The proof s completed. Remark 4: Let ˆū (x ) = π ˆμ (x ), where ˆμ (x ), = 1, 2,..., N, are obtaned by (42). Accordng to the selectons of the actvaton functons of the crtc networks, we can easly fnd that the approxmate optmal cost functons ˆV (x ), = 1, 2,...,N, are also Lyapunov functons. Furthermore, smlar to the proof of Theorem 1, we have L(x) ξ T A ξ + e,where e s the sum of the approxmaton errors. Hence, we can conclude that based on the approxmate optmal control polces ˆμ (x ), = 1, 2,...,N, the developed control par ( ˆū 1 (x 1 ), ˆū 2 (x 2 ),..., ˆū N (x N )) can ensure the UUB of the state trajectores of the closed-loop nterconnected system. It s n ths sense that we accomplsh the desgn of the decentralzed control scheme by adoptng the learnng optmal control approach based on onlne polcy teraton algorthm. Remark 5: Note that the controller presented here s a decentralzed stablzaton one. Though the optmal decentralzed controller of nterconnected systems has been studed before [50], n ths paper, we am at developng a novel decentralzed control strategy based on ADP. How to extend the present results to the desgn of optmal decentralzed control for nonlnear nterconnected systems s part of our future research.

8 LIU et al.: ONLINE LEARNING OPTIMAL CONTROL APPROACH 425 Fg D plot of the approxmaton error of the cost functon of solated subsystem 2,.e., J2 (x 2) ˆV 2 (x 2 ). Fg D plot of the approxmaton error of the control polcy of solated subsystem 2,.e., u 2 (x 2) ˆμ 2 (x 2 ). V. SIMULATION STUDY A smulaton example s provded n ths secton to show the applcablty of the decentralzed control strategy establshed n ths paper. Consder the followng contnuous-tme nonlnear largescale system consstng of two nterconnected subsystems: [ ] x ẋ 1 = 11 + x x x x 12 (cos(2x 11 ) + 2) 2 [ ] 0 ( ) + ū cos(2x 11 ) (x 1 )+(x 11+x 22) sn x12 2 cos(0.5x 21) [ ] x ẋ 2 = 22 x x x21 2 x 22 [ ] 0 ( + ū x 2 (x 2 ) + 0.5(x 12 + x 22 ) cos ( ) ) e x2 21 (47) 21 where x 1 =[x 11 x 12 ] T R 2 and ū 1 (x 1 ) R are the state and control varables of subsystem 1, and x 2 = [x 21 x 22 ] T R 2 and ū 2 (x 2 ) R are the state and control varables of subsystem 2. Let R 1 = R 2 = I, wherei denotes the dentty matrx wth sutable dmenson. Addtonally, let h 1 (x 1 ) = x 1 and h 2 (x 2 ) = x 22. Then, we fnd that Z 1 (x) and Z 2 (x) wth x =[x1 T x 2 T ]T are upper bounded as n (3). For example, we can select λ 11 = λ 12 = 1andλ 21 = λ 22 = 1/2. To desgn the decentralzed controller of nterconnected system (47), we frst deal wth the optmal control problem of two solated subsystems. Here, we choose Q 1 (x 1 ) = x 1 and Q 2 (x 2 ) = x 22. Hence, the cost functons of the optmal control problem are { J 1 (x 10 ) = x x ut 1 u } 1 dτ (48) and J 2 (x 20 ) = 0 0 { x u2 T u } 2 dτ. (49) We adopt the onlne polcy teraton algorthm to tackle the optmal control problem, where two crtc networks are constructed to approxmate the cost functons. We denote the weght vectors of the two crtc networks as Fg. 8. State trajectory of subsystem 1 under the acton of the decentralzed control strategy (π 1 ˆμ 1 (x 1 ), π 2 ˆμ 2 (x 2 )). ˆω c1 = [ˆω c11 ˆω c12 ˆω c13 ] T and ˆω c2 = [ˆω c21 ˆω c22 ˆω c23 ] T. Durng the smulaton process, the ntal weghts of the crtc networks are chosen randomly n [0, 2]. In addton, the actvaton functons of the two crtc networks are chosen as σ c1 (x 1 ) =[x 2 11 x 11x 12 x 2 12 ]T and σ c2 (x 2 ) =[x 2 21 x 21x 22 x 2 22 ]T. Besdes, let the learnng rates of the crtc networks be α c1 = α c2 = 0.1 and the ntal states of the two solated subsystems be x 10 = x 20 =[1 1] T. Durng the mplementaton process of the onlne polcy teraton algorthm, for each solated subsystem, we add a probng nose to satsfy the persstency of exctaton condton. We can observe that the convergence results of the weghts have occurred after 750 and 180 s, respectvely. Then, the probng sgnals are turned off. Actually, the weghts of the crtc networks converge to and ˆω c1 =[ ] T (50) ˆω c2 =[ ] T (51) that are shown n Fgs. 2 and 3.

9 426 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 The cost functons can be approxmated by constructng several crtc networks and the expressons of the control polces can be obtaned drectly. In addton, the dynamcs of the estmaton errors are proved to be UUB. Smulaton study s presented to demonstrate the valdty of the decentralzed control strategy n the end. REFERENCES Fg. 9. State trajectory of subsystem 2 under the acton of the decentralzed control strategy (π 1 ˆμ 1 (x 1 ), π 2 ˆμ 2 (x 2 )). Based on the convergent weghts ˆω c1 and ˆω c2, we can obtan the approxmate optmal cost functon and control polcy for each solated subsystem, namely, ˆV 1 (x 1 ), ˆμ 1 (x 1 ), ˆV 2 (x 2 ),and ˆμ 2 (x 2 ). In comparson, for the method proposed n [43], the optmal cost functon and control polcy of solated subsystem 1 are J1 (x 1) = 0.5x x 12 2 and u 1 (x 1) = (cos(2x 11 ) + 2)x 12, respectvely. Smlarly, the optmal cost functon and control polcy of solated subsystem 2 are J2 (x 2) = x x 22 2 and u 2 (x 2) = x 21 x 22. Therefore, for solated subsystem 1, the error between the optmal cost functon and the approxmate one s shown n Fg. 4. In addton, the error between the optmal control polcy and the approxmate verson s shown n Fg. 5. It s clear to see that both the approxmaton errors are close to zero, whch verfes the good performance of the onlne learnng algorthm. When regardng the solated subsystem 2, we obtan the same smulaton results shown n Fgs. 6 and 7. Next, by choosng θ 1 = θ 2 = 1andπ 1 = π 2 = 2, we can guarantee the postve defnteness of the matrx A. Thus, (π 1 ˆμ 1 (x 1 ), π 2 ˆμ 2 (x 2 )) s the decentralzed control strategy of the orgnal nterconnected system (47). Here, we apply the decentralzed control scheme to controlled plant (47) for 40 s and obtan the evoluton processes of the state trajectores shown n Fgs. 8 and 9. Through zoomng n the state trajectores near the zero, t s demonstrated that the state trajectores of the closed-loop system are UUB. Obvously, these smulaton results authentcate the valdty of the decentralzed control approach developed n ths paper. VI. CONCLUSION In ths paper, a novel decentralzed control strategy s developed to deal wth the stablzaton problem of a class of contnuous-tme nonlnear large-scale systems usng onlne polcy teraton algorthm. Intally, the optmal controllers of the solated subsystems are desgned. Then, t s shown that the decentralzed control strategy of the overall system can be establshed by addng feedback gans to the obtaned optmal control polces. In addton, the onlne polcy teraton algorthm s ntroduced to solve the HJB equatons teratvely. [1] L. Bakule, Decentralzed control: An overvew, Annu. Rev. Control, vol. 32, no. 1, pp , Apr [2] D. D. Sljak and A. I. Zecevc, Control of large-scale systems: Beyond decentralzed feedback, Annu. Rev. Control, vol. 29, no. 2, pp , Dec [3] J. Lavae, Decentralzed mplementaton of centralzed controllers for nterconnected systems, IEEE Trans. Autom. Control, vol. 57, no. 7, pp , Jul [4] H. F. Grp, A. Saber, and T. A. Johansen, Observers for nterconnected nonlnear and lnear systems, Automatca, vol. 48, no. 7, pp , Jul [5] S. Mehraeen, S. Jagannathan, and M. L. Crow, Power system stablzaton usng adaptve neural network-based dynamc surface control, IEEE Trans. Power Syst., vol. 26, no. 2, pp , May [6] K. Kals, J. Lan, and S. H. Zak, Decentralzed dynamc output feedback control of nonlnear nterconnected systems, IEEE Trans. Autom. Control, vol. 55, no. 8, pp , Aug [7] Z. G. Hou, M. M. Gupta, P. N. Nkforuk, M. Tan, and L. Cheng, A recurrent neural network for herarchcal control of nterconnected dynamc systems, IEEE Trans. Neural Netw., vol. 18, no. 2, pp , Mar [8] A. Saber, On optmalty of decentralzed control for a class of nonlnear nterconnected systems, Automatca, vol. 24, no. 1, pp , Jan [9] P. J. Werbos, Advanced forecastng methods for global crss warnng and models of ntellgence, n Proc. General Syst., Jun. 1977, pp [10] P. J. Werbos, Approxmate dynamc programmng for real-tme control and neural modelng, n Handbook of Intellgent Control: Neural, Fuzzy, and Adaptve Approaches, D.A.WhteandD.A.Sofge,Eds. New York, NY, USA: Van Nostrand Renhold, 1992, ch. 13. [11] H. Zhang, D. Lu, Y. Luo, and D. Wang, Adaptve Dynamc Programmng for Control: Algorthms and Stablty. London, U.K.: Sprnger-Verlag, [12] F. Y. Wang, H. Zhang, and D. Lu, Adaptve dynamc programmng: An ntroducton, IEEE Comput. Intell. Mag., vol. 4, no. 2, pp , May [13] F. L. Lews and D. Vrabe, Renforcement learnng and adaptve dynamc programmng for feedback control, IEEE Crcuts Syst. Mag., vol. 9, no. 3, pp , Jul [14] J. Fu, H. He, and X. Zhou, Adaptve learnng and control for MIMO system based on adaptve dynamc programmng, IEEE Trans. Neural Netw., vol. 22, no. 7, pp , Jul [15] F. Y. Wang, N. Jn, D. Lu, and Q. We, Adaptve dynamc programmng for fnte-horzon optmal control of dscrete-tme nonlnear systems wth ε-error bound, IEEE Trans. Neural Netw., vol. 22, no. 1, pp , Jan [16] F. L. Lews and D. Lu, Renforcement Learnng and Approxmate Dynamc Programmng for Feedback Control. New York, NY, USA: Wley, [17] S. N. Balakrshnan, J. Dng, and F. L. Lews, Issues on stablty of ADP feedback controllers for dynamc systems, IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp , Aug [18] A. Al-Tamm, F. L. Lews, and M. Abu-Khalaf, Dscrete-tme nonlnear HJB soluton usng approxmate dynamc programmng: Convergence proof, IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp , Aug [19] D. P. Bertsekas, M. L. Homer, D. A. Logan, S. D. Patek, and N. R. Sandell, Mssle defense and nterceptor allocaton by neurodynamc programmng, IEEE Trans. Syst., Man, Cybern. A, Syst. Humans, vol. 30, no. 1, pp , Jan [20] J. S and Y. T. Wang, On-lne learnng control by assocaton and renforcement, IEEE Trans. Neural Netw., vol. 12, no. 2, pp , Mar [21] D. V. Prokhorov and D. C. Wunsch, Adaptve crtc desgns, IEEE Trans. Neural Netw., vol. 8, no. 5, pp , Sep

10 LIU et al.: ONLINE LEARNING OPTIMAL CONTROL APPROACH 427 [22] R. S. Sutton and A. G. Barto, Renforcement Learnng: An Introducton. Cambrdge, MA, USA: MIT Press, [23] H. Zhang, Y. Luo, and D. Lu, Neural-network-based near-optmal control for a class of dscrete-tme affne nonlnear systems wth control constrants, IEEE Trans. Neural Netw., vol. 20, no. 9, pp , Sep [24] H. Zhang, Q. We, and D. Lu, An teratve adaptve dynamc programmng method for solvng a class of nonlnear zero-sum dfferental games, Automatca, vol. 47, no. 1, pp , Jan [25] D. Wang, D. Lu, and Q. We, Fnte-horzon neuro-optmal trackng control for a class of dscrete-tme nonlnear systems usng adaptve dynamc programmng approach, Neurocomputng, vol. 78, no. 1, pp , Feb [26] D. Lu, D. Wang, D. Zhao, Q. We, and N. Jn, Neural-network-based optmal control for a class of unknown dscrete-tme nonlnear systems usng globalzed dual heurstc programmng, IEEE Trans. Autom. Sc. Eng., vol. 9, no. 3, pp , Jul [27] D. Lu, H. L, and D. Wang, H control of unknown dscretetme nonlnear systems wth control constrants usng adaptve dynamc programmng, n Proc. Int. Jont Conf. Neural Netw., Jun. 2012, pp [28] D. Lu, D. Wang, and X. Yang, An teratve adaptve dynamc programmng algorthm for optmal control of unknown dscrete-tme nonlnear systems wth constraned nputs, Inf. Sc., vol. 220, no. 20, pp , Jan [29] T. Derks and S. Jagannathan, Onlne optmal control of affne nonlnear dscrete-tme systems wth unknown nternal dynamcs by usng tmebased polcy update, IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp , Jul [30] Y. Jang and Z. P. Jang, Robust adaptve dynamc programmng for large-scale systems wth an applcaton to multmachne power systems, IEEE Trans. Crcuts Syst. II, Exp. Brefs, vol. 59, no. 10, pp , Oct [31] H. Xu, S. Jagannathan, and F. L. Lews, Stochastc optmal control of unknown lnear networked control system n the presence of random delays and packet losses, Automatca, vol. 48, no. 6, pp , Jun [32] S. Mehraeen and S. Jagannathan, Decentralzed optmal control of a class of nterconnected nonlnear dscrete-tme systems by usng onlne Hamlton-Jacob-Bellman formulaton, IEEE Trans. Neural Netw., vol. 22, no. 11, pp , Nov [33] J. W. Park, R. G. Harley, and G. K. Venayagamoorthy, Decentralzed optmal neuro-controllers for generaton and transmsson devces n an electrc power network, Eng. Appl. Artf. Intell., vol. 18, no. 1, pp , Feb [34] J. Lang, G. K. Venayagamoorthy, and R. G. Harley, Wde-area measurement based dynamc stochastc optmal power flow control for smart grds wth hgh varablty and uncertanty, IEEE Trans. Smart Grd, vol. 3, no. 1, pp , Mar [35] H. N. Wu and B. Luo, Neural network based onlne smultaneous polcy update algorthm for solvng the HJI equaton n nonlnear H control, IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 12, pp , Dec [36] S. G. Khan, G. Herrmann, F. L. Lews, T. Ppe, and C. Melhush, Renforcement learnng and optmal adaptve control: An overvew and mplementaton examples, Annu. Rev. Contorl, vol. 36, no. 1, pp , Apr [37] Z. N, H. He, and J. Wen, Adaptve learnng n trackng control based on the dual crtc network desgn, IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 6, pp , Jun [38] X. Xu, Z. Hou, C. Lan, and H. He, Onlne learnng control usng adaptve crtc desgns wth sparse kernel machnes, IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 5, pp , May [39] Y. Jang and Z. P. Jang, Computatonal adaptve optmal control for contnuous-tme lnear systems wth completely unknown dynamcs, Automatca, vol. 48, no. 10, pp , Oct [40] A. Heydar and S. N. Balakrshnan, Fnte-horzon control-constraned nonlnear optmal control usng sngle network adaptve crtcs, IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 1, pp , Jan [41] M. Abu-Khalaf and F. L. Lews, Nearly optmal control laws for nonlnear systems wth saturatng actuators usng a neural network HJB approach, Automatca, vol. 41, no. 5, pp , May [42] D. Vrabe and F. L. Lews, Neural network approach to contnuoustme drect adaptve optmal control for partally unknown nonlnear systems, Neural Netw., vol. 22, no. 3, pp , Apr [43] K. G. Vamvoudaks and F. L. Lews, Onlne actor-crtc algorthm to solve the contnuous-tme nfnte horzon optmal control problem, Automatca, vol. 46, no. 5, pp , May [44] D. Lu, X. Yang, and H. L, Adaptve optmal control for a class of contnuous-tme affne nonlnear systems wth unknown nternal dynamcs, n Neural Computng and Applcatons. New York, NY, USA: Sprnger-Verlag, Nov [45] S. Bhasn, M. Johnson, and W. E. Dxon, A model-free robust polcy teraton algorthm for optmal control of nonlnear systems, n Proc. 49th IEEE Conf. Decson Control, Dec. 2010, pp [46] S. Bhasn, R. Kamalapurkar, M. Johnson, K. G. Vamvoudaks, F. L. Lews, and W. E. Dxon, A novel actor crtc dentfer archtecture for approxmate optmal control of uncertan nonlnear systems, Automatca, vol. 49, no. 1, pp , Jan [47] S. T. Glad, On the gan margn of nonlnear and optmal regulators, IEEE Trans. Autom. Control, vol. 29, no. 7, pp , Jul [48] J. N. Tstskls and M. Athans, Guaranteed robustness propertes of multvarable nonlnear stochastc optmal regulators, IEEE Trans. Autom. Control, vol. 29, no. 8, pp , Aug [49] R. W. Beard, G. N. Sards, and J. T. Wen, Galerkn approxmatons of the generalzed Hamlton-Jacob-Bellman equaton, Automatca, vol. 33, no. 12, pp , Dec [50] D. D. Sljak, Decentralzed Control of Complex Systems. Boston, MA, USA: Academc, Derong Lu (S 91 M 94 SM 96 F 05) receved the B.S. degree n mechancal engneerng from the East Chna Insttute of Technology (now Nanjng Unversty of Scence and Technology), Nanjng, Chna, n 1982, the M.S. degree n automatc control theory and applcatons from the Insttute of Automaton, Chnese Academy of Scences, Bejng, Chna, n 1987, and the Ph.D. degree n electrcal engneerng from the Unversty of Notre Dame, Notre Dame, IN, USA, n He was a Product Desgn Engneer wth Chna North Industres Corporaton, Jln, Chna, from 1982 to He was an Instructor wth the Graduate School of the Chnese Academy of Scences, Bejng, from 1987 to He was a Staff Fellow wth the General Motors Research and Development Center, Warren, MI, USA, from 1993 to He was an Assstant Professor wth the Department of Electrcal and Computer Engneerng, Stevens Insttute of Technology, Hoboken, NJ, USA, from 1995 to He joned the Unversty of Illnos at Chcago, Chcago, IL, USA, n 1999, and became a Full Professor of electrcal and computer engneerng and computer scence n He was selected for the 100 Talents Program by the Chnese Academy of Scences n He has publshed 14 books (sx research monographs and eght edted volumes). Dr. Lu was an Assocate Edtor of Automatca from 2006 to He serves as an Assocate Edtor of Neurocomputng, Internatonal Journal of Neural Systems, Soft Computng, Neural Computng and Applcatons, Journal of Control Scence and Engneerng, andscence n Chna Seres F: Informaton Scences. He was an elected member of the Board of Governors of the Internatonal Neural Network Socety from 2010 to He s a Governng Board Member of Asa Pacfc Neural Network Assembly. He was a member of the Conference Edtoral Board of the IEEE Control Systems Socety from 1995 to 2000, an Assocate Edtor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS-I: FUNDAMENTAL THEORY AND APPLICATIONS from 1997 to 1999, the IEEE TRANSACTIONS ON SIGNAL PROCESSING from 2001 to 2003, the IEEE TRANSACTIONS ON NEURAL NETWORKS from 2004 to 2009, the IEEE Computatonal Intellgence Magazne from 2006 to 2009, and the IEEE Crcuts and Systems Magazne from 2008 to 2009, and the Letters Edtor of the IEEE TRANSACTIONS ON NEURAL NETWORKS from 2006 to He was the Foundng Edtor of the IEEE Computatonal Intellgence Socety s Electronc Letter from 2004 to Currently, he s the Edtor-n-Chef of the IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS and an Assocate Edtor of the IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY and the IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS. He s the General Char of the 2014 IEEE World Congress on Computatonal Intellgence, Bejng, Chna. He was an elected AdCom member of the IEEE Computatonal Intellgence Socety from 2006 to He s the Char of IEEE CIS Bejng Chapter. He receved the Mchael J. Brck Fellowshp from the Unversty of Notre Dame n 1990, the Harvey N. Davs Dstngushed Teachng Award from Stevens Insttute of Technology n 1997, the Faculty Early Career Development (CAREER) Award from the Natonal Scence Foundaton n 1999, the Unversty Scholar Award from the Unversty of Illnos n 2006, and the Overseas Outstandng Young Scholar Award from the Natonal Natural Scence Foundaton of Chna n He s a member of Eta Kappa Nu and a fellow of the INNS.

11 428 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 Dng Wang receved the B.S. degree n mathematcs from the Zhengzhou Unversty of Lght Industry, Zhengzhou, Chna, the M.S. degree n operatons research and cybernetcs from Northeastern Unversty, Shenyang, Chna, and the Ph.D. degree n control theory and control engneerng from the Insttute of Automaton, Chnese Academy of Scences, Bejng, Chna, n 2007, 2009, and 2012, respectvely. He s currently an Assstant Professor wth The State Key Laboratory of Management and Control for Complex Systems, Insttute of Automaton, Chnese Academy of Scences. Hs current research nterests nclude adaptve dynamc programmng, neural networks and learnng systems, and complex systems and ntellgent control. Honglang L receved the B.S. degree n mechancal engneerng and automaton from the Bejng Unversty of Posts and Telecommuncatons, Bejng, Chna, n He s currently pursung the Ph.D. degree wth The State Key Laboratory of Management and Control for Complex Systems, Insttute of Automaton, Chnese Academy of Scences, Bejng. He s wth the Unversty of Chnese Academy of Scences, Bejng. Hs current research nterests nclude machne learnng, neural networks, renforcement learnng, adaptve dynamc programmng, and game theory.

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems

Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems Off-polcy Renforcement Learnng for Robust Control of Dscrete-tme Uncertan Lnear Systems Yonglang Yang 1 Zhshan Guo 2 Donald Wunsch 3 Yxn Yn 1 1 School of Automatc and Electrcal Engneerng Unversty of Scence

More information

THE GUARANTEED COST CONTROL FOR UNCERTAIN LARGE SCALE INTERCONNECTED SYSTEMS

THE GUARANTEED COST CONTROL FOR UNCERTAIN LARGE SCALE INTERCONNECTED SYSTEMS Copyrght 22 IFAC 5th rennal World Congress, Barcelona, Span HE GUARANEED COS CONROL FOR UNCERAIN LARGE SCALE INERCONNECED SYSEMS Hroak Mukadan Yasuyuk akato Yoshyuk anaka Koch Mzukam Faculty of Informaton

More information

Improved delay-dependent stability criteria for discrete-time stochastic neural networks with time-varying delays

Improved delay-dependent stability criteria for discrete-time stochastic neural networks with time-varying delays Avalable onlne at www.scencedrect.com Proceda Engneerng 5 ( 4456 446 Improved delay-dependent stablty crtera for dscrete-tme stochastc neural networs wth tme-varyng delays Meng-zhuo Luo a Shou-mng Zhong

More information

The Study of Teaching-learning-based Optimization Algorithm

The Study of Teaching-learning-based Optimization Algorithm Advanced Scence and Technology Letters Vol. (AST 06), pp.05- http://dx.do.org/0.57/astl.06. The Study of Teachng-learnng-based Optmzaton Algorthm u Sun, Yan fu, Lele Kong, Haolang Q,, Helongang Insttute

More information

EEE 241: Linear Systems

EEE 241: Linear Systems EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they

More information

Wavelet chaotic neural networks and their application to continuous function optimization

Wavelet chaotic neural networks and their application to continuous function optimization Vol., No.3, 04-09 (009) do:0.436/ns.009.307 Natural Scence Wavelet chaotc neural networks and ther applcaton to contnuous functon optmzaton Ja-Ha Zhang, Yao-Qun Xu College of Electrcal and Automatc Engneerng,

More information

ON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EQUATION

ON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EQUATION Advanced Mathematcal Models & Applcatons Vol.3, No.3, 2018, pp.215-222 ON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EUATION

More information

Neuro-Adaptive Design - I:

Neuro-Adaptive Design - I: Lecture 36 Neuro-Adaptve Desgn - I: A Robustfyng ool for Dynamc Inverson Desgn Dr. Radhakant Padh Asst. Professor Dept. of Aerospace Engneerng Indan Insttute of Scence - Bangalore Motvaton Perfect system

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

Introduction. - The Second Lyapunov Method. - The First Lyapunov Method

Introduction. - The Second Lyapunov Method. - The First Lyapunov Method Stablty Analyss A. Khak Sedgh Control Systems Group Faculty of Electrcal and Computer Engneerng K. N. Toos Unversty of Technology February 2009 1 Introducton Stablty s the most promnent characterstc of

More information

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence

More information

The Minimum Universal Cost Flow in an Infeasible Flow Network

The Minimum Universal Cost Flow in an Infeasible Flow Network Journal of Scences, Islamc Republc of Iran 17(2): 175-180 (2006) Unversty of Tehran, ISSN 1016-1104 http://jscencesutacr The Mnmum Unversal Cost Flow n an Infeasble Flow Network H Saleh Fathabad * M Bagheran

More information

Adaptive sliding mode reliable excitation control design for power systems

Adaptive sliding mode reliable excitation control design for power systems Acta Technca 6, No. 3B/17, 593 6 c 17 Insttute of Thermomechancs CAS, v.v.. Adaptve sldng mode relable exctaton control desgn for power systems Xuetng Lu 1, 3, Yanchao Yan Abstract. In ths paper, the problem

More information

Approximate optimal cooperative decentralized control for consensus in a topological network of agents with uncertain nonlinear dynamics

Approximate optimal cooperative decentralized control for consensus in a topological network of agents with uncertain nonlinear dynamics Approxmate optmal cooperatve decentralzed control for consensus n a topologcal network of agents wth uncertan nonlnear dynamcs Rushkesh Kamalapurkar, Huyen Dnh, Patrck Walters, and Warren Dxon arxv:34.3479v

More information

MMA and GCMMA two methods for nonlinear optimization

MMA and GCMMA two methods for nonlinear optimization MMA and GCMMA two methods for nonlnear optmzaton Krster Svanberg Optmzaton and Systems Theory, KTH, Stockholm, Sweden. krlle@math.kth.se Ths note descrbes the algorthms used n the author s 2007 mplementatons

More information

General viscosity iterative method for a sequence of quasi-nonexpansive mappings

General viscosity iterative method for a sequence of quasi-nonexpansive mappings Avalable onlne at www.tjnsa.com J. Nonlnear Sc. Appl. 9 (2016), 5672 5682 Research Artcle General vscosty teratve method for a sequence of quas-nonexpansve mappngs Cuje Zhang, Ynan Wang College of Scence,

More information

The Order Relation and Trace Inequalities for. Hermitian Operators

The Order Relation and Trace Inequalities for. Hermitian Operators Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence

More information

Interactive Bi-Level Multi-Objective Integer. Non-linear Programming Problem

Interactive Bi-Level Multi-Objective Integer. Non-linear Programming Problem Appled Mathematcal Scences Vol 5 0 no 65 3 33 Interactve B-Level Mult-Objectve Integer Non-lnear Programmng Problem O E Emam Department of Informaton Systems aculty of Computer Scence and nformaton Helwan

More information

Power law and dimension of the maximum value for belief distribution with the max Deng entropy

Power law and dimension of the maximum value for belief distribution with the max Deng entropy Power law and dmenson of the maxmum value for belef dstrbuton wth the max Deng entropy Bngy Kang a, a College of Informaton Engneerng, Northwest A&F Unversty, Yanglng, Shaanx, 712100, Chna. Abstract Deng

More information

Neuro-Adaptive Design II:

Neuro-Adaptive Design II: Lecture 37 Neuro-Adaptve Desgn II: A Robustfyng Tool for Any Desgn Dr. Radhakant Padh Asst. Professor Dept. of Aerospace Engneerng Indan Insttute of Scence - Bangalore Motvaton Perfect system modelng s

More information

FUZZY GOAL PROGRAMMING VS ORDINARY FUZZY PROGRAMMING APPROACH FOR MULTI OBJECTIVE PROGRAMMING PROBLEM

FUZZY GOAL PROGRAMMING VS ORDINARY FUZZY PROGRAMMING APPROACH FOR MULTI OBJECTIVE PROGRAMMING PROBLEM Internatonal Conference on Ceramcs, Bkaner, Inda Internatonal Journal of Modern Physcs: Conference Seres Vol. 22 (2013) 757 761 World Scentfc Publshng Company DOI: 10.1142/S2010194513010982 FUZZY GOAL

More information

Multigradient for Neural Networks for Equalizers 1

Multigradient for Neural Networks for Equalizers 1 Multgradent for Neural Netorks for Equalzers 1 Chulhee ee, Jnook Go and Heeyoung Km Department of Electrcal and Electronc Engneerng Yonse Unversty 134 Shnchon-Dong, Seodaemun-Ku, Seoul 1-749, Korea ABSTRACT

More information

NONCOOPERATIVE game theory [1] [3] can be used

NONCOOPERATIVE game theory [1] [3] can be used IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, VOL. 6, NO. 8, AUGUST 15 1645 Approxmate N-Player Nonzero-Sum Game Soluton for an Uncertan Contnuous Nonlnear System Marcus Johnson, Rushkesh

More information

A Local Variational Problem of Second Order for a Class of Optimal Control Problems with Nonsmooth Objective Function

A Local Variational Problem of Second Order for a Class of Optimal Control Problems with Nonsmooth Objective Function A Local Varatonal Problem of Second Order for a Class of Optmal Control Problems wth Nonsmooth Objectve Functon Alexander P. Afanasev Insttute for Informaton Transmsson Problems, Russan Academy of Scences,

More information

Control of Uncertain Bilinear Systems using Linear Controllers: Stability Region Estimation and Controller Design

Control of Uncertain Bilinear Systems using Linear Controllers: Stability Region Estimation and Controller Design Control of Uncertan Blnear Systems usng Lnear Controllers: Stablty Regon Estmaton Controller Desgn Shoudong Huang Department of Engneerng Australan Natonal Unversty Canberra, ACT 2, Australa shoudong.huang@anu.edu.au

More information

Yong Joon Ryang. 1. Introduction Consider the multicommodity transportation problem with convex quadratic cost function. 1 2 (x x0 ) T Q(x x 0 )

Yong Joon Ryang. 1. Introduction Consider the multicommodity transportation problem with convex quadratic cost function. 1 2 (x x0 ) T Q(x x 0 ) Kangweon-Kyungk Math. Jour. 4 1996), No. 1, pp. 7 16 AN ITERATIVE ROW-ACTION METHOD FOR MULTICOMMODITY TRANSPORTATION PROBLEMS Yong Joon Ryang Abstract. The optmzaton problems wth quadratc constrants often

More information

Erratum: A Generalized Path Integral Control Approach to Reinforcement Learning

Erratum: A Generalized Path Integral Control Approach to Reinforcement Learning Journal of Machne Learnng Research 00-9 Submtted /0; Publshed 7/ Erratum: A Generalzed Path Integral Control Approach to Renforcement Learnng Evangelos ATheodorou Jonas Buchl Stefan Schaal Department of

More information

Concurrent learning-based online approximate feedback-nash equilibrium solution of N-player nonzero-sum differential games

Concurrent learning-based online approximate feedback-nash equilibrium solution of N-player nonzero-sum differential games Concurrent learnng-based onlne approxmate feedbac-nash equlbrum soluton of N-player nonzero-sum dfferental games Rushesh Kamalapurar, Justn Klotz, and arren E. Dxon arxv:30.38v [cs.sy] Oct 03 Abstract

More information

A Fast Computer Aided Design Method for Filters

A Fast Computer Aided Design Method for Filters 2017 Asa-Pacfc Engneerng and Technology Conference (APETC 2017) ISBN: 978-1-60595-443-1 A Fast Computer Aded Desgn Method for Flters Gang L ABSTRACT *Ths paper presents a fast computer aded desgn method

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

A Hybrid Variational Iteration Method for Blasius Equation

A Hybrid Variational Iteration Method for Blasius Equation Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 1 (June 2015), pp. 223-229 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) A Hybrd Varatonal Iteraton Method

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Stability analysis for class of switched nonlinear systems

Stability analysis for class of switched nonlinear systems Stablty analyss for class of swtched nonlnear systems The MIT Faculty has made ths artcle openly avalable. Please share how ths access benefts you. Your story matters. Ctaton As Publshed Publsher Shaker,

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Errors for Linear Systems

Errors for Linear Systems Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch

More information

PARTICIPATION FACTOR IN MODAL ANALYSIS OF POWER SYSTEMS STABILITY

PARTICIPATION FACTOR IN MODAL ANALYSIS OF POWER SYSTEMS STABILITY POZNAN UNIVE RSITY OF TE CHNOLOGY ACADE MIC JOURNALS No 86 Electrcal Engneerng 6 Volodymyr KONOVAL* Roman PRYTULA** PARTICIPATION FACTOR IN MODAL ANALYSIS OF POWER SYSTEMS STABILITY Ths paper provdes a

More information

CHAPTER III Neural Networks as Associative Memory

CHAPTER III Neural Networks as Associative Memory CHAPTER III Neural Networs as Assocatve Memory Introducton One of the prmary functons of the bran s assocatve memory. We assocate the faces wth names, letters wth sounds, or we can recognze the people

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Time-Varying Systems and Computations Lecture 6

Time-Varying Systems and Computations Lecture 6 Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy

More information

Adaptive Consensus Control of Multi-Agent Systems with Large Uncertainty and Time Delays *

Adaptive Consensus Control of Multi-Agent Systems with Large Uncertainty and Time Delays * Journal of Robotcs, etworkng and Artfcal Lfe, Vol., o. (September 04), 5-9 Adaptve Consensus Control of Mult-Agent Systems wth Large Uncertanty and me Delays * L Lu School of Mechancal Engneerng Unversty

More information

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty

Additional Codes using Finite Difference Method. 1 HJB Equation for Consumption-Saving Problem Without Uncertainty Addtonal Codes usng Fnte Dfference Method Benamn Moll 1 HJB Equaton for Consumpton-Savng Problem Wthout Uncertanty Before consderng the case wth stochastc ncome n http://www.prnceton.edu/~moll/ HACTproect/HACT_Numercal_Appendx.pdf,

More information

CLASSICAL optimal control problems are formulated in

CLASSICAL optimal control problems are formulated in IEEE/CAA JOURNAL OF AUOMAICA SINICA, VOL., NO. 3, JULY 39 Concurrent Learnng-based Approxmate Feedbac-Nash Equlbrum Soluton of N-player Nonzero-sum Dfferental Games Rushesh Kamalapurar Justn R. Klotz arren

More information

The lower and upper bounds on Perron root of nonnegative irreducible matrices

The lower and upper bounds on Perron root of nonnegative irreducible matrices Journal of Computatonal Appled Mathematcs 217 (2008) 259 267 wwwelsevercom/locate/cam The lower upper bounds on Perron root of nonnegatve rreducble matrces Guang-Xn Huang a,, Feng Yn b,keguo a a College

More information

Parameter Estimation for Dynamic System using Unscented Kalman filter

Parameter Estimation for Dynamic System using Unscented Kalman filter Parameter Estmaton for Dynamc System usng Unscented Kalman flter Jhoon Seung 1,a, Amr Atya F. 2,b, Alexander G.Parlos 3,c, and Klto Chong 1,4,d* 1 Dvson of Electroncs Engneerng, Chonbuk Natonal Unversty,

More information

Controller Design of High Order Nonholonomic System with Nonlinear Drifts

Controller Design of High Order Nonholonomic System with Nonlinear Drifts Internatonal Journal of Automaton and Computng 6(3, August 9, 4-44 DOI:.7/s633-9-4- Controller Desgn of Hgh Order Nonholonomc System wth Nonlnear Drfts Xu-Yun Zheng Yu-Qang Wu Research Insttute of Automaton,

More information

A Fuzzy-Neural Adaptive Iterative Learning Control for Freeway Traffic Flow Systems

A Fuzzy-Neural Adaptive Iterative Learning Control for Freeway Traffic Flow Systems Proceedngs of the Internatonal MultConference of Engneers and Computer Scentsts 016 Vol I, IMECS 016, March 16-18, 016, Hong Kong A Fuzzy-Neural Adaptve Iteratve Learnng Control for Freeway Traffc Flow

More information

Supporting Information

Supporting Information Supportng Informaton The neural network f n Eq. 1 s gven by: f x l = ReLU W atom x l + b atom, 2 where ReLU s the element-wse rectfed lnear unt, 21.e., ReLUx = max0, x, W atom R d d s the weght matrx to

More information

Convexity preserving interpolation by splines of arbitrary degree

Convexity preserving interpolation by splines of arbitrary degree Computer Scence Journal of Moldova, vol.18, no.1(52), 2010 Convexty preservng nterpolaton by splnes of arbtrary degree Igor Verlan Abstract In the present paper an algorthm of C 2 nterpolaton of dscrete

More information

Appendix B. The Finite Difference Scheme

Appendix B. The Finite Difference Scheme 140 APPENDIXES Appendx B. The Fnte Dfference Scheme In ths appendx we present numercal technques whch are used to approxmate solutons of system 3.1 3.3. A comprehensve treatment of theoretcal and mplementaton

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009

College of Computer & Information Science Fall 2009 Northeastern University 20 October 2009 College of Computer & Informaton Scence Fall 2009 Northeastern Unversty 20 October 2009 CS7880: Algorthmc Power Tools Scrbe: Jan Wen and Laura Poplawsk Lecture Outlne: Prmal-dual schema Network Desgn:

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Feature Selection: Part 1

Feature Selection: Part 1 CSE 546: Machne Learnng Lecture 5 Feature Selecton: Part 1 Instructor: Sham Kakade 1 Regresson n the hgh dmensonal settng How do we learn when the number of features d s greater than the sample sze n?

More information

On a direct solver for linear least squares problems

On a direct solver for linear least squares problems ISSN 2066-6594 Ann. Acad. Rom. Sc. Ser. Math. Appl. Vol. 8, No. 2/2016 On a drect solver for lnear least squares problems Constantn Popa Abstract The Null Space (NS) algorthm s a drect solver for lnear

More information

1 Convex Optimization

1 Convex Optimization Convex Optmzaton We wll consder convex optmzaton problems. Namely, mnmzaton problems where the objectve s convex (we assume no constrants for now). Such problems often arse n machne learnng. For example,

More information

The Synchronous 8th-Order Differential Attack on 12 Rounds of the Block Cipher HyRAL

The Synchronous 8th-Order Differential Attack on 12 Rounds of the Block Cipher HyRAL The Synchronous 8th-Order Dfferental Attack on 12 Rounds of the Block Cpher HyRAL Yasutaka Igarash, Sej Fukushma, and Tomohro Hachno Kagoshma Unversty, Kagoshma, Japan Emal: {garash, fukushma, hachno}@eee.kagoshma-u.ac.jp

More information

Zhongping Jiang Tengfei Liu

Zhongping Jiang Tengfei Liu Zhongpng Jang Tengfe Lu 4 F.L. Lews, NAI Moncref-O Donnell Char, UTA Research Insttute (UTARI) The Unversty of Texas at Arlngton, USA and Qan Ren Consultng Professor, State Key Laboratory of Synthetcal

More information

Some modelling aspects for the Matlab implementation of MMA

Some modelling aspects for the Matlab implementation of MMA Some modellng aspects for the Matlab mplementaton of MMA Krster Svanberg krlle@math.kth.se Optmzaton and Systems Theory Department of Mathematcs KTH, SE 10044 Stockholm September 2004 1. Consdered optmzaton

More information

STAT 309: MATHEMATICAL COMPUTATIONS I FALL 2018 LECTURE 16

STAT 309: MATHEMATICAL COMPUTATIONS I FALL 2018 LECTURE 16 STAT 39: MATHEMATICAL COMPUTATIONS I FALL 218 LECTURE 16 1 why teratve methods f we have a lnear system Ax = b where A s very, very large but s ether sparse or structured (eg, banded, Toepltz, banded plus

More information

Chapter 2 A Class of Robust Solution for Linear Bilevel Programming

Chapter 2 A Class of Robust Solution for Linear Bilevel Programming Chapter 2 A Class of Robust Soluton for Lnear Blevel Programmng Bo Lu, Bo L and Yan L Abstract Under the way of the centralzed decson-makng, the lnear b-level programmng (BLP) whose coeffcents are supposed

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

APPENDIX A Some Linear Algebra

APPENDIX A Some Linear Algebra APPENDIX A Some Lnear Algebra The collecton of m, n matrces A.1 Matrces a 1,1,..., a 1,n A = a m,1,..., a m,n wth real elements a,j s denoted by R m,n. If n = 1 then A s called a column vector. Smlarly,

More information

Solving Nonlinear Differential Equations by a Neural Network Method

Solving Nonlinear Differential Equations by a Neural Network Method Solvng Nonlnear Dfferental Equatons by a Neural Network Method Luce P. Aarts and Peter Van der Veer Delft Unversty of Technology, Faculty of Cvlengneerng and Geoscences, Secton of Cvlengneerng Informatcs,

More information

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results.

For now, let us focus on a specific model of neurons. These are simplified from reality but can achieve remarkable results. Neural Networks : Dervaton compled by Alvn Wan from Professor Jtendra Malk s lecture Ths type of computaton s called deep learnng and s the most popular method for many problems, such as computer vson

More information

DETERMINATION OF TEMPERATURE DISTRIBUTION FOR ANNULAR FINS WITH TEMPERATURE DEPENDENT THERMAL CONDUCTIVITY BY HPM

DETERMINATION OF TEMPERATURE DISTRIBUTION FOR ANNULAR FINS WITH TEMPERATURE DEPENDENT THERMAL CONDUCTIVITY BY HPM Ganj, Z. Z., et al.: Determnaton of Temperature Dstrbuton for S111 DETERMINATION OF TEMPERATURE DISTRIBUTION FOR ANNULAR FINS WITH TEMPERATURE DEPENDENT THERMAL CONDUCTIVITY BY HPM by Davood Domr GANJI

More information

On the Multicriteria Integer Network Flow Problem

On the Multicriteria Integer Network Flow Problem BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 5, No 2 Sofa 2005 On the Multcrtera Integer Network Flow Problem Vassl Vasslev, Marana Nkolova, Maryana Vassleva Insttute of

More information

A New Evolutionary Computation Based Approach for Learning Bayesian Network

A New Evolutionary Computation Based Approach for Learning Bayesian Network Avalable onlne at www.scencedrect.com Proceda Engneerng 15 (2011) 4026 4030 Advanced n Control Engneerng and Informaton Scence A New Evolutonary Computaton Based Approach for Learnng Bayesan Network Yungang

More information

DUE: WEDS FEB 21ST 2018

DUE: WEDS FEB 21ST 2018 HOMEWORK # 1: FINITE DIFFERENCES IN ONE DIMENSION DUE: WEDS FEB 21ST 2018 1. Theory Beam bendng s a classcal engneerng analyss. The tradtonal soluton technque makes smplfyng assumptons such as a constant

More information

Natural Language Processing and Information Retrieval

Natural Language Processing and Information Retrieval Natural Language Processng and Informaton Retreval Support Vector Machnes Alessandro Moschtt Department of nformaton and communcaton technology Unversty of Trento Emal: moschtt@ds.untn.t Summary Support

More information

An identification algorithm of model kinetic parameters of the interfacial layer growth in fiber composites

An identification algorithm of model kinetic parameters of the interfacial layer growth in fiber composites IOP Conference Seres: Materals Scence and Engneerng PAPER OPE ACCESS An dentfcaton algorthm of model knetc parameters of the nterfacal layer growth n fber compostes o cte ths artcle: V Zubov et al 216

More information

NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS

NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS IJRRAS 8 (3 September 011 www.arpapress.com/volumes/vol8issue3/ijrras_8_3_08.pdf NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS H.O. Bakodah Dept. of Mathematc

More information

Hongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k)

Hongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k) ISSN 1749-3889 (prnt), 1749-3897 (onlne) Internatonal Journal of Nonlnear Scence Vol.17(2014) No.2,pp.188-192 Modfed Block Jacob-Davdson Method for Solvng Large Sparse Egenproblems Hongy Mao, College of

More information

Asymptotics of the Solution of a Boundary Value. Problem for One-Characteristic Differential. Equation Degenerating into a Parabolic Equation

Asymptotics of the Solution of a Boundary Value. Problem for One-Characteristic Differential. Equation Degenerating into a Parabolic Equation Nonl. Analyss and Dfferental Equatons, ol., 4, no., 5 - HIKARI Ltd, www.m-har.com http://dx.do.org/.988/nade.4.456 Asymptotcs of the Soluton of a Boundary alue Problem for One-Characterstc Dfferental Equaton

More information

IV. Performance Optimization

IV. Performance Optimization IV. Performance Optmzaton A. Steepest descent algorthm defnton how to set up bounds on learnng rate mnmzaton n a lne (varyng learnng rate) momentum learnng examples B. Newton s method defnton Gauss-Newton

More information

Support Vector Machines. Vibhav Gogate The University of Texas at dallas

Support Vector Machines. Vibhav Gogate The University of Texas at dallas Support Vector Machnes Vbhav Gogate he Unversty of exas at dallas What We have Learned So Far? 1. Decson rees. Naïve Bayes 3. Lnear Regresson 4. Logstc Regresson 5. Perceptron 6. Neural networks 7. K-Nearest

More information

A MODIFIED METHOD FOR SOLVING SYSTEM OF NONLINEAR EQUATIONS

A MODIFIED METHOD FOR SOLVING SYSTEM OF NONLINEAR EQUATIONS Journal of Mathematcs and Statstcs 9 (1): 4-8, 1 ISSN 1549-644 1 Scence Publcatons do:1.844/jmssp.1.4.8 Publshed Onlne 9 (1) 1 (http://www.thescpub.com/jmss.toc) A MODIFIED METHOD FOR SOLVING SYSTEM OF

More information

Perfect Competition and the Nash Bargaining Solution

Perfect Competition and the Nash Bargaining Solution Perfect Competton and the Nash Barganng Soluton Renhard John Department of Economcs Unversty of Bonn Adenauerallee 24-42 53113 Bonn, Germany emal: rohn@un-bonn.de May 2005 Abstract For a lnear exchange

More information

Report on Image warping

Report on Image warping Report on Image warpng Xuan Ne, Dec. 20, 2004 Ths document summarzed the algorthms of our mage warpng soluton for further study, and there s a detaled descrpton about the mplementaton of these algorthms.

More information

Design and Optimization of Fuzzy Controller for Inverse Pendulum System Using Genetic Algorithm

Design and Optimization of Fuzzy Controller for Inverse Pendulum System Using Genetic Algorithm Desgn and Optmzaton of Fuzzy Controller for Inverse Pendulum System Usng Genetc Algorthm H. Mehraban A. Ashoor Unversty of Tehran Unversty of Tehran h.mehraban@ece.ut.ac.r a.ashoor@ece.ut.ac.r Abstract:

More information

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems Mathematca Aeterna, Vol. 1, 011, no. 06, 405 415 Applcaton of B-Splne to Numercal Soluton of a System of Sngularly Perturbed Problems Yogesh Gupta Department of Mathematcs Unted College of Engneerng &

More information

Estimating the Fundamental Matrix by Transforming Image Points in Projective Space 1

Estimating the Fundamental Matrix by Transforming Image Points in Projective Space 1 Estmatng the Fundamental Matrx by Transformng Image Ponts n Projectve Space 1 Zhengyou Zhang and Charles Loop Mcrosoft Research, One Mcrosoft Way, Redmond, WA 98052, USA E-mal: fzhang,cloopg@mcrosoft.com

More information

Complement of Type-2 Fuzzy Shortest Path Using Possibility Measure

Complement of Type-2 Fuzzy Shortest Path Using Possibility Measure Intern. J. Fuzzy Mathematcal rchve Vol. 5, No., 04, 9-7 ISSN: 30 34 (P, 30 350 (onlne Publshed on 5 November 04 www.researchmathsc.org Internatonal Journal of Complement of Type- Fuzzy Shortest Path Usng

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Stabilizing decentralized model predictive control of nonlinear systems

Stabilizing decentralized model predictive control of nonlinear systems Automatca 42 (26) 23 236 www.elsever.com/locate/automatca Techncal communque Stablzng decentralzed model predctve control of nonlnear systems L. Magn a,, R. Scattoln b a Dpartmento d Informatca e Sstemstca,

More information

Lyapunov-Razumikhin and Lyapunov-Krasovskii theorems for interconnected ISS time-delay systems

Lyapunov-Razumikhin and Lyapunov-Krasovskii theorems for interconnected ISS time-delay systems Proceedngs of the 19th Internatonal Symposum on Mathematcal Theory of Networks and Systems MTNS 2010 5 9 July 2010 Budapest Hungary Lyapunov-Razumkhn and Lyapunov-Krasovsk theorems for nterconnected ISS

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Controller Design for Networked Control Systems in Multiple-packet Transmission with Random Delays

Controller Design for Networked Control Systems in Multiple-packet Transmission with Random Delays Appled Mehans and Materals Onlne: 03-0- ISSN: 66-748, Vols. 78-80, pp 60-604 do:0.408/www.sentf.net/amm.78-80.60 03 rans eh Publatons, Swtzerland H Controller Desgn for Networed Control Systems n Multple-paet

More information

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 7: LP Duality Lecturer: Matt Weinberg

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 7: LP Duality Lecturer: Matt Weinberg prnceton unv. F 17 cos 521: Advanced Algorthm Desgn Lecture 7: LP Dualty Lecturer: Matt Wenberg Scrbe: LP Dualty s an extremely useful tool for analyzng structural propertes of lnear programs. Whle there

More information

Solutions HW #2. minimize. Ax = b. Give the dual problem, and make the implicit equality constraints explicit. Solution.

Solutions HW #2. minimize. Ax = b. Give the dual problem, and make the implicit equality constraints explicit. Solution. Solutons HW #2 Dual of general LP. Fnd the dual functon of the LP mnmze subject to c T x Gx h Ax = b. Gve the dual problem, and make the mplct equalty constrants explct. Soluton. 1. The Lagrangan s L(x,

More information

4DVAR, according to the name, is a four-dimensional variational method.

4DVAR, according to the name, is a four-dimensional variational method. 4D-Varatonal Data Assmlaton (4D-Var) 4DVAR, accordng to the name, s a four-dmensonal varatonal method. 4D-Var s actually a drect generalzaton of 3D-Var to handle observatons that are dstrbuted n tme. The

More information

Uniqueness of Weak Solutions to the 3D Ginzburg- Landau Model for Superconductivity

Uniqueness of Weak Solutions to the 3D Ginzburg- Landau Model for Superconductivity Int. Journal of Math. Analyss, Vol. 6, 212, no. 22, 195-114 Unqueness of Weak Solutons to the 3D Gnzburg- Landau Model for Superconductvty Jshan Fan Department of Appled Mathematcs Nanjng Forestry Unversty

More information

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013 ISSN: 2277-375 Constructon of Trend Free Run Orders for Orthogonal rrays Usng Codes bstract: Sometmes when the expermental runs are carred out n a tme order sequence, the response can depend on the run

More information

Deriving the X-Z Identity from Auxiliary Space Method

Deriving the X-Z Identity from Auxiliary Space Method Dervng the X-Z Identty from Auxlary Space Method Long Chen Department of Mathematcs, Unversty of Calforna at Irvne, Irvne, CA 92697 chenlong@math.uc.edu 1 Iteratve Methods In ths paper we dscuss teratve

More information

Chapter - 2. Distribution System Power Flow Analysis

Chapter - 2. Distribution System Power Flow Analysis Chapter - 2 Dstrbuton System Power Flow Analyss CHAPTER - 2 Radal Dstrbuton System Load Flow 2.1 Introducton Load flow s an mportant tool [66] for analyzng electrcal power system network performance. Load

More information

Online Classification: Perceptron and Winnow

Online Classification: Perceptron and Winnow E0 370 Statstcal Learnng Theory Lecture 18 Nov 8, 011 Onlne Classfcaton: Perceptron and Wnnow Lecturer: Shvan Agarwal Scrbe: Shvan Agarwal 1 Introducton In ths lecture we wll start to study the onlne learnng

More information

Study on Active Micro-vibration Isolation System with Linear Motor Actuator. Gong-yu PAN, Wen-yan GU and Dong LI

Study on Active Micro-vibration Isolation System with Linear Motor Actuator. Gong-yu PAN, Wen-yan GU and Dong LI 2017 2nd Internatonal Conference on Electrcal and Electroncs: echnques and Applcatons (EEA 2017) ISBN: 978-1-60595-416-5 Study on Actve Mcro-vbraton Isolaton System wth Lnear Motor Actuator Gong-yu PAN,

More information

Decentralized Adaptive Regulation for Nonlinear Systems with iiss Inverse Dynamics

Decentralized Adaptive Regulation for Nonlinear Systems with iiss Inverse Dynamics Vol. 3, No. ACTA AUTOMATICA SINICA February, 8 Decentralzed Adaptve Regulaton for Nonlnear Systems wth ISS Inverse Dynamcs YAN Xue-Hua, XIE Xue-Jun, LIU Ha-Kuan Abstract Ths paper consders the decentralzed

More information

The Quadratic Trigonometric Bézier Curve with Single Shape Parameter

The Quadratic Trigonometric Bézier Curve with Single Shape Parameter J. Basc. Appl. Sc. Res., (3541-546, 01 01, TextRoad Publcaton ISSN 090-4304 Journal of Basc and Appled Scentfc Research www.textroad.com The Quadratc Trgonometrc Bézer Curve wth Sngle Shape Parameter Uzma

More information

The Expectation-Maximization Algorithm

The Expectation-Maximization Algorithm The Expectaton-Maxmaton Algorthm Charles Elan elan@cs.ucsd.edu November 16, 2007 Ths chapter explans the EM algorthm at multple levels of generalty. Secton 1 gves the standard hgh-level verson of the algorthm.

More information

Research Article Green s Theorem for Sign Data

Research Article Green s Theorem for Sign Data Internatonal Scholarly Research Network ISRN Appled Mathematcs Volume 2012, Artcle ID 539359, 10 pages do:10.5402/2012/539359 Research Artcle Green s Theorem for Sgn Data Lous M. Houston The Unversty of

More information