Research Article An Output-Recurrent-Neural-Network-Based Iterative Learning Control for Unknown Nonlinear Dynamic Plants

Similar documents
A Discrete Robust Adaptive Iterative Learning Control for a Class of Nonlinear Systems with Unknown Control Direction

1348 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 34, NO. 3, JUNE 2004

Fuzzy control of a class of multivariable nonlinear systems subject to parameter uncertainties: model reference approach

Research Article Weather Forecasting Using Sliding Window Algorithm

Research Article Convex Polyhedron Method to Stability of Continuous Systems with Two Additive Time-Varying Delay Components

On the Current Error Based Sampled-data Iterative Learning Control with Reduced Memory Capacity

ARTIFICIAL NEURAL NETWORK WITH HYBRID TAGUCHI-GENETIC ALGORITHM FOR NONLINEAR MIMO MODEL OF MACHINING PROCESSES

Adaptive Predictive Observer Design for Class of Uncertain Nonlinear Systems with Bounded Disturbance

Research Article Partial Pole Placement in LMI Region

Research Article Calculation for Primary Combustion Characteristics of Boron-Based Fuel-Rich Propellant Based on BP Neural Network

Research Article The Application of Baum-Welch Algorithm in Multistep Attack

Correspondence should be addressed to Chien-Yu Lu,

A Sliding Mode Control based on Nonlinear Disturbance Observer for the Mobile Manipulator

Research Article Doppler Velocity Estimation of Overlapping Linear-Period-Modulated Ultrasonic Waves Based on an Expectation-Maximization Algorithm

Research Article Mean Square Stability of Impulsive Stochastic Differential Systems

IN THIS PAPER, we consider a class of continuous-time recurrent

APPLICATION OF RADIAL BASIS FUNCTION NEURAL NETWORK, TO ESTIMATE THE STATE OF HEALTH FOR LFP BATTERY

Research Article Simplified Robotics Joint-Space Trajectory Generation with a via Point Using a Single Polynomial

Artificial Neural Network : Training

Observer-based sampled-data controller of linear system for the wave energy converter

An Iteration-Domain Filter for Controlling Transient Growth in Iterative Learning Control

Research Article Design of PDC Controllers by Matrix Reversibility for Synchronization of Yin and Yang Chaotic Takagi-Sugeno Fuzzy Henon Maps

Initial condition issues on iterative learning control for non-linear systems with time delay

An artificial neural networks (ANNs) model is a functional abstraction of the

Research Article Existence of Periodic Positive Solutions for Abstract Difference Equations

Information Exchange in Multi-rover SLAM

Research Article Travel-Time Difference Extracting in Experimental Study of Rayleigh Wave Acoustoelastic Effect

An Approach of Robust Iterative Learning Control for Uncertain Systems

ADAPTIVE INVERSE CONTROL BASED ON NONLINEAR ADAPTIVE FILTERING. Information Systems Lab., EE Dep., Stanford University

Synchronization of Chaotic Systems via Active Disturbance Rejection Control

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate

Neural Network Control of Robot Manipulators and Nonlinear Systems

Distributed Adaptive Synchronization of Complex Dynamical Network with Unknown Time-varying Weights

An Adaptive Iterative Learning Control for Robot Manipulator in Task Space

Takagi Sugeno Fuzzy Sliding Mode Controller Design for a Class of Nonlinear System

Results on stability of linear systems with time varying delay

GAIN SCHEDULING CONTROL WITH MULTI-LOOP PID FOR 2- DOF ARM ROBOT TRAJECTORY CONTROL

Lyapunov Stability of Linear Predictor Feedback for Distributed Input Delays

RECENTLY, many artificial neural networks especially

Robust Observer for Uncertain T S model of a Synchronous Machine

Research Article Technical Note on Q, r, L Inventory Model with Defective Items

Design and Stability Analysis of Single-Input Fuzzy Logic Controller

Research Letter An Algorithm to Generate Representations of System Identification Errors

Research Article Mathematical Model and Cluster Synchronization for a Complex Dynamical Network with Two Types of Chaotic Oscillators

LECTURE # - NEURAL COMPUTATION, Feb 04, Linear Regression. x 1 θ 1 output... θ M x M. Assumes a functional form

ECE Introduction to Artificial Neural Network and Fuzzy Systems

Research Article The Microphone Feedback Analogy for Chatter in Machining

Research Article New Oscillation Criteria for Second-Order Neutral Delay Differential Equations with Positive and Negative Coefficients

H-infinity Model Reference Controller Design for Magnetic Levitation System

Research Article Energy Reduction with Anticontrol of Chaos for Nonholonomic Mobile Robot System

Gain Scheduling Control with Multi-loop PID for 2-DOF Arm Robot Trajectory Control

A FUZZY NEURAL NETWORK MODEL FOR FORECASTING STOCK PRICE

Research Article An Analysis of the Quality of Repeated Plate Load Tests Using the Harmony Search Algorithm

Chaos suppression of uncertain gyros in a given finite time

STOCHASTIC STABILITY FOR MODEL-BASED NETWORKED CONTROL SYSTEMS

OVER THE past 20 years, the control of mobile robots has

Research Article Emissivity Measurement of Semitransparent Textiles

Adaptive Control of a Class of Nonlinear Systems with Nonlinearly Parameterized Fuzzy Approximators

Research Article GA-Based Fuzzy Sliding Mode Controller for Nonlinear Systems

Research Article Propagation Characteristics of Oblique Incident Terahertz Wave in Nonuniform Dusty Plasma

Simultaneous state and input estimation with partial information on the inputs

Adaptive Robust Tracking Control of Robot Manipulators in the Task-space under Uncertainties

The Design of Sliding Mode Controller with Perturbation Estimator Using Observer-Based Fuzzy Adaptive Network

AFAULT diagnosis procedure is typically divided into three

Research Article A New Type of Magnetic Actuator Capable of Wall-Climbing Movement Using Inertia Force

( t) Identification and Control of a Nonlinear Bioreactor Plant Using Classical and Dynamical Neural Networks

Research Article Influence of the Parameterization in the Interval Solution of Elastic Beams

A New Approach for Solving Dual Fuzzy Nonlinear Equations Using Broyden's and Newton's Methods

Static Output Feedback Controller for Nonlinear Interconnected Systems: Fuzzy Logic Approach

Pattern Classification

CONTROL OF ROBOT CAMERA SYSTEM WITH ACTUATOR S DYNAMICS TO TRACK MOVING OBJECT

Serious limitations of (single-layer) perceptrons: Cannot learn non-linearly separable tasks. Cannot approximate (learn) non-linear functions

PATTERN RECOGNITION FOR PARTIAL DISCHARGE DIAGNOSIS OF POWER TRANSFORMER

A DELAY-DEPENDENT APPROACH TO DESIGN STATE ESTIMATOR FOR DISCRETE STOCHASTIC RECURRENT NEURAL NETWORK WITH INTERVAL TIME-VARYING DELAYS

This is the published version.

Research Article Identifying a Global Optimizer with Filled Function for Nonlinear Integer Programming

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

Temperature control using neuro-fuzzy controllers with compensatory operations and wavelet neural networks

CHATTERING-FREE SMC WITH UNIDIRECTIONAL AUXILIARY SURFACES FOR NONLINEAR SYSTEM WITH STATE CONSTRAINTS. Jian Fu, Qing-Xian Wu and Ze-Hui Mao

Acceleration of Levenberg-Marquardt method training of chaotic systems fuzzy modeling

Using Neural Networks for Identification and Control of Systems

Research Article Modified T-F Function Method for Finding Global Minimizer on Unconstrained Optimization

Available online at ScienceDirect. Procedia Computer Science 22 (2013 )

Research Article A Two-Step Matrix-Free Secant Method for Solving Large-Scale Systems of Nonlinear Equations

A TSK-Type Quantum Neural Fuzzy Network for Temperature Control

Research Article Adaptive Neural Gradient Descent Control for a Class of Nonlinear Dynamic Systems with Chaotic Phenomenon

Research Article On the Security of a Novel Probabilistic Signature Based on Bilinear Square Diffie-Hellman Problem and Its Extension

On Node-Fault-Injection Training of an RBF Network

Research Article On the Stability Property of the Infection-Free Equilibrium of a Viral Infection Model

Adaptive iterative learning control for robot manipulators: Experimental results $

International Journal of Innovative Research in Advanced Engineering (IJIRAE) ISSN: Issue 12, Volume 4 (December 2017)

Power flow analysis by Artificial Neural Network

Repetitive control mechanism of disturbance rejection using basis function feedback with fuzzy regression approach

Research Article A New Fractional Integral Inequality with Singularity and Its Application

Analysis of Adaptation Law of the Robust Evolving Cloud-based Controller

Parameterized Joint Densities with Gaussian Mixture Marginals and their Potential Use in Nonlinear Robust Estimation

Research Article Band Structure Engineering in 2D Photonic Crystal Waveguide with Rhombic Cross-Section Elements

Short Term Load Forecasting Based Artificial Neural Network

Application of Artificial Neural Networks in Evaluation and Identification of Electrical Loss in Transformers According to the Energy Consumption

Observer Based Friction Cancellation in Mechanical Systems

Transcription:

Journal of Control Science and Engineering Volume 2012, Article D 545731, 9 pages doi:10.1155/2012/545731 Research Article An Output-Recurrent-Neural-Networ-Based terative Learning Control for Unnown Nonlinear Dynamic Plants Ying-Chung Wang and Chiang-Ju Chien Department of Electronic Engineering, Huafan University, Shihding, New Taipei City 223, Taiwan Correspondence should be addressed to Chiang-Ju Chien, cc@cc.hfu.edu.tw Received 31 July 2011; Revised 9 November 2011; Accepted 1 December 2011 Academic Editor: saac Chairez Copyright 2012 Y.-C. Wang and C.-J. Chien. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original wor is properly cited. We present a design method for iterative learning control system by using an output recurrent neural networ ORNN. Two ORNNs are employed to design the learning control structure. The first ORNN, which is called the output recurrent neural controller ORNC, is used as an iterative learning controller to achieve the learning control obective. To guarantee the convergence of learning error, some information of plant sensitivity is required to design a suitable adaptive law for the ORNC. Hence, a second ORNN, which is called the output recurrent neural identifier ORN, is used as an identifier to provide the required information. All the weights of ORNC and ORN will be tuned during the control iteration and identification process, respectively, in order to achieve a desired learning performance. The adaptive laws for the weights of ORNC and ORN and the analysis of learning performances are determined via a Lyapunov lie analysis. t is shown that the identification error will asymptotically converge to zero and repetitive output tracing error will asymptotically converge to zero except the initial resetting error. 1. ntroduction terative learning control LC system has become one of the most effective control strategies in dealing with repeated tracing control of nonlinear plants. The LC system improves the control performance by a self-tuning process in the traditional PD-type LC algorithms for linear plants or affine nonlinear plants with nonlinearities satisfying global Lipschitz continuous condition [1 3]. Recently, the LC strategies combined with other control methodologies such as observer-based iterative learning control [4], adaptive iterative learning control [5], robust iterative learning control [6], or adaptive robust iterative learning control [7], have been widely studied in order to extend the applications to more general class of nonlinear systems. However, more and more restrictions are required in theory to develop these learning controllers. Among these LC algorithms, PDtype LC algorithms are still attractive to engineers since they are simple and effective for real implementations and industry applications. A main problem of the PD-type LC algorithms is that a sufficient condition required to guarantee learning stability and convergence will depend on plant s input/output coupling function matrix. n general, it is hard to design the learning gain if the nonlinear dynamic plant is highly nonlinear and unnown. n order to get the input/output coupling function matrix, the LC using a neural or fuzzy system to solve the learning gain implementation problem can be found in [8, 9]. A neural networ or a fuzzy system was used to approximate the inverse of plant s input/output coupling function matrix. The inverse function matrix is claimed to be an optimal choice of the learning gain from a convergent condition point of view. As the nonlinear system is assumed to be unnown, some offline adaptive mechanisms are applied to update the networ parameters in order to approximate the ideal optimal learning gain. Actually, for control of unnown nonlinear systems, neural-networ-based controller has become an important strategy in the past two decades. Multilayer neural networs, recurrentneuralnetworsanddynamicneuralnetwor[10 16] were used for the design of adaptive controllers. On the other hand, fuzzy logic system, fuzzy neural networ,

2 Journal of Control Science and Engineering recurrent fuzzy neural networs and dynamic fuzzy neural networ were also a popular tool for the design of adaptive controllers [17 22].These concepts have also been applied to the design of adaptive iterative learning control of nonlinear plants [23 25]. However, few LC wors were developed for general unnown nonlinear plants, especially nonaffine nonlinear plants. As the authors can understand, a realtime recurrent networ RTRN was developed in [26] for real-time learning control of general unnown nonlinear plants. But unfortunately, their learning algorithm depends on the generalized inverse of weight matrix in the RTRN. f the generalized inverse of weight matrix does not exist, the learning control scheme is not implementable. n this paper, we consider the design of an iterative learning controller for a class of unnown nonlinear dynamic plants. Motivated by our previous wor in [27], an improved version of an identifier-based iterative learning controller is proposed by using an output recurrent neural networ ORNN. Two ORNNs are used to design an ORNN-based iterative learning control system. The proposed ORNNbased LC system includes an ORNN controller ORNC and an ORNN identifier ORN. The ORNC is used as an iterative learning controller to achieve the repetitive tracing control obective. The weights of ORNC are tuned via adaptive laws determined by a Lyapunov-lie analysis. n order to realize the adaptive laws and guarantee the convergence of learning error, some information of the unnown plant sensitivity is required for the design of adaptive laws. Hence, the ORN is then applied as an identifier to provide the required information from plant sensitivity. n a similar way, the weights of ORN are tuned via some adaptive laws determined by a Lyapunov-lie analysis. Both of the proposed ORNC and ORN update their networ weights along the control iteration and identification process, respectively. This ORNN-based LC system can be used to execute a repetitive control tas of a general nonlinear plant. t is shown that the identification error will asymptotically converge to zero and repetitive output tracing error will asymptotically converge to zero except the initial resetting error. This paper is organized as follows. The structure of ORNN is introduced in Section 2. n Section 3, we present the design of ORNC and ORN for the ORNN-based LC system. The adaptation laws are derived and the learning performance is guaranteed based on a Lyapunov-lie analysis. To illustrate the effectiveness of the proposed LC system, a numerical example is used in Section 4 for computer simulation. Finally a conclusion is made in Section 5. n the subsequent discussions, the following notations will be used in all the sections. i z denotes the absolute value of a function z. ii v = v v denotes the usual Euclidean norm of a vector v = [v 1,..., v n ] R n. iii A =max 1 i n { m =1 a i } denotes the norm of a matrix A ={a i } R n m. D O 3 x 1 x n Figure 1: Structure of the ORNN. 2. The Output Recurrent Neural Networ Layer 3 output layer Layer 2 hidden layer Layer 1 input layer n this paper, two ORNNs are used to design an iterative learning control system. The structure of the ORNN is shown in Figure 1, which comprises an input layer, a hidden layer, and an output layer. i Layer 1 nput Layer: Each node in this layer represents an input variable, which only transmits input value to the next layer directly. For the ith input node, i = 1,..., n +1, net 1 x i, i = 1,..., n i = D [ O 3], i = n +1 1 O 1 i = f 1 i net 1 i = net 1 i, where x i, i = 1,..., n represents the ith external input signal to the ith node of layer 1, and D[O 3 ]denotes the delay of ORNN output O 3 whcih can be further defined as x n+1 = D[O 3 ]. ii Layer 2 Hidden Layer: Each node in this layer performs an activation function whose inputs come from input layer. For the lth hidden node, a sigmoid function is adopted here such that the lth node, l = 1,..., M will be represented as O 2 l = f 2 l net 2 n+1 l = V il x 2 net 2 i=1 l = i, 1 1+exp net 2 l, 2 where x 2 i = O 1 i, V il is the connective weight between the input layer and the hidden layer, M is the number of neuron in the hidden layer.

Journal of Control Science and Engineering 3 iii Layer 3 Output Layer: Each node in this layer represents an output node, which computes the overall output as the summation of all input signals from hidden layer. For the output node, M net 3 = w l x 3 l, l=1 3 O 3 = f 3 net 3 = net 3, where x 3 l = O 2 l and w l is the connective weight between the hidden layer and the output layer. r ORNC u 1 M Reference model A.L. ORN A.L. y d t +1 + u Dynamic y t +1 plant sgny u y u,max e c t +1 e t +1 ^y t +1 + Let n denotes the dimension of input vector X = [x 1,..., x n ] R n 1 of nonlinear function f X andm denotes the number of neurons in the hidden layer, the ORNN which performs as an approximator of the nonlinear function f X is now described in a matrix form as follows: O 3 D [ O 3], X, W, V = W O 2 V X a, 4 where W R M 1 and V R n+1 M are outputhidden wight matrix and hidden-input weight matrix, respectively, X R n 1 is the external input vector, X a [X, D[O 3 ]] R n+1 1 is the augmented neural input vector, and D[O 3 ] denotes the delay of ORNN output O 3. The activation function vector is defined as O 2 V X a [O 2 1 V1 X a,..., O 2 M VMX a ] R M 1 where V = [V 1, V 2,..., V M ]withv l R n+1 1 being the lth column vector, and O 2 l Vl X a 1/1+exp Vl X a R, l = 1,..., M is a sigmoid function. 3. Design of Output-Recurrent- Neural- Networ-Based terative Learning Control System n this paper, we consider an unnown nonlinear dynamic plant which can perform a given tas repeatedly over a finite time sequence t ={0,..., N} as follows: y t +1 = f y,..., y t n +1, u, 5 where Z + denotes the index of control iteration number and t = {0,..., N} denotes the time index. The signals y andu R are the system output and input, respectively. f : R n+1 R is the unnown continuous function, n represents the respective output delay order. Given a specified desired traectory y d, t {0,..., N}, the control obective is to design an output-recurrent-neuralnetwor-based iterative learning control system such that when control iteration number is large enough, y d y will converge to some small positive error tolerance bounds for all t {0,..., N} even if there exists an initial resetting error. Here the initial resetting error means that y d 0 y 0 for all 1. To achieve the control obective, an iterative learning control system based on ORNN design is proposed in Figure 2. nthis figure, D denotes the delay in time domain and M denotes the memory in control iteration domain. y Figure 2: Bloc diagram of the ORNN-based LC system. Before we state the design steps of the proposed control structure, some assumptions on the unnown nonlinear system and desired traectories are given as follows. A1 The nonlinear dynamic plant is a relaxed system whose input u andoutputy are related by y = 0forallt {,..., 1}. A2 There exists a bounded unnown upper bounding function y u,max = max y u such that 0 < y u y u,max, where the factor y u = y t + 1/ u represents the sensitivity of the plant with respect to its input. A3 The reference model is designed to generate the bounded desired traectory y d t + 1 = f d y d,..., y d t n +1,r, which is based on a specified bounded reference input r with f d : R n+1 R being a continuous function. The design of the ORNN-based iterative learning control system is divided into two parts. 3.1. Part 1: Design of ORNC and Corresponding Adaptive Laws. Based on the assumptions on the nonlinear plant 5, we define a tracing error e c atth control iteration as follows: D e c = y d y. 6 t is noted that there exist bounded constants ε c, Z + such that the initial value of e c will satisfy ec0 = εc. 7

4 Journal of Control Science and Engineering The difference of e c between two successive iterations can be computed as [28] Δect +1 = ec t +1 ect +1 = y t +1 y t +1 y t +1 u u u y u u u. The ORNN is used to design an ORNC in order to achieve the iterative learning control obective. Let n c be the dimension of the external input vector Xc = [r, y, u 1, ] R nc 1 and M c denote the number of neurons in hidden layer of the ORNC. The ORNC which performs as an iterative learning controller is described in a matrix form as follows: [ ] u = O c 3 D O 3 c, Xc, Wc, Vc = Wc O c 2 V c Xca 9, where Wc R Mc 1 and Vc R nc+1 Mc are outputhidden wight matrix and hidden-input weight matrix to be tuned via some suitable adaptive laws, respectively, and Xca [Xc, D[O c 3 ]] R nc+1 1 is the augmented neural input vector. For the sae of convenience, we define O c 2 Vc Xca O c 2. Now substituting 9 into 8, we will have Δect +1 = yu Wc O c 2 Wc O c 2. 10 For simplicity, we define ΔXca = Xca Xca, ΔWc = Wc Wc, ΔVc = Vc Vc. After adding and subtracting Wc O c 2 to10, we can find that Δect +1 = yuδw c O c 2 yuw c O c 2 O 2 11 c. 8 nvestigating the second term in the right hand side of 11 by using the mean-value theorem, we have O 2 c O 2 c = O c 2 V c Xca Vc Xca = O c 2 V c Xca Vc Xca +Vc Xca Vc Xca = O c 2 ΔV c Xca + Vc ΔXca, 12 where O c 2 = diag[o 2 c,1,..., O 2 c,m c ] R Mc Mc with O 2 c,l do 2 c,l Zc,l /dz c,l Zc,l, Z c,l has a value between V c,l Xca andvc,l Xca, l = 1,..., M c. Nowifwesubstitute12 into 11, we will have Δect +1 = yuδw c O 2 c yuw c O c 2 ΔV c Xca + Vc ΔXca. 13 The adaptation algorithms for weights Wc andvc of ORNC at next + 1th control iteration to guarantee the error convergence are given as follows: Wc = Wc + sgn yu ect +1O c 2, 14 y u,max M c Vc = Vc X ca ΔXca Vc X 2, 15 where y u,max is defined in assumption A2. f we substitute adaptation laws 14and15 into 13,we can find that e c t +1 = ect +1 ect +1 y u O 2 c O 2 c. y u,max M c ca 16 Theorem 1. Consider the nonlinear plant 5 which satisfies assumptions A1 A3. The proposed ORNC 9 and adaptation laws 14 and 15 will ensure the asymptotic convergence of tracing error as control iteration approaches infinity. Proof. Let us choose a discrete-type Lyapunov function as E ct +1 = 1 2 e ct +1 2, 17 then the change of Lyapunov function is ΔEct +1 = Ec t +1 Ect +1 = 1 [ e c t +1 2 ect +1 2 ]. 2 18 Taing norms on 16, it yields ec t +1 = e y u O 2 2 c ct +1 1 y u,max M c < ect +1 19 for iteration 1. This further implies that Ect +1 > 0, ΔEct +1 < 0, for all t {0,..., N} for 1. Using Lyapunov stability of Ect +1> 0, ΔEct +1< 0and7, the tracing error ec will satisfy lim e { c = ε c, t = 0 20 0, t 0. This proves Theorem 1.

Journal of Control Science and Engineering 5 Remar 2. f the plant sensitivity y u is completely nown so that sgny u and y u,max are available, then the control obective can be achieved by using the adaptation algorithms 14 and15. However, the plant sensitivity y u isin general unnown or only partially nown. n part 2, we will design an ORNN-based identifier ORN to estimate the unnown plant sensitivity y u and then provide the sign function and upper bounding function of y u for adaptation algorithms of ORNC. 3.2. Part 2: Design of ORN and Corresponding Adaptive Laws. After each control iteration, the ORN subsequently begins to perform identification process. The trained ORN will then provide the approximated plant sensitivity to the ORNC to start the next control iteration. We would lie to emphasize that the ORN only identifies the nonlinear plant after each control iteration. This concept is quite different from traditional control tass [29] and very important to the proposed ORNN-based LC structure. The structure of ORNN is further applied to design an ORN to identify the nonlinear plant after the th control iteration. The identification process is stated as follows. After each trial of controlling the nonlinear system, we collect the input output data u andy, t = 0, 1,..., N + 1 as the training data for the identifier. When discussing the identification, we omit the control iteration index and introduce a new identification iteration index Z + to represent the number of identification process. That is, the notation for the training data u, y and the ORN output ŷ, are simplified as u, y, and ŷ, respectively. For the ORN, let n be the dimension of external input vector X = [u, y] R n 1 and M denote the number of neurons in hidden layer of the ORN. The ORN which performs as an iterative learning identifier for nonlinear plant 5 isnowdescribedinamatrixformas follows: [ ] ŷ t +1 = O 3 D O 3, Xa, W, V = W O 2 V Xa 21, where W R M 1 and V n +1 M R are outputhidden wight matrix and hidden-input weight matrix to be tuned via some suitable adaptive laws, respectively, and Xa [X, D[O 3 ]] R n+1 1 is the augmented neural input vector. For the sae of convenience, we define O 2 V Xa O 2. Based on the assumptions on the nonlinear plant 5, we define an identification error e atth identification processasfollows: e = y ŷ. 22 The difference of e between two successive identification process can be computed as Δe t +1 = e +1 t +1 e t +1 = ŷ +1 t +1 ŷ t +1. 23 Now substituting 21 into 23, we will have Δe t +1 = W +1 O 2 +1 W O 2. 24 For simplicity, we define ΔXa = Xa +1 Xa, ΔW = W +1 W, ΔV = V +1 V. After adding and subtracting W O 2 +1 to24, we can find Δe t +1 = ΔW O 2 +1 W O 2 +1 O 2 25. nvestigating the second term in the right hand side of 25 by using the mean-value theorem, we can derive O 2 +1 O 2 = O 2 V +1 Xa +1 V Xa = O 2 V +1 Xa +1 V Xa +1 +V Xa +1 V Xa = O 2 ΔV Xa +1 + V ΔXa, 26 where O 2 = diag[o 2,1,..., O 3,M ] R M M with O 2,l do 2,l Z,l /dz,l, Z Z,l,l hasavalue between V,l +1 Xa +1 andv,l Xa, l = 1,..., M. Nowifwesubstitute26 into 25, we will have Δe t +1 = ΔW O 2 +1 W O 2 ΔV Xa +1 + V ΔXa. 27 The adaptation algorithms for weights W +1 and V +1 ofornatnext + 1th identification process are given as follows: = W + e t +1O 2 W +1 V +1 = V X+1 a M +1 ΔXa V X +1 2. f we substitute adaptation laws 28 into 27,we have a 28 +1 e +1 t +1 = e t +1 e t +1 O2 O 2 +1. M 29 Theorem 3. Consider the nonlinear dynamic plant 5 which satisfies assumptions A1 A3. The proposed ORN 21 and adaptation laws 28 will ensure that the asymptotic convergenceofidentificationerrorisguaranteedasthenumbers of identification approach infinity.

6 Journal of Control Science and Engineering Proof. Let us choose a discrete-type Lyapunov function as E t +1 = 1 2 e t +1 2, t {0,..., N}, 30 then we can derive the change of Lyapunov function as ΔE t +1 = E +1 t +1 E t +1 = 1 2 [ e +1 t +1 2 e t +1 2 ]. Taing norms on 29, we have e +1 t +1 = e t +1 1 < e t +1 O 2 +1 2 M 31 32 for iteration 1. This implies that E t +1> 0, ΔE t + 1 < 0, for all t {0,..., N} for 1, and hence the identification error e will satisfy lim e =0, for all t {0, 1,..., N}.ThisprovesTheorem3. Remar 4. The ORNN is a promising tool for identification because it can approximate any well-behaved nonlinear function to any desired accuracy. This good function approximation is applied to estimate the unnown plant sensitivity in this paper. The plant sensitivity y u in 8 can be approximated as follows: y u y t +1 u ŷ t +1 u. 33 Note that the index in the identifier output ŷ isremoved once the identification process stops. Applying the chain rule to 21, it yields ŷ t +1 u = O3 u Also from 21, we have M = w,l O2,l u l=1. O 2,l u = f 2,l M ŷ t +1 O 2,l = l=1 O 2,l u net 2,l 34 net 2,l u. 35 Since the inputs to ORN are u, y andd[o 3 ], we further have Thus, net 2,l = V,1lu + V + V,3lD [ O 3,2ly ]. 36 net 2,l u = V 37,1l. From 34, 35 and37, we obtain ŷ u = ŷ t +1 u M = w,l f 2,l net 2,l l=1 V,1l, 38 where 0 < f 2,l net 2,l < 0.5. f we define w max l w,l,and V,1 max l V,1l, then ŷu w M f 2,l net 2,l V,1 M w V 39 2,1 ŷu,max. The sign function and upper bounding function of plant sensitivity after finishing the identification process at th control iteration can be obtained as follows: sgn y u = sgn ŷ u y u,max = max { ŷ u,max, ŷ 1 u,max }. 40 t is noted that we do not need the exact plant sensitivity y u for the design of adaptive law 14. Even though there may exist certain approximation error between y u andŷ u, we can still guarantee the convergence of learning error since only a upper bounding function is required. Also note that the value of sgny u +1 or 1 can be easily determined from the identification result. 4. Simulation Example n this section, we use the proposed ORNN-based LC to iteratively control an unnown non-bbo nonlinear dynamic plant [26, 29]. The difference equation of the nonlinear dynamic plant is given as y t +1 = 0.2 y 2 +0.2y t 1 +0.4sin 0.5 y t 1 + y cos 0.5 y t 1 + y +1.2u, 41 where y is the system output, u is the control input. The reference model is chosen as y d t +1 = 0.6y d + r, y d 0 = 0, 42 where r = sin2πt/25 + sin2πt/10 is a bounded reference input. The control obective is to force y to trac the desired traectory y d as close as possible over a finite time interval t {1,..., 200} except the initial point. The networ weight adaptation for the ORN and ORNC is designed according to 14, 15, and 28, respectively. n the ORNC, we set Wc R 2 1 and Vc R 4 2, that is, only two hidden nodes in layer 2 are used to construct the ORNC. n a similar way, we let W R 2 1

Journal of Control Science and Engineering 7 10 0 10 8 10 5 10 9 10 10 10 10 1 2 3 4 5 6 7 8 9 10 a 4 10 11 1 2 3 4 5 b 4 2 2 0 2 4 0 50 100 150 200 c 2 0 2 4 0 50 100 150 200 d 0 2 4 0 50 100 150 200 e Figure 3: a max t {1,...,200} ec versus control iteration. bmax t {0,...,200} e 10, versus identification process at the 10th control iteration. c y 10 dotted line and y d solid line versus time t at the 10th control iteration. d ŷ 10 dotted line and y 10 solid line versus time t at the 10th control iteration. e u 10 at the 10th control iteration versus time t. and V R 3 2, that is, only two hidden nodes in layer 2 are used to set up the ORN. For simplicity, all the initial conditions of ORNC parameters are set to be 0 at the first control iteration. n addition, the initial ORN parameters are set to be 0 at the first identification process which begins after the first control iteration. We assume that the plant initial condition satisfies y 0 = 2 + randn where randn is a generator of random number with normal distribution, mean = 0andvariance= 1. To study the effects of learning performances, we first show the maximum value of tracing error ec, t {1,..., 200} with respect to control iteration in Figure 3a. tisnoted that ec0 is omitted in calculating the maximum value of tracing error since it is not controllable. The identification error at 10th control iteration e 10, with respect to identification process is shown in Figure 3b. According to the simulation results, it is clear that the asymptotic convergence proved in Theorems 1 and 3 is achieved. Since a reasonable tracing performance is almost observed at the 10th control iteration, the traectories between the desired output y d and plant output y 10 at the 10th control iteration are shown to demonstrate the control performance in Figure 3c. Figure 3d shows the comparison between the identification result of ŷ 10 and the plant output y 10. The nice identification result enables the ORN to provide the required information for the design of ORNC. Finally, the bounded control input u 10 is plotted in Figure 3e. 5. Conclusion For controlling a repeatable nonaffine nonlinear dynamic plant, we propose an output-recurrent-neural-networbased iterative learning control system in this paper. The control structure consists of an ORNC used as an iterative learning controller and an ORN used as an identifier.

8 Journal of Control Science and Engineering The ORNC is the main controller utilized to achieve the repetitive control tas. The ORN is an auxiliary component utilized to provide some useful information from plant sensitivity for the design of ORNC s adaptive laws. All the networ weights of ORNC and ORN will be tuned during the control iteration and identification process so that no prior plant nowledge is required. The adaptive laws for the weights of ORNC and ORN and the analysis of learning performances are determined via a Lyapunov-lie analysis. We show that if the ORN can provide the nowledge of plant sensitivity for ORNC, then output tracing error will asymptotically converge to zero except an initial resetting error. We also show that the obective of identification can be achieved by the ORN if the number of identifications is large enough. Acnowledgment This wor is supported by National Science Council, R.O.C., under Grant NSC99-2221-E-211-011-MY2. References [1] K. L. Moore and J. X. Xu, Special issue on iterative learning control, nternational Journal of Control, vol. 73, no. 10, pp. 819 823, 2000. [2]D.A.Bristow,M.Tharayil,andA.G.Alleyne, Asurveyof iterative learning, EEE Control Systems Magazine, vol. 26, no. 3, pp. 96 114, 2006. [3] H. S. Ahn, Y. Q. Chen, and K. L. Moore, terative learning control: brief survey and categorization, EEE Transactions on Systems, Man and Cybernetics. Part C, vol. 37, no. 6, pp. 1099 1121, 2007. [4] J. X. Xu and J. Xu, Observer based learning control for a class of nonlinear systems with time-varying parametric uncertainties, EEE Transactions on Automatic Control, vol. 49, no. 2, pp. 275 281, 2004. [5]. Rotariu, M. Steinbuch, and R. Ellenbroe, Adaptive iterative learning control for high precision motion systems, EEE Transactions on Control Systems Technology, vol. 16, no. 5, pp. 1075 1082, 2008. [6] A. Tayebi, S. Abdul, M. B. Zaremba, and Y. Ye, Robust iterative learning control design: application to a robot manipulator, EEE/ASME Transactions on Mechatronics, vol. 13, no. 5, pp. 608 613, 2008. [7] J. X. Xu and B. Viswanathan, Adaptive robust iterative learning control with dead zone scheme, Automatica, vol. 36, no. 1, pp. 91 99, 2000. [8] J.Y. Choi and H. J. Par, Neural-based iterative learning control for unnown systems, in Proceedings of the 2nd Asian Control Conference, pp. 243 246, Seoul, Korea, 1997. [9] C. J. Chien, A sampled-data iterative learning control using fuzzy networ design, nternational Journal of Control, vol. 73, no. 10, pp. 902 913, 2000. [10] J. H. Par, S. H. Huh, S. H. Kim, S. J. Seo, and G. T. Par, Direct adaptive controller for nonaffine nonlinear systems using self-structuring neural networs, EEE Transactions on Neural Networs, vol. 16, no. 2, pp. 414 422, 2005. [11] J. S. Wang and Y. P. Chen, A fully automated recurrent neural networ for unnown dynamic system identification and control, EEE Transactions on Circuits and Systems, vol. 53, no. 6, pp. 1363 1372, 2006. [12] C. F. Hsu, C. M. Lin, and T. T. Lee, Wavelet adaptive bacstepping control for a class of nonlinear systems, EEE Transactions on Neural Networs, vol. 17, no. 5, pp. 1 9, 2006. [13] C. M. Lin, L. Y. Chen, and C. H. Chen, RCMAC hybrid control for MMO uncertain nonlinear systems using slidingmode technology, EEE Transactions on Neural Networs, vol. 18, no. 3, pp. 708 720, 2007. [14] Z. G. Hou, M. M. Gupta, P. N. Niiforu, M. Tan, and L. Cheng, A recurrent neural networ for hierarchical control of interconnected dynamic systems, EEE Transactions on Neural Networs, vol. 18, no. 2, pp. 466 481, 2007. [15] Z. Liu, R. E. Torres, N. Patel, and Q. Wang, Further development of input-to-state stabilizing control for dynamic neural networ systems, EEE Transactions on Systems, Man, and Cybernetics Part A: Systems and Humans,vol.38,no.6,pp. 1425 1433, 2008. [16] Z. Liu, S. C. Shih, and Q. Wang, Global robust stabilizing control for a dynamic neural networ system, EEE Transactions on Systems, Man, and Cybernetics Part A: Systems and Humans, vol. 39, no. 2, pp. 426 436, 2009. [17] Y. G. Leu, W. Y. Wang, and T. T. Lee, Observer-based direct adaptive fuzzy-neural control for nonaffine nonlinear systems, EEE Transactions on Neural Networs, vol. 16, no. 4, pp. 853 861, 2005. [18] S. Labiod and T. M. Guerra, Adaptive fuzzy control of a class of SSO nonaffine nonlinear systems, Fuzzy Sets and Systems, vol. 158, no. 10, pp. 1126 1137, 2007. [19] B. Chen, X. Liu, and S. Tong, Adaptive fuzzy output tracing control of MMO nonlinear uncertain systems, EEE Transactions on Fuzzy Systems, vol. 15, no. 2, pp. 287 300, 2007. [20] Y. J. Liu and W. Wang, Adaptive fuzzy control for a class of uncertain nonaffne nonlinear system, nformation Sciences, vol. 177, no. 18, pp. 3901 3917, 2007. [21] R. J. Wai and C. M. Liu, Design of dynamic petri recurrent fuzzy neural networ and its application to path-tracing control of nonholonomic mobile robot, EEE Transactions on ndustrial Electronics, vol. 56, no. 7, pp. 2667 2683, 2009. [22] C. H. Lee, Y. C. Lee, and F. Y. Chang, A dynamic fuzzy neural system design via hybridization of EM and PSO algorithms, AENG nternational Journal of Computer Science, vol. 37, no. 3, 2010. [23] W. G. Seo, B. H. Par, and J. S. Lee, Adaptive fuzzy learning control for a class of nonlinear dynamic systems, nternational Journal of ntelligent Systems, vol. 15, no. 12, pp. 1157 1175, 2000. [24] C. J. Chien and L. C. Fu, terative learning control of nonlinear systems using neural networ design, Asian Journal of Control, vol. 4, no. 1, pp. 21 29, 2002. [25] C. J. Chien, A combined adaptive law for fuzzy iterative learning control of nonlinear systems with varying control tass, EEE Transactions on Fuzzy Systems, vol.16,no.1,pp. 40 51, 2008. [26] T. W.S. Chow and Y. Fang, A recurrent neural-networbased real-time learning control strategy applying to nonlinear systems with unnown dynamics, EEE Transactions on ndustrial Electronics, vol. 45, no. 1, pp. 151 161, 1998. [27] Y. C. Wang, C. J. Chien, and D. T. Lee, An output recurrent fuzzy neural networ based iterative learning control for nonlinear systems, in Proceedings of the EEE nternational

Journal of Control Science and Engineering 9 Conference on Fuzzy Systems, pp. 1563 1569, Hong Kong, 2008. [28] Y. C. Wang and C. C. Teng, Output recurrent fuzzy neural networs based model reference control for unnown nonlinear systems, nternational Journal of Fuzzy Systems, vol. 6, no. 1, pp. 28 37, 2004. [29] C. C. Ku and K. Y. Lee, Diagonal recurrent neural networs for dynamic systems control, EEE Transactions on Neural Networs, vol. 6, no. 1, pp. 144 156, 1995.

nternational Journal of Rotating Machinery Engineering Journal of The Scientific World Journal nternational Journal of Distributed Sensor Networs Journal of Sensors Journal of Control Science and Engineering Advances in Civil Engineering Submit your manuscripts at Journal of Journal of Electrical and Computer Engineering Robotics VLS Design Advances in OptoElectronics nternational Journal of Navigation and Observation Chemical Engineering Active and Passive Electronic Components Antennas and Propagation Aerospace Engineering Volume 2010 nternational Journal of nternational Journal of nternational Journal of Modelling & Simulation in Engineering Shoc and Vibration Advances in Acoustics and Vibration