Asymptotic Tracking for Uncertain Dynamic Systems Via a Multilayer Neural Network Feedforward and RISE Feedback Control Structure

Similar documents
THE control of systems with uncertain nonlinear dynamics

Optimal Control of Uncertain Nonlinear Systems

LYAPUNOV-BASED ROBUST AND ADAPTIVE CONTROL OF NONLINEAR SYSTEMS USING A NOVEL FEEDBACK STRUCTURE

ADAPTIVE FORCE AND MOTION CONTROL OF ROBOT MANIPULATORS IN CONSTRAINED MOTION WITH DISTURBANCES

Combined NN/RISE-based Asymptotic Tracking Control of a 3 DOF Robot Manipulator

OVER THE past 20 years, the control of mobile robots has

AROTORCRAFT-BASED unmanned aerial vehicle

PERIODIC signals are commonly experienced in industrial

Adaptive Robust Tracking Control of Robot Manipulators in the Task-space under Uncertainties

Lyapunov Stability of Linear Predictor Feedback for Distributed Input Delays

Observer Based Output Feedback Tracking Control of Robot Manipulators

Adaptive and Robust Controls of Uncertain Systems With Nonlinear Parameterization

Adaptive set point control of robotic manipulators with amplitude limited control inputs* E. Zergeroglu, W. Dixon, A. Behal and D.

Adaptive Control of a Class of Nonlinear Systems with Nonlinearly Parameterized Fuzzy Approximators

Tracking Control of Robot Manipulators with Bounded Torque Inputs* W.E. Dixon, M.S. de Queiroz, F. Zhang and D.M. Dawson

A Sliding Mode Controller Using Neural Networks for Robot Manipulator

Neural Network-Based Adaptive Control of Robotic Manipulator: Application to a Three Links Cylindrical Robot

A Discrete Robust Adaptive Iterative Learning Control for a Class of Nonlinear Systems with Unknown Control Direction

NEURAL NETWORKS (NNs) play an important role in

RBF Neural Network Adaptive Control for Space Robots without Speed Feedback Signal

IN recent years, controller design for systems having complex

Robust Gain Scheduling Synchronization Method for Quadratic Chaotic Systems With Channel Time Delay Yu Liang and Horacio J.

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 58, NO. 5, MAY invertible, that is (1) In this way, on, and on, system (3) becomes

1348 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 34, NO. 3, JUNE 2004

Deadzone Compensation in Motion Control Systems Using Neural Networks

A Sliding Mode Control based on Nonlinear Disturbance Observer for the Mobile Manipulator

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 5, MAY Bo Yang, Student Member, IEEE, and Wei Lin, Senior Member, IEEE (1.

I. MAIN NOTATION LIST

Robust Control of Robot Manipulator by Model Based Disturbance Attenuation

458 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 16, NO. 3, MAY 2008

Adaptive Robust Control for Servo Mechanisms With Partially Unknown States via Dynamic Surface Control Approach

H State-Feedback Controller Design for Discrete-Time Fuzzy Systems Using Fuzzy Weighting-Dependent Lyapunov Functions

LYAPUNOV-BASED CONTROL OF SATURATED AND TIME-DELAYED NONLINEAR SYSTEMS

CHATTERING-FREE SMC WITH UNIDIRECTIONAL AUXILIARY SURFACES FOR NONLINEAR SYSTEM WITH STATE CONSTRAINTS. Jian Fu, Qing-Xian Wu and Ze-Hui Mao

Filter Design for Linear Time Delay Systems

Adaptive Predictive Observer Design for Class of Uncertain Nonlinear Systems with Bounded Disturbance

Adaptive motion/force control of nonholonomic mechanical systems with affine constraints

Global robust output feedback tracking control of robot manipulators* W. E. Dixon, E. Zergeroglu and D. M. Dawson

Neural Network Control of Robot Manipulators and Nonlinear Systems

A composite adaptive output feedback tracking controller for robotic manipulators* E. Zergeroglu, W. Dixon, D. Haste, and D.

Nonlinear PD Controllers with Gravity Compensation for Robot Manipulators

RECENTLY, many artificial neural networks especially

Robust fuzzy control of an active magnetic bearing subject to voltage saturation

Riccati difference equations to non linear extended Kalman filter constraints

Tracking Control of a Class of Differential Inclusion Systems via Sliding Mode Technique

ELECTRODYNAMIC magnetic suspension systems (EDS

Several Extensions in Methods for Adaptive Output Feedback Control

On Design of Reduced-Order H Filters for Discrete-Time Systems from Incomplete Measurements

AS A POPULAR approach for compensating external

Hover Control for Helicopter Using Neural Network-Based Model Reference Adaptive Controller

CHATTERING REDUCTION OF SLIDING MODE CONTROL BY LOW-PASS FILTERING THE CONTROL SIGNAL

AFAULT diagnosis procedure is typically divided into three

538 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 32, NO. 4, AUGUST 2002

Distributed Coordinated Tracking With Reduced Interaction via a Variable Structure Approach Yongcan Cao, Member, IEEE, and Wei Ren, Member, IEEE

Adaptive Jacobian Tracking Control of Robots With Uncertainties in Kinematic, Dynamic and Actuator Models

Exponential Controller for Robot Manipulators

Delay-dependent Stability Analysis for Markovian Jump Systems with Interval Time-varying-delays

MANY adaptive control methods rely on parameter estimation

The Rationale for Second Level Adaptation

DYNAMIC NEURAL NETWORK-BASED ROBUST CONTROL METHODS FOR UNCERTAIN NONLINEAR SYSTEMS

Neural network based robust hybrid control for robotic system: an H approach

Design Artificial Nonlinear Controller Based on Computed Torque like Controller with Tunable Gain

Simultaneous state and input estimation with partial information on the inputs

Dynamic Integral Sliding Mode Control of Nonlinear SISO Systems with States Dependent Matched and Mismatched Uncertainties

Adaptive NN Control of Dynamic Systems with Unknown Dynamic Friction

Chaos Suppression in Forced Van Der Pol Oscillator

Nonlinear Tracking Control of Underactuated Surface Vessel

Set-based adaptive estimation for a class of nonlinear systems with time-varying parameters

Takagi Sugeno Fuzzy Sliding Mode Controller Design for a Class of Nonlinear System

Nonlinear Controller Design of the Inverted Pendulum System based on Extended State Observer Limin Du, Fucheng Cao

Distributed Tracking ControlforLinearMultiagent Systems With a Leader of Bounded Unknown Input

Design and Stability Analysis of Single-Input Fuzzy Logic Controller

A Novel Integral-Based Event Triggering Control for Linear Time-Invariant Systems

Simultaneous State and Fault Estimation for Descriptor Systems using an Augmented PD Observer

CONSTRAINED MODEL PREDICTIVE CONTROL ON CONVEX POLYHEDRON STOCHASTIC LINEAR PARAMETER VARYING SYSTEMS. Received October 2012; revised February 2013

MCE/EEC 647/747: Robot Dynamics and Control. Lecture 12: Multivariable Control of Robotic Manipulators Part II

EML5311 Lyapunov Stability & Robust Control Design

A Nonlinear Disturbance Observer for Robotic Manipulators

Using Neural Networks for Identification and Control of Systems

Stability of Switched Linear Hyperbolic Systems by Lyapunov Techniques

Robust Observer for Uncertain T S model of a Synchronous Machine

An Adaptive Full-State Feedback Controller for Bilateral Telerobotic Systems

HIGHER ORDER SLIDING MODES AND ARBITRARY-ORDER EXACT ROBUST DIFFERENTIATION

Small Gain Theorems on Input-to-Output Stability

A DISCRETE-TIME SLIDING MODE CONTROLLER WITH MODIFIED FUNCTION FOR LINEAR TIME- VARYING SYSTEMS

Piecewise Linear Quadratic Optimal Control

Fixed-Order Robust H Filter Design for Markovian Jump Systems With Uncertain Switching Probabilities

IN THIS PAPER, we consider a class of continuous-time recurrent

An Adaptive Iterative Learning Control for Robot Manipulator in Task Space

Design of Sliding Mode Control for Nonlinear Uncertain System

Weak Convergence of Nonlinear High-Gain Tracking Differentiator

THIS paper deals with robust control in the setup associated

An Adaptive LQG Combined With the MRAS Based LFFC for Motion Control Systems

Robust Adaptive Control of Nonholonomic Mobile Robot With Parameter and Nonparameter Uncertainties

H-infinity Model Reference Controller Design for Magnetic Levitation System

WE PROPOSE a new approach to robust control of robot

DISTURBANCE OBSERVER BASED CONTROL: CONCEPTS, METHODS AND CHALLENGES

Optimal Design of CMAC Neural-Network Controller for Robot Manipulators

Initial Shift Issues on Discrete-Time Iterative Learning Control With System Relative Degree

UDE-based Dynamic Surface Control for Strict-feedback Systems with Mismatched Disturbances

Transcription:

80 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 [7] H. Gao, T. Chen, and T. Chai, Passivity and passification for networked control systems, SIAM J. Control and Optimiz., vol. 46, no. 4, pp. 99 3, 007. [8] H. Gao, J. Lam, and C. Wang, Robust energy-to-peak filter design for stochastic time-delay systems, Syst. & Control Lett., vol. 55, no., pp. 0, 006. [9] L. Guo and H. Wang, Fault detection and diagnosis for general stochastic systems using B-spline expansions and nonlinear filters, IEEE Trans. Circuits Syst. I, vol. 5, no. 8, pp. 644 65, Aug. 005. [0] L. Guo, F. Yang, and J. Fang, Multiobjective filtering for nonlinear time-delay systems with nonzero initial conditions based on convex optimizations, Circuits, Syst., Signal Process., vol. 5, no. 5, pp. 59 607, 006. [] A. Germani, C. Manes, and P. Palumbo, Filtering of stochastic nonlinear differential systems via a carleman approximation approach, IEEE Trans. Autom. Control, vol. 5, no., pp. 66 7, Nov. 007. [] R. Z. Has minskii, Stochastic Stability of Differential Equations. Alphen, The Netherlands: Sijtjoff and Noordhoff, 980. [3] Y. C. Ji and W. B. Gao, Nonlinear H control and estimation of optimal H gain, Syst. & Control Lett., vol. 4, pp. 3 33, 995. [4] X. Mao, Stochastic Differential Equations and Their Applications. Chichester, U.K.: Horwood Publisher, 997. [5] M. Mahmoud, Y. Shi, and H. Nounou, Resilient observer-based control of uncertain time-delay systems, Int. J. Innovative Comput., Inform., Control, vol. 3, no., pp. 407 48, 007. [6] S. K. Nguang and P. Shi, Nonlinear H filtering of sampled-data systems, Automatica, vol. 36, pp. 303 30, 000. [7] S. K. Nguang and P. Shi, H filtering design for uncertain nonlinear systems under sampled measurements, Int. J. Syst. Sci., vol. 3, pp. 889 898, 00. [8] S. Sarkka, On unscented Kalman filtering for state estimation of continuous-time nonlinear systems, IEEE Trans. Autom. Control, vol. 5, no. 9, pp. 63 64, Sep. 007. [9] U. Shaked and N. Berman, H nonlinear filtering of discrete-time processes, IEEE Trans. Signal Process., vol. 43, no. 9, pp. 05 09, Sep. 995. [0] P. Shi, M. Mahmoud, S. K. Nguang, and A. Ismail, Robust filtering for jumping systems with mode-dependent delays, Signal Process., vol. 86, pp. 40 5, 006. [] B. Sinopoli, L. Schenato, M. Franceschetti, K. Poolla, M. I. Jordan, and S. S. Sastry, Kalman filtering with intermittent observations, IEEE Trans. Autom. Control, vol. 49, no. 9, pp. 453 464, Sep. 004. [] A. J. Van der Schaft, L -gain analysis of nonlinear systems and nonlinear state feedback H control, IEEE Trans. Autom.Control, vol. 37, no. 6, pp. 770 784, Jun. 99. [3] Z. Wang, D. W. C. Ho, and X. Liu, Variance-constrained filtering for uncertain stochastic systems with missing measurements, IEEE Trans. Autom. Control, vol. 48, no. 7, pp. 54 58, Jul. 003. [4] Z. Wang and D. W. C. Ho, Filtering on nonlinear time-delay stochastic systems, Automatica, vol. 39, pp. 0 09, 003. [5] Z. Wang, F. Yang, D. W. C. Ho, and X. Liu, Robust H filtering for stochastic time-delay systems with missing measurements, IEEE Trans. Signal Process., vol. 54, no. 7, pp. 579 587, Jul. 006. [6] Z. Wang, Y. Liu, and X. Liu, H filtering for uncertain stochastic time-delay systems with sector-bounded nonlinearities, Automatica, vol. 44, no. 5, pp. 68 77, May 008. [7] G. Wei and H. Shu, H filtering on nonlinear stochastic systems with delay, Chaos Solitons & Fractals, vol. 33, no., pp. 663 670, 007. [8] G. Zames, Feedback and optimal sensitivity: Model reference transformation, multiplicative seminorms and approximative inverses, IEEE Trans. Autom. Control, vol. AC-6, no., pp. 30 30, Apr. 98. [9] W. Zhang, B. S. Chen, and C. S. Tseng, Robust H filtering for nonlinear stochastic systems, IEEE Trans. Signal Process., vol. 53, no., pp. 589 598, Feb. 005. [30] W. Zhang and B. S. Chen, State feedback H control for a class of nonlinear stochastic systems, SIAM J. Control and Optimiz., vol. 44, pp. 973 99, 006. Asymptotic Tracking for Uncertain Dynamic Systems Via a Multilayer Neural Network Feedforward and RISE Feedback Control Structure Parag M. Patre, William MacKunis, Kent Kaiser, and Warren E. Dixon Abstract The use of a neural network (NN) as a feedforward control element to compensate for nonlinear system uncertainties has been investigated for over a decade. Typical NN-based controllers yield uniformly ultimately bounded (UUB) stability results due to residual functional reconstruction inaccuracies and an inability to compensate for some system disturbances. Several researchers have proposed discontinuous feedback controllers (e.g., variable structure or sliding mode controllers) to reject the residual errors and yield asymptotic results. The research in this paper describes how a recently developed continuous robust integral of the sign of the error (RISE) feedback term can be incorporated with a NN-based feedforward term to achieve semi-global asymptotic tracking. To achieve this result, the typical stability analysis for the RISE method is modified to enable the incorporation of the NN-based feedforward terms, and a projection algorithm is developed to guarantee bounded NN weight estimates. Index Terms Adaptive control, asymptotic stability, Lyapunov methods, neural network, nonlinear systems, RISE feedback, robust control. I. INTRODUCTION Control researchers have extensively investigated the use of neural networks (NNs) as a feedforward control element over the last fifteen years. The focus on NN-based control methods is spawned from the ramifications of the fact that NNs are universal approximators []. That is, NNs can be used as a black-box estimator for a general class of systems. Examples include: nonlinear systems with parametric uncertainty that do not satisfy the linear-in-the-parameters assumption required in most adaptive control methods; systems with deadzones or discontinuities; and systems with backlash. Typically, NN-based controllers yield global uniformly ultimately bounded (UUB) stability results (e.g., see [] [4] for examples and reviews of literature) due to residual functional reconstruction inaccuracies and an inability to compensate for some system disturbances. Motivated by the desire to eliminate the residual steady-state errors, several researchers have obtained asymptotic tracking results by combining the NN feedforward element with discontinuous feedback methods such as variable structure controllers (VSC) (e.g., [5] and [6]) or sliding mode (SM) controllers (e.g., [6] and [7]). A clever VSC-like controller was also proposed in [8], where the controller is not initially discontinuous, but exponentially becomes discontinuous as an exogenous control element exponentially vanishes. Well known limitations of VSC and SM controllers include a requirement for infinite control bandwidth and chattering. Unfortunately, ad hoc fixes for these effects result in a loss of asymptotic stability (i.e., Manuscript received March 3, 008; revised July 3, 008. Current version published October 8, 008. This work was supported in part by the National Science Foundation under CAREER award 0547448, by the AFOSR under contract F4960-03--070, by BARD under research grant US-375-05, by the United States Israel Binational Agricultural Research and Development Fund, and by the Department of Energy under Grant DE-FG04-86NE37967 as part of the DOE University Research Program in Robotics (URPR). Recommended by Associate Editor C.-Y. Su. The authors are with the Department of Mechanical and Aerospace Engineering, University of Florida, MAE-A, 3608 Gainesville, FL 36-650 USA (e-mail: parag.patre@gmail.com; mackunis@gmail.com; michael.kaiser@eglin.af.mil; wdixon@ufl.edu). Digital Object Identifier 0.09/TAC.008.93000 008-986/$5.00 008 IEEE

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 8 UUB typically results). Motivated by issues associated with discontinuous controllers and the typical UUB stability result, an innovative continuous NN-based controller was recently developed in [9] to achieve partial asymptotic stability for a particular class of systems. The contribution in this paper is motivated by the question: Can a NN feedforward controller be modified by a continuous feedback element to achieve an asymptotic tracking result for a general class of systems? Despite the pervasive development of NN controllers in literature and the widespread use of NNs in industrial applications, the answer to this fundamental question has remained an open problem. To provide an answer to the fundamental motivating question, the result in this paper focuses on augmenting a multilayer NN-based feedforward method with a recently developed [0] high gain control strategy coined the robust integral of the sign of the error (RISE) in [], []. The RISE control structure is advantageous because it is a differentiable control method that can compensate for additive system disturbances and parametric uncertainties under the assumption that the disturbances are C with bounded time derivatives. Due to the advantages of the RISE control structure, a flurry of results have recently been developed (e.g., [3] [7]). A RISE feedback controller can be directly applied to yield asymptotic stability for the class of systems described in this paper. However, the RISE method is a high-gain feedback tool, and hence, clear motivation exists (as with any other feedback controller) to combine a feedforward control element with the feedback controller for potential gains such as improved transient and steady-state performance, and reduced control effort. That is, it is well accepted that a feedforward component can be used to cancel out some dynamic effects without relying on high-gain feedback. Given this motivation, some results have already been developed that combine the RISE feedback element with feedforward terms. In [8], a remark is provided regarding the use of a constant best-guess feedforward component in conjunction with the RISE method to yield a UUB result. In [] and [], the RISE feedback controller was combined with a standard gradient feedforward term for systems that satisfy the linear-in-the-parameters assumption. The experimental results in [] illustrate significant improvement in the root-mean-squared tracking error with reduced root-mean-squared control effort. However, for systems that do not satisfy the linear-in-theparameters assumption, motivation exists to combine the RISE controller with a new feedforward method such as the NN. To blend the NN and RISE methods, several technical challenges must be addressed. One (lesser) challenge is that the NN must be constructed in terms of the desired trajectory instead of the actual trajectory (i.e., a DCAL-based NN structure [8]) to remove the dependence on acceleration. The development of a DCAL-based NN structure is challenging for a multilayer NN because the adaptation law for the weights is required to be state-dependent. Straightforward application of the RISE method would yield an acceleration dependent adaptation law. One method to resolve this issue is to use a dirty derivative (as in the UUB result in [9]; see also [7]). In lieu of a dirty derivative, the result in this paper uses a Lyapunov-based stability analysis approach for the design of an adaptation law that is only velocity dependent. In comparison with the efforts in [], [], a more significant challenge arises from the fact that since a multilayer NN includes the first layer weight estimate inside of a nonlinear activation function, the previous methods (e.g., [] and []) cannot be applied. That is, because of the unique manner in which the NN weight estimates appear, the stability analysis and sufficient conditions developed in previous works are violated. Previous RISE methods have a restriction (encapsulated by a sufficient gain condition) that terms in the stability analysis that are upper bounded by a constant must also have time derivatives that are upper bounded by a constant (these terms are usually denoted by Nd (t) in RISE control literature; see [8]). The norm of the NN weight estimates can be bounded by a constant (due to a projection algorithm) but the time derivative is state-dependent (i.e., the norm of Nd (t) can be bounded by a constant but the norm of _ Nd (t) is state dependent). To address this issue, modified RISE stability analysis techniques are developed that result in modified (but not more restrictive) sufficient gain conditions. By addressing this issue through stability analysis methods, the standard NN weight adaptation law does not need to be modified. Through unique modifications to the stability analysis that enable the RISE feedback controller to be combined with the NN feedforward term, the result in this paper provides an affirmative answer for the first time to the aforementioned motivating question. Since the NN and the RISE control structures are model independent (black box) methods, the resulting controller is a universal reusable controller [8] for continuous systems. Because of the manner in which the RISE technique is blended with the NN-based feedforward method, the structure of the NN is not altered from textbook examples [4] and can be considered a somewhat modular element in the control structure. Hence, the NN weights and thresholds are automatically adjusted on-line, with no off-line learning phase required. Compared to standard adaptive controllers, the current asymptotic result does not require linearity in the parameters or the development and evaluation of a regression matrix. For systems with linear-in-the-parameters uncertainty, an adaptive feedforward controller has the desirable characteristics that the controller is continuous, can be proven to yield global asymptotic tracking, and includes the specific dynamics of the system in the feedforward path. Continuous feedback NN controllers don t include the specific dynamics in a regression matrix and have a degraded steady-state stability result (i.e., UUB tracking); however, they can be applied when the uncertainty in the system is unmodeled, cannot be linearly parameterized, or the development and implementation of a regression matrix is impractical. Sliding mode feedback NN controllers have the advantage that they can achieve global asymptotic tracking at the expense of implementing a discontinuous feedback controller (i.e., infinite bandwidth, exciting structural modes, etc.). In comparison to these controllers, the development in this paper has the advantage of asymptotic tracking with a continuous feedback controller for a general class of uncertainty; however, these advantages are at the expense of semi-global tracking instead of the typical global tracking results. II. DYNAMIC MODEL AND PROPERTIES The class of nonlinear dynamic systems considered in this paper is assumed to be modeled by the following Euler-Lagrange formulation that describes the behavior of a large class of engineering systems (e.g., robot manipulators, satellites, vehicular systems): M (q)q + Vm(q; _q)_q + G(q) +F (_q) +d (t) = (t): () In (), M (q) nn denotes the inertia matrix, Vm(q; _q) nn denotes the centripetal-coriolis matrix, G(q) n denotes the gravity vector, F (_q) n denotes friction, d (t) n denotes a general nonlinear disturbance (e.g., unmodeled effects), (t) n represents the torque input control vector, and q(t), _q(t), q(t) n denote the link position, velocity, and acceleration vectors, respectively. The subsequent development is based on the assumption that q(t) and _q(t) are measurable and that M (q), Vm(q; _q), G(q), F (_q) and d (t) are unknown. Moreover, the following properties and assumptions will be exploited in the subsequent development. Property : The inertia matrix M (q) is symmetric, positive definite, and satisfies the following inequality 8 y(t) n : m kyk y T M (q)y m(q) kyk ()

8 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 where m is a known positive constant, m(q) is a known positive function, and kk denotes the standard Euclidean norm. Property : If q(t); _q(t) L, then V m (q; _q), F (_q) and G(q) are bounded. Moreover, if q(t); _q(t) L, then the first and second partial derivatives of the elements of M (q), V m (q; _q), G(q) with respect to q (t) exist and are bounded, and the first and second partial derivatives of the elements of V m(q; _q), F (_q) with respect to _q(t) exist and are bounded. Property 3: The nonlinear disturbance term and its first two time derivatives are bounded, i.e., d (t) ; _ d (t) ; d (t) L. III. CONTROL OBJECTIVE The control objective is to ensure that the system tracks a desired time-varying trajectory, denoted by q d (t) n, despite uncertainties in the dynamic model. To quantify this objective, a position tracking error, denoted by e (t) n,isdefined as e q d 0 q: (3) The subsequent development is based on the assumption that the desired trajectory designed such that q (i) d (t) n (i =0; ;...; 4) exist and are bounded. To facilitate the subsequent analysis, filtered tracking errors, denoted by e (t);r(t) n, are also defined as e _e + e (4) r _e + e (5) where ; denote positive constants. The filtered tracking error r(t) is not measurable since the expression in (5) depends on q(t). IV. FEEDFORWARD NN ESTIMATION NN-based estimation methods are well suited for control systems where the dynamic model contains unstructured nonlinear disturbances as in (). The main feature that empowers NN-based controllers is the universal approximation property. Let be a compact simply connected set of +. With map f :! n n N,define ( ) as the space where f is continuous. There exist weights and thresholds such that some function f (x) n ( ) can be represented by a three-layer NN as [3], [4] f (x) =W T V T x + " (x) (6) for some given input x(t) N +. In (6), V (N +)N and W (N +)n are bounded constant ideal weight matrices for the first-to-second and second-to-third layers respectively, where N is the number of neurons in the input layer, N is the number of neurons in the hidden layer, and n is the number of neurons in the third layer. The activation function in (6) is denoted by () N +, and " (x) n is the functional reconstruction error. Note that, augmenting the input vector x(t) and activation function () by allows us to have thresholds as the first columns of the weight matrices [3], [4]. Thus, any tuning of W and V then includes tuning of thresholds as well. Remark : If " = 0, then f (x) is in the functional range of the NN. In general for any positive constant real number " N > 0, f (x) is within " N of the NN range if there exist finite hidden neurons N, and constant weights so that for all inputs in the compact set, the approximation holds with k"k < " N. For various activation functions, results such as the Stone-Weierstrass theorem indicate that any suffi- A variety of activation functions (e.g., sigmoid, hyperbolic tangent or radial basis) could be used for the control development in this paper. ciently smooth function can be approximated by a suitable large network. Therefore, the fact that the approximation error " is bounded follows from the Universal Approximation Property of the NNs []. Based on (6), the typical three-layer NN approximation for f (x) is given as [3], [4] ^f (x) ^W T ( ^V T x) (7) where ^V (t) (N +)N and ^W (t) (N +)n are subsequently designed estimates of the ideal weight matrices. The estimate mismatch for the ideal weight matrices, denoted by ~ V (t) (N +)N and ~W (t) (N +)n, are defined as ~V V 0 ^V ; ~ W W 0 ^W and the mismatch for the hidden-layer output error for a given x(t), denoted by ~(x) N +,isdefined as ~ 0 ^ = (V T x) 0 ( ^V T x): (8) Property 4: (Boundedness of the Ideal Weights) The ideal weights are assumed to exist and be bounded by known positive values so that kv k F = tr(v T V ) V B (9) kw k F = tr(w T W ) W B (0) where kk F is the Frobenius norm of a matrix, and tr () is the trace of a matrix. V. RISE FEEDBACK CONTROL DEVELOPMENT The contribution of this paper is the control development and stability analysis that illustrates how the aforementioned textbook (e.g., [4]) NN feedforward estimation strategy can be fused with a RISE feedback control method as a means to achieve an asymptotic stability result for general Euler-Lagrange systems described by (). In this section, the open-loop and closed-loop tracking error is developed for the combined control system. A. Open-Loop Error System The open-loop tracking error system can be developed by premultiplying (5) by M (q) and utilizing the expressions in (), (3), and (4) to obtain the following expression: M (q)r = f d + S + d 0 () where the auxiliary function f d (q d ; _q d ; q d ) f d M (q d )q d + V m (q d ; _q d )_q d + G(q d )+F (_q d ) () and the auxiliary function S (q; _q; q d ; _q d ; q d ) S M (q)( _e + e )+M (q) q d 0 M (q d )q d + V m (q; _q)_q 0 V m (q d ; _q d )_q d + G(q) 0 G(q d )+F (_q) 0 F (_q d ) : (3) The expression in () can be represented by a three-layer NN as f d = W T (V T x d )+" (x d ) : (4) In (4), the input x d (t) 3n+ is defined as x d (t) [ qd T (t) _q d T (t) q d T (t)] T so that N = 3n where N was

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 83 introduced in (6). Based on the assumption that the desired trajectory is bounded, the following inequalities hold: k" (x d )k " b ; k _" (x d ; _x d )k" b k" (x d ; _x d ; x d )k " b (5) where " b ;" b ;" b are known positive constants. B. Closed-Loop Error System Based on the open-loop error system in (), the control torque input is composed of a three-layer NN feedforward term plus the RISE feedback terms as ^f d + : (6) Specifically, the RISE feedback control term (t) [3] (t) (k s +)e (t) 0 (k s +)e (0) + 0 t [(k s +) e () + sgn(e ())]d (7) where k s; are positive constant control gains. The feedforward NN component in (6), denoted by ^f d (t) n, is generated as ^f d ^W T ( ^V T x d ): (8) The estimates for the NN weights in (8) are generated on-line (there is no off-line learning phase) as _^W proj 0 ^ 0 ^V T _x d e T (9) _^V proj 0 _x d ^ 0T ^We T (0) where 0 (N +)(N +), 0 (N +)(N +) are constant, positive definite, symmetric control gain matrices. The closed-loop tracking error system can be developed by substituting (6) into () as M (q)r = f d 0 ^f d + S + d 0 : () To facilitate the subsequent stability analysis, the time derivative of () is determined as M (q)_r = 0 M _ (q)r + f _ d 0 _^f d + S _ +_ d 0 _: () Remark : Taking the time derivative of the closed-loop error system is typical of the RISE stability analysis. In our case, the time differentiation also facilitates the design of NN weight adaptation laws instead of using the typical (as in [3] and [4]) Taylor series approximation method to obtain a linear form for the estimation error ~V. Using (4) and (8), the closed-loop error system in () can be expressed as M (q)_r = 0 M _ (q)r + W T 0 V T x d V T _x d 0 ^W _ T ( ^V T x d ) 0 ^W T 0 ( ^V T x d ) ^V _ T x d 0 ^W T 0 ( ^V T x d ) ^V T _x d + _" + _ S +_ d 0 _ (3) The use of the smooth projection algorithm in (9) and (0) is to ensure that ^W (t) and ^V (t) remain bounded. This fact will be exploited in the subsequent stability analysis. where 0 ( ^V T x) d V T x =d V T x j V x= ^V x. After adding and subtracting the terms W T ^ 0 ^V T _x d + ^W T ^ 0 V ~ T _x d to (3), the following expression can be obtained: M (q)_r = 0 M _ (q)r + ^W T ^ 0 V ~ T _x d + W ~ T ^ 0 ^V T _x d + W T 0 V T _x d 0 W T ^ 0 ^V T _x d 0 ^W T ^ 0 V ~ T _x d + S _ 0 ^W _ T ^ 0 ^W T ^ 0 ^V _ T x d +_ d + _" 0 _ (4) where the notation ^ is introduced in (8). Using the NN weight tuning laws in (9) and (0); the expression in (4) can be rewritten as M (q)_r = 0 M _ (q)r + N ~ + N 0 e 0 (k s +)r 0 sgn(e ) (5) where the fact that the time derivative of (7) is given as _(t) =(k s +)r + sgn(e ) (6) was utilized, and where the unmeasurable auxiliary terms ~N (e ;e ;r;t);n( ^W; ^V;x d ; _x d ;t) n are defined as ~N (t) 0 and In (8), N d (x d ; _x d ;t) _M (q)r 0 proj(0 ^ 0 ^V T _x d e T ) T ^ while N B ( ^W; ^V;x d ; _x d ;t) 0 ^W T ^ 0 proj(0 _x d (^ 0T ^We ) T ) T x d + _ S + e (7) N N d + N B: (8) N d W T 0 V T _x d + _" +_ d (9) n is further segregated as N B N B + N B (30) where N B ( ^W; ^V;x d ; _x d ;t) N B 0W T ^ 0 ^V T _x d 0 ^W T ^ 0 ~ V T _x d (3) and the term N B ( ^W; ^V;x d ; _x d ;t) N B ^W T ^ 0 ~ V T _x d + ~ W T ^ 0 ^V T _x d : (3) Motivation for segregating the terms in (8) is derived from the fact that the different components in (8) have different bounds. Segregating the terms as in (8) (3) facilitates the development of the NN weight update laws and the subsequent stability analysis. For example, the terms in (9) are grouped together because the terms and their time derivatives can be upper bounded by a constant and rejected by the RISE feedback, whereas the terms grouped in (30) can be upper bounded by a constant but their derivatives are state dependent. The state dependency of the time derivatives of the terms in (30) violates the assumptions given in previous RISE-based controllers (e.g., [] [7]), and requires additional consideration in the adaptation law design and stability analysis. The terms in (30) are further segregated because N B ( ^W; ^V;x d ) will be rejected by the RISE feedback, whereas N B ( ^W; ^V;x d ) will be partially rejected by the RISE feedback and partially canceled by the adaptive update law for the NN weight estimates. In a similar manner as in [3], the Mean Value Theorem can be used to develop the following upper bound: ~N (t) (kzk) kzk (33)

84 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 where z(t) 3 z(t) [ e T e T r T ] T (34) and the bounding function (kzk) is a positive globally invertible nondecreasing function. The following inequalities can be developed based on Property 3, (9), (0), (5), (30) (3): kn d k ; kn B k ; _N d 3 : (35) By using (9) and (0), the time derivative of N B ( ^W; ^V ;x d ) can be bounded as _N B 4 + 5 ke k : (36) In (35) and (36), i, (i =; ;...; 5) are known positive constants. VI. STABILITY ANALYSIS Theorem: The composite NN and RISE controller given in (6) (0) ensures that all system signals are bounded under closed-loop operation and that the position tracking error is regulated in the sense that ke (t)k!0ast! provided the control gain k s introduced in (7) is selected sufficiently large (see the subsequent proof), and and are selected according to the following sufficient conditions: > + + 3 + 4; > 5 (37) where i, i =; ;...; 5 are introduced in (35) (36) and is introduced in (40). Proof: Let D 3n+ be a domain containing y(t) =0, where y(t) 3n+ is defined as y(t) z T T (t) P (t) Q(t) : (38) Hence, (4) can be used to conclude that P (t) 0. The auxiliary function Q(t) in (38) is defined as Q(t) tr W ~ T 0 0 ~W + tr V ~ T 0 0 ~V : (43) Since 0 and 0 are constant, symmetric, and positive definite matrices and > 0, it is straightforward that Q(t) 0. Let V L(y; t) :D[0; )! be a continuously differentiable positive definite function defined as V L(y; t) e T e + et e + rt M (q)r + P + Q (44) which satisfies the following inequalities: U (y) V L(y; t) U (y) (45) provided the sufficient conditions introduced in (37) are satisfied. In (45), the continuous positive definite functions U (y);u (y) are defined as where ; (q) U (y) kyk ; U (y) (q) kyk (46) are defined as min f;m g ; (q) max m(q); where m, m(q) are introduced in (). After utilizing (4), (5), (5), and (6), the time derivative of (44) can be expressed as _V L(y; t) =0 ke k +e T e + r T N ~ (t) 0 (k s +)krk 0 ke k + ke k + e T + tr ~ W T 0 0 ^W T ^ 0 ~ V T _x d + ~ W T ^ 0 ^V T _x d _~W + tr ~ V T 0 0 _~V : (47) In (38), the auxiliary function P (t) is defined as Based on the fact that P (t) n i= je i(0)j0e (0) T N (0) 0 0 t L( )d (39) where the subscript i = ; ;...;n denotes the ith element of the vector, and the auxiliary function L(t) is defined as L(t) r T (N B (t)+n d (t)0 sgn(e ))+ _e (t) T N B (t)0 ke (t)k (40) where is a positive constant chosen according to the second sufficient condition in (37). The derivative P _ (t) can be expressed as P _ (t) =0L(t) =0r T (N B (t)+n d (t) 0 sgn(e )) 0 _e (t) T N B (t)+ ke (t)k : (4) Provided the sufficient conditions introduced in (37) are satisfied, the following inequality can be obtained 0 t L( )d je i (0)j0e (0) T N (0): (4) e T e kek + kek and using (9) and (0), the expression in (47) can be simplified as _V L (y; t) r T ~ N (t) 0 (k s +)krk 0 ( 0 ) ke k 0 ( 0 0 ) ke k : (48) By using (33), the expression in (48) can be further bounded as _V L (y; t) 0 3 kzk 0 k s krk 0 (kzk) krkkzk (49) where 3 minf 0 ; 0 0 ; g; hence, 3 is positive if, are chosen according to the following sufficient conditions: > ; > +: After completing the squares for the second and third term in (49), the following expression can be obtained: _V L (y; t) 0 3 kzk + (z) kzk 4k s 0U (y) (50)

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 9, OCTOBER 008 85 where U (y) = c kzk, for some positive constant c, is a continuous positive semi-definite function that is defined on the following domain: D y 3n+ jkyk 0 p 3 k s : The inequalities in (45) and (50) can be used to show that V L (y; t) L in D; hence, e (t), e (t), r(t), P (t), and Q(t) L in D. Given that e (t), e (t), and r(t) L in D, standard linear analysis methods can be used to prove that _e (t); _e (t) L in D from (4) and (5). Since e (t);e (t);r(t) L in D, the assumption that q d (t), _q d (t), q d (t) exist and are bounded can be used along with (3) (5) to conclude that q(t); _q(t); q(t) L in D. Since q(t); _q(t) L in D, Property can be used to conclude that M (q), V m(q; _q), G(q), and F (_q) L in D. Therefore, from () and Property 3, we can show that (t) L in D. Given that r(t) L in D, (6) can be used to show that _(t) L in D. Since _q(t); q(t) L in D, Property can be used to show that V _ m (q; _q), _G(q), F _ (q) and M _ (q) L in D; hence, () can be used to show that _r(t) Lin D. Since _e (t); _e (t); _r(t) L in D, the definitions for U (y) and z(t) can be used to prove that U (y) is uniformly continuous in D. Let SDdenote a set defined as follows: 3 p S y(t) DjU (y(t)) < 0 3k s [0, Theor. 8.4] can now be invoked to state that : (5) c kz(t)k! 0ast!8y(0) S: (5) Based on the definition of z(t), (5) can be used to show that ke (t)k!0ast!8y(0) S: (53) REFERENCES [] K. Hornik, M. Stinchcombe, and H. White, Multilayer feedforward networks are universal approximators, Neural Netw., vol., pp. 359 366, 985. [] F. Lewis, K. Liu, and A. Yesildirek, Neural net robot controller with guaranteed tracking performance, IEEE Trans. Neural Netw., vol. 6, no. 3, pp. 703 75, May 995. [3] F. L. Lewis, Nonlinear network structures for feedback control, Asian J. Control, vol., no. 4, pp. 05 8, 999. [4] F. Lewis, J. Campos, and R. Selmic, Neuro-Fuzzy Control of Industrial Systems With Actuator Nonlinearities. Philadelphia, PA: SIAM, 00. [5] S. S. Ge, C. C. Hang, and L. C. Woon, Adaptive neural network control of robot manipulators in task space, IEEE Trans. Ind. Electron., vol. 44, no. 6, pp. 746 75, Dec. 997. [6] D. Y. Meddah, A. Benallegue, and A. R. Cherif, A neural network robust controller for a class of nonlinear MIMO systems, in Proc. IEEE Int. Conf. Robotics and Automation, Albuquerque, NM, 997, pp. 645 650. [7] M. B. Cheng and C. C. Tsai, Hybrid robust tracking control for a mobile manipulator via sliding-mode neural network, in Proc. IEEE Int. Conf. Mechatronics, Taipei, Taiwan, 005, pp. 537 54. [8] C. M. Kwan, D. M. Dawson, and F. L. Lewis, Robust adaptive control of robots using neural network: Global tracking stability, in Proc. IEEE Conf. Decision and Control, New Orleans, LA, 995, pp. 846 85. [9] T. Hayakawa, W. Haddad, and N. Hovakimyan, Neural network adaptive control for a class of nonlinear uncertain dynamical systems with asymptotic stability guarantees, IEEE Trans. Neural Netw., vol. 9, no., pp. 80 89, Jan. 008. [0] Z. Qu and J. Xu, Model-based learning controls and their comparisons using Lyapunov direct method, Asian J. Control, vol. 4, no., pp. 99 0, Mar. 00. [] P. M. Patre, W. MacKunis, C. Makkar, and W. E. Dixon, Asymptotic tracking for systems with structured and unstructured uncertainties, IEEE Trans. Control. Syst. Technol., vol. 6, no., pp. 373 379, Mar. 008. [] P. M. Patre, W. MacKunis, C. Makkar, and W. E. Dixon, Asymptotic tracking for systems with structured and unstructured uncertainties, in Proc. IEEE Conf. Decision and Control, San Diego, CA, 006, pp. 44 446. [3] B. Xian, D. M. Dawson, M. S. de Queiroz, and J. Chen, A continuous asymptotic tracking control strategy for uncertain nonlinear systems, IEEE Trans. Automat. Control, vol. 49, no. 7, pp. 06, Jul. 004. [4] W. E. Dixon, Y. Fang, D. M. Dawson, and T. J. Flynn, Range identification for perspective vision systems, IEEE Trans. Automat. Control, vol. 48, no., pp. 3 38, Dec. 003. [5] C. Makkar, G. Hu, W. G. Sawyer, and W. E. Dixon, Lyapunov-based tracking control in the presence of uncertain nonlinear parameterizable friction, IEEE Trans. Automat. Control, vol. 5, no. 0, pp. 988 994, Oct. 007. [6] Z. Cai, M. S. de Queiroz, and D. M. Dawson, Robust adaptive asymptotic tracking of nonlinear systems with additive disturbance, IEEE Trans. Automat. Control, vol. 5, no. 3, pp. 54 59, Mar. 006. [7] X. Zhang, A. Behal, D. M. Dawson, and B. Xian, Output feedback control for a class of uncertain MIMO nonlinear systems with non-symmetric input gain matrix, in Proc. IEEE Conf. Decision and Control, and the European Control Conf., Seville, Spain, 005, pp. 776 7767. [8] B. Xian, M. S. de Queiroz, and D. M. Dawson, A Continuous Control Mechanism for Uncertain Nonlinear Systems in Optimal Control, Stabilization, and Nonsmooth Analysis. Heidelberg, Germany: Springer- Verlag, 004. [9] X. Zhang, D. M. Dawson, A. Behal, and B. Xian, A Modular Controller for a Class of Uncertain MIMO Nonlinear Systems With Non- Symmetric Input Gain Matrix, College of Eng. and Sci., Control and Robotics (CRB), Clemson University, Clemson, SC, 004, Tech. Rep. CU/CRB/9/0/04/. [0] H. K. Khalil, Nonlinear Systems, 3rd ed. Upper Saddle River, NJ: Prentice-Hall, 00. 3 The region of attraction in (5) can be made arbitrarily large to include any initial conditions by increasing the control gain k (i.e., a semi-global type of stability result) [3].