Optimal Control of Uncertain Nonlinear Systems

Similar documents
THE control of systems with uncertain nonlinear dynamics

Optimal Design of CMAC Neural-Network Controller for Robot Manipulators

Adaptive Robust Tracking Control of Robot Manipulators in the Task-space under Uncertainties

ADAPTIVE FORCE AND MOTION CONTROL OF ROBOT MANIPULATORS IN CONSTRAINED MOTION WITH DISTURBANCES

Observer Based Output Feedback Tracking Control of Robot Manipulators

Lyapunov Optimizing Sliding Mode Control for Robot Manipulators

OVER THE past 20 years, the control of mobile robots has

Adaptive and Robust Controls of Uncertain Systems With Nonlinear Parameterization

AROTORCRAFT-BASED unmanned aerial vehicle

WE PROPOSE a new approach to robust control of robot

A Sliding Mode Controller Using Neural Networks for Robot Manipulator

PERIODIC signals are commonly experienced in industrial

Robust Control of Robot Manipulator by Model Based Disturbance Attenuation

Asymptotic Tracking for Uncertain Dynamic Systems Via a Multilayer Neural Network Feedforward and RISE Feedback Control Structure

LYAPUNOV theory plays a major role in stability analysis.

A NONLINEAR TRANSFORMATION APPROACH TO GLOBAL ADAPTIVE OUTPUT FEEDBACK CONTROL OF 3RD-ORDER UNCERTAIN NONLINEAR SYSTEMS

LYAPUNOV-BASED ROBUST AND ADAPTIVE CONTROL OF NONLINEAR SYSTEMS USING A NOVEL FEEDBACK STRUCTURE

NEURAL NETWORKS (NNs) play an important role in

Real-time Motion Control of a Nonholonomic Mobile Robot with Unknown Dynamics

RBF Neural Network Adaptive Control for Space Robots without Speed Feedback Signal

OPTIMAL CONTROL AND ESTIMATION

Neural Network-Based Adaptive Control of Robotic Manipulator: Application to a Three Links Cylindrical Robot

SINCE the 1959 publication of Otto J. M. Smith s Smith

EML5311 Lyapunov Stability & Robust Control Design

Piecewise Linear Quadratic Optimal Control

Tracking Control of Robot Manipulators with Bounded Torque Inputs* W.E. Dixon, M.S. de Queiroz, F. Zhang and D.M. Dawson

Riccati difference equations to non linear extended Kalman filter constraints

Global robust output feedback tracking control of robot manipulators* W. E. Dixon, E. Zergeroglu and D. M. Dawson

Adaptive Robust Control for Servo Mechanisms With Partially Unknown States via Dynamic Surface Control Approach

Decentralized Control of Nonlinear Multi-Agent Systems Using Single Network Adaptive Critics

Adaptive set point control of robotic manipulators with amplitude limited control inputs* E. Zergeroglu, W. Dixon, A. Behal and D.

Dynamic backstepping control for pure-feedback nonlinear systems

Nonlinear Tracking Control of Underactuated Surface Vessel

An Adaptive LQG Combined With the MRAS Based LFFC for Motion Control Systems

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 5, MAY Bo Yang, Student Member, IEEE, and Wei Lin, Senior Member, IEEE (1.

Disturbance Attenuation for a Class of Nonlinear Systems by Output Feedback

Output Regulation of Uncertain Nonlinear Systems with Nonlinear Exosystems

Hover Control for Helicopter Using Neural Network-Based Model Reference Adaptive Controller

Pierre Bigot 2 and Luiz C. G. de Souza 3

Adaptive Predictive Observer Design for Class of Uncertain Nonlinear Systems with Bounded Disturbance

H State-Feedback Controller Design for Discrete-Time Fuzzy Systems Using Fuzzy Weighting-Dependent Lyapunov Functions

Lyapunov Stability of Linear Predictor Feedback for Distributed Input Delays

Analytic Nonlinear Inverse-Optimal Control for Euler Lagrange System

A Normal Form for Energy Shaping: Application to the Furuta Pendulum

An Optimal Tracking Approach to Formation Control of Nonlinear Multi-Agent Systems

IN recent years, controller design for systems having complex

Backstepping Control of Linear Time-Varying Systems With Known and Unknown Parameters

Control of industrial robots. Centralized control

I. MAIN NOTATION LIST

Contraction Based Adaptive Control of a Class of Nonlinear Systems

Neural Network Control of Robot Manipulators and Nonlinear Systems

458 IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, VOL. 16, NO. 3, MAY 2008

MANY adaptive control methods rely on parameter estimation

A Recurrent Neural Network for Solving Sylvester Equation With Time-Varying Coefficients

A Sliding Mode Control based on Nonlinear Disturbance Observer for the Mobile Manipulator

Robust Stabilization of Jet Engine Compressor in the Presence of Noise and Unmeasured States

Robust Adaptive Attitude Control of a Spacecraft

A Backstepping control strategy for constrained tendon driven robotic finger

Combined NN/RISE-based Asymptotic Tracking Control of a 3 DOF Robot Manipulator

Robust Output Feedback Stabilization of a Class of Nonminimum Phase Nonlinear Systems

Adaptive Control of a Class of Nonlinear Systems with Nonlinearly Parameterized Fuzzy Approximators

Design and Stability Analysis of Single-Input Fuzzy Logic Controller

A Receding Horizon Generalization of Pointwise Min-Norm Controllers

Robust Stabilization of Non-Minimum Phase Nonlinear Systems Using Extended High Gain Observers

Computing Optimized Nonlinear Sliding Surfaces

Asignificant problem that arises in adaptive control of

Robot Manipulator Control. Hesheng Wang Dept. of Automation

Attitude Regulation About a Fixed Rotation Axis

Stability of Hybrid Control Systems Based on Time-State Control Forms

The Rationale for Second Level Adaptation

Switched Systems: Mixing Logic with Differential Equations

A Discrete Robust Adaptive Iterative Learning Control for a Class of Nonlinear Systems with Unknown Control Direction

An Adaptive Full-State Feedback Controller for Bilateral Telerobotic Systems

Nonlinear PD Controllers with Gravity Compensation for Robot Manipulators

LYAPUNOV-BASED CONTROL OF SATURATED AND TIME-DELAYED NONLINEAR SYSTEMS

Introduction to centralized control

Force/Position Regulation for Robot Manipulators with. Unmeasurable Velocities and Uncertain Gravity. Antonio Loria and Romeo Ortega

Exponential Controller for Robot Manipulators

538 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 32, NO. 4, AUGUST 2002

UDE-based Dynamic Surface Control for Strict-feedback Systems with Mismatched Disturbances

HIGHER ORDER SLIDING MODES AND ARBITRARY-ORDER EXACT ROBUST DIFFERENTIATION

A composite adaptive output feedback tracking controller for robotic manipulators* E. Zergeroglu, W. Dixon, D. Haste, and D.

Stable Limit Cycle Generation for Underactuated Mechanical Systems, Application: Inertia Wheel Inverted Pendulum

Adaptive NN Control of Dynamic Systems with Unknown Dynamic Friction

Introduction to centralized control

New Parametric Affine Modeling and Control for Skid-to-Turn Missiles

Indirect Model Reference Adaptive Control System Based on Dynamic Certainty Equivalence Principle and Recursive Identifier Scheme

Controlo Switched Systems: Mixing Logic with Differential Equations. João P. Hespanha. University of California at Santa Barbara.

Video 8.1 Vijay Kumar. Property of University of Pennsylvania, Vijay Kumar

Computational Geometric Uncertainty Propagation for Hamiltonian Systems on a Lie Group

IMECE NEW APPROACH OF TRACKING CONTROL FOR A CLASS OF NON-MINIMUM PHASE LINEAR SYSTEMS

DISTURBANCE OBSERVER BASED CONTROL: CONCEPTS, METHODS AND CHALLENGES

A Control Lyapunov Function Approach to Multiagent Coordination

A Novel Integral-Based Event Triggering Control for Linear Time-Invariant Systems

CHATTERING REDUCTION OF SLIDING MODE CONTROL BY LOW-PASS FILTERING THE CONTROL SIGNAL

IN THIS paper, we will consider the analysis and synthesis

IN THIS paper, we study the problem of asymptotic stabilization

Filter Design for Linear Time Delay Systems

NMT EE 589 & UNM ME 482/582 ROBOT ENGINEERING. Dr. Stephen Bruder NMT EE 589 & UNM ME 482/582

Takagi Sugeno Fuzzy Sliding Mode Controller Design for a Class of Nonlinear System

Transcription:

Proceedings of the 47th IEEE Conference on Decision and Control Cancun, Mexico, Dec. 9-11, 28 Optimal Control of Uncertain Nonlinear Systems using RISE Feedback K.Dupree,P.M.Patre,Z.D.Wilcox,andW.E.Dixon email:{kdupree, pmpatre, zibrus, wdixon}@ufl.edu Department of Mechanical and Aerospace Engineering University of Florida, Gainesville, FL 32611 WeA9.3 Abstract A Hamilton-Jacobi-Bellman optimization scheme is used along with a RISE feedback structure to minimize a quadratic performance index while the generalized coordinates of a nonlinear Euler-Lagrange system asymptotically track a desired time-varying trajectory despite general uncertainty in the dynamics, such as additive bounded disturbances and parametric uncertainty. Motivated by recent previous results that use a neural network structure to approximate the dynamics (i.e., the state space model is approximated with a residual function reconstruction error), the result in this paper uses the implicit learning capabilities of the RISE control structure to learn the dynamics asymptotically. Specifically, a Lyapunov stability analysis is performed to show that the RISE feedback term asymptotically identifies the unknown dynamics, yielding semiglobal asymptotic tracking. In addition, it is shown that the system converges to a state space system that has a quadratic performance index which has been optimized by an additional control element. Simulation results are included to demonstrate the performance of the developed controller. I. INTRODUCTION 1 Optimal control theory involves the design of controllers that can satisfy some tracking or regulation control objective while simultaneously minimizing some performance metric. Asufficient condition to solve an optimal control problem is to solve the Hamilton-Jacobi-Bellman(HJB) equation. For the special case of linear time-invariant systems, the solution to the HJB equation reduces to solving the algebraic Riccati equation. However, for general systems, the challenge is to find a value function that satisfies the HJB equation. Finding this value function has remained problematic because it requires the solution of a partial differential equation that can not be solved explicitly. One common technique in developing an optimal controller for a nonlinear system is to assume the nonlinear dynamics are exactly known, feedback linearize the system, and then apply optimal control techniques to the resulting system as in[1] [3], and others. For example, dynamic feedback linearization was used in [1] to develop a control Lyapunov function to obtain a class of optimal controllers. A review of the optimality 1 This research is supported in part by the NSF CAREER award CMS- 547448, AFOSR contract numbers F4962-3-1-381 and F4962-3-1-17, AFRL contract number FA4819-5-D-11, and by research grant No. US-3715-5 from BARD, the United States - Israel Binational Agricultural Research and Development Fund at the University of Florida. The authors would like to gratefully acknowledge the support of the Department of Energy, grant number DE-FG4-86NE37967. This work was accomplished as part of the DOE University Research Program in Robotics(URPR). of nonlinear design techniques and general results involving feedback linearization as well as Jacobian linearization and other nonlinear design techniques are provided in[4],[5]. A review of inverse optimal control is provided in[6], where the cost function is not a priori provided. Motivated by the desire to eliminate the requirement for exact knowledge of the dynamics, [7] developed one of the first results to illustrate the interaction of adaptive control with an optimal controller. Specifically, [7] first used exact feedback linearization to cancel the nonlinear dynamics and produce an optimal controller. Then, a self-optimizing adaptive controller was developed to yield global asymptotic tracking despite linear-in-the parameters uncertainty. The analysis in[7] indicated that if the parameter estimation error could somehow converge to zero, then the controller would converge to the optimal solution. Another method to compensate for system uncertainties is to employ neural networks(nn) to approximate the unknown dynamics. The universal approximation property states that a NN can identify a function up to some function reconstruction error. The use of NN versus feedback linearization allows for general uncertain systems to be examined. However this added robustness comes at the expense of reduced steady-state error (i.e., generally resulting in a uniformly ultimately bounded (UUB) result). NN controllers were developed in results such as[8] [13] to accommodate for the uncertainty in the system and to solve the HJB equation. Specifically the tracking errors are proven to be uniformly ultimately bounded (UUB) and the resulting state space system, for which the HJB optimal controller is developed, is only approximated. Our contribution arises from incorporating an optimal control elements with an implicit learning feedback control strategy developed in [14] with modifications in [15] that was later coined the Robust Integral of the Sign of the Error (RISE) method in [16], [17]. The RISE method is used to identify the system and reject disturbances, while achieving asymptotic tracking and the convergence of a control term to the optimal controller. Inspired by the previous work in [7] [13],[18],[19], a system in which all terms are assumed known(temporarily) is feedback linearized and a control law is developed based on the HJB optimization method for a given quadratic performance index. Under the assumption that parametric uncertainty and unknown bounded disturbances are present in the dynamics, the control law is modified to contain 978-1-4244-3124-3/8/$25. 28 IEEE 2154

47th IEEE CDC, Cancun, Mexico, Dec. 9-11, 28 WeA9.3 therisefeedbacktermwhichisusedtoidentifytheuncertainty. Specifically, a Lyapunov stability analysis is included to show that the RISE feedback term asymptotically identifies the unknown dynamics(yielding semi-global asymptotic tracking) provided upper bounds on the disturbances are known and the control gains are selected appropriately. As in previous literature the control law converges to the optimal law, however because our result is asymptotic rather than UUB the control law converges exactly to the optimal law II. DYNAMICMODELANDPROPERTIES The class of nonlinear dynamic systems considered in this paper is assumed to be modeled by the following Euler- Lagrange[2] formulation: () + ( ) +()+( )+ ()=() (1) In(1), ()R denotestheinertiamatrix, ( ) R denotes the centripetal-coriolis matrix, () R denotes the gravity vector, ( ) R denotes friction, () R denotes a general nonlinear disturbance (e.g., unmodeled effects), () R represents the input control vector, and (), (), () R denote the position, velocity, and acceleration vectors, respectively. The subsequent development is based on the assumption that () and () aremeasurableandthat(), ( )(), ( ) and () are unknown. Moreover, the following properties and assumptions will be exploited in the subsequent development. Property 1: The inertia matrix () is symmetric, positive definite,andsatisfiesthefollowinginequality()r : 1 kk 2 () ()kk 2 (2) where 1 R is a known positive constant, () R is a known positive function, and k k denotes the standard Euclidean norm. Property 2: The following skew-symmetric relationship is satisfied: ³ ()2 ( ) = R (3) Property 3: If () () L,then ( ), ( ) and () are bounded. Moreover, if () () L, then the first and secondpartial derivativesof the elements of (), ( ), () with respect to () exist and are bounded, and the first and second partial derivatives of the elements of ( ),( )withrespectto ()existandarebounded. Property 4: The desired trajectory is assumed to be designed such that (), (), ()... ()... () R exist, and are bounded. III. CONTROLOBJECTIVE The control objective is to ensure that the system tracks a desired time-varying trajectory, denoted by () R, despite uncertainties in the dynamic model, while minimizing a given performance index. To quantify the tracking objective, apositiontrackingerror,denotedby 1 ()R,isdefinedas 1, (4) To facilitate the subsequent analysis, filtered tracking errors, denotedby 2 (),()R,arealsodefinedas 2, 1 + 1 1 (5), 2 + 2 2 (6) where 1 R, denotes a subsequently defined positive definite, constant, gain matrix, and 2 R is a positive constant. The filtered tracking error () is not measurable sincetheexpressionin(6)dependson (). IV. OPTIMALCONTROLDESIGN In this section, a state-space model is developed based on the tracking errors in (4) and (5). Based on this model, a controller is developed that minimizes a quadratic performance index under the (temporary) assumption that the dynamics in (1), including the additive disturbance, are known. The development in this section motivates the control design in Section V, where a robust controller is developed to identify the unknown dynamics and additive disturbance. To develop a state-space model for the tracking errors in (4) and (5), the inertia matrix is premultiplied by the time derivative of(5), and substitutions are made from(1) and(4) to obtain 2 = 2 ++ (7) wherethenonlinearfunction(,,,, )R isdefined as, ( + 1 1 )+ ( + 1 1 ) (8) ++ Under the(temporary) assumption that the dynamics in(1) are known, the control input can be designed as,+ (9) where () R is an auxiliary control input that will be designed to minimize a subsequent performance index. By substituting(9)into(7)theclosed-looperrorsystemfor 2 () canbeobtainedas 2 = 2 + (1) A state-space model for (5) and (1) can now be developed as =( )+() (11) where ( ) R 2 2, () R 2,and() R 2 are defined as 1 ( ), 1 (), 1 (), 1 2 where and denote a identity matrix and matrix of zeros, respectively. The quadratic performance index 2155

47th IEEE CDC, Cancun, Mexico, Dec. 9-11, 28 WeA9.3 ()Rtobeminimizedsubjecttotheconstraintsin(11) is Z µ 1 (), 2 + 1 2 (12) In (12), R 2 2 and R are positive definite symmetric matrices to weight the influence of the states and (partial) control effort, respectively. Furthermore, the matrix canbebrokenintoblocksas: 11 = 12 12 22 As stated in [8], [9], the fact that the performance index is onlypenalizedfortheauxiliarycontrol()ispracticalsince the gravity, Coriolis, and friction compensation terms in (8) cannotbemodifiedbytheoptimaldesignphase. To facilitate the subsequent development, let () R 2 2 bedefinedas ()= (13) where R denotes a gain matrix. Based on the development in Theorem 1 of ( [8], [9]), if 1,,and, introduced in(5),(12), and(13), satisfy the following algebraic relationships = = 1 12 + 12 2 (14) 11 = 1+ 1 (15) 1 = 22 (16) then () satisfies the Riccati differential equation, and the valuefunction()r = 1 2 satisfiesthehjbequation.lemma1of([8],[9])canbeused toconcludethattheoptimalcontrol()thatminimizes(12) subjectto(11)is ()= 1 µ () = 1 2 (17) V. RISEFEEDBACKCONTROLDEVELOPMENT Ingeneral,theboundeddisturbance ()andthenonlinear dynamics given in (8) are unknown, so the controller given in(9)cannotbeimplemented.however,ifthecontrolinput contains some method to identify and cancel these effects, then ()willconvergetothestatespacemodelin(11)sothat() minimizes the respective performance index. As stated in the introduction, several results have explored this strategy using function approximation methods such as neural networks, where the tracking control errors converge to a neighborhood near the state space model yielding a type of approximate optimal controller. In this section, a control input is developed that exploits RISE feedback to identify the nonlinear effects and bounded disturbances to enable () to asymptotically converge to the state space model. To develop the control input, the error system in (6) is premultiplied by () and the expressions in (1), (4), and (5) are utilized to obtain = 2 ++ + 2 2 (18) Based on the open-loop error system in(18), the control input iscomposedoftheoptimalcontroldevelopedin(17),plusa subsequentlydesignedauxiliarycontrolterm()r as, (19) The closed-loop tracking error system can be developed by substituting(19) into(18) as = 2 ++ + 2 2 + (2) To facilitate the subsequent stability analysis the auxiliary function ( )R whichisdefinedas,( ) + ( ) +( )+( ) (21) isaddedandsubtractedto(2)toyield = 2 + + + ++ 2 2 (22) where (,,,, )R isdefinedas, (23) Thetimederivativeof(22)canbewrittenas = 1 2 + + 2 1 (24) after strategically grouping specific terms. In (24), the unmeasurable auxiliary terms ( 1 2 ), () R are defined as, 2 2 1 2 + (25) + 2 2 + 2 2 + 2 + 2 1 2, + Motivationforgroupingtermsinto ( 1 2 )and () comes from the subsequent stability analysis and the fact that themeanvaluetheorem,property3,andproperty4canbe usedtoupperboundtheauxiliarytermsas () (kk)kk (26) k k 1 2 (27) where()r 3 isdefinedas (),[ 1 2 ] (28) the bounding function (kk) R is a positive globally invertible nondecreasing function, and R (=12) denote known positive constants. Based on (24), the control term()isdesignedbasedontheriseframework(see[14] [16])as (), ( +1) 2 ()( +1) 2 () (29) + Z [( +1) 2 2 ()+( 2 ())] 2156

47th IEEE CDC, Cancun, Mexico, Dec. 9-11, 28 WeA9.3 where R are positive constant control gains. The closed loop error systems for () can now be obtained by substituting the time derivative of(29) into(24) as = 1 2 + + 2 1 (3) ( +1)( 2 ) VI. STABILITYANALYSIS Theorem 1: The controller given in(17) and(19) ensures that all system signals are bounded under closed-loop operation, and the tracking errors are regulated in the sense that k 1 ()kk 2 ()kk()k (31) Theboundednessoftheclosedloopsignalsandtheresultin (31)canbeobtainedprovidedthecontrolgain introducedin (29)is selected sufficiently large(see the subsequent stability analysis),and 1, 2 areselectedaccordingtothesufficient conditions min ( 1 ) 1 2 1 (32) 2 where min ( 1 )istheminimumeigenvalueof 1 and is selected according to the following sufficient condition: 1 + 1 2 2 (33) wherewasintroducedin(29).furthermore,()converges to an optimal controller that minimizes (12) subject to (11) provided the gain conditions given in(14)-(16) are satisfied. Remark1: The control gain 1 can not be arbitrarily selected, rather it is calculated using a Lyapunov equation solver. Its value is determined based on the value of and Thereforeandmustbechosensuchthat(32)issatisfied. Proof:LetDR 3+1 beadomaincontaining()=, where()r 3+1 isdefinedas (),[ () p ()] (34) In(34),theauxiliaryfunction()Risdefinedas Z (),k 2 ()k 2 () () wheretheauxiliaryfunction()risdefinedas () (35) (), ( ()( 2 )) (36) where R is a positive constant chosen according to the sufficient conditions in (33). As illustrated in [15], provided the sufficient conditions introduced in (33) are satisfied, the following inequality can be obtained Z () k 2 ()k 2 () () (37) Hence,(37)canbeusedtoconcludethat(). Let () :D [) R be a continuously differentiable positive definite function defined as (), 1 1 + 1 2 2 2 + 1 2 ()+ (38) which satisfies the following inequalities: 1 () () 2 () (39) provided the sufficient conditions introduced in(33) are satisfied.in(39),thecontinuouspositivedefinitefunctions 1 (), and 2 () R are defined as 1 (), 1 kk 2,and 2 (), 2 ()kk 2 where 1, 2 ()Raredefinedas 1, 1 ½ ¾ 1 2 min{1 1} 2 (),max 2 ()1 where 1, ()areintroducedin(2).aftertakingthetime derivativeof(38), ()canbeexpressedas ()=2 1 1 + 2 2 + 1 2 ()+ () + After utilizing (5), (6), (3), and substituting in for the time derivativeof(), ()canbesimplifiedasfollows: () 2 1 1 1 +2 2 1 + () (4) Basedonthefactthat ( +1+ min 1 )kk 2 2 k 2 k 2 2 1 1 2 k 1k 2 + 1 2 k 2k 2 theexpressionin(4)canbesimplifiedas () ()( +1+ min 1 )kk 2 (41) (2 min ( 1 )1)k 1 k 2 ( 2 1)k 2 k 2 Byusing(26),theexpressionin(41)canberewrittenas h i () 3 kk 2 kk 2 (kk)kkkk (42) where 3,min{2 min ( 1 )1 2 11+ min 1 }and 1 and 2 arechosenaccordingtothesufficientconditionin (32). After completing the squares for the terms inside the brackets in(42), the following expression can be obtained () 3 kk 2 + 2 (kk)kk 2 4 () (43) where ()=kk 2, for some positive constant,isa continuous, positive semi-definite function that is defined on the following domain: D, nr 3+1 kk 1³ 2 p 3 o The inequalities in (39) and (43) can be used to show that ()L in D;hence, 1 (), 2 (),and()l in D.Giventhat 1 (), 2 (),and()l ind,standardlinear analysismethodscanbeusedtoprovethat 1 (), 2 ()L indfrom(5)and(6).since 1 (), 2 (),()L ind,the assumptionthat (), (), ()existandareboundedcan beusedalongwith(4)-(6)toconcludethat(), (), () L ind.since(), ()L ind,property3canbeused toconcludethat(), ( ),(),and( )L ind. Thusfrom(1)andProperty4,wecanshowthat()L in D.Giventhat()L ind,itcanbeshownthat ()L ind.since (), ()L ind,property3canbeusedto 2157

47th IEEE CDC, Cancun, Mexico, Dec. 9-11, 28 showthat ( ), (), ()and ()L ind;hence, (3)canbeusedtoshowthat ()L in D.Since 1 (), 2 (), ()L ind,thedefinitionsfor()and()can beusedtoprovethat()isuniformlycontinuousind. LetSDdenoteasetdefinedasfollows: S, ½()D 2 (()) 1 ³ 1³ 2 p 3 2¾ (44) The region of attraction in(44) can be made arbitrarily large to include any initial conditions by increasing the control gain (i.e.,asemi-globaltypeofstabilityresult)[15].theorem 8.4of[21]cannowbeinvokedtostatethat k()k 2 ()S (45) Basedonthedefinitionof(),(45)canbeusedtoconclude theresultsin(31)()s Since()as 2 ()(see(17)),then(22)canbe used to conclude that + + as(), 2 () (46) The result in(46) indicates that the dynamics in(1) converge tothestate-spacesystemin(11).hence,()convergestoan optimal controller that minimizes(12) subject to(11) provided the gain conditions given in(14)-(16) are satisfied. VII. SIMULATIONRESULTS In order to examine the performance of the controller proposed in(19) a numerical simulation was performed. The simulation is based on the dynamics for a two-link robot: 1 1 +2 = 3 2 2 + 3 2 1 (47) 2 2 + 3 2 2 2 3 + 2 2 3 2 ( 1 + 2 ) 3 2 1 1 1 1 + + 2 2 2 1 2 where 1 =3473 2 2 =196 2 3 =242 2 1 =53 sec 2 =11 sec 2 denotes ( 2 ) 2 denotes ( 2 ) and 1 2 denote bounded disturbances defined as 1 =1sin()+15 cos(3) 2 =15 sin(2)+1cos() The desired trajectory is given as (48) 1 = 2 = 1 sin(2) (49) 2 and the initial conditions of the robot were selected as 1 () = 2 ()=143 1 () = 2 ()=286sec The weighting matrixes were chosen as 2 2 45 11 = 22 12 = 3 6 22 = 35 35 ª Tracking Error (deg) Tracking Error (deg) 5 5 1 Link 1 15 1 2 3 4 5 6 7 8 9 1 5 5 1 Link 2 15 1 2 3 4 5 6 7 8 9 1 Time (sec) Fig. 1. The tracking errors for the controller developed in(19). which using(14),(15), and(16) yielded the following values for 1,and 4 4 81 56 = 4 6 1 = 56 54 = ½ 1 35 1 35 The control gains were selected as ¾ 2 =2 =2 =75 The tracking errors from the control inputs are shown in Fig. 1 and Fig. 2, respectively. To show that the RISE feedback identifies the nonlinear effects and bounded disturbances, a plot of the difference is shown in Fig. 3. As this difference goes to zero, the dynamics in(1) converge to the state-space system in (11), and the controller becomes optimal. To test how the optimal controller performed for the unknown system compared to feedback linearized system in (11), () was calculated for each. For this calculation the contribution of RISE feedback term in (19) as well as the contribution of (,,,, ) and () in (9) are not considered. This is duetothefactthatitistheinput()thatminimizes(12).for theunknownsystem,()wascalculatedtobe4332.forthe feedbacklinearizedsystem()wascalculatedtobe441 As expected, the perfectly feedback linearized system has a lower performance index, however the difference between the twovaluesislessthan1%thesearepreliminaryresults,and further simulations and an experimental study are required for more conclusive results. VIII. CONCLUSION WeA9.3 A control scheme is developed for a class of nonlinear Euler-Lagrange systems that enables the generalized coordinates to asymptotically track a desired time-varying trajectory despite general uncertainty in the dynamics such as additive bounded disturbances and parametric uncertainty that does not have to satisfy a linear-in-the-parameters assumption. The 2158

47th IEEE CDC, Cancun, Mexico, Dec. 9-11, 28 WeA9.3 Torque (Nm) Torque (Nm) 6 4 2 2 4 6 Link 1 8 1 2 3 4 5 6 7 8 9 1 25 2 15 1 5 5 Link 2 1 1 2 3 4 5 6 7 8 9 1 Time (sec) 1 5 5 Fig. 2. The torques for the controller developed in(19) Link 1 1 1 2 3 4 5 6 7 8 9 1 2 2 4 6 8 Link 2 1 1 2 3 4 5 6 7 8 9 1 Time (sec) Fig. 3. The difference between the RISE feedback and the nonlinear effects and bounded disturbances. main contribution of this work is that a RISE feedback method is augmented with an auxiliary control term that minimizes a quadratic performance index based on a HJB optimization scheme. Like the influential work in ( [8] [13], [18], [19]) the result in this effort initially develops the HJB optimization scheme based on a partially feedback linearized state-space model assuming exact knowledge of the dynamics. However, unlike previous results that use a neural network structure to approximate the uncertain dynamics(i.e., the state space model is approximated with a residual function reconstruction error), the result in this paper uses the implicit learning capabilities of the RISE control structure to learn the uncertain dynamics asymptotically. That is, the dynamics asymptotically converge tothestate-spacesystemthatthehjboptimizationschemeis basedon.theimplicationisthattheuseoftherisefeedback structure compensates for the uncertain nonlinear dynamics yielding a state space system with a quadratic performance index that is optimized by an additional control element. REFERENCES [1] R. Freeman and P. Kokotovic, Optimal nonlinear controllers for feedback linearizable systems, in Proc. of the American Controls Conf., Jun. 1995, pp. 2722 2726. [2] Q. Lu, Y. Sun, Z. Xu, and T. Mochizuki, Decentralized nonlinear optimal excitation control, IEEE Trans. Power Syst., vol. 11, no. 4, pp. 1957 1962, Nov. 1996. [3] M.Sekoguchi,H.Konishi,M.Goto,A.Yokoyama,andQ.Lu, Nonlinear optimal control applied to STATCOM for power system stabilization, in Proc. IEEE/PES Transmission and Distribution Conference and Exhibition, Oct. 22, pp. 342 347. [4] V. Nevistic and J. A. Primbs, Constrained nonlinear optimal control: a converse HJB approach, California Institute of Technology, Pasadena, CA 91125, Tech. Rep. CIT-CDS 96-21, 1996. [5] J. A. Primbs and V. Nevistic, Optimality of nonlinear design techniques: A converse hjb approach, California Institute of Technology, Pasadena, CA 91125, Tech. Rep. CIT-CDS 96-22, 1996. [6] M. Krstic and H. Deng, Stabilization of Nonlinear Uncertain Systems. Springer, 1998. [7] R. Johansson, Quadratic optimization of motion coordination and control, IEEE Trans. Autom. Control, vol. 35, no. 11, pp. 1197 128, Nov. 199. [8] Y. Kim, F. Lewis, and D. Dawson, Intelligent optimal control of robotic manipulator using neural networks, Automatica, vol. 36, pp. 1355 1364, 2. [9] Y. Kim and F. Lewis, Optimal design of CMAC neural-network controller for robot manipulators, IEEE Trans. Syst., Man, Cybern. C, vol.3,no.1,pp.22 31,Feb.2. [1] M. Abu-Khalaf and F. Lewis, Nearly optimal HJB solution for constrained input systems using a neural network least-squares approach, in Proc. IEEE Conf. on Decision and Control, Dec. 22, pp. 943 948. [11] T. Cheng and F. Lewis, Fixed-final time constrained optimal control of nonlinear systems using neural network HJB approach, in Proc. IEEE Conf. on Decision and Control, Dec. 26, pp. 316 321. [12] T. Cheng and F. L. Lewis, Neural network solution for finite-horizon h- infinity constrained optimal control of nonlinear systems, in Proc.IEEE Intl. Conf. on Control and Automation, Jun. 27, pp. 1966 1972. [13] T. Cheng, F. Lewis, and M. Abu-Khalaf, A neural network solution for fixed-final time optimal control of nonlinear systems, Automatica, vol. 43, pp. 482 49, 27. [14] Z. Qu and J. Xu, Model-based learning controls and their comparisons using Lyapunov direct method, Asian Journal of Control,vol.4,no.1, pp. 99 11, Mar. 22. [15] B.Xian,D.M.Dawson,M.S.deQueiroz,andJ.Chen, Acontinuous asymptotic tracking control strategy for uncertain nonlinear systems, IEEE Trans. Automat. Contr., vol. 49, no. 7, pp. 126 1211, Jul. 24. [16] P.M.Patre,W.MacKunis,C.Makkar,andW.E.Dixon, Asymptotic tracking for systems with structured and unstructured uncertainties, IEEE Trans. Contr. Syst. Technol.,vol.16,no.2,pp.373 379,28. [17], Asymptotic tracking for systems with structured and unstructured uncertainties, in Proc. IEEE Conf. on Decision and Control, San Diego, CA, 26, pp. 441 446. [18] M. Abu-Khalaf, J. Huang, and F. Lewis, Nonlinear H-2/H-Infinity Constrained Feedback Control. Springer, 26. [19] F.L.Lewis,OptimalControl. JohnWiley&Sons,1986. [2]R.Ortega,A.Loría,P.J.Nicklasson,andH.J.Sira-Ramirez,Passivitybased Control of Euler-Lagrange Systems: Mechanical, Electrical and Electromechanical Applications. Springer, 1998. [21] H. K. Khalil, Nonlinear Systems. New Jersey: Prentice-Hall, 22. 2159