Lazy learning for control design

Similar documents
Machine learning techniques for decision support in anesthesia

56 INTERNATIONAL JOURNAL OF COMPUTATIONAL COGNITION ( VOL. 3, NO. 1, MARCH 2005

Gaussian Process for Internal Model Control

CSTR CONTROL USING MULTIPLE MODELS

Long-Term Prediction of Time Series by combining Direct and MIMO Strategies

ANN Control of Non-Linear and Unstable System and its Implementation on Inverted Pendulum

Adaptive Dual Control

Adaptive Fuzzy Modelling and Control for Discrete-Time Nonlinear Uncertain Systems

Introduction to System Identification and Adaptive Control

Application of Neuro Fuzzy Reduced Order Observer in Magnetic Bearing Systems

Local Modelling of Nonlinear Dynamic Systems Using Direct Weight Optimization

A STATE-SPACE NEURAL NETWORK FOR MODELING DYNAMICAL NONLINEAR SYSTEMS

Confidence Estimation Methods for Neural Networks: A Practical Comparison

Temperature control using neuro-fuzzy controllers with compensatory operations and wavelet neural networks

Review on Aircraft Gain Scheduling

10/8/2015. Control Design. Pole-placement by state-space methods. Process to be controlled. State controller

Computational Intelligence Lecture 20:Neuro-Fuzzy Systems

Neural Network Identification of Non Linear Systems Using State Space Techniques.

Model-Based Reinforcement Learning with Continuous States and Actions

Electricity Demand Forecasting using Multi-Task Learning

MODELLING OF TOOL LIFE, TORQUE AND THRUST FORCE IN DRILLING: A NEURO-FUZZY APPROACH

On-line Learning of Robot Arm Impedance Using Neural Networks

GAIN SCHEDULING CONTROL WITH MULTI-LOOP PID FOR 2- DOF ARM ROBOT TRAJECTORY CONTROL

New Concepts for the Identification of Dynamic Takagi-Sugeno Fuzzy Models

Machine learning techniques to enable closed-loop control in anesthesia

Self-tuning Control Based on Discrete Sliding Mode

Modeling and Control Overview

A CUSUM approach for online change-point detection on curve sequences

N. Sarikaya Department of Aircraft Electrical and Electronics Civil Aviation School Erciyes University Kayseri 38039, Turkey

EECE Adaptive Control

A FUZZY NEURAL NETWORK MODEL FOR FORECASTING STOCK PRICE

Lecture 9: Discrete-Time Linear Quadratic Regulator Finite-Horizon Case

Robust multi objective H2/H Control of nonlinear uncertain systems using multiple linear model and ANFIS

STRUCTURED NEURAL NETWORK FOR NONLINEAR DYNAMIC SYSTEMS MODELING

Optimal Kernel Shapes for Local Linear Regression

1348 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 34, NO. 3, JUNE 2004

NONLINEAR PLANT IDENTIFICATION BY WAVELETS

Development of a Deep Recurrent Neural Network Controller for Flight Applications

The Applicability of Adaptive Control Theory to QoS Design: Limitations and Solutions

Neural Controller. Plant. Plant. Critic. evaluation signal. Reinforcement Learning Controller

ADAPTIVE NEURO-FUZZY INFERENCE SYSTEMS

Adaptive Predictive Observer Design for Class of Uncertain Nonlinear Systems with Bounded Disturbance

A Fast Approximation Algorithm for Set-Membership System Identification

inear Adaptive Inverse Control

Gain Scheduling Control with Multi-loop PID for 2-DOF Arm Robot Trajectory Control

Analysis of Fast Input Selection: Application in Time Series Prediction

An Adaptive LQG Combined With the MRAS Based LFFC for Motion Control Systems

Fast Bootstrap for Least-square Support Vector Machines

Predictive Control of a Single Link Flexible Joint Robot Based on Neural Network and Feedback Linearization

Optimal Polynomial Control for Discrete-Time Systems

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION - Vol. V - Prediction Error Methods - Torsten Söderström

Robust Observer for Uncertain T S model of a Synchronous Machine

Adaptive Robust Tracking Control of Robot Manipulators in the Task-space under Uncertainties

OPTIMIZATION OF MODEL-FREE ADAPTIVE CONTROLLER USING DIFFERENTIAL EVOLUTION METHOD

Identification and Control of Mechatronic Systems

CS545 Contents XVI. l Adaptive Control. l Reading Assignment for Next Class

Convergence of Hybrid Algorithm with Adaptive Learning Parameter for Multilayer Neural Network

Automatic Differentiation Equipped Variable Elimination for Sensitivity Analysis on Probabilistic Inference Queries

Stability Analysis of the Simplest Takagi-Sugeno Fuzzy Control System Using Popov Criterion

Algorithms for Increasing of the Effectiveness of the Making Decisions by Intelligent Fuzzy Systems

Adaptive Control of a Class of Nonlinear Systems with Nonlinearly Parameterized Fuzzy Approximators

A Boiler-Turbine System Control Using A Fuzzy Auto-Regressive Moving Average (FARMA) Model

I. D. Landau, A. Karimi: A Course on Adaptive Control Adaptive Control. Part 9: Adaptive Control with Multiple Models and Switching

Algorithm for Multiple Model Adaptive Control Based on Input-Output Plant Model

Immediate Reward Reinforcement Learning for Projective Kernel Methods

( t) Identification and Control of a Nonlinear Bioreactor Plant Using Classical and Dynamical Neural Networks

Chapter 2 Review of Linear and Nonlinear Controller Designs

Strong Robustness in Multi-Phase Adaptive Control: the basic Scheme

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games

Applying Particle Swarm Optimization to Adaptive Controller Leandro dos Santos Coelho 1 and Fabio A. Guerra 2

Adaptive fuzzy observer and robust controller for a 2-DOF robot arm

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XVII - Analysis and Stability of Fuzzy Systems - Ralf Mikut and Georg Bretthauer

Bidirectional Representation and Backpropagation Learning

A Probabilistic Representation for Dynamic Movement Primitives

Identification of Nonlinear Dynamic Systems with Multiple Inputs and Single Output using discrete-time Volterra Type Equations

Predictive Cascade Control of DC Motor

Computation of Stabilizing PI and PID parameters for multivariable system with time delays

Indirect Model Reference Adaptive Control System Based on Dynamic Certainty Equivalence Principle and Recursive Identifier Scheme

Model Reference Adaptive Control for Multi-Input Multi-Output Nonlinear Systems Using Neural Networks

4.0 Update Algorithms For Linear Closed-Loop Systems

A Matlab/Simulink Toolbox for Inversion of Local Linear Model Trees

Design Artificial Nonlinear Controller Based on Computed Torque like Controller with Tunable Gain

NONLINEAR AND ADAPTIVE (INTELLIGENT) SYSTEMS MODELING, DESIGN, & CONTROL A Building Block Approach

4F3 - Predictive Control

Acceleration of Levenberg-Marquardt method training of chaotic systems fuzzy modeling

A Modified Incremental Principal Component Analysis for On-line Learning of Feature Space and Classifier

Nonlinear System Identification Using MLP Dr.-Ing. Sudchai Boonto

A Sliding Mode Controller Using Neural Networks for Robot Manipulator

Ensembles of Extreme Learning Machine Networks for Value Prediction

PREDICTIVE CONTROL OF NONLINEAR SYSTEMS. Received February 2008; accepted May 2008

Parameter Identification and Dynamic Matrix Control Design for a Nonlinear Pilot Distillation Column

Observer Based Friction Cancellation in Mechanical Systems

ECE Introduction to Artificial Neural Network and Fuzzy Systems

Modeling of Hysteresis Effect of SMA using Neuro Fuzzy Inference System

Virtual Reference Feedback Tuning for non-linear systems

Takagi Sugeno Fuzzy Sliding Mode Controller Design for a Class of Nonlinear System

EECE Adaptive Control

Application of Associative Matrices to Recognize DNA Sequences in Bioinformatics

A TSK-Type Quantum Neural Fuzzy Network for Temperature Control

FUZZY STABILIZATION OF A COUPLED LORENZ-ROSSLER CHAOTIC SYSTEM

Transcription:

Lazy learning for control design Gianluca Bontempi, Mauro Birattari, Hugues Bersini Iridia - CP 94/6 Université Libre de Bruxelles 5 Bruxelles - Belgium email: {gbonte, mbiro, bersini}@ulb.ac.be Abstract. This paper presents two local methods for the control of discrete-time unknown nonlinear dynamical systems, when only a limited amount of input-output data is available. The modeling procedure adopts lazy learning, a query-based approach for local modeling inspired to memory-based approximators. In the first method the lazy technique returns the forward and inverse models of the system which are used to compute the control action to take. The second is an indirect method inspired to adaptive control where the self-tuning identification module is replaced by a lazy approximator. Simulation examples of control of nonlinear systems starting from observed data are given.. Introduction The idea of local memory-based approximators as alternative to global models originated in non-parametric statistics to be later rediscovered and developed in machine learning (Bottou & Vapnik, 992). Recent work on lazy learning (a.k.a just-in-time learning) gave a new impetus to the adoption of local techniques for modeling (Atkeson et al., 997a), and control problems (Atkeson et al., 997b). In the lazy learning approach, the value of an unknown mapping is estimated giving the whole attention to the region surrounding the point ϕ q where the estimation itself is required. The procedure essentially consists of these steps: i) for each query point ϕ q, define a set of neighbors, each weighted according to some relevance criterion (e.g. the distance) ii) choose a local regression function f( ) in a restricted family of parametric functions iii) compute the regression value ˆf(ϕ q ). In (Bontempi et al., 997) we extend the classical memory-based approach with a method that automatically selects the local model configuration. To this aim, we apply tools and techniques from linear statistical analysis to nonlinear modeling problems. The most relevant one is the PRESS statistic (Myers, 99), a simple, well-founded and economical way to perform leave-one-out cross validation (Efron & Tibshirani, 993) and The work of Gianluca Bontempi was supported by the European Union TMR Grant FMBICT96692. The work of Mauro Birattari was supported by the F.I.R.S.T. program of the Région Wallonne, Belgium.

therefore to assess the performance in generalization of local linear models. Due to its short computation time which allows its intensive use, it is the key element of our approach to modeling data. In this paper we present and compare two methods for discrete-time control based on the local linear description returned by the lazy learning model. The idea of employing linear techniques in a nonlinear setting is not new in control literature but had recently a new popularity thanks to methods for combining multiple estimators and controllers in different operating regimes of the system (Murray-Smith & Johansen, 997). Gain scheduling (Shamma & Athans, 992), fuzzy inference systems (Takagy & Sugeno, 985) and local model networks (Johansen & Foss, 993), are well-known examples of control techniques for nonlinear systems inspired to linear control. The first method we propose is an example of gradient-based control system which is inspired to neural controllers and combines an inverse with a forward lazy model to select the control action. The algorithm has been introduced by (Atkeson et al., 997b) and applied to a static control task. Here we test its stability properties on two different dynamic tasks. The second controller is based on the self-tuning regulator (STR) architecture (Astrom & Wittenmark, 99) and combines discrete-time conventional control (e.g. generalized minimum variance, pole placement) with the lazy local modeling. The adoption of linear control techniques allows the local analysis of the closed-loop systems in terms of stability properties. In the next section, we will introduce the two control algorithms. In section 3. we will present some simulation results of the control of two nonlinear systems. Some final considerations are summarised in section 4. 2. Lazy learning for control design Consider a class of discrete-time dynamic systems whose equations of motion canbeexpressedinthenarmaxform: y(k) =f ( y(k ),...,u(k d),...,e(k ),... ) + e(k), () where k denotes the time, y(k) is the system output, u(k) the input, e(k) isa zero-mean disturbance term, d> is the relative degree and f( ) is some nonlinear function. Let us assume we have no physical description of the function f( ) but a limited amount of pairs [ u(k),y(k) ] is available. 2.. The lazy learning gradient-based controller The idea of the lazy gradient-based controller is to solve a one time-step horizon control problem as an optimization problem. Suppose that the system (), is required to reach at the next time time step a reference value y ref (for clarity, we assume d =). Thelazy model can be used to predict the response of the system to the control action u i : ŷ u i(k) = ˆf ( y(k ),...,u i,u(k 2),...,e(k ),... ). (2)

In addition, the linearization returned by the local description provides an estimate of the gradient of the system output dŷ u i (k) du with respect to the control i action (Atkeson et al., 997a). Thecontrol problemcan therefore beformulated as a constrained gradient based optimization problem: u opt = arg min J(u i )=arg min (y ref ŷ u i(k)) 2. (3) u i u i In order to speed up the optimization resolution, the algorithm can be initialized with the value returned from the model of the inverse dynamics: u = ˆf inv ( yref,y(k ),...,u(k 2),...,e(k ),... ). (4) The lazy gradient-based control algorithm can be described by the following steps to be repeated at each sampling period:. Initialization of the algorithm with the value u provided by the inverse mapping (4). 2. Prediction of the outcome ŷ u i of the system forced by the input u i and computation of the gradient vector dj(ui ) du. i 3. Updating of the control sequence u i u i+. The optimization step is performed by a constrained gradient based algorithm. 4. If the minimum has been reached (u i = u i+ ) goto 5 else goto 2. 5. Control action execution. 6. Updating of the database. 2.2. The Lazy learning self-tuning controller This approach combines the lazy learning identification procedure with control techniques from adaptive linear control (Astrom & Wittenmark, 99), e.g. minimum-variance and pole-placement. Let us consider a linear discrete-time process described in input-output form by the equation: A(z)y(k) =z d B(z)u(k)+C(z)e(k), (5) and suppose we want to regulate it to y ref =. The MV control problem can be stated as finding the control law which minimizes the variance of the output. The MV controlled closed loop system is stable only if B has all of its roots inside the unit circle (minimum phase). However, more complex formulations are available in the case of a tracking problem or in the case of non minimum-phase systems (Generalized MV or GMV). Pole placement design is an alternative technique to deal with non minimum-phase configurations. The procedure requires first to choose the desired closed loop pole positions and then to calculate the appropriate controller. Both these design techniques require a model linearization in the form (5). In our approach this is returned at each time step by the lazy learning model. In detail the control algorithm can be described by the following steps to be repeated at each sampling period:

5 5 2 25 3 ESANN'998 proceedings - European Symposium on Artificial Neural Networks.5.5.5 t.5 5 5 2 25 3 t Figure : Gradient-based control: a) reference (solid) and system (dotted) outputs b) control action. Linearization of the function f( ) computed by the lazy learning algorithm. 2. Derivation of the polynomials A, B, C of (5) from the linearized model. 3. Design of a MVG/PP controller for (5) which satisfies the required properties (stability, accuracy, speed... ) of the closed loop behavior. 4. Computation and execution of the control signal. 5. Updating of the database. 3. Simulation studies 3.. A lazy gradient-based control example In this simulation we consider the control of the plant described by the example.2 in (Narendra & Li, 996). We use the control algorithm described in section 2.2.. The system is represented in the minimum-phase input-output form y(k +) =f ( y(k),y(k ),y(k 2),y(k 3),u(k) ). We use an initial empty database which is updated all along the identification. The system is controlled for 3 time steps. The plot in Fig. a shows the model and the system output in the last 3 points, while the plot in Fig. b shows the control action. These results outperform those obtained by (Narendra & Li, 996) after 2,, steps of on-line adjustments and a complex architecture (4-layer feed-forward neural network). 3.2. A lazy self-tuning control example In this simulation we consider the control of the nonlinear SISO system described by the difference equation: y(k +)= y(k)y(k )y(k 2)( y(k 2) ) u(k ) + u(k). (6) +y 2 (k ) + y 2 (k 2)

.5 3 2.5 2.5.5.5 5 5 2 25 3 35 4 2 5 5 2 25 3 35 4 Figure 2: Self-tuning control: a) reference (solid) and system (dotted) outputs b) control action. The system is represented in the input-output form y(k +)=f ( y(k),y(k ),y(k 2),u(k),u(k ) ). The reference output y ref (k) is given by a periodic square wave. The lazy nonlinear algorithm is not able to control the system over this reference trajectory. On the contrary, a self-tuning regulator based on a pole placement algorithm is able to track the trajectory. We initialize the lazy learning database with a set of 5 points collected by preliminarly exciting the system with a random uniform input. The database is then updated online each time a new input-output pairs is returned by the simulated system. The plot in Fig. 2a shows the reference and the system output, while the plot in Fig. 2b shows the control action. From the local analysis of the identified system we have that the system is non minimum-phase (i.e. absolute value of the zero greater than one) when y is in the neighborhood of y = (e.g. see thetimeinterval5 ). This is indeed the region where the gradient-based controller fails to control the system by making the feedback loop unstable. 4. Final considerations We illustrated and tested two control systems, which make an extensive use of local modeling. The lazy gradient-based control system, inspired by neural control, makes use of forward and inverse approximations of the system dynamics to select the control action. Like in neuro-control, properties of stability cannot be guaranteed in a general case. However, we showed how this approach can obtain performances more accurate than neural networks even using a smaller set of training examples. The lazy self-tuning architecture adopts a linear control technique which eases the analysis in terms of stability properties and provides a useful insight in the dynamic properties of the nonlinear system. It is worthy noting how in both the approaches we made the assumption of certainty equivalence. Future developments will focus on how to deal with parameter uncertainty in local control.

References Astrom K.J. & Wittenmark B. 99. Computer-controlled Systems: Theory and Design. Prentice-Hall International Editions. Atkeson C.G., Moore A.W. & Schaal S. 997a. Locally weighted learning. Artificial Intelligence Review, ( 5), 73. Atkeson C.G., Moore A.W. & Schaal S. 997b. Locally weighted learning for control. Artificial Intelligence Review, ( 5), 75 3. Bersini H., Birattari M. & Bontempi G.. 997. Adaptive memory-based regression methods. submitted to 998 IEEE International Joint Conference on Neural Networks. Bontempi G., Birattari M. & Bersini H.. 997. Recursive lazy learning for modeling and control. submitted to Tenth European Conference On Machine Learning (ECML-98). Bottou L. & Vapnik V.N. 992. Local learning algorithms. Neural Computation, 4(6), 888 9. Efron B. & Tibshirani R.J. 993. An Introduction to the Bootstrap. New York: Chapman and Hall. Johansen T.A. & Foss B.A. 993. Constructing NARMAX models using ARMAX models. International Journal of Control, 58, 25 53. Murray-Smith R. & Johansen T.A. (eds) 997. Multiple Model Approaches to Modelling and Control. TaylorandFrancis. Myers R.H. 99. Classical and Modern Regression with Applications. Boston, MA: PWS-KENT. Narendra K.S. & Li S.M. 996. Neural Networks in Control Systems. Chap., pages 347 394 of: Paul Smolensky M.C. Mozer & Rumelhart D.E. (eds), Mathematical Perspectives on Neural Networks. Lawrence Erlbaum Associates. Shamma J.S. & Athans M. 992. Gain scheduling: Potential hazards and possible remedies. IEEE Control Systems Magazine, June, 7. Takagy T. & Sugeno M. 985. Fuzzy identification of systems and its applications to modeling and control. IEEE Transactions on System, Man and Cybernetics, 5(), 6 32.