Adaptive Inverse Control

Similar documents
inear Adaptive Inverse Control

ADAPTIVE INVERSE CONTROL BASED ON NONLINEAR ADAPTIVE FILTERING. Information Systems Lab., EE Dep., Stanford University

Adaptive Inverse Control based on Linear and Nonlinear Adaptive Filtering

Automatic Stabilization of an Unmodeled Dynamical System Final Report

Temporal Backpropagation for FIR Neural Networks

EFFICIENT LINEAR MIMO ADAPTIVE ŜP INVERSE CONTROL. Gregory L. Plett E

4.0 Update Algorithms For Linear Closed-Loop Systems

New Recursive-Least-Squares Algorithms for Nonlinear Active Control of Sound and Vibration Using Neural Networks

( t) Identification and Control of a Nonlinear Bioreactor Plant Using Classical and Dynamical Neural Networks

Recurrent neural networks with trainable amplitude of activation functions

Neural Network Identification of Non Linear Systems Using State Space Techniques.

On the Use of A Priori Knowledge in Adaptive Inverse Control

ADAPTIVE FILTER THEORY

Optimal Polynomial Control for Discrete-Time Systems

T Machine Learning and Neural Networks

THE Back-Prop (Backpropagation) algorithm of Paul

THE concept of active sound control has been known

ANN Control of Non-Linear and Unstable System and its Implementation on Inverted Pendulum

Title without the persistently exciting c. works must be obtained from the IEE

Cosc 3451 Signals and Systems. What is a system? Systems Terminology and Properties of Systems

AN ALTERNATIVE FEEDBACK STRUCTURE FOR THE ADAPTIVE ACTIVE CONTROL OF PERIODIC AND TIME-VARYING PERIODIC DISTURBANCES

Control System Design

NONLINEAR PLANT IDENTIFICATION BY WAVELETS

Relating Real-Time Backpropagation and. Backpropagation-Through-Time: An Application of Flow Graph. Interreciprocity.

Direct Method for Training Feed-forward Neural Networks using Batch Extended Kalman Filter for Multi- Step-Ahead Predictions

III. Time Domain Analysis of systems

Application of Artificial Neural Networks in Evaluation and Identification of Electrical Loss in Transformers According to the Energy Consumption

Unit 11 - Week 7: Quantitative feedback theory (Part 1/2)

Identification and Control of an Inverted Pendulum Using Neural Networks

Application of Neuro Fuzzy Reduced Order Observer in Magnetic Bearing Systems

STRUCTURED NEURAL NETWORK FOR NONLINEAR DYNAMIC SYSTEMS MODELING

Variable Learning Rate LMS Based Linear Adaptive Inverse Control *

Simon Fraser University School of Engineering Science ENSC Linear Systems Spring Instructor Jim Cavers ASB

Lecture - 1 Introduction to Adaptive Filters

Neural Network Control in a Wastewater Treatment Plant

Iterative Learning Control Analysis and Design I

Submitted to Electronics Letters. Indexing terms: Signal Processing, Adaptive Filters. The Combined LMS/F Algorithm Shao-Jen Lim and John G. Harris Co

Lecture 12. Upcoming labs: Final Exam on 12/21/2015 (Monday)10:30-12:30

Robust Controller Design for Speed Control of an Indirect Field Oriented Induction Machine Drive

Neural Dynamic Optimization for Control Systems Part II: Theory

Nonlinear System Identification Using MLP Dr.-Ing. Sudchai Boonto

Introduction to DSP Time Domain Representation of Signals and Systems

Lecture 11 FIR Filters

Filter-Generating Systems

ECE Introduction to Artificial Neural Network and Fuzzy Systems

Artificial Neural Network Simulation of Battery Performance

A STATE-SPACE NEURAL NETWORK FOR MODELING DYNAMICAL NONLINEAR SYSTEMS

NONLINEAR ECHO CANCELLATION FOR HANDS-FREE SPEAKERPHONES. Bryan S. Nollett and Douglas L. Jones

Assesment of the efficiency of the LMS algorithm based on spectral information

GENERALIZED DEFLATION ALGORITHMS FOR THE BLIND SOURCE-FACTOR SEPARATION OF MIMO-FIR CHANNELS. Mitsuru Kawamoto 1,2 and Yujiro Inouye 1

Repetitive control : Power Electronics. Applications

Recursive Neural Filters and Dynamical Range Transformers

Performance of an Adaptive Algorithm for Sinusoidal Disturbance Rejection in High Noise

y k = (a)synaptic f(x j ) link linear i/p o/p relation (b) Activation link linear i/p o/p relation

The Rationale for Second Level Adaptation

IN neural-network training, the most well-known online

Lifted approach to ILC/Repetitive Control

Trajectory tracking control and feedforward

EFFICIENT LINEAR MIMO ADAPTIVE INVERSE CONTROL

A METHOD OF ADAPTATION BETWEEN STEEPEST- DESCENT AND NEWTON S ALGORITHM FOR MULTI- CHANNEL ACTIVE CONTROL OF TONAL NOISE AND VIBRATION

Robust Adaptive Attitude Control of a Spacecraft

System Identification for the Hodgkin-Huxley Model using Artificial Neural Networks

Inverted Pendulum. Objectives

LTI Systems (Continuous & Discrete) - Basics

Process Control, 3P4 Assignment 6

Classical Dual-Inverted-Pendulum Control

On Comparison of Neural Observers for Twin Rotor MIMO System

Roundoff Noise in Digital Feedback Control Systems

Analysis and Synthesis of Single-Input Single-Output Control Systems

Introduction to Machine Learning Spring 2018 Note Neural Networks

Christian Mohr

Theory and Problems of Signals and Systems

Neural Networks Introduction

EEE 241: Linear Systems

EECE Adaptive Control

1 An Overview and Brief History of Feedback Control 1. 2 Dynamic Models 23. Contents. Preface. xiii

Lecture 19 IIR Filters

Here represents the impulse (or delta) function. is an diagonal matrix of intensities, and is an diagonal matrix of intensities.

arxiv: v1 [cs.sd] 28 Feb 2017

Artificial Neural Network

Modeling and Control Overview

Keywords- Source coding, Huffman encoding, Artificial neural network, Multilayer perceptron, Backpropagation algorithm

Study on the use of neural networks in control systems

Essence of the Root Locus Technique

I. MAIN NOTATION LIST

Discrete-Time Systems

Using Neural Networks for Identification and Control of Systems

A New Model Reference Adaptive Formulation to Estimate Stator Resistance in Field Oriented Induction Motor Drive

Hopfield Neural Network

IS NEGATIVE STEP SIZE LMS ALGORITHM STABLE OPERATION POSSIBLE?

SOME RESULTS CONCERNING FAST LINEAR MIMO ADAPTIVE INVERSE CONTROL

PROPORTIONAL-Integral-Derivative (PID) controllers

Neural Network Control of Robot Manipulators and Nonlinear Systems

Deep Feedforward Networks

Exam. 135 minutes + 15 minutes reading time

Digital Signal Processing

Chapter 7 Interconnected Systems and Feedback: Well-Posedness, Stability, and Performance 7. Introduction Feedback control is a powerful approach to o

Biomedical Signal Processing and Signal Modeling

A NEURAL NETWORK-BASED SVPWM CONTROLLER FOR A TWO-LEVEL VOLTAGE-FED INVERTER INDUCTION MOTOR DRIVE

Acceleration Feedback

Transcription:

TA1-8:30 Adaptive nverse Control Bernard Widrow Michel Bilello Stanford University Department of Electrical Engineering, Stanford, CA 94305-4055 Abstract A plant can track an input command signal if it is driven by a controller whose transfer function approximates the inverse of its transfer function. A stable inverse can be obtained even if the plant is nonminimum-phase. No direct feedback is used, except that the plant output is monitored and utilized to adapt the parameters of the controller. A model-reference inverse control system can learn to approximate a desired reference-model dynamics. Control of internal plant disturbance is accomplished with an optimal adaptive disturbance canceller. t does not affect plant dynamics, but feeds back plant disturbance in a way that minimizes disturbance power at the plant output. Similar principles can be utilized to control nonlinear systems. Neural networks are used to build a model of the plant and to construct its inverse 1 ntroduction This paper presents techniques for solving adaptive control problems by means of adaptive filtering. Many problems in adaptive control can be divided into two parts: (a) control of plant dynamics, and (b) control of plant disturbance. Very often, a single system is utilized to achieve both of these control objectives. Our approach however treats each problem separately. Control of plant dynamics can be achieved by preceding the plant with an adaptive controller whose transfer function is the inverse of that of the plant. Control of plant disturbance can be achieved by an adap- This work was sponsored by NSF under grant NSF R 91-12531, by ONR under contract no N00014-92-J-1787, and by EPR under contract RP:8010-13 tive feedback process that minimizes plant output disturbance without altering plant dynamics. The principle of control of plant dynamics can be extended to deal with nonlinear plants. n that case, tapped delay lines and neural networks are used in place of linear adaptive filters. 2 2.1 Adaptive nverse Control for Linear Plants Direct plant identification Adaptive plant modeling or identification is an important function. Fig. 1 illustrates how this can be done with an adaptive FR filter. The plant input signal is the input to the ada.ptive filter. The plant output signal is the desired response, the target signal for the filter output. The adaptive algorithm, LMS 111 or RLS [2], minimizes mean square error, causing the model P to be a best least squares match to the plant P for the given input signal and for the given set of parameters (weights) allocated to?. nput + P r Figure 1: Direct plant identification. 2.2 nverse plant identification +b Another important function is inverse plant identification. This technique is illustrated in Fig. 2. The plant input is as before. The plant output is the input to the adaptive filter. The desired 0-7803-1 206-6/93/$3.00 01993 EEE 1

response for the adaptive filter is the plant input in this case. Minimizismean square error causes the adaptive filter P-l to be a best least squares inverse to the plant P for the given imput spectrum and for the given set of weights of the adaptive filter. The adaptive algorithm attempts to make the cascade of plant and adaptive inverse behave like a unit gain. This process is often called deconvolution. Figure 3: Delayed inverse identification for a nonminimum-phase plant. laplt P Figure 2: nverse identification. For sake of argument, the plant is assumed to have poles and zeros. An inverse, if it also had poles and zeros, would need to have zeros where the plant had poles and poles where the plant had zeros. Making an inverse would be no problem except for the case of a nonminimum-phase plant. t would seem that such an inverse would need to have unstable poles, and this would be true if the inverse were causal. f the inverse could be noncausal as well as causal however, then a two-sided stable inverse would exist for all linear time-invariant plants in accord with the theory of two-sided Laplace transforms. A causal FR filter can approximate a delayed version of the two-sided plant inverse, and an adaptive FR filter can self-adjust to this function. The method is illustrated in Fig. 3. The time span of the adaptive filter (the number of weights multiplied by the sampling period) can be made adequately long so that the mean square error of the optimized inverse would be a small fraction of the plant input power. To achieve this objective with a nonminimum-phase plant, the delay A needs to be chosen appropriately. The choice is generally not critical however. The inverse filter is used as a controller in the present scheme, so that A becomes the response delay of the controlled plant. Making A small is generally desirable, but the quality of control depends upon the accuracy of the inversion process which sometimes requires A to be of the order of half the length of the adaptive filter, or less. A simulation experiment has been done to illustrate the effectiveness of the inversion process. Fig. 4 shows the impulse response of a nonminimum-phase plant having a small transport delay. Fig. 5 shows the impulse reponse of the best least squares inverse with a delay of A = 50 sample periods. Fig. 6 is a convolution of the plant and its inverse impulse response. The result is essentially a unit impulse at a delay of 50. Figure 4: mpulse response of nonminimumphase plant. A model-reference inversion process is shown in Fig. 7. A reference model is used in place of the delay of Fig. 3. Minimizing mean square error with the system of Fig. 7 causes the cascade of the plant and its model-reference inverse to approximate closely the response of a model M. Much is known about the design of model reference systems [3]. The model is chosen to give a desirable response to the overall system. Some delay may need to be incorporated into the model in order to achieve low error. 2

back. Extensive analysis will be presented in the forthcoming book by Widrow and Walach [4]. -24 i J 0 20 40 Bo 10 100 120 140 Figure 5: mpulse response of delayed inverse. Figure 8: nverse control scheme for minimumphase plants. J Figure 9: nverse control scheme for nonminimum-phase plants. Figure 6: Convolution of plant with delayed inverse. 2.3 Adaptive Control of Plant Dynamics Now having the plant inverse, it can be used as a controller to provide a driving function for the plant. This simple idea is illustrated in Fig. 8 for minimum-phase plants. Fig. 9 shows the corresponding scheme for nonminimum-phase systems. Many simulation examples have been performed, with consistently good results, as long as the plant is stable or is first stabilized by feed- Figure 7: Model-reference plant inverse. 2.4 Adaptive Plant-Disturbance cancelling The systems of Fig. 8 and Fig. 9 only control and compensate for plant dynamics. The disturbance appears at the plant output unabated. The only way that the plant output disturbance can be reduced is to obtain this disturbance from the plant output and process it, then feed it back into the plant input. The system shown in Fig. 10 does this. n Fig. 10, an exact copy of j is fed the same input signal as the plant P. The output of this f' copy is subtracted from the plant output. Assuming that p has a dynamic response essentially identical to that of the plant P, the difference in the outputs is a close estimate of the plant disturbance. This disturbance is filtered by Q and then subtracted from the plant input. The filter Q is generated by an off-line process that delivers new values of Q almost instantaneously with new values of P, which adapts continually to keep up with changes in the plant P. The filter Q is essentially the best inverse 3

1103-1 o(kh- Figure 10: Disturbance cancelling system. (without delay) of P. The synthetic disturbance used to train Q should have a spectral character like that of the plant disturbance. t is shown in the Widrow and Walach book [4] that the disturbance cancelling system of Fig. 10 adapts and converges to minimize the plant disturbance at the plant output. As such, it is an optimal linear least squares system. There is no way to further reduce the plant disturbance. The system of Fig. 10 appears to be a feedback system. However, if P is dynamically the same as P, the transfer function around the loop is zero. The transfer function from the Plant nput point to the Plant Output point is the same as that of the plant alone. Thus, the disturbance canceller does not affect the plant dynamics. Almost perfect disturbance cancellation is possible with a minimum-phase plant. With a nonminimum-phase plant, even optimal cancelling will not cancel all the disturbance. Fig. 11 and Fig. 12 show results of a plant disturbance cancellation experiment. Although the plant in this case was nonminimum-phase, the results are quite good. 3 Nonlinear nverse Control The principles of inverse control can be extended to deal with nonlinear systems. Nonlinear systems behave quite differently from their linear counterparts. For example, whereas a linear system possesses a unique inverse, nonlinear systems have only local inverses, valid only in a bounded region of the signal space. n addition, left and Figure 11: Output of undisturbed plant and output of disturbed plant, with and without disturbance canceller (canceller turned on at k = 300). Figure 12: nstantaneous power of plant output disturbance (canceller turned on at k = 300). right inverses are in general different, as in the case of multi-input multi-output linear systems. As linear adaptive filters are used to control linear plants, the "inverse" controller for nonlinear plants involves a type of recurrent neural network. The ability of multilayered neural networks to approximate nonlinear mappings over compact regions as detailed in [5] makes them useful in identifying direct and inverse models. The inverse control of nonlinear plants involves a two-stage process where a model of the plant is first constructed (identification) and second the plant model is inverted. The system is modelled through the use of a feedforward multilayered neural network fitted with tapped delay lines at its input and output and a feedback loop. With an appropriate number of hidden neurons, such a neural network can 4

represent a system of the form over a bounded region of input space. The choice of the integers n and p is part of the modelling design and follows from requirements of model accuracy. The identification scheme is founded on a standard technique, which is the nonlinear equivalent of the equation-error formulation described in [6], and is called a series-parallel model in [7]. The choice of this formulation allows the use of the standard backpropagation algorithm for training the neural network. The second step is the design of the controller. Once the neural network has been trained to perform plant identification, the controller, also implemented as a neural network, is trained to behave like the inverse of the system. The algorithm used for training the controller is a variant of the recurrent backpropagation algorithm [8]. As previously mentioned and unlike linear systems, nonlinear systems do not commute with their inverses. This restriction demands that the controller be trained upstream from the plant model and that the error signal be backpropagated through the plant model as shown in Fig. 13. units. The result of the plant identification is displayed in Fig. 15. Specifically, the outputs of the plant and plant model are compared when the same test signal is fed to their inputs. Here the test signal is chosen as a sinusoid with increasing amplitude. 1.pt Figure 14: Configuration for plant identification. P 1 Figure 13: nverse control for nonlinear systems. 3.1 Example Let's consider the nonlinear plant suggested in [7] and defined by the equation: The input signal is confined in the interval [-1,1]. Fig. 14 shows the training configuration for the plant model. t had two inputs, the external input U and the output from the real plant, and one output. t had one hidden layer with 10 Figure 15: Result of nonlinear plant identification. Training of the inverse controller C is illustrated in Fig. 16. The neural network controller had a two-tap tapped delay line as input, a hidden layer with 10 units and one output which is fed to the plant model. The error is backpropagated through the plant model using online recurrent backpropagation. The time plots of Fig. 17 show the command input fed to the trained inverse controller versus the plant output. Although there are errors, the agreement between the two signals is very good. The important thing to note is that the controller is trained to be an inverse to the plant model and not the plant itself. Consequently, good performance of the controller is contingent on building an accurate model for the plant. 5

q 171 Figure 16: Configuration for training of inverse controller. 01. 151 K. Hornik, M. Stinchcombe, and H. White. Multilayer feedforward networks are universal approximators. Neural Networks, 2359-366, 1989. [61 181 John J. Shynk. Adaptive iir filtering. EEE ASSP Magazine, April 1989. Kumpati S. Narendra and Kannan Parthasarathy. dentification and control of dynamical systems using neural networks. EEE Transactions on Neural Networks, 1(1):4-27, March 1990. R. J. Williams and D. Zipser. A learning algorithm for continually running fully recurrent neural networks. Cs Report 8805, nstitute for Cognitive Science, University of California at San Diego, La Jolla, CA 92093, October 1988. 0.6. M- 0.2. 0. 42. 4.4. 4.6. 4J. Figure 17: Performance of inverse controller. 4 conclusion Methods for adaptive control of plant dynamics and for control of plant disturbance for unknown linear plants have been described. n addition extension of control of plant dynamics to nonlinear plants using neural networks have been presented. For their proper application, the plant must be stable. An unstable plant could first be stabilized with feedback, then adaptively controlled. Details will be found in the Widrow and Walach book, which is in press. References Bernard Widrow and Samuel D. Stearns. Adaptive Signal Processing. Prentice-Hall, Englewood Cliffs, N J, 1985. Simon Haykin. Adaptive Filter Theory. Prentice-Hall, Englewood Cliffs, NJ, 2 edition, 1991.. D. Landau. Adaptive Control. The Model Reference Approach. Marcel Dekker, New York, 1979. Bernard Widrow and Eugene Walach. Adaptive nverse Control. Prentice-Hall, Englewood Cliffs, N J, 1993. (h press). 6