The Uncertainty Threshold Principle: Some Fundamental Limitations of Optimal Decision Making under Dynamic Uncertainty

Size: px

Start display at page:

Download "The Uncertainty Threshold Principle: Some Fundamental Limitations of Optimal Decision Making under Dynamic Uncertainty"

Paulina Richard
5 years ago
Views:

1 The Uncertainty Threshold Principle: Some Fundamental Limitations of Optimal Decision Making under Dynamic Uncertainty Michael Athans, Richard Ku, Stanley Gershwin (Nicholas Ballard ) Introduction The paper in question deals with the controller s ability to apply optimal control strategies, and shows that for systems that evolve with uncertainty above a quantifiable level, optimal control strategies do not exist for the infinite horizon case. The problem statement is presented as follows: Consider the scalar dynamical system with state x t and control u t described by the difference equation x t+1 = a t x t + b t u t (1) Where the parameters a t and b t are Gaussian random variables, are uncorrelated in time, and whose means and variances are known. E a t = a E a t = Σ aa + a E b t = b E b t = Σ bb + b E a t b t = Σ ab + ab It is assumed that the system is fully observable, that is, the state of the system x t can be measured exactly. A white noise term may be added to the difference equation (1), as is the case for many models, however, this does not change the result for the system. The results of the paper are motivated by the minimization of the standard cost function: N J = E t=0 Qx t + Ru t () Where Q > 0 and R > 0.

2 At the time of the publication of this paper the optimal control for the minimization of () had already been known for some time, and is found by applying a standard dynamic programming technique on the cost function. The optimal control and optimal cost for the system are given by: u t = G t x t (3) G t = K t+1 E a t b t R + K t+1 E b t J = K 0 x 0 (4) Where the scalar K t are related by a discrete Riccati-like recursive equation: K t = Q + K t+1 E a t K t+1e a t b t R+ K t+1 E b t (5) Results These results for the optimal control had already been published in other papers for scalar and vector systems, but the true insight of this paper is in examining the optimal cost for different variations of the system parameters and the planning horizon N. If we return to the recursive equation (5) and recall that K N = Q, and Q > 0, we can deduce that the K t are nondecreasing backwards in time, that is K t K t+1, and when the system exhibits random behaviour, Σ aa 0, Σ bb 0, the K t are monotonically increasing backwards in time. As such, with an increasingly large planning horizon N, the K t become much larger than the constants Q and R. The evolution of (5) in time is approximated by considering the K t much larger than Q and R, equivalently Q=0 and R=0 in (5). K t K t+1 E a t K t+1 E a t b t = K t+1 (E a K t+1 E b t E a tb t t E b ) t

3 This approximation leads to a new insight in the optimal control problem, so let us examine it in other way: K t mk t+1 (6) m = Σ aa + a (Σ ab + ab ) Σ bb + b The constant m is known as the uncertainty threshold parameter. As we can see from (6) if m 1 then K t undergoes exponential growth backwards in time. If this is the case (m 1), then the optimal cost experiences exponential growth: J N x 0 e mn As we can see, for a threshold parameter greater than one the optimal cost for the system becomes unbounded for an increasingly large planning horizon, so the paper concludes that the optimal control for the infinite horizon case exists if and only if the uncertainty threshold parameter m < 1. This is the main result of the paper, but the authors investigate another notion of optimal control in the infinite horizon. Since it has been shown that the optimal cost for the system with m 1 is unbounded, it is suggested that the optimal control for such a system is one which minimizes the rate of growth of the cost. For such systems, it is shown that the control gain reaches a steady state value, but that the variance of the state propagates according to: Σ xx t + 1 mσ xx (t) The variability of the state for systems with m 1 blows up, so there is no optimal control which will either minimize the cost function or minimize the rate of growth of the cost function for such systems.

4 Discussion The system model in this paper treats the state and control parameters as purely random variables with known means and variances; this model immediately seems to be impractical and unrealizable from a physical point of view. A more practical system model is one for which these parameters are constant and unknown, which leads to a well defined problem in adaptive stochastic control, outside of the scope of this paper. The authors of this paper have made an attempt to justify this choice of white parameters by citing instances in economic systems when treatment of unknown parameters as purely random is advantageous in order to obtain a cautious estimate of the optimal control. Nevertheless, as impractical as the model may seem, it is well defined as a stochastic control problem. With reservations about the system model in mind, a natural question to ask is whether or not the results of this paper apply to other systems. The model that was used for this paper does not allow any learning of the a t and b t parameters, though the case where learning is possible is often of more interest in stochastic control. I have examined the uncertainty threshold principle for a system where learning is possible, where both of the parameters are modeled as archetypal AR-1 processes: a t = ρa t 1 + ω t 1 The result for this system is similar to the result for the system (1), except that the existence of an optimal control in the infinite horizon case corresponds to the condition of stationarity of the AR processes. It seems to be the case that for a system whose parameters are neither completely deterministic nor completely stochastic, the uncertainty threshold principle applies (proof pending). This is not to say that every system has an uncertainty threshold level, on the contrary. The popular model with which we are familiar:

5 x t+1 = ax t + bu t + ω t (7) Where the ω t term is the only random parameter, can be shown to have an optimal control strategy in the infinite horizon case irrespective of the mean and variance of ω t. This particular case shows that there are stochastic processes that cannot exceed the uncertainty threshold level, nevertheless, the principle still applies. Consequently, the system (7) can be thought of as less random than the system (1) because of the inherent uncertainty in the structure of (1). Furthermore, the fact that the system (1) fails to have an optimal control in some cases whereas the system (7) always has an optimal control indicates that additive term ω t is a poor compensation for systems whose parameters a t and b t evolve with uncertainty. This paper used a scalar LQG system as an example to prove that the optimal control strategy exists for the infinite horizon case if and only if the system s uncertainty is below a quantifiable level, however, these results are applicable to a wider range of systems not discussed in the paper. The equivalent vector LQG case is an extension of the scalar system, and the results of this paper immediately apply to the vector case, though the uncertainty threshold parameter takes a different form. The uncertainty threshold principle is not limited to Gaussian systems either; it applies to all stochastic control problems in a very broad sense. From a philosophical point of view, the problem of optimal control is matter of making the best decision now based on some idea of what a system will do in the future. This paper indicates that if a system evolves with uncertainty above a quantifiable level, then there is no way of making a best decision now for the rest of the system s future. This notion is critical to stochastic control, and can be applied to all dynamical systems that evolve with uncertainty.

On Stochastic Adaptive Control & its Applications. Bozenna Pasik-Duncan University of Kansas, USA

On Stochastic Adaptive Control & its Applications Bozenna Pasik-Duncan University of Kansas, USA ASEAS Workshop, AFOSR, 23-24 March, 2009 1. Motivation: Work in the 1970's 2. Adaptive Control of Continuous