arxiv: v1 [cs.ro] 24 May 2017

Size: px
Start display at page:

Download "arxiv: v1 [cs.ro] 24 May 2017"

Transcription

1 A Near-Otimal Searation Princile for Nonlinear Stochastic Systems Arising in Robotic Path Planning and Control Mohammadhussein Rafieisakhaei 1, Suman Chakravorty 2 and P. R. Kumar 1 arxiv: v1 [cs.ro] 24 May 2017 Abstract We consider nonlinear stochastic systems that arise in ath lanning and control of mobile robots. As is tyical of almost all nonlinear stochastic systems, the otimally solving roblem is intractable. We rovide a design aroach which yields a tractable design that is quantifiably nearotimal. We exhibit a searation rincile under a small noise assumtion consisting of the otimal oen-loo design of nominal trajectory followed by an otimal feedback law to track this trajectory, which is different from the usual effort of searating estimation from control. As a corollary, we obtain a trajectory-otimized linear quadratic regulator design for stochastic nonlinear systems with Gaussian noise. I. INTRODUCTION Practical systems are often subject to inaccuracies that we model as noise. Planning for a stochastic system requires attention to the noise structure, available models and noise levels. Many robotic systems, in articular, mobile aerial and ground robots, are equied with noisy actuators that require feedback comensation or lanning ahead for a olicy that accounts for the random erturbations. Simly ignoring the noise and lanning for the unerturbed equivalent of the stochastic system can yield crucial errors leading to the failure in reaching the end-goal, or cause the system to fall into unsafe states. In a stochastic setting, the general roblem of sequential decision-making is formulated as a Markov Decision Problem (MDP) [1], [2]. The otimal solution of the stochastic control roblem can be obtained iteratively by value or olicy iteration methods to solve the Hamilton-Jacobi- Bellman equations [2]. Excet in secial cases, such as in a linear Gaussian environment, this involves discretization of the underlying saces [3]; an aroach whose scalability faces the curse of dimensionality [4]. As a result, they require a comutation time that is rovably exonential in the state dimension, in a real number based model of comlexity, without any assumtion that P NP [5]. Many aroaches have been roosed based on their tractability. Some rely on a searate design of the deterministic trajectory from the feedback olicy. Model Predictive Control (MPC)-based methods [6], [7], robust formulations [8], [9], and other designs that relate to the Pontryagin s *This material is based uon work artially suorted by NSF under Contract Nos. CNS and Science & Technology Center Grant CCF , the U.S. Army Research Office under Contract No. W911NF , and NPRP grant NPRP from the Qatar National Research Fund, a member of Qatar Foundation. 1 M. Rafieisakhaei and P. R. Kumar are with the Deartment of Electrical and Comuter Engineering, and 2 S. Chakravorty is with the Deartment of Aerosace Engineering, Texas A&M University, College Station, Texas, USA. {mrafieis, schakrav, rk@tamu.edu} Maximum Princile [10] are some of the methods that have been successfully used as surrogate design aroaches. Another oular aroach is utilizing Differential Dynamic Programing (DDP) [11] and DDP-based variations, such as the Stochastic DDP [12], ilqr and ilqg [13]. These methods rely on local linearizations of the cost function and the dynamics to the second order and roose iterative methods that attemt to find locally-otimal solutions in a tube around a nominal trajectory [13]. In this aer, we address the nonlinear stochastic control roblem and roose an architecture under which the searate design of an otimal oen-loo control sequence and a feedback olicy is near-otimal. In articular, we show that under a small noise assumtion, the searation into globallyotimal trajectory design and a globally-otimal feedback control law holds for a fully-observed nonlinear stochastic system. This result also sheds light on the conditions under which oular design aroaches based on the Maximum Princile may be globally ɛ-otimal. We quantify the first order stochastic error for small-noise levels based on Wentzell-Freidlin large-deviations theory. We thereby determine reach to a Trajectory-otimized Linear Quadratic Regulator (T-LQR) design for fully-observed nonlinear stochastic systems under Gaussian small-noise erturbations. In short, the design can be broken into two arts: i) an oen-loo otimal control roblem that designs the nominal trajectory of the LQR controller, which resects the nonlinearities as well as state and control constraints; ii) the design of an LQR olicy around the otimized nominal trajectory. The quality of the design is rigorously rovided by the main results of the aer. The organization of the aer is as follows. Section II rovides a brief background on Wentzell-Freidlin theory [14] and investigates its imlications regarding the linearization of a stochastic system couled with the usage of the Taylor theorem. Section III defines a general stochastic control roblem for a fully-observed system. Section IV rovides the main results by first analyzing the effect of feedback comensation on the linearization error, and then roviding the state and control error roagations along with robabilistic bounds based on the theory develoed in Section II. Section IV also rovides the first-order exected error of the stochastic cost function along with the searation result. Section V introduces the T-LQR design aroach. Finally, Section VI rovides a design based on T-LQR for a non-holonomic carlike robot and rovides numerical results on the roosed aroach to design.

2 II. SMALL RANDOM PERTURBATIONS OF A NON-LINEAR SYSTEM In this section, we discuss the theoretical background regarding the small noise erturbations of general dynamical systems. In articular, we discuss Wentzell-Freidlin theory on the small noise asymtotics of a erturbed system reresented by a general Stochastic Differential Equation (SDE). We consider a time-varying system as that is required for our design. A general discussion regarding large deviations of the trajectories of a erturbed system from that of its unerturbed counterarts and related theories can be found in [14] [22]. Probability sace: We consider a robability sace {Ω, F, P } with the random variables on a measurable sace (X, B), where X is a Euclidean sace with dimension of n x, n w or a smooth manifold in these saces, and B is the corresonding σ-algebra of Borel sets. Diffusion rocess: Let us consider a dynamical system with the following equation: dx ɛ t = b(t, X ɛ t)dt + ɛdw t, X ɛ 0 = x 0, (1) where b : R R nx R nx is a uniformly Lischitz continuous function, such that: b(t 1, x 1 ) b(t 2, x 2 ) K 1 x 1 x 2, (2) where x 1, x 2 R nx, t 1, t 2 [0, K], ɛ > 0, and K 1 > 0, {w t, t 0} is a Wiener rocess on R nw. Nominal unerturbed trajectory: Such a system can result from small random erturbations of the following timevarying ODE: ẋ t = b(t, x t ), (3) with initial condition x 0 = x 0 R nx. First order Taylor exansion: Using Taylor s theorem to obtain the first order linearization of the right hand side of the above system around the trajectory {x t } K results in the following: dx ɛ t =b(t, x t )dt+a t (X ɛ t x t )dt+ɛdw t +o( X ɛ t x t ), (4) where A t = x b(t, x) t,x t is the Jacobian matrix. Accuracy of linearization: Equation (4) states that if X ɛ t x t δ for all 0 t K, then, dx ɛ t =b(t, x t )dt+a t (X ɛ t x t )dt +ɛdw t + o(δ). (5) We will use the Wentzell-Freidlin theorem to calculate the robability that the aforesaid condition holds. In order to do that, we define the action functional for the family of rocesses defined in equation (1). Action functional [14]: For [T 1, T 2 ] [0, K], the action functional is defined as: S T1,T 2 (φ) := 1 T2 2ɛ 2 φ t b(t, φ t ) 2 dt, (6) T 1 for absolutely continuous φ, and is set to be equal to + for other φ C 0K (R nx ). Note that this defines the action functional for the (ɛ-deendent) family of rocesses given by the SDE (1), uniformly on the whole sace as ɛ 0. Theorem 1. Exonential Rate of Convergence Let: D be a domain in R nx, and denote its closure by cl(d); D denote the boundary of D; H D (t, x 0 )={φ C 0K (R nx ) : φ 0 = x 0, φ t D D}. Assume D = cl(d). Then, we have the following: lim ɛ 0 ɛ2 ln P x0 {X ɛ t D}= inf S 0t(φ), (7) φ H D (t,x 0) Theorem 2. Asymtotics of the Diffusion Process: Let: D t = cl(b c δ (x t )), the closure of the comlement of a ball with radius δ > 0 around the oint x t ; and τ ɛ = Min{t : X ɛ t D t }. Then, lim ɛ 0 ɛ2 ln P x0 {τ ɛ t} = inf S 0t (φ). (8) {φ:φ 0 =x 0, φ t x t >δ} Proof of these results can be found in [14], [15]. Thus, according to Theorem 1, for a given t, the robability as ɛ 0 of X ɛ t x t δ can be calculated as in equation (7). Note that this robability tends to zero exonentially for any fixed δ > 0 as ɛ 0. Moreover, from Theorem 2, the robability that the trajectory of X ɛ ever exits the tube of radius δ round the nominal trajectory in the time interval [0, t] also goes to zero exonentially at the same rate. (This also asserts that the likely aths to ever exit in [0, t] are those exiting at time t). This rovides the validity region of the linearized equation (4) and concludes our discussion in this section. III. THE FULLY OBSERVED SYSTEM The general stochastic control roblem of interest for fully observed system can be formulated as an otimization roblem in the sace of feedback olicies. In this section, we define the system equations and ose the general roblem. Without loss of generality, we consider the discrete-time version of the systems considered in the revious section and continue our analysis on that basis. Process model: We denote the state and control by x X R nx and u U R nu, resectively. The rocess model with f : X U X is defined as: x t+1 = f(x t, u t ) + ω t, ω t N (0, Σ ωt ) (9) where {ω t } is indeendent, identically distributed (i.i.d.). Now, we ose the general stochastic control roblem [1], [23]. Problem 1. Stochastic Control Problem for Fully Observed System: Given an initial state x 0, we wish to determine an otimal or near-otimal for min E[ c π t (x t, u t ) + c π π K(x K )] s.t. x t+1 = f(x t, u t ) + ω t, (10) where the otimization is over Markov, i.e., time-varying state-feedback, olicies, π Π, with π := {π 0,, π t }, π t : X U ; and u t = π t (x t ) secifying the action taken given the state; c π t (, ) : X U R is the one-ste cost function; c π K ( ) : X R denotes the terminal cost; K is the time horizon.

3 IV. SEPARATION OF OPEN LOOP AND CLOSED LOOP DESIGNS: FULLY OBSERVED SYSTEMS In this section, we rovide the theoretical basis for our design. The analysis emloys the Taylor series exansion of the rocess model and large deviations theory. A. Preliminary Analysis We start by roviding the nominal trajectory to linearize the rocess model. Then, we discuss the feedback law and comensate the rocess model with the feedback in order to use large deviations theory. Nominal Trajectory: We use the rocess model with zero noise to roagate the initial state, x 0, with a set of unknown controls {u t }, in order to obtain a arametrization of the feasible nominal trajectories as: x t+1 = f(x t, u t ), 0 t K 1, (11) where x 0 = x 0. Linearization of the rocess model: We linearize the rocess model of equation (9) around the nominal trajectory: x t+1 =A t x t +B t ũ t +ω t +o(e x,u t ), (12) where we have: A t (x t, u t ) = x f(x, u) x, denoted by A t; t,u t B t (x t, u t ) = u f(x, u) x, denoted by B t; t,u t x t := x t x t, the state error with resect to the nominal trajectory; ũ t := u t u t, the control error; and := x t + ũ t the error. e x,u t As the control inuts change, the underlying nominal trajectory also changes, and therefore the Jacobian matrices, A t, B t, and G t change, as well. The Taylor series exansion of equation (12) is valid as e x t 0, i.e., the linearized function remains close to the linearization region. In this equation, the only factor that can drive the linearized function away from the linearization region is the noise rocess ω t. Therefore, we establish robabilistic bounds on the validity of this equation using the small noise theory of Section II. Otimization over olicy sace: A feedback law with Linear Time-Varying (LTV) gain is sufficient to control a linearized model around a nominal trajectory. Therefore, we restrict the search to feedback olicies with LTV feedback gain, Π L. In the next section, we design a Linear Quadratic Regulator olicy (LQR) as a secial case for our design. Feedback controller: Assuming the controllability of the deterministic model of the system, we suose the existence of a feedback control law with LTV feedback gain to track and stabilize the trajectory of states around the nominaldesigned trajectory. Later, we exlain in detail how to design such a law. Thus, the control action error can be exressed as: ũ t = u t u t = L t (x t x t ), (13) where L t is the linear feedback gain. It is imortant to note that although we are working with the linearized system, the original system is a nonlinear system, and the design is tailored to work for the original system. Linearized system equation comensated with feedback: Relacing the feedback law in equation (12), we obtain: x t+1 =A t x t + B t ũ t + ω t + o(e x,u t ), =(A t B t L t ) x t + ω t + o(e x t ), =D t x t + ω t + o(e x t ), (14) where D t := A t B t L t, t 1 and e x,ω t := x t denotes the linearization-based error. Comensating the original system with feedback: Let us substitute for the control action in (9) using the feedback law of (13) as follows: x t+1 = f(x t, u t ) + ω t = f(x t, u t L t (x t x t )) + ω t. Using the last equation we define g : R X X, where g(t, x) =: f(x t, u t L t (x t x t )). (15) Note that the time-deendency for g stems from the timedeendency of the feedback law. Moreover, the nominal trajectory, {x t } K, satisfies the same equation as (11): x t+1 = g(t, x t ) = f(x t, u t L t (x t x t )) = f(x t, u t ). Note that linearizing g around the nominal trajectory yields (14), which itself is equivalent to equation (12) x g(t, x) t,x t (x t x t ) = x f(x, u t L t (x x t )) x t (x t x t ) = x f(x, u) x t,u t Lt(x x t ) (x t x t ) + u f(x, u) x t,u t Lt(x x t ) (u t L t (x x t )) x x (x t x t t ) = x f(x, u) x (x t,u t x t t ) + u f(x, u) x ( L t)(x t,u t t x t ) =A t (x t x t ) + B t ( L t )(x t x t ) = D t (x t x t ). Therefore, g(t, x t ) =D t (x t x t ) + ω t + o(e x t ), as e x,ω t 0. (16) Validity of the linearization: Let us analyze the validity of (12) using the Wentzell-Freidlin theory discussed in Section II. Let us assume that the noise rocess is ω t = ɛw t, where w t is a Wiener rocess as described in Section II, and ɛ > 0. Now, for a time-varying system, the robability that the error x t is less than a given δ > 0 can be calculated using large deviations theory. In articular, the discussion in Section II holds for rocess g. However, we require the function g to satisfy a uniform Lischitz continuity condition, for which uniform Lischitz continuity of rocess model f is sufficient. This is because, if f(x 1, u 1 ) f(x 2, u 2 ) K f ( x 1 x 2 + u 1 u 2 ), where x 1, x 2 R nx, and u 1, u 2 R nu, in addition to smoothness of the nominal trajectory (which is calculated as in (11)) on the interval [0, K], and we have the Lischitz continuity of g, as well. Effect of feedback on the linearization error: Note that before alying the feedback law, equation (9) deends on both u and ω. The influence of ω can be analyzed using large deviations theory; however, it is the feedback law that

4 limits the error of linearization caused by the control actions and converts the control action error into the state error. Moreover, the feedback effectively changes the drift term of the diffusion rocess and affects the validity region s robability through the action functional. B. Main Results In this section, we quantify the overall erformance obtained from the searated design. The roofs are rovided in the aendix. Lemma 1. State Error Proagation: Let ω t = ɛw t, where w t is a Gaussian rocess as described in section II, and ɛ > 0. Let the state error be x t = x t x t for t 0. Then, for t 0 the non-recursive state error roagation, x t+1, in terms of the indeendent variables, including rocess noise at each time ste can be written as follows: t x t+1 = D ω s,tω s + o(δ), as ɛ 0, (17) where we have: D 0 := A 0 D t1:t 2 = Π t2 t=t 1 D t, t 2 t 1 0, otherwise, it is the identity matrix; D ω s,t := D s+1:t, 0 s t 1, t 1; and D ω t,t := D t+1:t = I, t 0. The following lemma follows directly by taking into account the feedback law in the result of Lemma 1. Lemma 2. Control Error Proagation: Let ω t = ɛw t, where w t is a Gaussian rocess as described in section II, and ɛ > 0. Let the control error be ũ t = u t u t for t 0. Then, for t 0 the non-recursive control error roagation, ũ t+1, in terms of the indeendent variables, including rocess noise at each time ste can be written as follows: t ũ t+1 = L ω s,t+1ω s + o(δ), as ɛ 0, where L ω s,t+1 := L t+1 Dω s,t, t 0, t s 0. Moreover, the validity region of the above equation is the same as for (17) in Lemma 1. Next, we linearize of the cost function and rovide the searation result for a fully observed system. Linearization of the cost function: Using the Taylor aroximation around the nominal trajectories of state and control actions yields J = J + (C x t x t + C u t ũt) + C x K x K + o(e x,u ), (18) J 1 where we assume that the cost function is continuously differentiable. Moreover: J := c t(x t, u t )+c K (x K ) denotes the nominal cost; J 1 := J + (Cx t x t + C u t ũt) + C x K x K is the first order aroximation of the cost function; J1 := (Cx t x t + C u t ũt) + C x K x K is the first order error in the cost by our aroximation scheme. Therefore, J1 = J 1 J ; C x t = x c t (x, u) x C u t = u c t (x, u) x ; t,u t ; t,u t C x K = xc K (x) x ; and K e x,u J 1 error. := t=1 ( x t + ũ t ) + x K is the linearization Note that since the error term is in terms of state and control at all time stes, the robability of this equation holding true is equivalent to the robability of the latest time-ste term still being in the vicinity of the nominal trajectory at that ste. Therefore, the robability that this last equation is valid can be calculated as the robability that x K δ for δ > 0, which is given by equation (7) for rocess g defined in equation (15) and using D K = cl(b c δ (x K )) in Theorem 1. As a result, all the revious stes will remain within the same tube around the nominal trajectory and the total error will still be of the order of δ. Therefore, given this robability, we have: J = J + (C x t x t + C u t ũt) + C x K x K + o(δ), (19) ɛ 0. Hence, J J 1 = o(δ) as ɛ 0 with robability given in equation (7) for t = K. Next, we rovide the main result regarding the exected first order error of the cost function. Theorem 3. First Order Cost Function Error: Let us denote the first order cost function error by J 1. Given that rocess noises are zero mean i.i.d., under a first-order aroximation for the small noise aradigm, the stochastic cost function is dominated by the nominal art of the cost function. Moreover the exected first-order error is zero. That is, E[ J 1 ] = 0. Moreover, if the rocess noise at each time ste is distributed according to a zero mean Gaussian distribution, then J 1 also has a zero mean Gaussian distribution. The above result says that the random erturbation in the stochastic running cost form the nominal is zero mean if the linearization holds. From Wentzell-Freidlin theory, we have already established that the linearization holds with a robability exonentially close to 1 as ɛ 0. Hence, this imlies that the exected stochastic cost is equal to the nominal cost with a very high robability as ɛ 0. Therefore, it follows that the oen loo nominal design can be done searately from the closed loo design, summarized bellow: Corollary 1. Searation of the Closed Loo and Oen Design Under Small Noise Based on Theorem 3, under the small noise aradigm, as ɛ 0, the design of the feedback law can be done searately from the design of the oen loo otimized trajectory. Furthermore, this result holds with a robability that exonentially tends to one as ɛ 0.

5 Remark: This result means that under a small noise assumtion and assuming the existence of a feedback law (with LTV gain, which is designed searately), the oen loo nominal trajectory of the system can be designed by relacing the stochastic equations with their nominal counterarts. This design tends to the otimal design with robability one (for the general class of Gaussian rocesses that are considered) as the intensity of noise tends to zero. Remark: It should be mentioned that while our general roblem definition has only the rocess model as dynamics, other constraints on state or control can be considered as long as they share the same smoothness roerties as the cost function. Remark: It is worth mentioning that although we have considered diffusion rocesses with additive white Gaussian noise, the theory in fact holds for a larger class of roblems. On can aeal to more general results in [15] for time-inhomogeneous diffusion rocesses with non-additive white noise. In such cases, the action functional is usually calculated through the Legendre transform. Remark: As mentioned before, although we roved the results of this section for discrete time systems, one can rove the continuous-time versions of our results. This can be done, for instance, by reducing the samling time and limiting it to zero, while utilizing results such as Fubini s theorem along with the similar conditional exectation theorem on Itô s stochastic integrals to exchange the integrations with the exectation. It should be mentioned that there also exists a discrete-time counterart of the Wentzell-Freidlin theory as rovided in [15]. Remark: Higher order designs and analysis of the cost function (or even the dynamics) are ossible using a similar aroach rovided in this aer. Remark: In Ref. [17], for a secial case of nonlinear systems where the rocess model is linear in the control variable, i.e., f(x t, u t ) = f 1 (x t ) + f 2 (x t )u t, three results are roven. The first result, concerns the ɛ-otimality of the otimal deterministic law under convexity of J in the control (i.e., v T ( u,u J)v 0, v), and additional smoothness and regularity conditions. The second result concerns the ɛ 2 - otimality of the otimal deterministic law under a stronger convexity condition of J in the control (i.e., v T ( u,u J)v c( u ) v 2, v, c( ) : R R is a monotonically nonincreasing ositive function), and some smoothness and regularity conditions. The third result concerns the ɛ-otimality of the otimal deterministic sequence under the latter condition. Our result, on the other hand, rovides the ɛ-otimality of the roosed design aroach for a broader class of rocesses f(x t, u t ) with nonlinear deendence in the control variable and more general cost functions (most imortantly, does not assume the linear deendence on the control sequence). In fact, our simulations are erformed for a car-like robot with nonlinear deendence on the control variables. V. T-LQR: TRAJECTORY-OPTIMIZED LQR In this section, we rovide a design scheme based on the theory rovided in the revious sections. This aroach aims at designing an LQR controller with an otimal nominal underlying trajectory based on the searation result of Corollary 1 and Theorem 3. As a result, we term this method as the Trajectory-otimized LQR (T-LQR). Problem 2. Trajectory Planning Problem: Solve for the otimal trajectory: min u 0: c(x t, u t ) + c K (x K ) s.t. x t+1 = f(x t, u t ), 0 t K 1, x 0 = x 0. (20a) (20b) Otimized nominal trajectory: Problem 2 is a deterministic roblem aiming for the best nominal erformance. This roblem utilizes the first order aroximation of the cost function and otimizes the underlying nominal trajectory used in the design of the feedback law. We will denote the resulting otimized nominal trajectory of roblem 2 by {x o t } K, {u o t }. Feedback control: The resulting trajectory from the otimization roblem is otimized in terms of control effort and other constraints, such as a terminal constraint. Now, using the searation result, an LQR controller is designed to track the otimized nominal trajectory. Therefore, the LQR cost is designed for the tracking error x t x o t. The resulting control olicy is a feedback olicy with LTV gain, and the evolution of x t is obtained from the original equation of the rocess model during the execution. Although we utilize an LQR controller, it is imortant to note that the searation result only assumes a linear form of feedback and other tyes of designs [24] can be used as well. Linearization of system equations: For simlicity, we denote the Jacobian matrices and every other variable associated with the otimized nominal trajectory with a suerscrit o. The Jacobians are A o t = x f(x, u) x o t,u o, and Bo t t = u f(x, u) x o t,u o. t Problem 3. LQR Problem: Given the otimized nominal trajectory as {x o t } K and {u o t }, and a lanning horizon of K > 0, solve the following LQR roblem to track the nominal trajectory: K min [(x t x o u t ) T Wt x (x t x o t ) + (ũ o t 1) T Wt u ũo t 1] 0: t=1 s.t. x o t+1 = A o t x o t + B o t u o t, 0 t K 1 (21) where ũ o t = u t u o t and W u t, W x t 0 are ositive-definite matrices. Control olicy: The resulting control olicy of roblem 3 is a feedback olicy as follows [1]: ũ o t = L o t (x t x o t ), where the linear feedback gain L o t is: L o t = (W u t + (B o t ) T P f t+1 Bo t ) 1 (B o t ) T P f t+1 Ao t, and the matrix P f t is the result of backward iteration of the dynamic Riccati equation P f t 1 = (Ao t ) T P f t A o t

6 (a) Otimized trajectory of roblem(b) A tyical ground truth trajectory 2. with noise standard deviation equal to 10% of the maximum control signal. Fig. 1. Otimized vs. a tyical execution trajectory for a car-like robot. (A o t ) T P f t B o t (W u t + (B o t ) T P f t B o t ) 1 (B o t ) T P f t A o t +W x t, which is solvable with a terminal condition P f K = Wx t. Remark: The comutations involved in roblem 2 is of the order of O(Kn 2 x) for tyically smooth dynamics for one iteration. Let us assume O(l) is the order of the number of iterations in the otimizer until convergence. The LQR olicy calculation is of order of O(Kn 3 x). Therefore, overall, the design aroach based on the searation rincile of Corollary 1 is O(lKn 2 x + Kn 3 x) for a tyical rocess model (such as our examle in the next section). The low comutational comlexity of this aroach results in fast relanning in case of deviations during execution. This renders the first scheme to be eminently imlementable for imlementation in on-line alications. Remark: For the secific class of roblems considered in [17] (see the last remark in Section IV) the design aroach of [17] requires calculation of the otimal control law through intractable dynamic rogramming. In contrast, the roosed design aroach in this aer utilizes the tractable solution of Maximum Princile roblem followed by an LQR design. Even imlementing the result of [17] through a model redictive aroach would require more comutations of at least an order of the lanning horizon (from O(K) to O(K 2 )). In such an imlementation, the online comutations of the aroach of [17] require O(lKn 2 x) calculations comared to only O(n 2 x) calculations in our algorithm. VI. EXAMPLE Let us consider a car-like four-wheel robot with rocess model [25]: v ẋ = v cos(θ), ẏ = v sin(θ), θ = tan(φ), (22) L where (x, y, θ) is the state, and (v, φ) is the control inut. We suose that, φ < φ max = π/2, v v max = 0.6, x 0 = ( 1.5, 0.5, 0), K = 20, and the time discretization eriod is 0.7. We incororate the control constraints and the terminal goal, x g = ( 0.5, 1, 0), in the cost function. Last, the initial control sequence used for the otimization is just a sequence of zero inuts. The rocess noise is additive mean zero Gaussian noise with a standard deviation equal to ɛ max t { u t 2 }. Figure 1a shows the result of the otimization roblem 2 whereas Fig. 1b shows a tyical ground truth trajectory with ɛ = 0.1. We have used MATLAB (a) Feedback-comensated system. (b) Oen-loo system. Fig. 2. Evolution of average NMSE as ɛ 0 for a feedback comensated and oen loo system with the same nominal trajectories. 2016b and its fmincon solver for simulations. In the next exeriment, we increase ɛ from to , in ste sizes of For each value of ɛ, we execute the resulting olicy 100 times and comute the average Normalized Mean Squared Error (NMSE) as: Average NMSE (%) = x x j x 2 100, (23) j=1 2 where x indicates the lanned trajectory and x j indicates the ground truth trajectory at jth exeriment. The results of this exeriment are shown in Fig. 2a, where the evolution of the average NMSE is deicted for various values of noise level ɛ. As indicated in this figure, as ɛ 0, the average NMSE tends to zero at an exonential rate, which is consistent with the theory develoed in Section II. Moreover, this figure indicates that through the feedback comensation, moderate noise levels can be tolerated, rather than just small levels. Last, Fig. 2b deicts the evolution of the average NMSE for an exeriment with the same setting as in Fig. 2a, excet that only the oen-loo lanned control sequence is alied during execution. As redicted by the theory, the error still decreases exonentially as the noise level decreases. However, the rate of convergence is about one-fifth of the revious rate. The results of Fig. 2 show that our design can be used for relatively moderate levels of noise, using the ower of feedback. Remark: In ractice, if at any oint in the execution the calculated error exceeds a threshold, very raid relanning can be triggered very fast due to the low comutational burden of the otimization roblem. VII. CONCLUSION We have resented a design aroach that searates the design of the oen-loo nominal trajectory and the closedloo feedback olicy for fully-observed nonlinear stochastic systems with Gaussian distributions. We have shown that under a small-noise assumtion, the stochastic cost function is dominated by the nominal art of the cost function and the exected first order linearization error is of mean zero. This results in a reliable raid lanning method that is rovably near-otimal. It can be used in robotic ath lanning and control, and otentially in other alications.

7 REFERENCES [1] P. R. Kumar and P. P. Varaiya, Stochastic Systems: Estimation, Identification, and Adative Control. Englewood Cliffs, NJ: Prentice- Hall, [2] D. P. Bertsekas, D. P. Bertsekas, D. P. Bertsekas, and D. P. Bertsekas, Dynamic rogramming and otimal control. Athena Scientific Belmont, MA, 1995, vol. 1, no. 2. [3] H. Kushner and P. G. Duuis, Numerical methods for stochastic control roblems in continuous time. Sringer Science & Business Media, 2013, vol. 24. [4] R. Bellman, Dynamic Programming, 1st ed. Princeton, NJ, USA: Princeton University Press, [5] C.-S. Chow and J. N. Tsitsiklis, The comlexity of dynamic rogramming, Journal of comlexity, vol. 5, no. 4, , [6] D. Mayne, Robust and stochastic mc: Are we going in the right direction? IFAC-PaersOnLine, vol. 48, no. 23,. 1 8, [7] D. Q. Mayne, Model redictive control: Recent develoments and future romise, Automatica, vol. 50, no. 12, , [8] J. N. Tsitsiklis, Comutational comlexity in markov decision theory, HERMIS-An International Journal of Comuter Mathematics and its Alications, vol. 9, , [9] Y. Le Tallec, Robust, risk-sensitive, and data-driven control of markov decision rocesses, Ph.D. dissertation, Massachusetts Institute of Technology, [10] R. E. Ko, Pontryagin maximum rincile, Mathematics in Science and Engineering, vol. 5, , [11] D. H. Jacobson and D. Q. Mayne, Differential dynamic rogramming, [12] E. Theodorou, Y. Tassa, and E. Todorov, Stochastic differential dynamic rogramming, in American Control Conference (ACC), IEEE, 2010, [13] E. Todorov and W. Li, A generalized iterative lqg method for locallyotimal feedback control of constrained nonlinear stochastic systems, in American Control Conference, Proceedings of the IEEE, 2005, [14] M. I. Freidlin and A. D. Wentzell, Random Perturbations. New York, NY: Sringer US, 1984, [15] A. D. Wentzell, Limit theorems on large deviations for Markov stochastic rocesses. Sringer Science & Business Media, 2012, vol. 38. [16] A. Dembo and O. Zeitouni, Large deviations techniques and alications. Sringer Science & Business Media, 2009, vol. 38. [17] W. H. Fleming, Stochastic control for small noise intensities, SIAM Journal on Control, vol. 9, no. 3, , [18] H. Cruz-Suárez and R. Ilhuicatzi-Roldán, Stochastic otimal control for small noise intensities: The discrete-time case, WSEAS Trans. Math., vol. 9, no. 2, , Feb [19] J. D. Perkins and R. W. H. Sargent, Nonlinear otimal stochastic control some aroximations when the noise is small. Berlin, Heidelberg: Sringer Berlin Heidelberg, 1976, [20] J. Perkins and R. Sargent, Nonlinear otimal stochastic controlsome aroximations when the noise is small, in IFIP Technical Conference on Otimization Techniques. Sringer, 1975, [21] C. J. Holland, An aroximation technique for small noise oen-loo control roblems, Otimal Control Alications and Methods, vol. 2, no. 1. [22] S. S. Varadhan and S. S. Varadhan, Large deviations and alications. SIAM, 1984, vol. 46. [23] D. Bertsekas, Dynamic Programming and Otimal Control: 3rd Ed. Athena Scientific, [24] P. Kumar et al., Control: a ersective, Automatica, vol. 50, no. 1,. 3 43, [25] S. Lavalle, Planning algorithms. Cambridge University Press, APPENDIX Proof. Lemma 1: State Error Proagation Ignoring the validity region, x t+1 =A t x t + B t ũ t + ω t = (A t B t L t ) x t + ω t =:D t x t +ω t =: D t t 0:t x 0 + D r+1:t ω r =: D ω s,tω s. r=0 Note that using the definition of x t, the initial state error is x 0 = x 0 x 0 = x 0 x 0 = 0. Likewise, the state error at time-ste 1 is x 1 = A 0 x 0 +ω 0 = ω 0. Moreover, these errors are consistent with the lemma using the definitions rovided and the indicator function notation. Now, since this equation utilizes the linearizations at all stes, its error is within o(δ), if x s δ for all s t. Moreover, the robability that equation (17) is valid (i.e., the linearizations are valid with o(δ) error for the entire trajectory u to time t) is the same as the robability that the linearization is valid on the last ste (i.e., ste t). This is due to Wentzell-Freidlin theory. Now, the robability that x t δ is given by (7) for rocess g defined in (15), and D t = cl(b δ (x t )) for Theorem 1. Therefore, as ɛ 0, the robability of x t x t δ is calculated as in equation (7), which tends exonentially to zero. Last, note that through Wentzell-Freidlin theory, the validity of linearization only deends on the aggregated effect of the random erturbations at stes rior to t, and there is no need to individually bound the noise at each ste. Proof. Lemma 2, Control Error Proagation Relacing state error in the control law: Using the result of Lemma 1, we can rewrite ũ t+1 for t 1 as follows: t t ũ t+1 = L t+1 x t+1 = L t+1 D ω s,tω s =: L ω s,t+1ω s. Note that ũ 0 = 0, and the last formula is consistent with this error using the definitions rovided in the lemma. Proof. Theorem 3, Cost Function Error Using the linearization rocess described reviously, we can write the cost function error as E[ J 1 ] = E[ (Cx t x t+c u t ũt)+c x K x K]. Utilizing the assumtion that the rocess noise is zero mean i.i.d., E[ω t ] = 0 for all t. Moreover, x 0 = 0 which follows from the fact that x 0 = x 0. Therefore, using the linearity of the exectation oerator and Lemmas 1 and 2, we can rewrite E[ J 1 ] as follows: E[ J 1 ]= (C x t E[ x t ] + C u t E[ũ t ]) + C x KE[ x K ] = t 1 C x t E[ D ω s,t 1ω s ]+ + C x KE[ D ω s,ω s ] t 1 C u t E[ L ω s,tω s ] t 1 = E[(C x D t ω s,t 1 C u t L ω s,t)ω s ]+ E[C x D K ω s,ω s ] t 1 t 1 n u K K =: E[(w s,t ) T ω s ] = ws,te[ω j s] j = 0. j=1 where w s,t := (C x D t ω s,t 1 C u t L ω s,t) T, t 1 s 0, K 1 t 0, w s,k := (C x D K ω s, )T, K 1 s 0. Moreover, w s,t := (ws,t, 1, ws,t nu ) T is a vector of the same size of ω s = (ωs, 1, ωs nu ) T.

Feedback-error control

Feedback-error control Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller

More information

STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS. A Dissertation JAMES ROBERT FISHER

STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS. A Dissertation JAMES ROBERT FISHER STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS A Dissertation by JAMES ROBERT FISHER Submitted to the Office of Graduate Studies of Texas A&M University in artial fulfillment

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

State Estimation with ARMarkov Models

State Estimation with ARMarkov Models Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,

More information

Sums of independent random variables

Sums of independent random variables 3 Sums of indeendent random variables This lecture collects a number of estimates for sums of indeendent random variables with values in a Banach sace E. We concentrate on sums of the form N γ nx n, where

More information

Robust Predictive Control of Input Constraints and Interference Suppression for Semi-Trailer System

Robust Predictive Control of Input Constraints and Interference Suppression for Semi-Trailer System Vol.7, No.7 (4),.37-38 htt://dx.doi.org/.457/ica.4.7.7.3 Robust Predictive Control of Inut Constraints and Interference Suression for Semi-Trailer System Zhao, Yang Electronic and Information Technology

More information

Research Article An iterative Algorithm for Hemicontractive Mappings in Banach Spaces

Research Article An iterative Algorithm for Hemicontractive Mappings in Banach Spaces Abstract and Alied Analysis Volume 2012, Article ID 264103, 11 ages doi:10.1155/2012/264103 Research Article An iterative Algorithm for Hemicontractive Maings in Banach Saces Youli Yu, 1 Zhitao Wu, 2 and

More information

arxiv: v1 [quant-ph] 20 Jun 2017

arxiv: v1 [quant-ph] 20 Jun 2017 A Direct Couling Coherent Quantum Observer for an Oscillatory Quantum Plant Ian R Petersen arxiv:76648v quant-h Jun 7 Abstract A direct couling coherent observer is constructed for a linear quantum lant

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668

More information

Recursive Estimation of the Preisach Density function for a Smart Actuator

Recursive Estimation of the Preisach Density function for a Smart Actuator Recursive Estimation of the Preisach Density function for a Smart Actuator Ram V. Iyer Deartment of Mathematics and Statistics, Texas Tech University, Lubbock, TX 7949-142. ABSTRACT The Preisach oerator

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

An analytical approximation method for the stabilizing solution of the Hamilton-Jacobi equation based on stable manifold theory

An analytical approximation method for the stabilizing solution of the Hamilton-Jacobi equation based on stable manifold theory Proceedings of the 27 American Control Conference Marriott Marquis Hotel at Times Square New York City, USA, July -3, 27 ThA8.4 An analytical aroimation method for the stabilizing solution of the Hamilton-Jacobi

More information

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm - (23) NLP - NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS The Goldstein-Levitin-Polya algorithm We consider an algorithm for solving the otimization roblem under convex constraints. Although the convexity

More information

2-D Analysis for Iterative Learning Controller for Discrete-Time Systems With Variable Initial Conditions Yong FANG 1, and Tommy W. S.

2-D Analysis for Iterative Learning Controller for Discrete-Time Systems With Variable Initial Conditions Yong FANG 1, and Tommy W. S. -D Analysis for Iterative Learning Controller for Discrete-ime Systems With Variable Initial Conditions Yong FANG, and ommy W. S. Chow Abstract In this aer, an iterative learning controller alying to linear

More information

Analysis of some entrance probabilities for killed birth-death processes

Analysis of some entrance probabilities for killed birth-death processes Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels oname manuscrit o. will be inserted by the editor) Quantitative estimates of roagation of chaos for stochastic systems with W, kernels Pierre-Emmanuel Jabin Zhenfu Wang Received: date / Acceted: date Abstract

More information

Age of Information: Whittle Index for Scheduling Stochastic Arrivals

Age of Information: Whittle Index for Scheduling Stochastic Arrivals Age of Information: Whittle Index for Scheduling Stochastic Arrivals Yu-Pin Hsu Deartment of Communication Engineering National Taiei University yuinhsu@mail.ntu.edu.tw arxiv:80.03422v2 [math.oc] 7 Ar

More information

Improved Capacity Bounds for the Binary Energy Harvesting Channel

Improved Capacity Bounds for the Binary Energy Harvesting Channel Imroved Caacity Bounds for the Binary Energy Harvesting Channel Kaya Tutuncuoglu 1, Omur Ozel 2, Aylin Yener 1, and Sennur Ulukus 2 1 Deartment of Electrical Engineering, The Pennsylvania State University,

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Robust Solutions to Markov Decision Problems

Robust Solutions to Markov Decision Problems Robust Solutions to Markov Decision Problems Arnab Nilim and Laurent El Ghaoui Deartment of Electrical Engineering and Comuter Sciences University of California, Berkeley, CA 94720 nilim@eecs.berkeley.edu,

More information

Elementary Analysis in Q p

Elementary Analysis in Q p Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some

More information

t 0 Xt sup X t p c p inf t 0

t 0 Xt sup X t p c p inf t 0 SHARP MAXIMAL L -ESTIMATES FOR MARTINGALES RODRIGO BAÑUELOS AND ADAM OSȨKOWSKI ABSTRACT. Let X be a suermartingale starting from 0 which has only nonnegative jums. For each 0 < < we determine the best

More information

Distributed Rule-Based Inference in the Presence of Redundant Information

Distributed Rule-Based Inference in the Presence of Redundant Information istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced

More information

Algorithms for Air Traffic Flow Management under Stochastic Environments

Algorithms for Air Traffic Flow Management under Stochastic Environments Algorithms for Air Traffic Flow Management under Stochastic Environments Arnab Nilim and Laurent El Ghaoui Abstract A major ortion of the delay in the Air Traffic Management Systems (ATMS) in US arises

More information

Positivity, local smoothing and Harnack inequalities for very fast diffusion equations

Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Dedicated to Luis Caffarelli for his ucoming 60 th birthday Matteo Bonforte a, b and Juan Luis Vázquez a, c Abstract

More information

Multi-Operation Multi-Machine Scheduling

Multi-Operation Multi-Machine Scheduling Multi-Oeration Multi-Machine Scheduling Weizhen Mao he College of William and Mary, Williamsburg VA 3185, USA Abstract. In the multi-oeration scheduling that arises in industrial engineering, each job

More information

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law On Isoerimetric Functions of Probability Measures Having Log-Concave Densities with Resect to the Standard Normal Law Sergey G. Bobkov Abstract Isoerimetric inequalities are discussed for one-dimensional

More information

LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL

LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL Mohammad Bozorg Deatment of Mechanical Engineering University of Yazd P. O. Box 89195-741 Yazd Iran Fax: +98-351-750110

More information

The non-stochastic multi-armed bandit problem

The non-stochastic multi-armed bandit problem Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at

More information

Equivalence of Wilson actions

Equivalence of Wilson actions Prog. Theor. Ex. Phys. 05, 03B0 7 ages DOI: 0.093/te/tv30 Equivalence of Wilson actions Physics Deartment, Kobe University, Kobe 657-850, Jaan E-mail: hsonoda@kobe-u.ac.j Received June 6, 05; Revised August

More information

Research Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inputs

Research Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inputs Abstract and Alied Analysis Volume 203 Article ID 97546 5 ages htt://dxdoiorg/055/203/97546 Research Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inuts Hong

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

ESTIMATION OF THE OUTPUT DEVIATION NORM FOR UNCERTAIN, DISCRETE-TIME NONLINEAR SYSTEMS IN A STATE DEPENDENT FORM

ESTIMATION OF THE OUTPUT DEVIATION NORM FOR UNCERTAIN, DISCRETE-TIME NONLINEAR SYSTEMS IN A STATE DEPENDENT FORM Int. J. Al. Math. Comut. Sci. 2007 Vol. 17 No. 4 505 513 DOI: 10.2478/v10006-007-0042-z ESTIMATION OF THE OUTPUT DEVIATION NORM FOR UNCERTAIN DISCRETE-TIME NONLINEAR SYSTEMS IN A STATE DEPENDENT FORM PRZEMYSŁAW

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment

More information

On the capacity of the general trapdoor channel with feedback

On the capacity of the general trapdoor channel with feedback On the caacity of the general tradoor channel with feedback Jui Wu and Achilleas Anastasooulos Electrical Engineering and Comuter Science Deartment University of Michigan Ann Arbor, MI, 48109-1 email:

More information

An Analysis of Reliable Classifiers through ROC Isometrics

An Analysis of Reliable Classifiers through ROC Isometrics An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit

More information

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino

More information

Probability Estimates for Multi-class Classification by Pairwise Coupling

Probability Estimates for Multi-class Classification by Pairwise Coupling Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics

More information

Some results of convex programming complexity

Some results of convex programming complexity 2012c12 $ Ê Æ Æ 116ò 14Ï Dec., 2012 Oerations Research Transactions Vol.16 No.4 Some results of convex rogramming comlexity LOU Ye 1,2 GAO Yuetian 1 Abstract Recently a number of aers were written that

More information

SUPER-GEOMETRIC CONVERGENCE OF A SPECTRAL ELEMENT METHOD FOR EIGENVALUE PROBLEMS WITH JUMP COEFFICIENTS *

SUPER-GEOMETRIC CONVERGENCE OF A SPECTRAL ELEMENT METHOD FOR EIGENVALUE PROBLEMS WITH JUMP COEFFICIENTS * Journal of Comutational Mathematics Vol.8, No.,, 48 48. htt://www.global-sci.org/jcm doi:.48/jcm.9.-m6 SUPER-GEOMETRIC CONVERGENCE OF A SPECTRAL ELEMENT METHOD FOR EIGENVALUE PROBLEMS WITH JUMP COEFFICIENTS

More information

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation Uniformly best wavenumber aroximations by satial central difference oerators: An initial investigation Vitor Linders and Jan Nordström Abstract A characterisation theorem for best uniform wavenumber aroximations

More information

STABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS

STABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS STABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS Massimo Vaccarini Sauro Longhi M. Reza Katebi D.I.I.G.A., Università Politecnica delle Marche, Ancona, Italy

More information

Brownian Motion and Random Prime Factorization

Brownian Motion and Random Prime Factorization Brownian Motion and Random Prime Factorization Kendrick Tang June 4, 202 Contents Introduction 2 2 Brownian Motion 2 2. Develoing Brownian Motion.................... 2 2.. Measure Saces and Borel Sigma-Algebras.........

More information

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES OHAD GILADI AND ASSAF NAOR Abstract. It is shown that if (, ) is a Banach sace with Rademacher tye 1 then for every n N there exists

More information

Mobility-Induced Service Migration in Mobile. Micro-Clouds

Mobility-Induced Service Migration in Mobile. Micro-Clouds arxiv:503054v [csdc] 7 Mar 205 Mobility-Induced Service Migration in Mobile Micro-Clouds Shiiang Wang, Rahul Urgaonkar, Ting He, Murtaza Zafer, Kevin Chan, and Kin K LeungTime Oerating after ossible Deartment

More information

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment

More information

Stochastic integration II: the Itô integral

Stochastic integration II: the Itô integral 13 Stochastic integration II: the Itô integral We have seen in Lecture 6 how to integrate functions Φ : (, ) L (H, E) with resect to an H-cylindrical Brownian motion W H. In this lecture we address the

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating

More information

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com

More information

Applications to stochastic PDE

Applications to stochastic PDE 15 Alications to stochastic PE In this final lecture we resent some alications of the theory develoed in this course to stochastic artial differential equations. We concentrate on two secific examles:

More information

Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models

Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models Ketan N. Patel, Igor L. Markov and John P. Hayes University of Michigan, Ann Arbor 48109-2122 {knatel,imarkov,jhayes}@eecs.umich.edu

More information

Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems

Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems Int. J. Oen Problems Comt. Math., Vol. 3, No. 2, June 2010 ISSN 1998-6262; Coyright c ICSRS Publication, 2010 www.i-csrs.org Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various

More information

Reducing Risk in Convex Order

Reducing Risk in Convex Order Reducing Risk in Convex Order Junnan He a, Qihe Tang b and Huan Zhang b a Deartment of Economics Washington University in St. Louis Camus Box 208, St. Louis MO 6330-4899 b Deartment of Statistics and Actuarial

More information

1 Extremum Estimators

1 Extremum Estimators FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective

More information

MATH 2710: NOTES FOR ANALYSIS

MATH 2710: NOTES FOR ANALYSIS MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite

More information

On a class of Rellich inequalities

On a class of Rellich inequalities On a class of Rellich inequalities G. Barbatis A. Tertikas Dedicated to Professor E.B. Davies on the occasion of his 60th birthday Abstract We rove Rellich and imroved Rellich inequalities that involve

More information

Journal of Mathematical Analysis and Applications

Journal of Mathematical Analysis and Applications J. Math. Anal. Al. 44 (3) 3 38 Contents lists available at SciVerse ScienceDirect Journal of Mathematical Analysis and Alications journal homeage: www.elsevier.com/locate/jmaa Maximal surface area of a

More information

PArtially observable Markov decision processes

PArtially observable Markov decision processes Solving Continuous-State POMDPs via Density Projection Enlu Zhou, Member, IEEE, Michael C. Fu, Fellow, IEEE, and Steven I. Marcus, Fellow, IEEE Abstract Research on numerical solution methods for artially

More information

Convex Optimization methods for Computing Channel Capacity

Convex Optimization methods for Computing Channel Capacity Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem

More information

Mean Square Stability Analysis of Sampled-Data Supervisory Control Systems

Mean Square Stability Analysis of Sampled-Data Supervisory Control Systems 17th IEEE International Conference on Control Alications Part of 28 IEEE Multi-conference on Systems and Control San Antonio, Texas, USA, Setember 3-5, 28 WeA21 Mean Square Stability Analysis of Samled-Data

More information

1-way quantum finite automata: strengths, weaknesses and generalizations

1-way quantum finite automata: strengths, weaknesses and generalizations 1-way quantum finite automata: strengths, weaknesses and generalizations arxiv:quant-h/9802062v3 30 Se 1998 Andris Ambainis UC Berkeley Abstract Rūsiņš Freivalds University of Latvia We study 1-way quantum

More information

On Doob s Maximal Inequality for Brownian Motion

On Doob s Maximal Inequality for Brownian Motion Stochastic Process. Al. Vol. 69, No., 997, (-5) Research Reort No. 337, 995, Det. Theoret. Statist. Aarhus On Doob s Maximal Inequality for Brownian Motion S. E. GRAVERSEN and G. PESKIR If B = (B t ) t

More information

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation Paer C Exact Volume Balance Versus Exact Mass Balance in Comositional Reservoir Simulation Submitted to Comutational Geosciences, December 2005. Exact Volume Balance Versus Exact Mass Balance in Comositional

More information

Uncertainty Modeling with Interval Type-2 Fuzzy Logic Systems in Mobile Robotics

Uncertainty Modeling with Interval Type-2 Fuzzy Logic Systems in Mobile Robotics Uncertainty Modeling with Interval Tye-2 Fuzzy Logic Systems in Mobile Robotics Ondrej Linda, Student Member, IEEE, Milos Manic, Senior Member, IEEE bstract Interval Tye-2 Fuzzy Logic Systems (IT2 FLSs)

More information

A Special Case Solution to the Perspective 3-Point Problem William J. Wolfe California State University Channel Islands

A Special Case Solution to the Perspective 3-Point Problem William J. Wolfe California State University Channel Islands A Secial Case Solution to the Persective -Point Problem William J. Wolfe California State University Channel Islands william.wolfe@csuci.edu Abstract In this aer we address a secial case of the ersective

More information

Linear diophantine equations for discrete tomography

Linear diophantine equations for discrete tomography Journal of X-Ray Science and Technology 10 001 59 66 59 IOS Press Linear diohantine euations for discrete tomograhy Yangbo Ye a,gewang b and Jiehua Zhu a a Deartment of Mathematics, The University of Iowa,

More information

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms

More information

Analysis of Multi-Hop Emergency Message Propagation in Vehicular Ad Hoc Networks

Analysis of Multi-Hop Emergency Message Propagation in Vehicular Ad Hoc Networks Analysis of Multi-Ho Emergency Message Proagation in Vehicular Ad Hoc Networks ABSTRACT Vehicular Ad Hoc Networks (VANETs) are attracting the attention of researchers, industry, and governments for their

More information

Location of solutions for quasi-linear elliptic equations with general gradient dependence

Location of solutions for quasi-linear elliptic equations with general gradient dependence Electronic Journal of Qualitative Theory of Differential Equations 217, No. 87, 1 1; htts://doi.org/1.14232/ejqtde.217.1.87 www.math.u-szeged.hu/ejqtde/ Location of solutions for quasi-linear ellitic equations

More information

Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations

Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations PINAR KORKMAZ, BILGE E. S. AKGUL and KRISHNA V. PALEM Georgia Institute of

More information

Generalized Coiflets: A New Family of Orthonormal Wavelets

Generalized Coiflets: A New Family of Orthonormal Wavelets Generalized Coiflets A New Family of Orthonormal Wavelets Dong Wei, Alan C Bovik, and Brian L Evans Laboratory for Image and Video Engineering Deartment of Electrical and Comuter Engineering The University

More information

Solutions of the Duffing and Painlevé-Gambier Equations by Generalized Sundman Transformation

Solutions of the Duffing and Painlevé-Gambier Equations by Generalized Sundman Transformation Solutions of the Duffing and Painlevé-Gambier Equations by Generalized Sundman Transformation D.K.K. Adjaï a, L. H. Koudahoun a, J. Akande a, Y.J.F. Komahou b and M. D. Monsia a 1 a Deartment of Physics,

More information

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS #A13 INTEGERS 14 (014) ON THE LEAST SIGNIFICANT ADIC DIGITS OF CERTAIN LUCAS NUMBERS Tamás Lengyel Deartment of Mathematics, Occidental College, Los Angeles, California lengyel@oxy.edu Received: 6/13/13,

More information

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification

More information

Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application

Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application BULGARIA ACADEMY OF SCIECES CYBEREICS AD IFORMAIO ECHOLOGIES Volume 9 o 3 Sofia 009 Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Alication Svetoslav Savov Institute of Information

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Fault Tolerant Quantum Computing Robert Rogers, Thomas Sylwester, Abe Pauls

Fault Tolerant Quantum Computing Robert Rogers, Thomas Sylwester, Abe Pauls CIS 410/510, Introduction to Quantum Information Theory Due: June 8th, 2016 Sring 2016, University of Oregon Date: June 7, 2016 Fault Tolerant Quantum Comuting Robert Rogers, Thomas Sylwester, Abe Pauls

More information

Design of NARMA L-2 Control of Nonlinear Inverted Pendulum

Design of NARMA L-2 Control of Nonlinear Inverted Pendulum International Research Journal of Alied and Basic Sciences 016 Available online at www.irjabs.com ISSN 51-838X / Vol, 10 (6): 679-684 Science Exlorer Publications Design of NARMA L- Control of Nonlinear

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information

Topic 7: Using identity types

Topic 7: Using identity types Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules

More information

How to Estimate Expected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty

How to Estimate Expected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty How to Estimate Exected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty Christian Servin Information Technology Deartment El Paso Community College El Paso, TX 7995, USA cservin@gmail.com

More information

Developing A Deterioration Probabilistic Model for Rail Wear

Developing A Deterioration Probabilistic Model for Rail Wear International Journal of Traffic and Transortation Engineering 2012, 1(2): 13-18 DOI: 10.5923/j.ijtte.20120102.02 Develoing A Deterioration Probabilistic Model for Rail Wear Jabbar-Ali Zakeri *, Shahrbanoo

More information

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables Partial Identification in Triangular Systems of Equations with Binary Deendent Variables Azeem M. Shaikh Deartment of Economics University of Chicago amshaikh@uchicago.edu Edward J. Vytlacil Deartment

More information

On Wald-Type Optimal Stopping for Brownian Motion

On Wald-Type Optimal Stopping for Brownian Motion J Al Probab Vol 34, No 1, 1997, (66-73) Prerint Ser No 1, 1994, Math Inst Aarhus On Wald-Tye Otimal Stoing for Brownian Motion S RAVRSN and PSKIR The solution is resented to all otimal stoing roblems of

More information

PETER J. GRABNER AND ARNOLD KNOPFMACHER

PETER J. GRABNER AND ARNOLD KNOPFMACHER ARITHMETIC AND METRIC PROPERTIES OF -ADIC ENGEL SERIES EXPANSIONS PETER J. GRABNER AND ARNOLD KNOPFMACHER Abstract. We derive a characterization of rational numbers in terms of their unique -adic Engel

More information

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose

More information

THE 3-DOF helicopter system is a benchmark laboratory

THE 3-DOF helicopter system is a benchmark laboratory Vol:8, No:8, 14 LQR Based PID Controller Design for 3-DOF Helicoter System Santosh Kr. Choudhary International Science Index, Electrical and Information Engineering Vol:8, No:8, 14 waset.org/publication/9999411

More information

Hidden Predictors: A Factor Analysis Primer

Hidden Predictors: A Factor Analysis Primer Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor

More information

Preconditioning techniques for Newton s method for the incompressible Navier Stokes equations

Preconditioning techniques for Newton s method for the incompressible Navier Stokes equations Preconditioning techniques for Newton s method for the incomressible Navier Stokes equations H. C. ELMAN 1, D. LOGHIN 2 and A. J. WATHEN 3 1 Deartment of Comuter Science, University of Maryland, College

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

RUN-TO-RUN CONTROL AND PERFORMANCE MONITORING OF OVERLAY IN SEMICONDUCTOR MANUFACTURING. 3 Department of Chemical Engineering

RUN-TO-RUN CONTROL AND PERFORMANCE MONITORING OF OVERLAY IN SEMICONDUCTOR MANUFACTURING. 3 Department of Chemical Engineering Coyright 2002 IFAC 15th Triennial World Congress, Barcelona, Sain RUN-TO-RUN CONTROL AND PERFORMANCE MONITORING OF OVERLAY IN SEMICONDUCTOR MANUFACTURING C.A. Bode 1, B.S. Ko 2, and T.F. Edgar 3 1 Advanced

More information

Indirect Rotor Field Orientation Vector Control for Induction Motor Drives in the Absence of Current Sensors

Indirect Rotor Field Orientation Vector Control for Induction Motor Drives in the Absence of Current Sensors Indirect Rotor Field Orientation Vector Control for Induction Motor Drives in the Absence of Current Sensors Z. S. WANG *, S. L. HO ** * College of Electrical Engineering, Zhejiang University, Hangzhou

More information

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)] LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for

More information

Automatic Generation and Integration of Equations of Motion for Linked Mechanical Systems

Automatic Generation and Integration of Equations of Motion for Linked Mechanical Systems Automatic Generation and Integration of Equations of Motion for Linked Mechanical Systems D. Todd Griffith a, John L. Junkins a, and James D. Turner b a Deartment of Aerosace Engineering, Texas A&M University,

More information