arxiv: v1 [cs.ro] 24 May 2017
|
|
- Aubrey Felicity Goodwin
- 5 years ago
- Views:
Transcription
1 A Near-Otimal Searation Princile for Nonlinear Stochastic Systems Arising in Robotic Path Planning and Control Mohammadhussein Rafieisakhaei 1, Suman Chakravorty 2 and P. R. Kumar 1 arxiv: v1 [cs.ro] 24 May 2017 Abstract We consider nonlinear stochastic systems that arise in ath lanning and control of mobile robots. As is tyical of almost all nonlinear stochastic systems, the otimally solving roblem is intractable. We rovide a design aroach which yields a tractable design that is quantifiably nearotimal. We exhibit a searation rincile under a small noise assumtion consisting of the otimal oen-loo design of nominal trajectory followed by an otimal feedback law to track this trajectory, which is different from the usual effort of searating estimation from control. As a corollary, we obtain a trajectory-otimized linear quadratic regulator design for stochastic nonlinear systems with Gaussian noise. I. INTRODUCTION Practical systems are often subject to inaccuracies that we model as noise. Planning for a stochastic system requires attention to the noise structure, available models and noise levels. Many robotic systems, in articular, mobile aerial and ground robots, are equied with noisy actuators that require feedback comensation or lanning ahead for a olicy that accounts for the random erturbations. Simly ignoring the noise and lanning for the unerturbed equivalent of the stochastic system can yield crucial errors leading to the failure in reaching the end-goal, or cause the system to fall into unsafe states. In a stochastic setting, the general roblem of sequential decision-making is formulated as a Markov Decision Problem (MDP) [1], [2]. The otimal solution of the stochastic control roblem can be obtained iteratively by value or olicy iteration methods to solve the Hamilton-Jacobi- Bellman equations [2]. Excet in secial cases, such as in a linear Gaussian environment, this involves discretization of the underlying saces [3]; an aroach whose scalability faces the curse of dimensionality [4]. As a result, they require a comutation time that is rovably exonential in the state dimension, in a real number based model of comlexity, without any assumtion that P NP [5]. Many aroaches have been roosed based on their tractability. Some rely on a searate design of the deterministic trajectory from the feedback olicy. Model Predictive Control (MPC)-based methods [6], [7], robust formulations [8], [9], and other designs that relate to the Pontryagin s *This material is based uon work artially suorted by NSF under Contract Nos. CNS and Science & Technology Center Grant CCF , the U.S. Army Research Office under Contract No. W911NF , and NPRP grant NPRP from the Qatar National Research Fund, a member of Qatar Foundation. 1 M. Rafieisakhaei and P. R. Kumar are with the Deartment of Electrical and Comuter Engineering, and 2 S. Chakravorty is with the Deartment of Aerosace Engineering, Texas A&M University, College Station, Texas, USA. {mrafieis, schakrav, rk@tamu.edu} Maximum Princile [10] are some of the methods that have been successfully used as surrogate design aroaches. Another oular aroach is utilizing Differential Dynamic Programing (DDP) [11] and DDP-based variations, such as the Stochastic DDP [12], ilqr and ilqg [13]. These methods rely on local linearizations of the cost function and the dynamics to the second order and roose iterative methods that attemt to find locally-otimal solutions in a tube around a nominal trajectory [13]. In this aer, we address the nonlinear stochastic control roblem and roose an architecture under which the searate design of an otimal oen-loo control sequence and a feedback olicy is near-otimal. In articular, we show that under a small noise assumtion, the searation into globallyotimal trajectory design and a globally-otimal feedback control law holds for a fully-observed nonlinear stochastic system. This result also sheds light on the conditions under which oular design aroaches based on the Maximum Princile may be globally ɛ-otimal. We quantify the first order stochastic error for small-noise levels based on Wentzell-Freidlin large-deviations theory. We thereby determine reach to a Trajectory-otimized Linear Quadratic Regulator (T-LQR) design for fully-observed nonlinear stochastic systems under Gaussian small-noise erturbations. In short, the design can be broken into two arts: i) an oen-loo otimal control roblem that designs the nominal trajectory of the LQR controller, which resects the nonlinearities as well as state and control constraints; ii) the design of an LQR olicy around the otimized nominal trajectory. The quality of the design is rigorously rovided by the main results of the aer. The organization of the aer is as follows. Section II rovides a brief background on Wentzell-Freidlin theory [14] and investigates its imlications regarding the linearization of a stochastic system couled with the usage of the Taylor theorem. Section III defines a general stochastic control roblem for a fully-observed system. Section IV rovides the main results by first analyzing the effect of feedback comensation on the linearization error, and then roviding the state and control error roagations along with robabilistic bounds based on the theory develoed in Section II. Section IV also rovides the first-order exected error of the stochastic cost function along with the searation result. Section V introduces the T-LQR design aroach. Finally, Section VI rovides a design based on T-LQR for a non-holonomic carlike robot and rovides numerical results on the roosed aroach to design.
2 II. SMALL RANDOM PERTURBATIONS OF A NON-LINEAR SYSTEM In this section, we discuss the theoretical background regarding the small noise erturbations of general dynamical systems. In articular, we discuss Wentzell-Freidlin theory on the small noise asymtotics of a erturbed system reresented by a general Stochastic Differential Equation (SDE). We consider a time-varying system as that is required for our design. A general discussion regarding large deviations of the trajectories of a erturbed system from that of its unerturbed counterarts and related theories can be found in [14] [22]. Probability sace: We consider a robability sace {Ω, F, P } with the random variables on a measurable sace (X, B), where X is a Euclidean sace with dimension of n x, n w or a smooth manifold in these saces, and B is the corresonding σ-algebra of Borel sets. Diffusion rocess: Let us consider a dynamical system with the following equation: dx ɛ t = b(t, X ɛ t)dt + ɛdw t, X ɛ 0 = x 0, (1) where b : R R nx R nx is a uniformly Lischitz continuous function, such that: b(t 1, x 1 ) b(t 2, x 2 ) K 1 x 1 x 2, (2) where x 1, x 2 R nx, t 1, t 2 [0, K], ɛ > 0, and K 1 > 0, {w t, t 0} is a Wiener rocess on R nw. Nominal unerturbed trajectory: Such a system can result from small random erturbations of the following timevarying ODE: ẋ t = b(t, x t ), (3) with initial condition x 0 = x 0 R nx. First order Taylor exansion: Using Taylor s theorem to obtain the first order linearization of the right hand side of the above system around the trajectory {x t } K results in the following: dx ɛ t =b(t, x t )dt+a t (X ɛ t x t )dt+ɛdw t +o( X ɛ t x t ), (4) where A t = x b(t, x) t,x t is the Jacobian matrix. Accuracy of linearization: Equation (4) states that if X ɛ t x t δ for all 0 t K, then, dx ɛ t =b(t, x t )dt+a t (X ɛ t x t )dt +ɛdw t + o(δ). (5) We will use the Wentzell-Freidlin theorem to calculate the robability that the aforesaid condition holds. In order to do that, we define the action functional for the family of rocesses defined in equation (1). Action functional [14]: For [T 1, T 2 ] [0, K], the action functional is defined as: S T1,T 2 (φ) := 1 T2 2ɛ 2 φ t b(t, φ t ) 2 dt, (6) T 1 for absolutely continuous φ, and is set to be equal to + for other φ C 0K (R nx ). Note that this defines the action functional for the (ɛ-deendent) family of rocesses given by the SDE (1), uniformly on the whole sace as ɛ 0. Theorem 1. Exonential Rate of Convergence Let: D be a domain in R nx, and denote its closure by cl(d); D denote the boundary of D; H D (t, x 0 )={φ C 0K (R nx ) : φ 0 = x 0, φ t D D}. Assume D = cl(d). Then, we have the following: lim ɛ 0 ɛ2 ln P x0 {X ɛ t D}= inf S 0t(φ), (7) φ H D (t,x 0) Theorem 2. Asymtotics of the Diffusion Process: Let: D t = cl(b c δ (x t )), the closure of the comlement of a ball with radius δ > 0 around the oint x t ; and τ ɛ = Min{t : X ɛ t D t }. Then, lim ɛ 0 ɛ2 ln P x0 {τ ɛ t} = inf S 0t (φ). (8) {φ:φ 0 =x 0, φ t x t >δ} Proof of these results can be found in [14], [15]. Thus, according to Theorem 1, for a given t, the robability as ɛ 0 of X ɛ t x t δ can be calculated as in equation (7). Note that this robability tends to zero exonentially for any fixed δ > 0 as ɛ 0. Moreover, from Theorem 2, the robability that the trajectory of X ɛ ever exits the tube of radius δ round the nominal trajectory in the time interval [0, t] also goes to zero exonentially at the same rate. (This also asserts that the likely aths to ever exit in [0, t] are those exiting at time t). This rovides the validity region of the linearized equation (4) and concludes our discussion in this section. III. THE FULLY OBSERVED SYSTEM The general stochastic control roblem of interest for fully observed system can be formulated as an otimization roblem in the sace of feedback olicies. In this section, we define the system equations and ose the general roblem. Without loss of generality, we consider the discrete-time version of the systems considered in the revious section and continue our analysis on that basis. Process model: We denote the state and control by x X R nx and u U R nu, resectively. The rocess model with f : X U X is defined as: x t+1 = f(x t, u t ) + ω t, ω t N (0, Σ ωt ) (9) where {ω t } is indeendent, identically distributed (i.i.d.). Now, we ose the general stochastic control roblem [1], [23]. Problem 1. Stochastic Control Problem for Fully Observed System: Given an initial state x 0, we wish to determine an otimal or near-otimal for min E[ c π t (x t, u t ) + c π π K(x K )] s.t. x t+1 = f(x t, u t ) + ω t, (10) where the otimization is over Markov, i.e., time-varying state-feedback, olicies, π Π, with π := {π 0,, π t }, π t : X U ; and u t = π t (x t ) secifying the action taken given the state; c π t (, ) : X U R is the one-ste cost function; c π K ( ) : X R denotes the terminal cost; K is the time horizon.
3 IV. SEPARATION OF OPEN LOOP AND CLOSED LOOP DESIGNS: FULLY OBSERVED SYSTEMS In this section, we rovide the theoretical basis for our design. The analysis emloys the Taylor series exansion of the rocess model and large deviations theory. A. Preliminary Analysis We start by roviding the nominal trajectory to linearize the rocess model. Then, we discuss the feedback law and comensate the rocess model with the feedback in order to use large deviations theory. Nominal Trajectory: We use the rocess model with zero noise to roagate the initial state, x 0, with a set of unknown controls {u t }, in order to obtain a arametrization of the feasible nominal trajectories as: x t+1 = f(x t, u t ), 0 t K 1, (11) where x 0 = x 0. Linearization of the rocess model: We linearize the rocess model of equation (9) around the nominal trajectory: x t+1 =A t x t +B t ũ t +ω t +o(e x,u t ), (12) where we have: A t (x t, u t ) = x f(x, u) x, denoted by A t; t,u t B t (x t, u t ) = u f(x, u) x, denoted by B t; t,u t x t := x t x t, the state error with resect to the nominal trajectory; ũ t := u t u t, the control error; and := x t + ũ t the error. e x,u t As the control inuts change, the underlying nominal trajectory also changes, and therefore the Jacobian matrices, A t, B t, and G t change, as well. The Taylor series exansion of equation (12) is valid as e x t 0, i.e., the linearized function remains close to the linearization region. In this equation, the only factor that can drive the linearized function away from the linearization region is the noise rocess ω t. Therefore, we establish robabilistic bounds on the validity of this equation using the small noise theory of Section II. Otimization over olicy sace: A feedback law with Linear Time-Varying (LTV) gain is sufficient to control a linearized model around a nominal trajectory. Therefore, we restrict the search to feedback olicies with LTV feedback gain, Π L. In the next section, we design a Linear Quadratic Regulator olicy (LQR) as a secial case for our design. Feedback controller: Assuming the controllability of the deterministic model of the system, we suose the existence of a feedback control law with LTV feedback gain to track and stabilize the trajectory of states around the nominaldesigned trajectory. Later, we exlain in detail how to design such a law. Thus, the control action error can be exressed as: ũ t = u t u t = L t (x t x t ), (13) where L t is the linear feedback gain. It is imortant to note that although we are working with the linearized system, the original system is a nonlinear system, and the design is tailored to work for the original system. Linearized system equation comensated with feedback: Relacing the feedback law in equation (12), we obtain: x t+1 =A t x t + B t ũ t + ω t + o(e x,u t ), =(A t B t L t ) x t + ω t + o(e x t ), =D t x t + ω t + o(e x t ), (14) where D t := A t B t L t, t 1 and e x,ω t := x t denotes the linearization-based error. Comensating the original system with feedback: Let us substitute for the control action in (9) using the feedback law of (13) as follows: x t+1 = f(x t, u t ) + ω t = f(x t, u t L t (x t x t )) + ω t. Using the last equation we define g : R X X, where g(t, x) =: f(x t, u t L t (x t x t )). (15) Note that the time-deendency for g stems from the timedeendency of the feedback law. Moreover, the nominal trajectory, {x t } K, satisfies the same equation as (11): x t+1 = g(t, x t ) = f(x t, u t L t (x t x t )) = f(x t, u t ). Note that linearizing g around the nominal trajectory yields (14), which itself is equivalent to equation (12) x g(t, x) t,x t (x t x t ) = x f(x, u t L t (x x t )) x t (x t x t ) = x f(x, u) x t,u t Lt(x x t ) (x t x t ) + u f(x, u) x t,u t Lt(x x t ) (u t L t (x x t )) x x (x t x t t ) = x f(x, u) x (x t,u t x t t ) + u f(x, u) x ( L t)(x t,u t t x t ) =A t (x t x t ) + B t ( L t )(x t x t ) = D t (x t x t ). Therefore, g(t, x t ) =D t (x t x t ) + ω t + o(e x t ), as e x,ω t 0. (16) Validity of the linearization: Let us analyze the validity of (12) using the Wentzell-Freidlin theory discussed in Section II. Let us assume that the noise rocess is ω t = ɛw t, where w t is a Wiener rocess as described in Section II, and ɛ > 0. Now, for a time-varying system, the robability that the error x t is less than a given δ > 0 can be calculated using large deviations theory. In articular, the discussion in Section II holds for rocess g. However, we require the function g to satisfy a uniform Lischitz continuity condition, for which uniform Lischitz continuity of rocess model f is sufficient. This is because, if f(x 1, u 1 ) f(x 2, u 2 ) K f ( x 1 x 2 + u 1 u 2 ), where x 1, x 2 R nx, and u 1, u 2 R nu, in addition to smoothness of the nominal trajectory (which is calculated as in (11)) on the interval [0, K], and we have the Lischitz continuity of g, as well. Effect of feedback on the linearization error: Note that before alying the feedback law, equation (9) deends on both u and ω. The influence of ω can be analyzed using large deviations theory; however, it is the feedback law that
4 limits the error of linearization caused by the control actions and converts the control action error into the state error. Moreover, the feedback effectively changes the drift term of the diffusion rocess and affects the validity region s robability through the action functional. B. Main Results In this section, we quantify the overall erformance obtained from the searated design. The roofs are rovided in the aendix. Lemma 1. State Error Proagation: Let ω t = ɛw t, where w t is a Gaussian rocess as described in section II, and ɛ > 0. Let the state error be x t = x t x t for t 0. Then, for t 0 the non-recursive state error roagation, x t+1, in terms of the indeendent variables, including rocess noise at each time ste can be written as follows: t x t+1 = D ω s,tω s + o(δ), as ɛ 0, (17) where we have: D 0 := A 0 D t1:t 2 = Π t2 t=t 1 D t, t 2 t 1 0, otherwise, it is the identity matrix; D ω s,t := D s+1:t, 0 s t 1, t 1; and D ω t,t := D t+1:t = I, t 0. The following lemma follows directly by taking into account the feedback law in the result of Lemma 1. Lemma 2. Control Error Proagation: Let ω t = ɛw t, where w t is a Gaussian rocess as described in section II, and ɛ > 0. Let the control error be ũ t = u t u t for t 0. Then, for t 0 the non-recursive control error roagation, ũ t+1, in terms of the indeendent variables, including rocess noise at each time ste can be written as follows: t ũ t+1 = L ω s,t+1ω s + o(δ), as ɛ 0, where L ω s,t+1 := L t+1 Dω s,t, t 0, t s 0. Moreover, the validity region of the above equation is the same as for (17) in Lemma 1. Next, we linearize of the cost function and rovide the searation result for a fully observed system. Linearization of the cost function: Using the Taylor aroximation around the nominal trajectories of state and control actions yields J = J + (C x t x t + C u t ũt) + C x K x K + o(e x,u ), (18) J 1 where we assume that the cost function is continuously differentiable. Moreover: J := c t(x t, u t )+c K (x K ) denotes the nominal cost; J 1 := J + (Cx t x t + C u t ũt) + C x K x K is the first order aroximation of the cost function; J1 := (Cx t x t + C u t ũt) + C x K x K is the first order error in the cost by our aroximation scheme. Therefore, J1 = J 1 J ; C x t = x c t (x, u) x C u t = u c t (x, u) x ; t,u t ; t,u t C x K = xc K (x) x ; and K e x,u J 1 error. := t=1 ( x t + ũ t ) + x K is the linearization Note that since the error term is in terms of state and control at all time stes, the robability of this equation holding true is equivalent to the robability of the latest time-ste term still being in the vicinity of the nominal trajectory at that ste. Therefore, the robability that this last equation is valid can be calculated as the robability that x K δ for δ > 0, which is given by equation (7) for rocess g defined in equation (15) and using D K = cl(b c δ (x K )) in Theorem 1. As a result, all the revious stes will remain within the same tube around the nominal trajectory and the total error will still be of the order of δ. Therefore, given this robability, we have: J = J + (C x t x t + C u t ũt) + C x K x K + o(δ), (19) ɛ 0. Hence, J J 1 = o(δ) as ɛ 0 with robability given in equation (7) for t = K. Next, we rovide the main result regarding the exected first order error of the cost function. Theorem 3. First Order Cost Function Error: Let us denote the first order cost function error by J 1. Given that rocess noises are zero mean i.i.d., under a first-order aroximation for the small noise aradigm, the stochastic cost function is dominated by the nominal art of the cost function. Moreover the exected first-order error is zero. That is, E[ J 1 ] = 0. Moreover, if the rocess noise at each time ste is distributed according to a zero mean Gaussian distribution, then J 1 also has a zero mean Gaussian distribution. The above result says that the random erturbation in the stochastic running cost form the nominal is zero mean if the linearization holds. From Wentzell-Freidlin theory, we have already established that the linearization holds with a robability exonentially close to 1 as ɛ 0. Hence, this imlies that the exected stochastic cost is equal to the nominal cost with a very high robability as ɛ 0. Therefore, it follows that the oen loo nominal design can be done searately from the closed loo design, summarized bellow: Corollary 1. Searation of the Closed Loo and Oen Design Under Small Noise Based on Theorem 3, under the small noise aradigm, as ɛ 0, the design of the feedback law can be done searately from the design of the oen loo otimized trajectory. Furthermore, this result holds with a robability that exonentially tends to one as ɛ 0.
5 Remark: This result means that under a small noise assumtion and assuming the existence of a feedback law (with LTV gain, which is designed searately), the oen loo nominal trajectory of the system can be designed by relacing the stochastic equations with their nominal counterarts. This design tends to the otimal design with robability one (for the general class of Gaussian rocesses that are considered) as the intensity of noise tends to zero. Remark: It should be mentioned that while our general roblem definition has only the rocess model as dynamics, other constraints on state or control can be considered as long as they share the same smoothness roerties as the cost function. Remark: It is worth mentioning that although we have considered diffusion rocesses with additive white Gaussian noise, the theory in fact holds for a larger class of roblems. On can aeal to more general results in [15] for time-inhomogeneous diffusion rocesses with non-additive white noise. In such cases, the action functional is usually calculated through the Legendre transform. Remark: As mentioned before, although we roved the results of this section for discrete time systems, one can rove the continuous-time versions of our results. This can be done, for instance, by reducing the samling time and limiting it to zero, while utilizing results such as Fubini s theorem along with the similar conditional exectation theorem on Itô s stochastic integrals to exchange the integrations with the exectation. It should be mentioned that there also exists a discrete-time counterart of the Wentzell-Freidlin theory as rovided in [15]. Remark: Higher order designs and analysis of the cost function (or even the dynamics) are ossible using a similar aroach rovided in this aer. Remark: In Ref. [17], for a secial case of nonlinear systems where the rocess model is linear in the control variable, i.e., f(x t, u t ) = f 1 (x t ) + f 2 (x t )u t, three results are roven. The first result, concerns the ɛ-otimality of the otimal deterministic law under convexity of J in the control (i.e., v T ( u,u J)v 0, v), and additional smoothness and regularity conditions. The second result concerns the ɛ 2 - otimality of the otimal deterministic law under a stronger convexity condition of J in the control (i.e., v T ( u,u J)v c( u ) v 2, v, c( ) : R R is a monotonically nonincreasing ositive function), and some smoothness and regularity conditions. The third result concerns the ɛ-otimality of the otimal deterministic sequence under the latter condition. Our result, on the other hand, rovides the ɛ-otimality of the roosed design aroach for a broader class of rocesses f(x t, u t ) with nonlinear deendence in the control variable and more general cost functions (most imortantly, does not assume the linear deendence on the control sequence). In fact, our simulations are erformed for a car-like robot with nonlinear deendence on the control variables. V. T-LQR: TRAJECTORY-OPTIMIZED LQR In this section, we rovide a design scheme based on the theory rovided in the revious sections. This aroach aims at designing an LQR controller with an otimal nominal underlying trajectory based on the searation result of Corollary 1 and Theorem 3. As a result, we term this method as the Trajectory-otimized LQR (T-LQR). Problem 2. Trajectory Planning Problem: Solve for the otimal trajectory: min u 0: c(x t, u t ) + c K (x K ) s.t. x t+1 = f(x t, u t ), 0 t K 1, x 0 = x 0. (20a) (20b) Otimized nominal trajectory: Problem 2 is a deterministic roblem aiming for the best nominal erformance. This roblem utilizes the first order aroximation of the cost function and otimizes the underlying nominal trajectory used in the design of the feedback law. We will denote the resulting otimized nominal trajectory of roblem 2 by {x o t } K, {u o t }. Feedback control: The resulting trajectory from the otimization roblem is otimized in terms of control effort and other constraints, such as a terminal constraint. Now, using the searation result, an LQR controller is designed to track the otimized nominal trajectory. Therefore, the LQR cost is designed for the tracking error x t x o t. The resulting control olicy is a feedback olicy with LTV gain, and the evolution of x t is obtained from the original equation of the rocess model during the execution. Although we utilize an LQR controller, it is imortant to note that the searation result only assumes a linear form of feedback and other tyes of designs [24] can be used as well. Linearization of system equations: For simlicity, we denote the Jacobian matrices and every other variable associated with the otimized nominal trajectory with a suerscrit o. The Jacobians are A o t = x f(x, u) x o t,u o, and Bo t t = u f(x, u) x o t,u o. t Problem 3. LQR Problem: Given the otimized nominal trajectory as {x o t } K and {u o t }, and a lanning horizon of K > 0, solve the following LQR roblem to track the nominal trajectory: K min [(x t x o u t ) T Wt x (x t x o t ) + (ũ o t 1) T Wt u ũo t 1] 0: t=1 s.t. x o t+1 = A o t x o t + B o t u o t, 0 t K 1 (21) where ũ o t = u t u o t and W u t, W x t 0 are ositive-definite matrices. Control olicy: The resulting control olicy of roblem 3 is a feedback olicy as follows [1]: ũ o t = L o t (x t x o t ), where the linear feedback gain L o t is: L o t = (W u t + (B o t ) T P f t+1 Bo t ) 1 (B o t ) T P f t+1 Ao t, and the matrix P f t is the result of backward iteration of the dynamic Riccati equation P f t 1 = (Ao t ) T P f t A o t
6 (a) Otimized trajectory of roblem(b) A tyical ground truth trajectory 2. with noise standard deviation equal to 10% of the maximum control signal. Fig. 1. Otimized vs. a tyical execution trajectory for a car-like robot. (A o t ) T P f t B o t (W u t + (B o t ) T P f t B o t ) 1 (B o t ) T P f t A o t +W x t, which is solvable with a terminal condition P f K = Wx t. Remark: The comutations involved in roblem 2 is of the order of O(Kn 2 x) for tyically smooth dynamics for one iteration. Let us assume O(l) is the order of the number of iterations in the otimizer until convergence. The LQR olicy calculation is of order of O(Kn 3 x). Therefore, overall, the design aroach based on the searation rincile of Corollary 1 is O(lKn 2 x + Kn 3 x) for a tyical rocess model (such as our examle in the next section). The low comutational comlexity of this aroach results in fast relanning in case of deviations during execution. This renders the first scheme to be eminently imlementable for imlementation in on-line alications. Remark: For the secific class of roblems considered in [17] (see the last remark in Section IV) the design aroach of [17] requires calculation of the otimal control law through intractable dynamic rogramming. In contrast, the roosed design aroach in this aer utilizes the tractable solution of Maximum Princile roblem followed by an LQR design. Even imlementing the result of [17] through a model redictive aroach would require more comutations of at least an order of the lanning horizon (from O(K) to O(K 2 )). In such an imlementation, the online comutations of the aroach of [17] require O(lKn 2 x) calculations comared to only O(n 2 x) calculations in our algorithm. VI. EXAMPLE Let us consider a car-like four-wheel robot with rocess model [25]: v ẋ = v cos(θ), ẏ = v sin(θ), θ = tan(φ), (22) L where (x, y, θ) is the state, and (v, φ) is the control inut. We suose that, φ < φ max = π/2, v v max = 0.6, x 0 = ( 1.5, 0.5, 0), K = 20, and the time discretization eriod is 0.7. We incororate the control constraints and the terminal goal, x g = ( 0.5, 1, 0), in the cost function. Last, the initial control sequence used for the otimization is just a sequence of zero inuts. The rocess noise is additive mean zero Gaussian noise with a standard deviation equal to ɛ max t { u t 2 }. Figure 1a shows the result of the otimization roblem 2 whereas Fig. 1b shows a tyical ground truth trajectory with ɛ = 0.1. We have used MATLAB (a) Feedback-comensated system. (b) Oen-loo system. Fig. 2. Evolution of average NMSE as ɛ 0 for a feedback comensated and oen loo system with the same nominal trajectories. 2016b and its fmincon solver for simulations. In the next exeriment, we increase ɛ from to , in ste sizes of For each value of ɛ, we execute the resulting olicy 100 times and comute the average Normalized Mean Squared Error (NMSE) as: Average NMSE (%) = x x j x 2 100, (23) j=1 2 where x indicates the lanned trajectory and x j indicates the ground truth trajectory at jth exeriment. The results of this exeriment are shown in Fig. 2a, where the evolution of the average NMSE is deicted for various values of noise level ɛ. As indicated in this figure, as ɛ 0, the average NMSE tends to zero at an exonential rate, which is consistent with the theory develoed in Section II. Moreover, this figure indicates that through the feedback comensation, moderate noise levels can be tolerated, rather than just small levels. Last, Fig. 2b deicts the evolution of the average NMSE for an exeriment with the same setting as in Fig. 2a, excet that only the oen-loo lanned control sequence is alied during execution. As redicted by the theory, the error still decreases exonentially as the noise level decreases. However, the rate of convergence is about one-fifth of the revious rate. The results of Fig. 2 show that our design can be used for relatively moderate levels of noise, using the ower of feedback. Remark: In ractice, if at any oint in the execution the calculated error exceeds a threshold, very raid relanning can be triggered very fast due to the low comutational burden of the otimization roblem. VII. CONCLUSION We have resented a design aroach that searates the design of the oen-loo nominal trajectory and the closedloo feedback olicy for fully-observed nonlinear stochastic systems with Gaussian distributions. We have shown that under a small-noise assumtion, the stochastic cost function is dominated by the nominal art of the cost function and the exected first order linearization error is of mean zero. This results in a reliable raid lanning method that is rovably near-otimal. It can be used in robotic ath lanning and control, and otentially in other alications.
7 REFERENCES [1] P. R. Kumar and P. P. Varaiya, Stochastic Systems: Estimation, Identification, and Adative Control. Englewood Cliffs, NJ: Prentice- Hall, [2] D. P. Bertsekas, D. P. Bertsekas, D. P. Bertsekas, and D. P. Bertsekas, Dynamic rogramming and otimal control. Athena Scientific Belmont, MA, 1995, vol. 1, no. 2. [3] H. Kushner and P. G. Duuis, Numerical methods for stochastic control roblems in continuous time. Sringer Science & Business Media, 2013, vol. 24. [4] R. Bellman, Dynamic Programming, 1st ed. Princeton, NJ, USA: Princeton University Press, [5] C.-S. Chow and J. N. Tsitsiklis, The comlexity of dynamic rogramming, Journal of comlexity, vol. 5, no. 4, , [6] D. Mayne, Robust and stochastic mc: Are we going in the right direction? IFAC-PaersOnLine, vol. 48, no. 23,. 1 8, [7] D. Q. Mayne, Model redictive control: Recent develoments and future romise, Automatica, vol. 50, no. 12, , [8] J. N. Tsitsiklis, Comutational comlexity in markov decision theory, HERMIS-An International Journal of Comuter Mathematics and its Alications, vol. 9, , [9] Y. Le Tallec, Robust, risk-sensitive, and data-driven control of markov decision rocesses, Ph.D. dissertation, Massachusetts Institute of Technology, [10] R. E. Ko, Pontryagin maximum rincile, Mathematics in Science and Engineering, vol. 5, , [11] D. H. Jacobson and D. Q. Mayne, Differential dynamic rogramming, [12] E. Theodorou, Y. Tassa, and E. Todorov, Stochastic differential dynamic rogramming, in American Control Conference (ACC), IEEE, 2010, [13] E. Todorov and W. Li, A generalized iterative lqg method for locallyotimal feedback control of constrained nonlinear stochastic systems, in American Control Conference, Proceedings of the IEEE, 2005, [14] M. I. Freidlin and A. D. Wentzell, Random Perturbations. New York, NY: Sringer US, 1984, [15] A. D. Wentzell, Limit theorems on large deviations for Markov stochastic rocesses. Sringer Science & Business Media, 2012, vol. 38. [16] A. Dembo and O. Zeitouni, Large deviations techniques and alications. Sringer Science & Business Media, 2009, vol. 38. [17] W. H. Fleming, Stochastic control for small noise intensities, SIAM Journal on Control, vol. 9, no. 3, , [18] H. Cruz-Suárez and R. Ilhuicatzi-Roldán, Stochastic otimal control for small noise intensities: The discrete-time case, WSEAS Trans. Math., vol. 9, no. 2, , Feb [19] J. D. Perkins and R. W. H. Sargent, Nonlinear otimal stochastic control some aroximations when the noise is small. Berlin, Heidelberg: Sringer Berlin Heidelberg, 1976, [20] J. Perkins and R. Sargent, Nonlinear otimal stochastic controlsome aroximations when the noise is small, in IFIP Technical Conference on Otimization Techniques. Sringer, 1975, [21] C. J. Holland, An aroximation technique for small noise oen-loo control roblems, Otimal Control Alications and Methods, vol. 2, no. 1. [22] S. S. Varadhan and S. S. Varadhan, Large deviations and alications. SIAM, 1984, vol. 46. [23] D. Bertsekas, Dynamic Programming and Otimal Control: 3rd Ed. Athena Scientific, [24] P. Kumar et al., Control: a ersective, Automatica, vol. 50, no. 1,. 3 43, [25] S. Lavalle, Planning algorithms. Cambridge University Press, APPENDIX Proof. Lemma 1: State Error Proagation Ignoring the validity region, x t+1 =A t x t + B t ũ t + ω t = (A t B t L t ) x t + ω t =:D t x t +ω t =: D t t 0:t x 0 + D r+1:t ω r =: D ω s,tω s. r=0 Note that using the definition of x t, the initial state error is x 0 = x 0 x 0 = x 0 x 0 = 0. Likewise, the state error at time-ste 1 is x 1 = A 0 x 0 +ω 0 = ω 0. Moreover, these errors are consistent with the lemma using the definitions rovided and the indicator function notation. Now, since this equation utilizes the linearizations at all stes, its error is within o(δ), if x s δ for all s t. Moreover, the robability that equation (17) is valid (i.e., the linearizations are valid with o(δ) error for the entire trajectory u to time t) is the same as the robability that the linearization is valid on the last ste (i.e., ste t). This is due to Wentzell-Freidlin theory. Now, the robability that x t δ is given by (7) for rocess g defined in (15), and D t = cl(b δ (x t )) for Theorem 1. Therefore, as ɛ 0, the robability of x t x t δ is calculated as in equation (7), which tends exonentially to zero. Last, note that through Wentzell-Freidlin theory, the validity of linearization only deends on the aggregated effect of the random erturbations at stes rior to t, and there is no need to individually bound the noise at each ste. Proof. Lemma 2, Control Error Proagation Relacing state error in the control law: Using the result of Lemma 1, we can rewrite ũ t+1 for t 1 as follows: t t ũ t+1 = L t+1 x t+1 = L t+1 D ω s,tω s =: L ω s,t+1ω s. Note that ũ 0 = 0, and the last formula is consistent with this error using the definitions rovided in the lemma. Proof. Theorem 3, Cost Function Error Using the linearization rocess described reviously, we can write the cost function error as E[ J 1 ] = E[ (Cx t x t+c u t ũt)+c x K x K]. Utilizing the assumtion that the rocess noise is zero mean i.i.d., E[ω t ] = 0 for all t. Moreover, x 0 = 0 which follows from the fact that x 0 = x 0. Therefore, using the linearity of the exectation oerator and Lemmas 1 and 2, we can rewrite E[ J 1 ] as follows: E[ J 1 ]= (C x t E[ x t ] + C u t E[ũ t ]) + C x KE[ x K ] = t 1 C x t E[ D ω s,t 1ω s ]+ + C x KE[ D ω s,ω s ] t 1 C u t E[ L ω s,tω s ] t 1 = E[(C x D t ω s,t 1 C u t L ω s,t)ω s ]+ E[C x D K ω s,ω s ] t 1 t 1 n u K K =: E[(w s,t ) T ω s ] = ws,te[ω j s] j = 0. j=1 where w s,t := (C x D t ω s,t 1 C u t L ω s,t) T, t 1 s 0, K 1 t 0, w s,k := (C x D K ω s, )T, K 1 s 0. Moreover, w s,t := (ws,t, 1, ws,t nu ) T is a vector of the same size of ω s = (ωs, 1, ωs nu ) T.
Feedback-error control
Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller
More informationSTABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS. A Dissertation JAMES ROBERT FISHER
STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS A Dissertation by JAMES ROBERT FISHER Submitted to the Office of Graduate Studies of Texas A&M University in artial fulfillment
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More informationAsymptotically Optimal Simulation Allocation under Dependent Sampling
Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee
More informationState Estimation with ARMarkov Models
Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,
More informationSums of independent random variables
3 Sums of indeendent random variables This lecture collects a number of estimates for sums of indeendent random variables with values in a Banach sace E. We concentrate on sums of the form N γ nx n, where
More informationRobust Predictive Control of Input Constraints and Interference Suppression for Semi-Trailer System
Vol.7, No.7 (4),.37-38 htt://dx.doi.org/.457/ica.4.7.7.3 Robust Predictive Control of Inut Constraints and Interference Suression for Semi-Trailer System Zhao, Yang Electronic and Information Technology
More informationResearch Article An iterative Algorithm for Hemicontractive Mappings in Banach Spaces
Abstract and Alied Analysis Volume 2012, Article ID 264103, 11 ages doi:10.1155/2012/264103 Research Article An iterative Algorithm for Hemicontractive Maings in Banach Saces Youli Yu, 1 Zhitao Wu, 2 and
More informationarxiv: v1 [quant-ph] 20 Jun 2017
A Direct Couling Coherent Quantum Observer for an Oscillatory Quantum Plant Ian R Petersen arxiv:76648v quant-h Jun 7 Abstract A direct couling coherent observer is constructed for a linear quantum lant
More informationCombining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationRobustness of classifiers to uniform l p and Gaussian noise Supplementary material
Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668
More informationRecursive Estimation of the Preisach Density function for a Smart Actuator
Recursive Estimation of the Preisach Density function for a Smart Actuator Ram V. Iyer Deartment of Mathematics and Statistics, Texas Tech University, Lubbock, TX 7949-142. ABSTRACT The Preisach oerator
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationAn analytical approximation method for the stabilizing solution of the Hamilton-Jacobi equation based on stable manifold theory
Proceedings of the 27 American Control Conference Marriott Marquis Hotel at Times Square New York City, USA, July -3, 27 ThA8.4 An analytical aroimation method for the stabilizing solution of the Hamilton-Jacobi
More informationNONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm
- (23) NLP - NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS The Goldstein-Levitin-Polya algorithm We consider an algorithm for solving the otimization roblem under convex constraints. Although the convexity
More information2-D Analysis for Iterative Learning Controller for Discrete-Time Systems With Variable Initial Conditions Yong FANG 1, and Tommy W. S.
-D Analysis for Iterative Learning Controller for Discrete-ime Systems With Variable Initial Conditions Yong FANG, and ommy W. S. Chow Abstract In this aer, an iterative learning controller alying to linear
More informationAnalysis of some entrance probabilities for killed birth-death processes
Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More informationQuantitative estimates of propagation of chaos for stochastic systems with W 1, kernels
oname manuscrit o. will be inserted by the editor) Quantitative estimates of roagation of chaos for stochastic systems with W, kernels Pierre-Emmanuel Jabin Zhenfu Wang Received: date / Acceted: date Abstract
More informationAge of Information: Whittle Index for Scheduling Stochastic Arrivals
Age of Information: Whittle Index for Scheduling Stochastic Arrivals Yu-Pin Hsu Deartment of Communication Engineering National Taiei University yuinhsu@mail.ntu.edu.tw arxiv:80.03422v2 [math.oc] 7 Ar
More informationImproved Capacity Bounds for the Binary Energy Harvesting Channel
Imroved Caacity Bounds for the Binary Energy Harvesting Channel Kaya Tutuncuoglu 1, Omur Ozel 2, Aylin Yener 1, and Sennur Ulukus 2 1 Deartment of Electrical Engineering, The Pennsylvania State University,
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More informationRobust Solutions to Markov Decision Problems
Robust Solutions to Markov Decision Problems Arnab Nilim and Laurent El Ghaoui Deartment of Electrical Engineering and Comuter Sciences University of California, Berkeley, CA 94720 nilim@eecs.berkeley.edu,
More informationElementary Analysis in Q p
Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some
More informationt 0 Xt sup X t p c p inf t 0
SHARP MAXIMAL L -ESTIMATES FOR MARTINGALES RODRIGO BAÑUELOS AND ADAM OSȨKOWSKI ABSTRACT. Let X be a suermartingale starting from 0 which has only nonnegative jums. For each 0 < < we determine the best
More informationDistributed Rule-Based Inference in the Presence of Redundant Information
istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced
More informationAlgorithms for Air Traffic Flow Management under Stochastic Environments
Algorithms for Air Traffic Flow Management under Stochastic Environments Arnab Nilim and Laurent El Ghaoui Abstract A major ortion of the delay in the Air Traffic Management Systems (ATMS) in US arises
More informationPositivity, local smoothing and Harnack inequalities for very fast diffusion equations
Positivity, local smoothing and Harnack inequalities for very fast diffusion equations Dedicated to Luis Caffarelli for his ucoming 60 th birthday Matteo Bonforte a, b and Juan Luis Vázquez a, c Abstract
More informationMulti-Operation Multi-Machine Scheduling
Multi-Oeration Multi-Machine Scheduling Weizhen Mao he College of William and Mary, Williamsburg VA 3185, USA Abstract. In the multi-oeration scheduling that arises in industrial engineering, each job
More informationOn Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law
On Isoerimetric Functions of Probability Measures Having Log-Concave Densities with Resect to the Standard Normal Law Sergey G. Bobkov Abstract Isoerimetric inequalities are discussed for one-dimensional
More informationLINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL
LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL Mohammad Bozorg Deatment of Mechanical Engineering University of Yazd P. O. Box 89195-741 Yazd Iran Fax: +98-351-750110
More informationThe non-stochastic multi-armed bandit problem
Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at
More informationEquivalence of Wilson actions
Prog. Theor. Ex. Phys. 05, 03B0 7 ages DOI: 0.093/te/tv30 Equivalence of Wilson actions Physics Deartment, Kobe University, Kobe 657-850, Jaan E-mail: hsonoda@kobe-u.ac.j Received June 6, 05; Revised August
More informationResearch Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inputs
Abstract and Alied Analysis Volume 203 Article ID 97546 5 ages htt://dxdoiorg/055/203/97546 Research Article Controllability of Linear Discrete-Time Systems with Both Delayed States and Delayed Inuts Hong
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationESTIMATION OF THE OUTPUT DEVIATION NORM FOR UNCERTAIN, DISCRETE-TIME NONLINEAR SYSTEMS IN A STATE DEPENDENT FORM
Int. J. Al. Math. Comut. Sci. 2007 Vol. 17 No. 4 505 513 DOI: 10.2478/v10006-007-0042-z ESTIMATION OF THE OUTPUT DEVIATION NORM FOR UNCERTAIN DISCRETE-TIME NONLINEAR SYSTEMS IN A STATE DEPENDENT FORM PRZEMYSŁAW
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More informationMATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK
Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment
More informationOn the capacity of the general trapdoor channel with feedback
On the caacity of the general tradoor channel with feedback Jui Wu and Achilleas Anastasooulos Electrical Engineering and Comuter Science Deartment University of Michigan Ann Arbor, MI, 48109-1 email:
More informationAn Analysis of Reliable Classifiers through ROC Isometrics
An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit
More informationA Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition
A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino
More informationProbability Estimates for Multi-class Classification by Pairwise Coupling
Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics
More informationSome results of convex programming complexity
2012c12 $ Ê Æ Æ 116ò 14Ï Dec., 2012 Oerations Research Transactions Vol.16 No.4 Some results of convex rogramming comlexity LOU Ye 1,2 GAO Yuetian 1 Abstract Recently a number of aers were written that
More informationSUPER-GEOMETRIC CONVERGENCE OF A SPECTRAL ELEMENT METHOD FOR EIGENVALUE PROBLEMS WITH JUMP COEFFICIENTS *
Journal of Comutational Mathematics Vol.8, No.,, 48 48. htt://www.global-sci.org/jcm doi:.48/jcm.9.-m6 SUPER-GEOMETRIC CONVERGENCE OF A SPECTRAL ELEMENT METHOD FOR EIGENVALUE PROBLEMS WITH JUMP COEFFICIENTS
More informationUniformly best wavenumber approximations by spatial central difference operators: An initial investigation
Uniformly best wavenumber aroximations by satial central difference oerators: An initial investigation Vitor Linders and Jan Nordström Abstract A characterisation theorem for best uniform wavenumber aroximations
More informationSTABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS
STABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS Massimo Vaccarini Sauro Longhi M. Reza Katebi D.I.I.G.A., Università Politecnica delle Marche, Ancona, Italy
More informationBrownian Motion and Random Prime Factorization
Brownian Motion and Random Prime Factorization Kendrick Tang June 4, 202 Contents Introduction 2 2 Brownian Motion 2 2. Develoing Brownian Motion.................... 2 2.. Measure Saces and Borel Sigma-Algebras.........
More informationIMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES
IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES OHAD GILADI AND ASSAF NAOR Abstract. It is shown that if (, ) is a Banach sace with Rademacher tye 1 then for every n N there exists
More informationMobility-Induced Service Migration in Mobile. Micro-Clouds
arxiv:503054v [csdc] 7 Mar 205 Mobility-Induced Service Migration in Mobile Micro-Clouds Shiiang Wang, Rahul Urgaonkar, Ting He, Murtaza Zafer, Kevin Chan, and Kin K LeungTime Oerating after ossible Deartment
More informationUncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning
TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment
More informationStochastic integration II: the Itô integral
13 Stochastic integration II: the Itô integral We have seen in Lecture 6 how to integrate functions Φ : (, ) L (H, E) with resect to an H-cylindrical Brownian motion W H. In this lecture we address the
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationImproved Bounds on Bell Numbers and on Moments of Sums of Random Variables
Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating
More informationA Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split
A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com
More informationApplications to stochastic PDE
15 Alications to stochastic PE In this final lecture we resent some alications of the theory develoed in this course to stochastic artial differential equations. We concentrate on two secific examles:
More informationEvaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models
Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models Ketan N. Patel, Igor L. Markov and John P. Hayes University of Michigan, Ann Arbor 48109-2122 {knatel,imarkov,jhayes}@eecs.umich.edu
More informationVarious Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various Families of R n Norms and Some Open Problems
Int. J. Oen Problems Comt. Math., Vol. 3, No. 2, June 2010 ISSN 1998-6262; Coyright c ICSRS Publication, 2010 www.i-csrs.org Various Proofs for the Decrease Monotonicity of the Schatten s Power Norm, Various
More informationReducing Risk in Convex Order
Reducing Risk in Convex Order Junnan He a, Qihe Tang b and Huan Zhang b a Deartment of Economics Washington University in St. Louis Camus Box 208, St. Louis MO 6330-4899 b Deartment of Statistics and Actuarial
More information1 Extremum Estimators
FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective
More informationMATH 2710: NOTES FOR ANALYSIS
MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite
More informationOn a class of Rellich inequalities
On a class of Rellich inequalities G. Barbatis A. Tertikas Dedicated to Professor E.B. Davies on the occasion of his 60th birthday Abstract We rove Rellich and imroved Rellich inequalities that involve
More informationJournal of Mathematical Analysis and Applications
J. Math. Anal. Al. 44 (3) 3 38 Contents lists available at SciVerse ScienceDirect Journal of Mathematical Analysis and Alications journal homeage: www.elsevier.com/locate/jmaa Maximal surface area of a
More informationPArtially observable Markov decision processes
Solving Continuous-State POMDPs via Density Projection Enlu Zhou, Member, IEEE, Michael C. Fu, Fellow, IEEE, and Steven I. Marcus, Fellow, IEEE Abstract Research on numerical solution methods for artially
More informationConvex Optimization methods for Computing Channel Capacity
Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem
More informationMean Square Stability Analysis of Sampled-Data Supervisory Control Systems
17th IEEE International Conference on Control Alications Part of 28 IEEE Multi-conference on Systems and Control San Antonio, Texas, USA, Setember 3-5, 28 WeA21 Mean Square Stability Analysis of Samled-Data
More information1-way quantum finite automata: strengths, weaknesses and generalizations
1-way quantum finite automata: strengths, weaknesses and generalizations arxiv:quant-h/9802062v3 30 Se 1998 Andris Ambainis UC Berkeley Abstract Rūsiņš Freivalds University of Latvia We study 1-way quantum
More informationOn Doob s Maximal Inequality for Brownian Motion
Stochastic Process. Al. Vol. 69, No., 997, (-5) Research Reort No. 337, 995, Det. Theoret. Statist. Aarhus On Doob s Maximal Inequality for Brownian Motion S. E. GRAVERSEN and G. PESKIR If B = (B t ) t
More informationPaper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation
Paer C Exact Volume Balance Versus Exact Mass Balance in Comositional Reservoir Simulation Submitted to Comutational Geosciences, December 2005. Exact Volume Balance Versus Exact Mass Balance in Comositional
More informationUncertainty Modeling with Interval Type-2 Fuzzy Logic Systems in Mobile Robotics
Uncertainty Modeling with Interval Tye-2 Fuzzy Logic Systems in Mobile Robotics Ondrej Linda, Student Member, IEEE, Milos Manic, Senior Member, IEEE bstract Interval Tye-2 Fuzzy Logic Systems (IT2 FLSs)
More informationA Special Case Solution to the Perspective 3-Point Problem William J. Wolfe California State University Channel Islands
A Secial Case Solution to the Persective -Point Problem William J. Wolfe California State University Channel Islands william.wolfe@csuci.edu Abstract In this aer we address a secial case of the ersective
More informationLinear diophantine equations for discrete tomography
Journal of X-Ray Science and Technology 10 001 59 66 59 IOS Press Linear diohantine euations for discrete tomograhy Yangbo Ye a,gewang b and Jiehua Zhu a a Deartment of Mathematics, The University of Iowa,
More informationShadow Computing: An Energy-Aware Fault Tolerant Computing Model
Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms
More informationAnalysis of Multi-Hop Emergency Message Propagation in Vehicular Ad Hoc Networks
Analysis of Multi-Ho Emergency Message Proagation in Vehicular Ad Hoc Networks ABSTRACT Vehicular Ad Hoc Networks (VANETs) are attracting the attention of researchers, industry, and governments for their
More informationLocation of solutions for quasi-linear elliptic equations with general gradient dependence
Electronic Journal of Qualitative Theory of Differential Equations 217, No. 87, 1 1; htts://doi.org/1.14232/ejqtde.217.1.87 www.math.u-szeged.hu/ejqtde/ Location of solutions for quasi-linear ellitic equations
More informationCharacterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations
Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations PINAR KORKMAZ, BILGE E. S. AKGUL and KRISHNA V. PALEM Georgia Institute of
More informationGeneralized Coiflets: A New Family of Orthonormal Wavelets
Generalized Coiflets A New Family of Orthonormal Wavelets Dong Wei, Alan C Bovik, and Brian L Evans Laboratory for Image and Video Engineering Deartment of Electrical and Comuter Engineering The University
More informationSolutions of the Duffing and Painlevé-Gambier Equations by Generalized Sundman Transformation
Solutions of the Duffing and Painlevé-Gambier Equations by Generalized Sundman Transformation D.K.K. Adjaï a, L. H. Koudahoun a, J. Akande a, Y.J.F. Komahou b and M. D. Monsia a 1 a Deartment of Physics,
More informationON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS
#A13 INTEGERS 14 (014) ON THE LEAST SIGNIFICANT ADIC DIGITS OF CERTAIN LUCAS NUMBERS Tamás Lengyel Deartment of Mathematics, Occidental College, Los Angeles, California lengyel@oxy.edu Received: 6/13/13,
More informationLecture 6. 2 Recurrence/transience, harmonic functions and martingales
Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification
More informationPositive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application
BULGARIA ACADEMY OF SCIECES CYBEREICS AD IFORMAIO ECHOLOGIES Volume 9 o 3 Sofia 009 Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Alication Svetoslav Savov Institute of Information
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationFault Tolerant Quantum Computing Robert Rogers, Thomas Sylwester, Abe Pauls
CIS 410/510, Introduction to Quantum Information Theory Due: June 8th, 2016 Sring 2016, University of Oregon Date: June 7, 2016 Fault Tolerant Quantum Comuting Robert Rogers, Thomas Sylwester, Abe Pauls
More informationDesign of NARMA L-2 Control of Nonlinear Inverted Pendulum
International Research Journal of Alied and Basic Sciences 016 Available online at www.irjabs.com ISSN 51-838X / Vol, 10 (6): 679-684 Science Exlorer Publications Design of NARMA L- Control of Nonlinear
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationTopic 7: Using identity types
Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules
More informationHow to Estimate Expected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty
How to Estimate Exected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty Christian Servin Information Technology Deartment El Paso Community College El Paso, TX 7995, USA cservin@gmail.com
More informationDeveloping A Deterioration Probabilistic Model for Rail Wear
International Journal of Traffic and Transortation Engineering 2012, 1(2): 13-18 DOI: 10.5923/j.ijtte.20120102.02 Develoing A Deterioration Probabilistic Model for Rail Wear Jabbar-Ali Zakeri *, Shahrbanoo
More informationPartial Identification in Triangular Systems of Equations with Binary Dependent Variables
Partial Identification in Triangular Systems of Equations with Binary Deendent Variables Azeem M. Shaikh Deartment of Economics University of Chicago amshaikh@uchicago.edu Edward J. Vytlacil Deartment
More informationOn Wald-Type Optimal Stopping for Brownian Motion
J Al Probab Vol 34, No 1, 1997, (66-73) Prerint Ser No 1, 1994, Math Inst Aarhus On Wald-Tye Otimal Stoing for Brownian Motion S RAVRSN and PSKIR The solution is resented to all otimal stoing roblems of
More informationPETER J. GRABNER AND ARNOLD KNOPFMACHER
ARITHMETIC AND METRIC PROPERTIES OF -ADIC ENGEL SERIES EXPANSIONS PETER J. GRABNER AND ARNOLD KNOPFMACHER Abstract. We derive a characterization of rational numbers in terms of their unique -adic Engel
More informationYixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA
Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose
More informationTHE 3-DOF helicopter system is a benchmark laboratory
Vol:8, No:8, 14 LQR Based PID Controller Design for 3-DOF Helicoter System Santosh Kr. Choudhary International Science Index, Electrical and Information Engineering Vol:8, No:8, 14 waset.org/publication/9999411
More informationHidden Predictors: A Factor Analysis Primer
Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor
More informationPreconditioning techniques for Newton s method for the incompressible Navier Stokes equations
Preconditioning techniques for Newton s method for the incomressible Navier Stokes equations H. C. ELMAN 1, D. LOGHIN 2 and A. J. WATHEN 3 1 Deartment of Comuter Science, University of Maryland, College
More informationApproximating min-max k-clustering
Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost
More informationRUN-TO-RUN CONTROL AND PERFORMANCE MONITORING OF OVERLAY IN SEMICONDUCTOR MANUFACTURING. 3 Department of Chemical Engineering
Coyright 2002 IFAC 15th Triennial World Congress, Barcelona, Sain RUN-TO-RUN CONTROL AND PERFORMANCE MONITORING OF OVERLAY IN SEMICONDUCTOR MANUFACTURING C.A. Bode 1, B.S. Ko 2, and T.F. Edgar 3 1 Advanced
More informationIndirect Rotor Field Orientation Vector Control for Induction Motor Drives in the Absence of Current Sensors
Indirect Rotor Field Orientation Vector Control for Induction Motor Drives in the Absence of Current Sensors Z. S. WANG *, S. L. HO ** * College of Electrical Engineering, Zhejiang University, Hangzhou
More informationLECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]
LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for
More informationAutomatic Generation and Integration of Equations of Motion for Linked Mechanical Systems
Automatic Generation and Integration of Equations of Motion for Linked Mechanical Systems D. Todd Griffith a, John L. Junkins a, and James D. Turner b a Deartment of Aerosace Engineering, Texas A&M University,
More information