NEW FRONTIERS IN APPLIED PROBABILITY

Size: px
Start display at page:

Download "NEW FRONTIERS IN APPLIED PROBABILITY"

Transcription

1 J. Appl. Prob. Spec. Vol. 48A, (2011) Applied Probability Trust 2011 NEW FRONTIERS IN APPLIED PROBABILITY A Festschrift for SØREN ASMUSSEN Edited by P. GLYNN, T. MIKOSCH and T. ROLSKI Part 4. Simulation A COMPARISON OF CROSS-ENTROPY AND VARIANCE MINIMIZATION STRATEGIES JOSHUA C. C. CHAN, Australian National Uniersity Research School of Economics, Australian National Uniersity, Canberra, ACT 0200, Australia. PETER W. GLYNN, Stanford Uniersity Department of Management Science and Engineering, Huang Engineering Center, Stanford Uniersity, 475 Via Ortega, Stanford, CA , USA. DIRK P. KROESE, Uniersity of Queensland Department of Mathematics, Uniersity of Queensland, St Lucia, Brisbane, QLD 4072, Australia. address: kroese@maths.uq.edu.au APPLIED PROBABILITY TRUST AUGUST 2011

2 A COMPARISON OF CROSS-ENTROPY AND VARIANCE MINIMIZATION STRATEGIES BY JOSHUA C. C. CHAN, PETER W. GLYNN AND DIRK P. KROESE Abstract The ariance minimization (VM) and cross-entropy (CE) methods are two ersatile adaptie importance sampling procedures that hae been successfully applied to a wide ariety of difficult rare-eent estimation problems. We compare these two methods ia arious examples where the optimal VM and CE importance densities can be obtained analytically. We find that in the cases studied both VM and CE methods prescribe the same importance sampling parameters, suggesting that the criterion of minimizing the CE distance is ery close, if not asymptotically identical, to minimizing the ariance of the associated importance sampling estimator. Keywords: Variance minimization; cross entropy; importance sampling; rare-eent simulation; likelihood ratio degeneracy 2010 Mathematics Subject Classification: Primary 65C05 Secondary 65C60 1. Introduction In this article we compare two adaptie importance sampling procedures, namely the ariance minimization (VM) and cross-entropy (CE) methods [11], [16, pp ], in the context of rare-eent simulation. Both algorithms aim to find an importance density that is optimal in a well-defined sense, though the optimality criteria are different. Under the VM method, the optimal importance density is the one whose associated estimator has minimum ariance within a gien parametric family. Although this minimum ariance criterion is obiously desirable, in practice the minimization problem inoles application of time consuming numerical techniques. Instead of directly minimizing the ariance of the estimator, the CE method seeks to locate the importance density that is closest in Kullback Leibler diergence or CE distance to the zero-ariance importance density: the conditional density gien the rare eent. The main adantage of the CE method is that the optimization problem required to obtain the optimal density often admits closed-form solutions. To compare these two related but distinct algorithms, we consider arious explicit examples where the optimal VM and CE importance densities can be obtained analytically. In all the examples considered, we find that the optimal VM and CE importance densities are asymptotically identical. Although whether this result holds in general or not is an open question, it suggests that the VM and CE criteria are ery similar, at least asymptotically. Put differently, the importance density that is the closest in CE distance to the zero-ariance importance density is also the one whose associated estimator has the minimum asymptotic ariance. The significance of this obseration is that since CE estimators are typically easier to obtain, this practical adaptie importance sampling strategy is also optimal in the sense that it gies the Applied Probability Trust

3 184 J. C. C. CHAN ET AL. minimum ariance importance sampling estimator. Furthermore, in situations where the VM or CE optimization problem does not admit closed-form solutions, the optimal parameters need to be estimated ia a multileel procedure. We analyze how the ariability in the estimates affects the performance of the associated importance sampling estimator. The rest of this article is organized as follows. In Section 2 we first introduce some background material, and then discuss the classic VM and CE methods as well as two ariants proposed recently. We consider in Section 3 the sum of independent but not necessarily identical random ariables in the exponential families where the number of parameters is sent to infinity. We show in this scenario that the optimal CE parameters coincide with those suggested by large deiation theory. It is followed by three case studies: we consider the example of the sum of exponential random ariables in Section 4, and the cases for Pareto and Weibull random ariables in Sections 5 and 6, respectiely. 2. Adaptie importance sampling ia VM and CE methods We first introduce some standard notation and efficiency measures in the context of rare-eent simulation. We write a(t) b(t) to indicate that lim t a(t)/b(t) = 1, and X i = f, i = 1,...,n, to indicate that X 1,...,X n are independent and identically distributed () according to the density or distribution f. An unbiased estimator Z(γ) for l(γ ) is said to be logarithmically efficient, weakly efficient,orasymptotically optimal if log E Z(γ) 2 lim = 2. γ log l(γ ) This condition is equialent to the requirement that lim γ E Z(γ) 2 /l(γ ) 2 ε = 0 for eery ε > 0. The estimator is said to be strongly efficient or hae bounded relatie error if sup γ 0 E Z(γ) 2 /l(γ ) 2 <. It is readily obsered that bounded relatie error implies asymptotic optimality. These notions of efficiency are standard in the literature; see, for example, [1, pp ] and [13]. We are interested in estimating the probability of the form l = P(S(X) >γ)= 1(S(x) >γ)f(x) dx, where S is some real-alued performance function, X is a ector of random ariables with probability density function (PDF) f, and γ is a sufficiently large constant such that l is small. Consider estimating l ia the importance sampling estimator l IS = 1 N N 1(S(X i )>γ) f(x i) g(x i ), where X i = g, i = 1,...,N, for some importance sampling PDF g for which g(x) = 0 implies that 1(S(x) >γ)f(x) = 0 for all x. Although the estimator l IS is consistent and unbiased for any such g, its performance depends critically on the choice of g. Hence, we wish to choose g so that the associated estimator is optimal in a well-defined sense. To this end, consider a parametric family F ={f(x; )} indexed by a parameter ector that contains the nominal (original) density f. Thus, we can write f(x) = f(x; u) for some parameter ector u. For any gien, the general term of the associated importance sampling estimator is Z() = W(X; u, ) 1(S(X) > γ), where W(x; u, ) is the likelihood ratio defined as W(x; u, ) = f(x; u)/f (x; ). Now, we wish to choose so that the associated importance

4 A comparison of CE and VM strategies 185 sampling estimator has minimum ariance within the parametric family F. The minimizer m is referred to as the optimal VM parameter ector. For any unbiased estimator l of l,wehae ar l = E l 2 l 2. Therefore, m can be written as m = argmin E Z() 2 = argmin E u Z() = argmin log E u Z(), (2.1) where the expectation operators E u and E are taken with respect to the densities f( ; u) and f( ; ), respectiely. A related approach to locating a good importance density inoles the Kullback Leibler diergence,orce distance. To motiate the method, first note that the zeroariance importance density for estimating l is simply g (x) = l 1 f(x; u) 1(S(x) >γ) the conditional density gien the rare eent. Obiously, g cannot be used directly in practice as it inoles the unknown constant l. Neertheless, this proides a practical criterion to locate a good importance density. Specifically, if we choose the density within F that is the closest to g in the CE distance, then intuitiely the associated estimator should hae reasonable performance. Let ce denote the minimizer, which we refer to as the optimal CE parameter ector. It can be shown [16, pp ] that soling the CE minimization problem is equialent to finding ce = argmax f(x; u) 1(S(x) >γ)log f(x; ) dx. (2.2) Although the optimal CE and VM parameter ectors can be obtained analytically for a few specific cases, in general the optimization problems in (2.1) and (2.2) are difficult to sole. Thus, in practice, we often need to estimate m or ce ia a multileel procedure, which we shall call multileel VM or CE (see [11] for a more thorough discussion). Recent research has shown that in certain high-dimensional cases the estimates for m and ce obtained from the multileel procedures are not accurate, and, as a consequence, the associated estimators perform poorly [8], [9], [14], [15]. A recent ariant, called the screening method, introduced in [15], aims to reduce the dimension of the likelihood ratio, and is shown to perform better than the multileel VM and CE methods in arious high-dimensional estimation problems. To motiate the method, partition the parameter ector u into two subsets: u = (u 0, u 1 ), where the occurrence of the rare eent {S(X) >γ} is substantially affected by u 1 but not by u 0. The ector u 1 is referred to as the bottleneck parameter. Now consider the parametric family F 0 ={f(x; ṽ)} indexed by ṽ, where ṽ = (u 0, 1 ) and u 0 is fixed therefore, F 0 is in fact indexed by 1. The screening method proceeds in the same way as the multileel CE and VM methods, but instead of twisting the whole parameter ector u, we only twist the bottleneck parameter u 1. Since F 0 F, the ariance of the importance sampling estimator associated with m is at least as small as the ariance of the estimator associated with ṽ m simply by definition. Paradoxically, howeer, the empirical findings in [15] suggest otherwise for the situation where the parameters are estimated ia a multileel procedure. A possible explanation is that the parameter ector obtained ia the multileel procedure, say, m,t, is not an accurate estimate for m. By reducing the dimension of the likelihood ratio ia the screening method, we can estimate ṽ m the optimal VM parameter ector within F 0 more accurately. As a result, the importance density f(x; ṽ m,t ), where ṽ m,t denotes the parameter ector obtained ia the screening method, is closer to g compared to f(x; m,t ), and, thus, the estimator associated with the former density has a smaller ariance than that of the latter. Another improed ariant proposed by Chan and Kroese [8] aims to estimate ce in one step so as to circument likelihood degeneracy of the estimation procedure. Specifically, instead of

5 186 J. C. C. CHAN ET AL. the multileel procedure in the classic CE method, they proposed estimating ce by finding ce = argmax M log f(x j ; ), j=1 where X 1,...,X M are draws from g. They demonstrated that the improed CE method does not only gie substantial improement oer the traditional approach but also works well in high-dimensional estimation problems. Generating draws from g, howeer, might not be triial in general, but with the adent of Marko chain Monte Carlo (MCMC) methods, this problem is well studied and a ariety of techniques are aailable. In Sections 4 6 we consider arious concrete examples where we can derie asymptotic expressions for m and ce, and we show that they are identical asymptotically. We then compute the asymptotic ariances of the associated estimators and inestigate how they are affected by the estimation errors introduced in the multileel VM and CE approaches. 3. Sum of independent nonidentical random ariables in the exponential families In this section we consider the rare-eent regime where the number of random ariables n approaches infinity. To set the stage, suppose that X 1,X 2,... is a sequence of independent but not necessarily identical random ariables where each X j belongs to a one-parameter exponential family parameterized by the mean; that is, the density of each X j is gien by f j (x; u j ) = e xθ(u j ) ζ(θ(u j )) h j (x). (3.1) Let µ n = n E X i and σ 2 n = n ar X i. We are interested in estimating the probabilities l n = P(S n >nb) as n, where S n = X 1 + +X n and lim n (nb µ n )/σ n =. We will show that the optimal CE parameters coincide with those suggested by large deiation theory. More specifically, the CE method suggests twisting the means of the random ariables such that their sum is equal to the threshold nb. Proposition 3.1. Let X 1,X 2,...be a sequence of independent random ariables such that X j belongs to a one-parameter exponential family parameterized by the mean with PDF gien in (3.1). Consider estimating l n = P(S n >nb)as n ia the CE method with importance density of the form n f i (x i ; i ). Suppose that lim n (nb µ n )/σ n =, where µ n = n E X i and σ 2 n = n ar X i. Then the optimal CE parameters ce,1, ce,2,...satisfy n ce,i nb as n. Proof. First note that to estimate l n, the optimal CE parameters ce,i,i= 1,...,n, are gien in [17, p. 320]: ce,i = E[X i S n >nb]. Therefore, n ce,i = E[S n S n >nb]. By the central limit theorem, (S n µ n )/σ n is asymptotically N(0, 1) distributed as n. Therefore, E[S n S n >nb] µ n + ϕ((nb µ n)/σ n ) 1 ((nb µ n )/σ n ) σ n µ n + nb µ n σ n = nb σ n as n, where ϕ( ) and ( ) are respectiely the PDF and cumulatie distribution function (CDF) of the standard normal distribution hence, the desired result.

6 A comparison of CE and VM strategies 187 In what follows, we consider a different rare-eent regime where we keep the number of random ariables n fixed and let γ go to infinity. 4. Sum of exponential random ariables Consider the estimation of l = P(X 1 + +X n >γ)ia importance sampling, where X i = Exp(1), i = 1...,n; that is, X i has PDF f(x)= e x,x 0. Note that n 1 l = e γ k=0 γ k k! = Ɣ(n, γ ) Ɣ(n), (4.1) where Ɣ(n) = (n 1)! and Ɣ(n, γ ) is the alue of the (upper) incomplete gamma function at (n, γ ). Suppose that we generate X i = Exp( 1 ), i = 1,...,n,with PDF f(x; 1 ) = 1 exp( 1 x). It follows that the general term in the importance sampling estimator is Z() = 1(X 1 + +X n > γ )W(X; 1, 1 ), where the likelihood ratio is gien by ( ( W(x; 1, 1 ) = n exp 1 1 ) n x i ). We first derie the asymptotic expressions for the optimal VM and CE parameters and show that they are the same. We note that Rubinstein and Kroese [16, p. 69] proed the special case for n = 1. Proposition 4.1. Let X i = Exp(1), i = 1,...,n. To estimate l = P(X 1 + +X n >γ)ia importance sampling, suppose that we generate X i identically from the Exp( 1 ) distribution. Then the optimal VM and CE parameters are asymptotically the same. In fact, we hae m γ/nand ce γ/n. Proof. To obtain the optimal VM parameter, we first derie an asymptotic expression for the second moment of the importance sampling estimator Z(): E 1/ Z() 2 = E 1 Z() = xi >γ ( ( n exp 1 1 ) n n x i) e x i dx = n ( 2 1 ) n P(Y 1 + +Y n >γ). Here Y i = Exp(2 1/), i = 1,...,n, for > 2 1. Therefore, we hae E 1/ Z() 2 e 2γ γ n 1 (n 1)! n e γ/ 2 1/ as γ. To obtain m, we differentiate log E 1/ Z() 2 with respect to and sole the equation (with the constraint that > 2 1 ): d d log E 1/ Z() 2 n γ = 0.

7 188 J. C. C. CHAN ET AL. It follows that m = γ + γ 2 nγ + (n + 1) 2 /4 2n + n + 1 4n + O(γ 1 ) γ n. To compute the optimal CE parameter, we first note that the exponential distribution is a member of the exponential family. Therefore, ce = E[X 1 X 1 + +X n >γ]= 1 E[Y Y>γ], n where Y = D Gamma(n, 1). Direct computation shows (see also [16, p. 78]) that E[Y Y > γ ]=Ɣ(n + 1, γ )/ Ɣ(n, γ ). Since we hae Ɣ(n + 1,γ)= nɣ(n, γ ) + γ n e γ and Ɣ(n, γ ) = γ n 1 e γ (1 + O(γ 1 )), E[Y Y>γ]=n + γ n e γ Ɣ(n, γ ) = n + γ(1 + O(γ 1 )). It follows that ce = γ/n+ 1 + O(γ 1 ) γ/nas γ. Therefore, by Proposition 4.1, the optimal VM parameter is asymptotically identical to that gien by the CE program when γ. We show in the next proposition that either m or ce in fact gies an asymptotically optimal importance sampling estimator for l. In addition, by the definition of m, no other importance sampling estimators obtained by generating X i identically from Exp( 1 ) can be strongly efficient. In what follows, we also inestigate how the estimation error in obtaining ce affects the relatie error of the associated importance sampling estimator. Proposition 4.2. Under the same assumptions as in Proposition 4.1, if we set = γ/n+ h for some constant h then E 1/ Z() 2 ( l 2 = en (n 1)! 2n n γ 1 + 2n 1 + n ) 2γ 4γ 2 (2n2 h 2 2nh + 3n 2) + O(γ 3 ) (4.2) as γ. In particular, the optimal VM/CE parameter gies an asymptotically optimal estimator. Proof. Let = γ/n+ h. By a similar computation as in Proposition 4.1 we hae E 1/ Z() 2 = = n Ɣ(n, (2 1/)γ ) (2 1/) n Ɣ(n) ( n e 2γ e γ/ (2 1/)(n 1)! γ n n 1 2 1/ γ 1 + O(γ 2 ) ). (4.3) Substituting = γ/n+h and using the expression for l in (4.1) gies (4.2). The final statement in the proposition follows by setting = γ/n m. It is worth noting that in (4.2) only the coefficient of the third order term 1/γ 3 inoles h, and that the magnitude of h does not affect the asymptotic efficiency of the importance sampling estimator. Howeer, when the dimension of the estimation problem n is large, h might hae a

8 A comparison of CE and VM strategies 189 substantial impact on the ariance of the importance sampling estimator for any finite γ. This explains why the multileel CE estimator generally works well in problems with light-tailed random ariables, but sometimes breaks down when the dimension of the problem becomes large; see, e.g. [8]. We now inestigate what happens when the importance sampling parameter is obtained ia a random procedure, such as in the multileel VM or CE method. Let us denote the random parameter thus obtained by V, which is independent of the {Z i } used in the importance sampling estimator. In the CE procedure the reference parameter V is obtained as V = Nk=1 1(S k >γ)w k (w)s k n N k=1 1(S k >γ)w k (w), (4.4) where W k (w) = W(X k ; 1, 1/w) is the kth likelihood ratio corresponding to a reference parameter w obtained in the penultimate iteration, and S k = Gamma(n, 1/w), k = 1,...,N. The parameter w is usually random as well for example when obtained ia a CE procedure. Suppose, howeer, that w is some arbitrarily fixed reference parameter. The asymptotic distribution of V as a function of w is gien in the next proposition. Proposition 4.3. Under the same assumptions as in Proposition 4.1, the CE reference parameter V gien in (4.4) is asymptotically normal as N with mean ce and ariance σγ,w 2 /N. Furthermore, we hae ( σγ,w ) 2 w n (n 1)! n (2 1/w) γ n+3 e γ/w as γ. In particular, σ 2 γ,γ/n γ 3 (n 1)! (1 1/n)2 e n 2n n. Proof. First note that V gien in (4.4) is a ratio estimator. By the delta method [1, pp ], the asymptotic distribution is normal with mean and ariance σγ,w 2 /N, with µ = E 1/w 1(S > γ )W (w)s n E 1/w 1(S > γ )W (w) = E 1 1(S > γ )S n E 1 1(S > γ ) = ce σγ,w 2 = ar(a) 2µ co(a, B) + µ2 ar(b) l 2, where A = 1(S > γ )W (w)s, B = 1(S > γ )W (w), and S is Gamma(n, 1/w) distributed. The second moment of B is gien in (4.3) with w substituted for. The expectation of A is simply E 1/w A = E 1 1(S > γ )S = l ce. The second moment of A is E 1/w A 2 = 1(s > γ )w n e (1 1/w)s s 2 1 Ɣ(n) sn 1 e s ds n(n + 1) Ɣ(n + 2,(2 1/w)γ ) = (2 1/w) n+2 Ɣ(n + 2) γ 2 E w B 2.

9 190 J. C. C. CHAN ET AL. Moreoer, E 1/w AB = E 1 1(S > γ )W (w)s γ E w B 2. It follows, after some algebra, that, for n>1, ( σγ,w ) 2 w n (n 1)! n (2 1/w) γ n+3 e γ/w. This completes the proof. Note that the asymptotic ariance of the CE reference parameter V is cubic in γ when we set w = γ/n m. Therefore, een though ce gies an asymptotically optimal estimator, when γ is sufficiently large, the estimation error in obtaining ce in the multileel CE procedure might be so substantial that it renders the resulting importance sampling estimator unreliable. 5. Sum of Pareto random ariables We now consider estimating the tail probability of the sum of heay-tailed random ariables. Specifically, we wish to estimate l = P(X 1 + +X n >γ)ia importance sampling, where X i = Pareto(1, 1), i = 1,..., n. Since the Pareto distribution is subexponential [1, pp ], we hae l n/(1 + γ)as γ. To estimate l ia importance sampling, we consider the Pareto(α, 1) family indexed by α > 0 with PDF f(x; α) = α(1 + x) (α+1),x 0. Now suppose that we generate X i = Pareto(α, 1), i = 1,...,n. The general term of the likelihood ratio is W(x; 1,α)= n (1 + x i ) 2 α(1 + x i ) (α+1) = α n and the corresponding importance sampling estimator is n (1 + x i ) (1 α), Z(α) = 1(X 1 + +X n > γ )W(X; 1,α). In the following proposition we show that the optimal VM and CE parameters are identical. In fact, we show that α m n/ log(1 + γ), which gies the minimum ariance estimator within the class of importance sampling estimators obtained by generating X i = Pareto(α, 1) for i = 1,...,n. Compare this with the suggestions in [3] and [10]. Proposition 5.1. Let X i = Pareto(1, 1), i = 1,...,n. Suppose that we wish to estimate l = P(X X n >γ)ia importance sampling by generating X i = Pareto(α, 1). Then the optimal VM and CE parameters for α are asymptotically the same. In fact, we hae α m n/ log(1 + γ). Proof. Note that the optimal CE parameter for α is gien in [4]: α ce = (1 + log(1 + γ)/n) 1 n/ log(1 + γ).to compute the optimal VM parameter, we first derie the second moment of Z(α) with respect to the Pareto(α, 1) distribution: E α Z(α) 2 = E 1 Z(α) = α n xi >γ n (1 + x i ) (1 α) (1 + x i ) 2 dx = (2α α 2 ) n P(Y 1 + +Y n >γ). Here Y i = Pareto(2 α, 1), i = 1,..., n, proided that α < 2. Hence, E α Z(α) 2 (2α α 2 ) n n(1 + γ) (2 α). (5.1)

10 A comparison of CE and VM strategies 191 By a computation similar to that in Proposition 4.1 we hae α m = (2/n)log(1 + γ)+ 2 (4/n 2 ) log 2 (1 + γ)+ 4 + O(log 2 (1 + γ)) (2/n)log(1 + γ) n log(1 + γ). Again, the optimal VM parameter is asymptotically identical to that gien by the CE program as γ. We next inestigate how the choice of the parameter α affects the growth rate of E α Z(α) 2 /l 2. As a corollary to Proposition 5.1, we show that α = α ce n/ log(1 + γ)gies an importance sampling estimator that is asymptotically optimal. We note that Asmussen and Kroese [2] proided a conditional Monte Carlo estimator that has bounded relatie error for the case of the sum of Pareto random ariables. In addition, by utilizing a technique based on Lyapunotype inequalities first introduced in [5], Blanchet and Li [6] were able to derie an importance sampling estimator that achiees bounded relatie error for general subexponential distributions. Proposition 5.2. Under the same assumptions as in Proposition 5.1, if we set α = n/ log(1 + γ)+ h for some constant h such that 0 <α<2, then E α Z(α) 2 ( l 2 en n γ h 2n h(2 h) + log(1 + γ) n 2 ) n log 2 as γ. (5.2) (1 + γ) In particular, the optimal VM/CE parameter gies an asymptotically optimal estimator. Proof. By (5.1) and the fact that l n/(1 + γ),wehae E α Z(α) 2 l 2 1 n(2α α 2 ) n (1 + γ)α. Hence, if we set α = n/ log(1 + γ) + h then (5.2) follows. As a result, for α = α ce n/ log(1 + γ),we hae E α Z(α) 2 ( l 2 en 2n n log(1 + γ) n 2 ) n log 2 as γ. (1 + γ) This completes the proof. It is of interest to note that in contrast to the light-tailed case, the estimation error h does increase the asymptotic ariance of the importance sampling estimator. Therefore, the problem of suboptimal VM and CE reference parameters is expected to be more seere in the heay-tailed case. 6. Sum of Weibull random ariables Consider the same estimation problem as in the last section, but now X i = Weib(β, 1), i = 1,...,n, for 0 <β<1; that is, X i has PDF f(x; β) = βx β 1 e xβ. We wish to estimate the tail probability l ia importance sampling by tilting the scale parameter. That is, we locate the importance density within the parametric family Weib(β, θ) with PDF f(x; β, θ) = θβx β 1 e θxβ,x 0, indexed by θ>0while keeping β fixed. It follows that the general term

11 192 J. C. C. CHAN ET AL. of the importance sampling estimator is with likelihood ratio Z(θ) = 1(X 1 + +X n > γ )W(X; 1,θ), ( W(x; 1,θ)= θ n exp (1 θ) Again, for this sum of Weibull random ariables case, the optimal VM and CE parameters coincide asymptotically. Kroese and Rubinstein [12] proed that, for the n = 2 case, the optimal VM parameter is asymptotically θ m 2/γ β. They conjectured that, for general n, θ m n/γ β, and the squared relatie error of the resulting estimator increases proportionally to γ nβ. The following propositions proe these two conjectures. Proposition 6.1. Let X i = Weib(β, 1), i = 1,..., n, with 0 < β < 1. Suppose that we wish to estimate l = P(X X n >γ)ia importance sampling by generating X i = Weib(β, θ). Then the optimal VM and CE parameters for θ are asymptotically identical. In fact, we hae θ m n/γ β. Proof. First note that the optimal CE parameter for θ is gien in [4] and [16]: θ ce = n/(n + γ β ) n/γ β. Next we compute the optimal VM parameter as follows: E θ Z(θ) 2 = E 1 Z(θ) ( = θ n exp (1 θ) xi >γ n x β i n x β i ) n = θ n (2 θ) n P(Y 1 + +Y n >γ). ). βx β 1 i e xβ i dx Here Y i = Weib(β, 2 θ), proided that θ < 2. Since the Weib(β, θ) distribution is subexponential for β<1, we hae E θ Z(θ) 2 n θ n (2 θ) n P(Y n 1 >γ)= θ n (2 θ) n e (2 θ)γβ as γ. (6.1) By a similar computation as in Proposition 4.1, it can be shown that θ m = n γ β n2 γ 2β + O(γ (β+1) ) n γ β as γ. Therefore, the optimal CE and VM parameters for θ are identical asymptotically. The choice of θ m is to be compared with the suggestion in [10] to take θ = b/γ β, where b>0is some arbitrary constant. Since the Weib(β, 1) distribution is subexponential for β<1, it follows that l ne γ β. Therefore, using the expression in (6.1), it can be shown that if we choose θ such that θγ β = c for some constant c, the associated importance sampling estimator is asymptotically optimal. In particular, the choice θ = θ ce θ m gies an asymptotically optimal importance sampling estimator, which also has the minimum asymptotic ariance within the class of importance sampling estimators with importance densities under which X i = Weib(β, θ), i = 1,...,n.

12 A comparison of CE and VM strategies 193 Proposition 6.2. Under the same assumptions as in Proposition 6.1, if we set θ = n/γ β then E θ Z(θ) 2 l 2 en 2 n n n+1 γ nβ, i.e. the optimal VM/CE parameter gies an asymptotically optimal estimator. Proof. Since the Weib(β, 1) distribution is subexponential for β<1, we hae l ne βγ. It follows from (6.1) that E θ Z(θ) 2 l 2 n 1 θ n (2 θ) n e θγβ as γ. The desired result follows by letting θ = n/γ β. 7. Concluding remarks and future research We compared the VM and CE methods through arious concrete examples and we found that in the three examples considered the optimal VM and CE parameters are asymptotically identical. It would be of considerable interest to determine under what conditions this is the case. Since CE estimators are typically easy to obtain, this would proide a practical approach to locate the importance sampling estimator with the minimum ariance within a gien parametric class. Moreoer, it is worthwhile to study further the impact of CE parameter estimation on the quality of the associated importance sampling estimator. Acknowledgements Joshua Chan and Dirk Kroese would like to acknowledge financial support from the Australian Research Council through Discoery Grants DP and DP , respectiely. References [1] Asmussen, S. and Glynn, P. W. (2007). Stochastic Simulation: Algorithms and Analysis. Springer, New York. [2] Asmussen, S. and Kroese, D. P. (2006). Improed algorithms for rare eent simulation with heay tails. Ad. Appl. Prob. 38, [3] Asmussen, S., Binswanger, K. and Højgaard, B. (2000). Rare eents simulation for heay-tailed distributions. Bernoulli 6, [4] Asmussen, S., Kroese, D. P. and Rubinstein, R. Y. (2005). Heay tails, importance sampling and cross-entropy. Stoch. Models 21, [5] Blanchet, J. and Glynn, P. (2008). Efficient rare-eent simulation for the maximum of heay-tailed random walks. Ann. Appl. Prob. 18, [6] Blanchet, J. and Li, C. (2011). Efficient rare eent simulation for heay-tailed compound sums. To appear in ACM Trans. Model. Comput. Simul. [7] Casella, G. and Berger, R. L. (2001). Statistical Inference, 2nd edn. Duxbury Press. [8] Chan, J. C. C. and Kroese, D. P. (2011). Improed cross-entropy method for estimation. Submitted. [9] Chan, J. C. C. and Kroese, D. P. (2009). Rare-eent probability estimation with conditional Monte Carlo. Ann. Operat. Res., 19pp. [10] Juneja, S. and Shahabuddin, P. (2002). Simulating heay tailed processes using delayed hazard rate twisting. ACM Trans. Model. Comput. Simul. 12, [11] Kroese, D. P. (2011). The cross-entropy method. In Wiley Encyclopedia of Operations Research and Management Science. [12] Kroese, D. P. and Rubinstein, R. Y. (2004). The transform likelihood ratio method for rare eent simulation with heay tails. Queueing Systems 46, [13] L Ecuyer, P., Blanchet, J. H., Tuffin, B. and Glynn, P. W. (2010). Asymptotic robustness of estimators in rare-eent simulation. ACM Trans. Model. Comput. Simul. 20, [14] Rubinstein, R. Y. (2009). The Gibbs cloner for combinatorial optimization, counting and sampling. Methodology Comput. Appl. Prob. 11,

13 194 J. C. C. CHAN ET AL. [15] Rubinstein, R. Y. and Glynn, P. W. (2009). How to deal with the curse of dimensionality of likelihood ratios in Monte Carlo simulation. Stoch. Models 25, [16] Rubinstein, R. Y. and Kroese, D. P. (2004). The Cross-Entropy Method. Springer, New York. [17] Rubinstein, R. Y. and Kroese, D. P. (2007). Simulation and the Monte Carlo Method, 2nd edn. John Wiley, New York. JOSHUA C. C. CHAN, Australian National Uniersity Research School of Economics, Australian National Uniersity, Canberra, ACT 0200, Australia. PETER W. GLYNN, Stanford Uniersity Department of Management Science and Engineering, Huang Engineering Center, Stanford Uniersity, 475 Via Ortega, Stanford, CA , USA. DIRK P. KROESE, Uniersity of Queensland Department of Mathematics, Uniersity of Queensland, St Lucia, Brisbane, QLD 4072, Australia. address:

Provided for non-commercial research and educational use only. Not for reproduction, distribution or commercial use.

Provided for non-commercial research and educational use only. Not for reproduction, distribution or commercial use. Proided for non-commercial research and educational use only. Not for reproduction, distribution or commercial use. This chapter was originally published in the book Handbook of Statistics. The copy attached

More information

Semi-Parametric Importance Sampling for Rare-event probability Estimation

Semi-Parametric Importance Sampling for Rare-event probability Estimation Semi-Parametric Importance Sampling for Rare-event probability Estimation Z. I. Botev and P. L Ecuyer IMACS Seminar 2011 Borovets, Bulgaria Semi-Parametric Importance Sampling for Rare-event probability

More information

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use:

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use: This article was downloaded by: [Stanford University] On: 20 July 2010 Access details: Access Details: [subscription number 917395611] Publisher Taylor & Francis Informa Ltd Registered in England and Wales

More information

Efficient rare-event simulation for sums of dependent random varia

Efficient rare-event simulation for sums of dependent random varia Efficient rare-event simulation for sums of dependent random variables Leonardo Rojas-Nandayapa joint work with José Blanchet February 13, 2012 MCQMC UNSW, Sydney, Australia Contents Introduction 1 Introduction

More information

Asymptotic Normality of an Entropy Estimator with Exponentially Decaying Bias

Asymptotic Normality of an Entropy Estimator with Exponentially Decaying Bias Asymptotic Normality of an Entropy Estimator with Exponentially Decaying Bias Zhiyi Zhang Department of Mathematics and Statistics Uniersity of North Carolina at Charlotte Charlotte, NC 28223 Abstract

More information

Improved Cross-Entropy Method for Estimation

Improved Cross-Entropy Method for Estimation Noname manuscript No. (will be inserted by the editor) Improved Cross-Entropy Method for Estimation Joshua C.C. Chan Dirk P. Kroese Received: date / Accepted: date Abstract The cross-entropy (CE) method

More information

Balanced Partitions of Vector Sequences

Balanced Partitions of Vector Sequences Balanced Partitions of Vector Sequences Imre Bárány Benjamin Doerr December 20, 2004 Abstract Let d,r N and be any norm on R d. Let B denote the unit ball with respect to this norm. We show that any sequence

More information

CHARACTERIZATIONS OF THE PARETO DISTRIBUTION BY THE INDEPENDENCE OF RECORD VALUES. Se-Kyung Chang* 1. Introduction

CHARACTERIZATIONS OF THE PARETO DISTRIBUTION BY THE INDEPENDENCE OF RECORD VALUES. Se-Kyung Chang* 1. Introduction JOURNAL OF THE CHUNGCHEONG MATHEMATICAL SOCIETY Volume 20, No., March 2007 CHARACTERIZATIONS OF THE PARETO DISTRIBUTION BY THE INDEPENDENCE OF RECORD VALUES Se-Kyung Chang* Abstract. In this paper, we

More information

EFFICIENT SIMULATION FOR LARGE DEVIATION PROBABILITIES OF SUMS OF HEAVY-TAILED INCREMENTS

EFFICIENT SIMULATION FOR LARGE DEVIATION PROBABILITIES OF SUMS OF HEAVY-TAILED INCREMENTS Proceedings of the 2006 Winter Simulation Conference L. F. Perrone, F. P. Wieland, J. Liu, B. G. Lawson, D. M. Nicol, and R. M. Fujimoto, eds. EFFICIENT SIMULATION FOR LARGE DEVIATION PROBABILITIES OF

More information

Two-sided bounds for L p -norms of combinations of products of independent random variables

Two-sided bounds for L p -norms of combinations of products of independent random variables Two-sided bounds for L p -norms of combinations of products of independent random ariables Ewa Damek (based on the joint work with Rafał Latała, Piotr Nayar and Tomasz Tkocz) Wrocław Uniersity, Uniersity

More information

Asymptotics and Simulation of Heavy-Tailed Processes

Asymptotics and Simulation of Heavy-Tailed Processes Asymptotics and Simulation of Heavy-Tailed Processes Department of Mathematics Stockholm, Sweden Workshop on Heavy-tailed Distributions and Extreme Value Theory ISI Kolkata January 14-17, 2013 Outline

More information

Online Companion to Pricing Services Subject to Congestion: Charge Per-Use Fees or Sell Subscriptions?

Online Companion to Pricing Services Subject to Congestion: Charge Per-Use Fees or Sell Subscriptions? Online Companion to Pricing Serices Subject to Congestion: Charge Per-Use Fees or Sell Subscriptions? Gérard P. Cachon Pnina Feldman Operations and Information Management, The Wharton School, Uniersity

More information

On general error distributions

On general error distributions ProbStat Forum, Volume 6, October 3, Pages 89 95 ISSN 974-335 ProbStat Forum is an e-journal. For details please isit www.probstat.org.in On general error distributions R. Vasudea, J. Vasantha Kumari Department

More information

A Tutorial on the Cross-Entropy Method

A Tutorial on the Cross-Entropy Method A Tutorial on the Cross-Entropy Method Pieter-Tjerk de Boer Electrical Engineering, Mathematics and Computer Science department University of Twente ptdeboer@cs.utwente.nl Dirk P. Kroese Department of

More information

Notes on Linear Minimum Mean Square Error Estimators

Notes on Linear Minimum Mean Square Error Estimators Notes on Linear Minimum Mean Square Error Estimators Ça gatay Candan January, 0 Abstract Some connections between linear minimum mean square error estimators, maximum output SNR filters and the least square

More information

Introduction to Rare Event Simulation

Introduction to Rare Event Simulation Introduction to Rare Event Simulation Brown University: Summer School on Rare Event Simulation Jose Blanchet Columbia University. Department of Statistics, Department of IEOR. Blanchet (Columbia) 1 / 31

More information

Computing Laboratory A GAME-BASED ABSTRACTION-REFINEMENT FRAMEWORK FOR MARKOV DECISION PROCESSES

Computing Laboratory A GAME-BASED ABSTRACTION-REFINEMENT FRAMEWORK FOR MARKOV DECISION PROCESSES Computing Laboratory A GAME-BASED ABSTRACTION-REFINEMENT FRAMEWORK FOR MARKOV DECISION PROCESSES Mark Kattenbelt Marta Kwiatkowska Gethin Norman Daid Parker CL-RR-08-06 Oxford Uniersity Computing Laboratory

More information

Tail Behavior and Limit Distribution of Maximum of Logarithmic General Error Distribution

Tail Behavior and Limit Distribution of Maximum of Logarithmic General Error Distribution Tail Behaior and Limit Distribution of Maximum of Logarithmic General Error Distribution Xin Liao, Zuoxiang Peng & Saralees Nadarajah First ersion: 8 December 0 Research Report No. 3, 0, Probability and

More information

Final Exam (Solution) Economics 501b Microeconomic Theory

Final Exam (Solution) Economics 501b Microeconomic Theory Dirk Bergemann and Johannes Hoerner Department of Economics Yale Uniersity Final Exam (Solution) Economics 5b Microeconomic Theory May This is a closed-book exam. The exam lasts for 8 minutes. Please write

More information

On the Ratio of Inverted Gamma Variates

On the Ratio of Inverted Gamma Variates AUSTRIAN JOURNAL OF STATISTICS Volume 36 2007), Number 2, 153 159 On the Ratio of Inerted Gamma Variates M. Masoom Ali 1, Manisha Pal 2, and Jungsoo Woo 3 Department of Mathematical Sciences, Ball State

More information

Hölder-type inequalities and their applications to concentration and correlation bounds

Hölder-type inequalities and their applications to concentration and correlation bounds Hölder-type inequalities and their applications to concentration and correlation bounds Christos Pelekis * Jan Ramon Yuyi Wang Noember 29, 2016 Abstract Let Y, V, be real-alued random ariables haing a

More information

Target Trajectory Estimation within a Sensor Network

Target Trajectory Estimation within a Sensor Network Target Trajectory Estimation within a Sensor Network Adrien Ickowicz IRISA/CNRS, 354, Rennes, J-Pierre Le Cadre, IRISA/CNRS,354, Rennes,France Abstract This paper deals with the estimation of the trajectory

More information

A014 Uncertainty Analysis of Velocity to Resistivity Transforms for Near Field Exploration

A014 Uncertainty Analysis of Velocity to Resistivity Transforms for Near Field Exploration A014 Uncertainty Analysis of Velocity to Resistiity Transforms for Near Field Exploration D. Werthmüller* (Uniersity of Edinburgh), A.M. Ziolkowski (Uniersity of Edinburgh) & D.A. Wright (Uniersity of

More information

Modeling Highway Traffic Volumes

Modeling Highway Traffic Volumes Modeling Highway Traffic Volumes Tomáš Šingliar1 and Miloš Hauskrecht 1 Computer Science Dept, Uniersity of Pittsburgh, Pittsburgh, PA 15260 {tomas, milos}@cs.pitt.edu Abstract. Most traffic management

More information

ABSTRACT 1 INTRODUCTION. Jose Blanchet. Henry Lam. Boston University 111 Cummington Street Boston, MA 02215, USA

ABSTRACT 1 INTRODUCTION. Jose Blanchet. Henry Lam. Boston University 111 Cummington Street Boston, MA 02215, USA Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelspach, K. P. White, and M. Fu, eds. RARE EVENT SIMULATION TECHNIQUES Jose Blanchet Columbia University 500 W 120th

More information

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds.

Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. Proceedings of the 2016 Winter Simulation Conference T. M. K. Roeder, P. I. Frazier, R. Szechtman, E. Zhou, T. Huschka, and S. E. Chick, eds. AN M-ESTIMATOR FOR RARE-EVENT PROBABILITY ESTIMATION Zdravko

More information

Patterns of Non-Simple Continued Fractions

Patterns of Non-Simple Continued Fractions Patterns of Non-Simple Continued Fractions Jesse Schmieg A final report written for the Uniersity of Minnesota Undergraduate Research Opportunities Program Adisor: Professor John Greene March 01 Contents

More information

NEW FRONTIERS IN APPLIED PROBABILITY

NEW FRONTIERS IN APPLIED PROBABILITY J. Appl. Prob. Spec. Vol. 48A, 209 213 (2011) Applied Probability Trust 2011 NEW FRONTIERS IN APPLIED PROBABILITY A Festschrift for SØREN ASMUSSEN Edited by P. GLYNN, T. MIKOSCH and T. ROLSKI Part 4. Simulation

More information

STA205 Probability: Week 8 R. Wolpert

STA205 Probability: Week 8 R. Wolpert INFINITE COIN-TOSS AND THE LAWS OF LARGE NUMBERS The traditional interpretation of the probability of an event E is its asymptotic frequency: the limit as n of the fraction of n repeated, similar, and

More information

be ye transformed by the renewing of your mind Romans 12:2

be ye transformed by the renewing of your mind Romans 12:2 Lecture 12: Coordinate Free Formulas for Affine and rojectie Transformations be ye transformed by the reing of your mind Romans 12:2 1. Transformations for 3-Dimensional Computer Graphics Computer Graphics

More information

An Explicit Lower Bound of 5n o(n) for Boolean Circuits

An Explicit Lower Bound of 5n o(n) for Boolean Circuits An Eplicit Lower Bound of 5n o(n) for Boolean Circuits Kazuo Iwama, Oded Lachish, Hiroki Morizumi, and Ran Raz Graduate School of Informatics, Kyoto Uniersity, Kyoto, JAPAN {iwama, morizumi}@kuis.kyoto-u.ac.jp

More information

BAYESIAN PREMIUM AND ASYMPTOTIC APPROXIMATION FOR A CLASS OF LOSS FUNCTIONS

BAYESIAN PREMIUM AND ASYMPTOTIC APPROXIMATION FOR A CLASS OF LOSS FUNCTIONS TH USHNG HOUS ROCDNGS OF TH ROMANAN ACADMY Series A OF TH ROMANAN ACADMY Volume 6 Number 3/005 pp 000-000 AYSAN RMUM AND ASYMTOTC AROMATON FOR A CASS OF OSS FUNCTONS Roxana CUMARA Department of Mathematics

More information

Estimating Tail Probabilities of Heavy Tailed Distributions with Asymptotically Zero Relative Error

Estimating Tail Probabilities of Heavy Tailed Distributions with Asymptotically Zero Relative Error Estimating Tail Probabilities of Heavy Tailed Distributions with Asymptotically Zero Relative Error S. Juneja Tata Institute of Fundamental Research, Mumbai juneja@tifr.res.in February 28, 2007 Abstract

More information

Towards Universal Cover Decoding

Towards Universal Cover Decoding International Symposium on Information Theory and its Applications, ISITA2008 Auckland, New Zealand, 7-10, December, 2008 Towards Uniersal Coer Decoding Nathan Axig, Deanna Dreher, Katherine Morrison,

More information

Residual migration in VTI media using anisotropy continuation

Residual migration in VTI media using anisotropy continuation Stanford Exploration Project, Report SERGEY, Noember 9, 2000, pages 671?? Residual migration in VTI media using anisotropy continuation Tariq Alkhalifah Sergey Fomel 1 ABSTRACT We introduce anisotropy

More information

Trajectory Estimation for Tactical Ballistic Missiles in Terminal Phase Using On-line Input Estimator

Trajectory Estimation for Tactical Ballistic Missiles in Terminal Phase Using On-line Input Estimator Proc. Natl. Sci. Counc. ROC(A) Vol. 23, No. 5, 1999. pp. 644-653 Trajectory Estimation for Tactical Ballistic Missiles in Terminal Phase Using On-line Input Estimator SOU-CHEN LEE, YU-CHAO HUANG, AND CHENG-YU

More information

arxiv: v1 [math.pr] 25 Jun 2015

arxiv: v1 [math.pr] 25 Jun 2015 Random walks colliding before getting trapped Louigi Addario-Berry Roberto I. Olieira Yual Peres Perla Sousi ari:156.78451 math.pr 25 Jun 215 Abstract Let P be the transition matrix of a finite, irreducible

More information

Monte Carlo Methods for Computation and Optimization (048715)

Monte Carlo Methods for Computation and Optimization (048715) Technion Department of Electrical Engineering Monte Carlo Methods for Computation and Optimization (048715) Lecture Notes Prof. Nahum Shimkin Spring 2015 i PREFACE These lecture notes are intended for

More information

Reversal in time order of interactive events: Collision of inclined rods

Reversal in time order of interactive events: Collision of inclined rods Reersal in time order of interactie eents: Collision of inclined rods Published in The European Journal of Physics Eur. J. Phys. 27 819-824 http://www.iop.org/ej/abstract/0143-0807/27/4/013 Chandru Iyer

More information

Importance Sampling for Rare Events

Importance Sampling for Rare Events Importance Sampling for Rare Events Soren Asmussen Department of Mathematical Sciences Aarhus University Ny Munkegade DK-8000 Aarhus C, Denmark asmus@imf.au.dk Paul Dupuis Division of Applied Mathematics

More information

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable Distributions of Functions of Random Variables 5.1 Functions of One Random Variable 5.2 Transformations of Two Random Variables 5.3 Several Random Variables 5.4 The Moment-Generating Function Technique

More information

Astrometric Errors Correlated Strongly Across Multiple SIRTF Images

Astrometric Errors Correlated Strongly Across Multiple SIRTF Images Astrometric Errors Correlated Strongly Across Multiple SIRTF Images John Fowler 28 March 23 The possibility exists that after pointing transfer has been performed for each BCD (i.e. a calibrated image

More information

ICES REPORT Model Misspecification and Plausibility

ICES REPORT Model Misspecification and Plausibility ICES REPORT 14-21 August 2014 Model Misspecification and Plausibility by Kathryn Farrell and J. Tinsley Odena The Institute for Computational Engineering and Sciences The University of Texas at Austin

More information

On the inefficiency of state-independent importance sampling in the presence of heavy tails

On the inefficiency of state-independent importance sampling in the presence of heavy tails Operations Research Letters 35 (2007) 251 260 Operations Research Letters www.elsevier.com/locate/orl On the inefficiency of state-independent importance sampling in the presence of heavy tails Achal Bassamboo

More information

APPM/MATH 4/5520 Solutions to Exam I Review Problems. f X 1,X 2. 2e x 1 x 2. = x 2

APPM/MATH 4/5520 Solutions to Exam I Review Problems. f X 1,X 2. 2e x 1 x 2. = x 2 APPM/MATH 4/5520 Solutions to Exam I Review Problems. (a) f X (x ) f X,X 2 (x,x 2 )dx 2 x 2e x x 2 dx 2 2e 2x x was below x 2, but when marginalizing out x 2, we ran it over all values from 0 to and so

More information

Hölder norms and the support theorem for diffusions

Hölder norms and the support theorem for diffusions Hölder norms and the support theorem for diffusions Gérard Ben Arous, Mihai Gradinaru Uniersité de Paris-Sud, Mathématiques, Bât. 45, 9145 Orsay Cedex, France and Michel Ledoux Laboratoire de Statistique

More information

Three examples of a Practical Exact Markov Chain Sampling

Three examples of a Practical Exact Markov Chain Sampling Three examples of a Practical Exact Markov Chain Sampling Zdravko Botev November 2007 Abstract We present three examples of exact sampling from complex multidimensional densities using Markov Chain theory

More information

Probabilistic Engineering Design

Probabilistic Engineering Design Probabilistic Engineering Design Chapter One Introduction 1 Introduction This chapter discusses the basics of probabilistic engineering design. Its tutorial-style is designed to allow the reader to quickly

More information

Estimation of Efficiency with the Stochastic Frontier Cost. Function and Heteroscedasticity: A Monte Carlo Study

Estimation of Efficiency with the Stochastic Frontier Cost. Function and Heteroscedasticity: A Monte Carlo Study Estimation of Efficiency ith the Stochastic Frontier Cost Function and Heteroscedasticity: A Monte Carlo Study By Taeyoon Kim Graduate Student Oklahoma State Uniersity Department of Agricultural Economics

More information

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015 Part IA Probability Definitions Based on lectures by R. Weber Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly) after lectures.

More information

Lecture 6: Graphical Models

Lecture 6: Graphical Models Lecture 6: Graphical Models Kai-Wei Chang CS @ Uniersity of Virginia kw@kwchang.net Some slides are adapted from Viek Skirmar s course on Structured Prediction 1 So far We discussed sequence labeling tasks:

More information

Brief Review on Estimation Theory

Brief Review on Estimation Theory Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on

More information

Lecture 1. 1 Overview. 2 Maximum Flow. COMPSCI 532: Design and Analysis of Algorithms August 26, 2015

Lecture 1. 1 Overview. 2 Maximum Flow. COMPSCI 532: Design and Analysis of Algorithms August 26, 2015 COMPSCI 532: Design and Analysis of Algorithms August 26, 205 Lecture Lecturer: Debmalya Panigrahi Scribe: Allen Xiao Oeriew In this lecture, we will define the maximum flow problem and discuss two algorithms

More information

Sparse Functional Regression

Sparse Functional Regression Sparse Functional Regression Junier B. Olia, Barnabás Póczos, Aarti Singh, Jeff Schneider, Timothy Verstynen Machine Learning Department Robotics Institute Psychology Department Carnegie Mellon Uniersity

More information

Estimating the Number of s-t Paths in a Graph

Estimating the Number of s-t Paths in a Graph Journal of Graph Algorithms and Applications http://jgaa.info/ vol. 11, no. 1, pp. 195 214 (2007) Estimating the Number of s-t Paths in a Graph Ben Roberts Department of Pure Mathematics and Mathematical

More information

Noise constrained least mean absolute third algorithm

Noise constrained least mean absolute third algorithm Noise constrained least mean absolute third algorithm Sihai GUAN 1 Zhi LI 1 Abstract: he learning speed of an adaptie algorithm can be improed by properly constraining the cost function of the adaptie

More information

arxiv: v1 [physics.comp-ph] 17 Jan 2014

arxiv: v1 [physics.comp-ph] 17 Jan 2014 An efficient method for soling a correlated multi-item inentory system Chang-Yong Lee and Dongu Lee The Department of Industrial & Systems Engineering, Kongu National Uniersity, Kongu 34-70 South Korea

More information

Selected Exercises on Expectations and Some Probability Inequalities

Selected Exercises on Expectations and Some Probability Inequalities Selected Exercises on Expectations and Some Probability Inequalities # If E(X 2 ) = and E X a > 0, then P( X λa) ( λ) 2 a 2 for 0 < λ

More information

EFFICIENT TAIL ESTIMATION FOR SUMS OF CORRELATED LOGNORMALS. Leonardo Rojas-Nandayapa Department of Mathematical Sciences

EFFICIENT TAIL ESTIMATION FOR SUMS OF CORRELATED LOGNORMALS. Leonardo Rojas-Nandayapa Department of Mathematical Sciences Proceedings of the 2008 Winter Simulation Conference S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. EFFICIENT TAIL ESTIMATION FOR SUMS OF CORRELATED LOGNORMALS Jose Blanchet

More information

Polynomial-Solvability of N P-class Problems

Polynomial-Solvability of N P-class Problems Polynomial-Solability of N P-class Problems Anatoly Panyuko paniukoa@susu.ru arxi:1409.03755 [cs.cc] 24 Aug 2018 Abstract LetHamiltoniancomplement ofthegraphg = (V(G), E(G))betheminimal cardinality set

More information

Optimal Joint Detection and Estimation in Linear Models

Optimal Joint Detection and Estimation in Linear Models Optimal Joint Detection and Estimation in Linear Models Jianshu Chen, Yue Zhao, Andrea Goldsmith, and H. Vincent Poor Abstract The problem of optimal joint detection and estimation in linear models with

More information

OBSERVATIONS ON BAGGING

OBSERVATIONS ON BAGGING OBSERVATIONS ON BAGGING Andreas Buja and Werner Stuetzle Uniersity of Pennsylania and Uniersity of Washington Abstract: Bagging is a deice intended for reducing the prediction error of learning algorithms.

More information

Insights into Cross-validation

Insights into Cross-validation Noname manuscript No. (will be inserted by the editor) Insights into Cross-alidation Amit Dhurandhar Alin Dobra Receied: date / Accepted: date Abstract Cross-alidation is one of the most widely used techniques,

More information

Distribution of the Ratio of Normal and Rice Random Variables

Distribution of the Ratio of Normal and Rice Random Variables Journal of Modern Applied Statistical Methods Volume 1 Issue Article 7 11-1-013 Distribution of the Ratio of Normal and Rice Random Variables Nayereh B. Khoolenjani Uniersity of Isfahan, Isfahan, Iran,

More information

Position in the xy plane y position x position

Position in the xy plane y position x position Robust Control of an Underactuated Surface Vessel with Thruster Dynamics K. Y. Pettersen and O. Egeland Department of Engineering Cybernetics Norwegian Uniersity of Science and Technology N- Trondheim,

More information

Representational Power of Restricted Boltzmann Machines and Deep Belief Networks

Representational Power of Restricted Boltzmann Machines and Deep Belief Networks 1 Representational Power of Restricted Boltzmann Machines and Deep Belief Networks Nicolas Le Roux and Yoshua Bengio Dept. IRO, Uniersité de Montréal C.P. 6128, Montreal, Qc, H3C 3J7, Canada {lerouxni,bengioy}@iro.umontreal.ca

More information

A spectral Turán theorem

A spectral Turán theorem A spectral Turán theorem Fan Chung Abstract If all nonzero eigenalues of the (normalized) Laplacian of a graph G are close to, then G is t-turán in the sense that any subgraph of G containing no K t+ contains

More information

LECTURE 1: INTRODUCTION, VECTORS, AND DOT PRODUCTS

LECTURE 1: INTRODUCTION, VECTORS, AND DOT PRODUCTS LECTURE 1: INTRODUCTION, VECTORS, AND DOT PRODUCTS MA1111: LINEAR ALGEBRA I, MICHAELMAS 2016 1. Introduction Firstly, I want to point out that I am here to help, so if you hae a question about a lecture

More information

NOTES ON THE REGULAR E-OPTIMAL SPRING BALANCE WEIGHING DESIGNS WITH CORRE- LATED ERRORS

NOTES ON THE REGULAR E-OPTIMAL SPRING BALANCE WEIGHING DESIGNS WITH CORRE- LATED ERRORS REVSTAT Statistical Journal Volume 3, Number 2, June 205, 9 29 NOTES ON THE REGULAR E-OPTIMAL SPRING BALANCE WEIGHING DESIGNS WITH CORRE- LATED ERRORS Authors: Bronis law Ceranka Department of Mathematical

More information

4-vectors. Chapter Definition of 4-vectors

4-vectors. Chapter Definition of 4-vectors Chapter 12 4-ectors Copyright 2004 by Daid Morin, morin@physics.harard.edu We now come to a ery powerful concept in relatiity, namely that of 4-ectors. Although it is possible to derie eerything in special

More information

THE DESCRIPTIVE COMPLEXITY OF SERIES REARRANGEMENTS

THE DESCRIPTIVE COMPLEXITY OF SERIES REARRANGEMENTS THE DESCRIPTIVE COMPLEXITY OF SERIES REARRANGEMENTS MICHAEL P. COHEN Abstract. We consider the descriptie complexity of some subsets of the infinite permutation group S which arise naturally from the classical

More information

Scalar multiplication and algebraic direction of a vector

Scalar multiplication and algebraic direction of a vector Roberto s Notes on Linear Algebra Chapter 1: Geometric ectors Section 5 Scalar multiplication and algebraic direction of a ector What you need to know already: of a geometric ectors. Length and geometric

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

Fluid Heuristics, Lyapunov Bounds and E cient Importance Sampling for a Heavy-tailed G/G/1 Queue

Fluid Heuristics, Lyapunov Bounds and E cient Importance Sampling for a Heavy-tailed G/G/1 Queue Fluid Heuristics, Lyapunov Bounds and E cient Importance Sampling for a Heavy-tailed G/G/1 Queue J. Blanchet, P. Glynn, and J. C. Liu. September, 2007 Abstract We develop a strongly e cient rare-event

More information

E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks

E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks Jose Blanchet and Jingchen Liu y June 14, 21 Abstract The contribution of this paper is to introduce change

More information

A matrix Method for Interval Hermite Curve Segmentation O. Ismail, Senior Member, IEEE

A matrix Method for Interval Hermite Curve Segmentation O. Ismail, Senior Member, IEEE International Journal of Video&Image Processing Network Security IJVIPNS-IJENS Vol:15 No:03 7 A matrix Method for Interal Hermite Cure Segmentation O. Ismail, Senior Member, IEEE Abstract Since the use

More information

Two new Fuzzy Models Using Fuzzy Cognitive Maps Model and Kosko Hamming Distance

Two new Fuzzy Models Using Fuzzy Cognitive Maps Model and Kosko Hamming Distance Ultra Scientist Vol (1)B, 43- (01) Two new Fuzzy Models Using Fuzzy ognitie Maps Model and Kosko Hamming Distance K THULUKKANAM 1 and RVASUKI (Acceptance Date th April, 01) Abstract In this paper for the

More information

A New Extended Uniform Distribution

A New Extended Uniform Distribution International Journal of Statistical Distriutions and Applications 206; 2(3): 3-4 http://wwwsciencepulishinggroupcom/j/ijsda doi: 0648/jijsd20602032 ISS: 2472-3487 (Print); ISS: 2472-309 (Online) A ew

More information

An Alternative Characterization of Hidden Regular Variation in Joint Tail Modeling

An Alternative Characterization of Hidden Regular Variation in Joint Tail Modeling An Alternatie Characterization of Hidden Regular Variation in Joint Tail Modeling Grant B. Weller Daniel S. Cooley Department of Statistics, Colorado State Uniersity, Fort Collins, CO USA May 22, 212 Abstract

More information

Lecture 1: August 28

Lecture 1: August 28 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random

More information

arxiv: v1 [stat.ml] 15 Feb 2018

arxiv: v1 [stat.ml] 15 Feb 2018 1 : A New Algorithm for Streaming PCA arxi:1802.054471 [stat.ml] 15 Feb 2018 Puyudi Yang, Cho-Jui Hsieh, Jane-Ling Wang Uniersity of California, Dais pydyang, chohsieh, janelwang@ucdais.edu Abstract In

More information

A nonparametric confidence interval for At-Risk-of-Poverty-Rate: an example of application

A nonparametric confidence interval for At-Risk-of-Poverty-Rate: an example of application A nonparametric confidence interal for At-Ris-of-Poerty-Rate: an example of application Wojciech Zielińsi Department of Econometrics and Statistics Warsaw Uniersity of Life Sciences Nowoursynowsa 159,

More information

(x,y) 4. Calculus I: Differentiation

(x,y) 4. Calculus I: Differentiation 4. Calculus I: Differentiation 4. The eriatie of a function Suppose we are gien a cure with a point lying on it. If the cure is smooth at then we can fin a unique tangent to the cure at : If the tangent

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

Collective Risk Models with Dependence Uncertainty

Collective Risk Models with Dependence Uncertainty Collectie Risk Models with Dependence Uncertainty Haiyan Liu and Ruodu Wang ebruary 27, 2017 Abstract We bring the recently deeloped framework of dependence uncertainty into collectie risk models, one

More information

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued Chapter 3 sections Chapter 3 - continued 3.1 Random Variables and Discrete Distributions 3.2 Continuous Distributions 3.3 The Cumulative Distribution Function 3.4 Bivariate Distributions 3.5 Marginal Distributions

More information

1: PROBABILITY REVIEW

1: PROBABILITY REVIEW 1: PROBABILITY REVIEW Marek Rutkowski School of Mathematics and Statistics University of Sydney Semester 2, 2016 M. Rutkowski (USydney) Slides 1: Probability Review 1 / 56 Outline We will review the following

More information

Estimation of Dynamic Regression Models

Estimation of Dynamic Regression Models University of Pavia 2007 Estimation of Dynamic Regression Models Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy.

More information

Isoperimetric problems

Isoperimetric problems CHAPTER 2 Isoperimetric problems 2.1. History One of the earliest problems in geometry was the isoperimetric problem, which was considered by the ancient Greeks. The problem is to find, among all closed

More information

Sums of dependent lognormal random variables: asymptotics and simulation

Sums of dependent lognormal random variables: asymptotics and simulation Sums of dependent lognormal random variables: asymptotics and simulation Søren Asmussen Leonardo Rojas-Nandayapa Department of Theoretical Statistics, Department of Mathematical Sciences Aarhus University,

More information

Probability. Lecture Notes. Adolfo J. Rumbos

Probability. Lecture Notes. Adolfo J. Rumbos Probability Lecture Notes Adolfo J. Rumbos October 20, 204 2 Contents Introduction 5. An example from statistical inference................ 5 2 Probability Spaces 9 2. Sample Spaces and σ fields.....................

More information

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2 Order statistics Ex. 4. (*. Let independent variables X,..., X n have U(0, distribution. Show that for every x (0,, we have P ( X ( < x and P ( X (n > x as n. Ex. 4.2 (**. By using induction or otherwise,

More information

Section 6: PRISMATIC BEAMS. Beam Theory

Section 6: PRISMATIC BEAMS. Beam Theory Beam Theory There are two types of beam theory aailable to craft beam element formulations from. They are Bernoulli-Euler beam theory Timoshenko beam theory One learns the details of Bernoulli-Euler beam

More information

OPTIMAL RESOLVABLE DESIGNS WITH MINIMUM PV ABERRATION

OPTIMAL RESOLVABLE DESIGNS WITH MINIMUM PV ABERRATION Statistica Sinica 0 (010), 715-73 OPTIMAL RESOLVABLE DESIGNS WITH MINIMUM PV ABERRATION J. P. Morgan Virginia Tech Abstract: Amongst resolable incomplete block designs, affine resolable designs are optimal

More information

4. A Physical Model for an Electron with Angular Momentum. An Electron in a Bohr Orbit. The Quantum Magnet Resulting from Orbital Motion.

4. A Physical Model for an Electron with Angular Momentum. An Electron in a Bohr Orbit. The Quantum Magnet Resulting from Orbital Motion. 4. A Physical Model for an Electron with Angular Momentum. An Electron in a Bohr Orbit. The Quantum Magnet Resulting from Orbital Motion. We now hae deeloped a ector model that allows the ready isualization

More information

Solving Markov Decision Processes by Simulation: a Model-Based Approach

Solving Markov Decision Processes by Simulation: a Model-Based Approach Universität Ulm Fakultät für Mathematik und Wirtschaftswissenschaften Solving Markov Decision Processes by Simulation: a Model-Based Approach Diplomarbeit in Wirtschaftsmathematik vorgelegt von Kristina

More information

Relativistic Energy Derivation

Relativistic Energy Derivation Relatiistic Energy Deriation Flamenco Chuck Keyser //4 ass Deriation (The ass Creation Equation ρ, ρ as the initial condition, C the mass creation rate, T the time, ρ a density. Let V be a second mass

More information

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications. Stat 811 Lecture Notes The Wald Consistency Theorem Charles J. Geyer April 9, 01 1 Analyticity Assumptions Let { f θ : θ Θ } be a family of subprobability densities 1 with respect to a measure µ on a measurable

More information

Some Statistical Inferences For Two Frequency Distributions Arising In Bioinformatics

Some Statistical Inferences For Two Frequency Distributions Arising In Bioinformatics Applied Mathematics E-Notes, 14(2014), 151-160 c ISSN 1607-2510 Available free at mirror sites of http://www.math.nthu.edu.tw/ amen/ Some Statistical Inferences For Two Frequency Distributions Arising

More information

Quasi-Monte Carlo Techniques and Rare Event Sampling

Quasi-Monte Carlo Techniques and Rare Event Sampling www.oeaw.ac.at Quasi-Monte Carlo Techniques and Rare Event Sampling J. Hartinger, D. Kortschak RICAM-Report 2005-26 www.ricam.oeaw.ac.at Quasi-Monte Carlo Techniques and Rare Event Sampling Jürgen Hartinger

More information