Nonparametric estimation of a maximum of quantiles

Size: px
Start display at page:

Download "Nonparametric estimation of a maximum of quantiles"

Transcription

1 Noparametric estimatio of a maximum of quatiles Georg C. Ess 1, Beedict Götz 1, Michael Kohler 2, Adam Krzyżak 3, ad Rolad Platz 4. 1 Fachgebiet Systemzuverlässigkeit ud Maschieakustik SzM, Techische Uiversität Darmstadt, Magdaleestr. 4, Darmstadt, Germay, ess, goetz@szm.tu-darmstadt.de 2 Fachbereich Mathematik, Techische Uiversität Darmstadt, Schlossgartestr. 7, Darmstadt, Germay, kohler@mathematik.tu-darmstadt.de 3 Departmet of Computer Sciece ad Software Egieerig, Cocordia Uiversity, 1455 De Maisoeuve Blvd. West, Motreal, Quebec, Caada H3G 1M8, krzyzak@cs.cocordia.ca 4 Frauhofer Istitute for Structural Durability ad System Reliability LBF, Bartigstr. 47, Darmstadt, Germay, rolad.platz@lbf.frauhofer.de May 16, 2014 Abstract A simulatio model of a complex system is cosidered, for which the outcome is described by m(p, X), where p is a parameter of the system, X is a radom iput of the system ad m is a real-valued fuctio. The maximum (with respect to p) of the quatiles of m(p, X) is estimated. The quatiles of m(p, X) of a give level are estimated for various values of p by a order statistic of values m(p i, X i ) where X, X 1, X 2,... are idepedet ad idetically distributed ad where p i is close to p, ad the maximal quatile is estimated by the maximum of these quatile estimates. Uder assumptios o the smoothess of the fuctio describig the depedecy of the values of the quatiles o the parameter p the rate of covergece of this estimate is aalyzed. The fiite sample size behavior of the estimate is illustrated by simulated data ad by applyig it i a simulatio model of a real mechaical system. AMS classificatio: Primary 62G05; secodary 62G30. Key words ad phrases: Coditioal quatile estimatio, maximal quatile, rate of covergece, supremum orm error. Correspodig author. Tel: ext. 3007, Fax: Ruig title: Estimatio of the maximal quatile 1

2 1 Itroductio We cosider a simulatio model of a complex system described by Y = m(p, X), where p is a parameter of the system from some compact subset P of R l, X is a R d -valued radom variable with kow distributio ad m : P R d R is a give (measurable) fuctio. Let G p (y) = Pm(p, X) y be the cumulative distributio fuctio (cdf) of Y for a give value of p. For α (0, 1) let q p,α = miy R : G p (y) α be the α-quatile of m(p, X). Usig at most fuctio evaluatios m(p i, x i ) of m for arbitrarily chose values (p 1, x 1 ),..., (p, x ) P R d we are iterested i estimatig the maximal α-quatile sup q p,α. This problem is motivated by research carried out at the Collaborative Research Cetre SFB 805 at the Techische Uiversität Darmstadt which focuses o ucertaity i load-carryig systems such as truss structures. The truss structure uder cosideratio is a typical system for load distributio i mechaical egieerig (cf., Figure 1). It is made of te odes ad 24 equal beams M 6 7 M σ 5 v,2 σ v, σ v,1 F z E, r, l 2 y z x Figure 1: Truss structure 2

3 with circular cross sectios ad fixed boudary coditios. Four equal tetrahedro structures may be assembled ito modules. The truss is supported statically determied at odes 2, 3 ad 4. A sigle force F z i z-directio acts at ode 1 ad two symmetric momets M act at odes 6 ad 7 as show i Figure 1. Due to the o-symmetric loadig, the maximum equivalet stress σ v (cf., e.g., Gross et al. (2007)) is ot equal i every beam. It is iflueced ot oly by the o-symmetric loadig via F z ad M but also by the beam s material parameter Youg s modulus E as well as geometric parameters radius r ad legth l. We assume that from the maufacturig process of the truss structure we kow for each beam parameter E, r ad l the correspodig rages [a, b]. The parameter vector p = (E, r, l) describes the parameters for all 24 equal beams withi the truss structure. The occurrig radom force F z is ormally distributed ad the momets M have trucated ormal distributio restricted to positive half-axis. A umerical fiite elemet model is built that calculates the maximum equivalet stress Y = σ v,1, Y = σ v,2 ad Y = σ v,3 i the beams 1, 2 ad 3, resp., that are the members with the highest equivalet stresses i this specific truss structure. So our parameter set is P = [a E, b E ] [a r, b r ] [a l, b l ]. The fuctio m : P R + R + describes the physical model of the truss structure ad Y = m(p, X) is the occurrig maximum equivalet stress i beam 1, 2 ad 3, resp., of the truss structure with load vector X = (F z, M). We are iterested i quatifyig the ucertaity i the radom quatity Y = m(p, X) which we characterize by determiig a iterval i which the value of Y is cotaied with high probability. Such a iterval ca be determied by computig quatiles of Y for α close to oe (leadig to the right ed poit of the iterval) ad α close to zero (leadig to the left ed poit of the iterval). However, we do ot kow the exact value of p P. Hece istead of computig the right ed poit of the iterval by a quatile of Y we compute for each p P the α-quatile q p,α of m(p, X) ad use sup q p,α for α close to oe as the right ed poit of the iterval. Similarly we ca costruct the left ed poit of the iterval by computig if q p,α for α close to zero. I the sequel we will describe how we ca defie estimates of these quatities. I order to simplify the otatio we cosider oly sup q p,α, the other value ca be estimated similarly. For p P we estimate the α-quatile of G p by the α-quatile of a empirical cumulative distributio fuctio correspodig to data poits m(p i, X i ), where p i is close to p ad X, X 1, X 2,... are idepedet ad idetically distributed radom variables. The we estimate the maximum quatile by the maximum of these quatile estimates. Uder suitable assumptios o the smoothess of the fuctio describig the depedecy of the quatiles o the parameter we aalyze the rate of covergece of this estimate. The fiite sample size behavior of the estimate is illustrated usig simulated data. I our proofs the mai trick is to relate the error of our quatile estimates to the behavior of 3

4 the order statistics of a fixed sequece of idepedet radom variables uiformly distributed o (0, 1). The estimatio problem i this paper is estimatio of a quatile fuctio i a fixed desig regressio problem. Usually this kid of problem is studied i the literature i the cotext of radom desig regressio, see, e.g., Yu, Lu ad Stader (2003), Koeker (2005), ad the literature cited therei. I the cotext of radom desig regressio the rate of covergece results for local averagig estimates of coditioal quatiles have bee derived i Bhattacharya ad Gagopadhya (1990). Results cocerig estimates based o support vector machies ca be foud i Steiwart ad Christma (2011). The problem of estimatig the maximal quatile was ot cosidered i the papers metioed above. The cocept of local averagig cosidered i this paper is also very popular i the cotext of oparametric regressio, cf., e.g., Nadaraya (1964, 1970), Watso (1964), Stoe (1977), Devroye ad Wager (1980), Györfi (1981), Devroye (1982), Devroye ad Krzyżak (1989), Devroye, Györfi, Krzyżak ad Lugosi (1994) or Beirlat ad Györfi (1998). Throughout the paper we use the followig otatio: N, R ad R + are the set of positive itegers, real umbers, ad oegative real umbers, resp. For z R we deote the smallest iteger greater tha or equal to z by z. For a set A its idicator fuctio (which takes o value oe o A ad zero outside of A) is deoted by I A. x = maxx (1),..., x (d) is the supremum orm of a vector x = (x (1),..., x (d) ) T order statistics are deoted by z 1:,..., z :, i.e., z 1:,..., z : R d. For z 1,..., z R the correspodig is a permuatio of z 1,..., z satisfyig z 1: z :. Similarly, the order statistics of real valued radom variables are defied. If Z are real valued radom variables ad δ > 0 are positive real umbers, we write Z = O P (δ ) if lim lim sup P Z > c δ = 0. c The defiitio of our estimate of the maximum quatile is give i Sectio 2, the mai result is formulated i Sectio 3 ad prove i Sectio 5. The fiite sample size behaviour of the estimate is illustrated i Sectio 4 usig simulated data. 2 Defiitio of the estimate I order to simplify the otatio we assume i the sequel for all of our theoretical cosideratios that the set of parameters is give by P = [0, 1] l. Let X, X 1, X 2,... be idepedet ad idetically distributed ad let p 1,..., p be equidistatly chose from P. I the sequel we estimate the maximum quatile sup q p,α from the data D = ((p 1, X 1 ), m(p 1, X 1 )),..., ((p, X ), m(p, X )). 4

5 For p P we estimate the cumulative distributio fuctio G p (y) = Pm(p, X) y by the empirical cumulative distributio fuctio based o all those m(p i, X i ) where the distace betwee p i ad p i supremum orm is at most h. Here h > 0 is a parameter of the estimate. I order to defie this estimate, let K = I [ 1,1] l be the aive kerel. The Ĝ p (y) = ( ) i=1 I m(p i,x i) y K p pi h ( ) j=1 K p pj h is our estimate of G p (y). Next, the quatile is estimated by the plug-i estimate q p,α = mi y R : G p (y) α ˆq p,α = mi Fially, we estimate the maximal quatile y R : Ĝ p (y) α. sup q p,α by the maximum of the estimated quatile ˆq p,α for p p 1,..., p, i.e., by ˆM = max ˆq p i=1,..., i,α. (1) 3 Mai results Our mai result is the followig boud o the error of our maximal quatile estimate. Theorem 1 Let α (0, 1), let P = [0, 1] l ad let m : P R d R be a measurable fuctio. Let X be a R d -valued radom variable ad for p P let G p be the cumulative distributio fuctio of m(p, X). Let q p,α = miy R : G p (y) α be the α-quatile of m(p, X). Assume that there exists δ > 0 such that for some c 1 > 0 for all p P the derivative of G p exists o [q p,α δ, q p,α + δ] ad is cotiuous o this iterval ad greater tha c 1. Let the estimate ˆM be defied as i (1) for some h > 0 satisfyig h 0 ( ) ad h l ( ). 5

6 The ( ) ˆM log() sup q p,α = O P + sup q p1,α q h l p2,α. p 1,p 2 P : p 1 p 2 h If we impose some smoothess coditio o the fuctio describig the depedecy of the values of the quatiles o the parameter p, we ca derive from the above theorem the followig rate of covergece result. Corollary 1 Assume that the assumptios of Theorem 1 hold ad that, i additio, q p,α is (r, C)- smooth as a fuctio of p P for some r (0, 1] ad C > 0, i.e., Set q p1,α q p2,α C p 1 p 2 r (p 1, p 2 P). ( log 2 () h = c )1/(2r+l) ad defie the estimate ˆM as i Sectio 2. The ( (log ˆM 2 )r/(2r+l) ) () sup q p,α = O P. Proof. Sice q p,α is (r, C)-smooth as a fuctio of p P we ca coclude from Theorem 1 ( ) ˆM log() sup q p,α = O P + C h r h l. The defiitio of h implies the result. Remark 1. The rate of covergece i Corollary 1 is (up to some logarithmic factor) the same as the optimal miimax rate of covergece for estimatio of a (r, C) smooth fuctio o a compact subset of R l i supremum orm derived i Stoe (1982). Remark 2. I ay applicatio of the above estimate we have to select the badwidth h of the estimate i a data-drive way. We suggest to choose the badwidth i a optimal way i view of estimatio of G p (y) by Ĝp(y) (p P) for suitable chose y R. To do this, we propose to use a versio of the well-kow splittig of the sample techique i oparametric regressio (cf., e.g., Chapter 7 i Györfi et al. (2002)). More precisely, let us assume that we have available additioal radom variables X 1,..., X such that X, X 1,..., X, X1,..., X are idepedet ad idetically distributed, ad that we observe m(p i, X i ) for these radom variables. We choose y as the α-quatile of the empirical cumulative distributio fuctio correspodig to m(p 1, X 1 ),..., m(p, X ), ad choose h by miimizig 1 I m(pi, X i) y Ĝp i (y) 2. i=1 6

7 I the ext sectio we will ivestigate the performace of this data-drive way of choosig the badwidth usig simulated data. 4 Applicatio to simulated data I this sectio we illustrate the fiite sample size behaviour of our estimate (i particular i view of the data-drive choice of the badwidth i Remark 2) with the aid of simulated data. I all simulatios we choose the sample size for the estimate of the maximal quatile as = 20, 000, where we use half of the data to choose the badwidth of the estimate from a fiite set of badwidths as described i Remark 2. I each simulatio we compute the absolute differece betwee the true value of the maximal quatile ad firstly the estimate with the data-drive choice of the badwidth ad secodly with the estimate with sample size /2 applied with that value of the badwidth which leads to the miimal error for the give set of data. The level of the quatile is chose as α = We repeat each simulatio 100 times ad report the mea values ad the stadard deviatios of the resultig 100 error values for the two estimates. I our first example we choose X as a stadard ormally distributed radom variable, P = [0, 1] ad defie m(p, X) = 0.5 si(2 π p) + X. Hece m(p, X) is ormally distributed with mea 0.5 si(2 π p) ad variace 1. I this case the α = 0.95 quatile of m(p, X) is give by q p,α 0.5 si(2 π p) We choose i this example the badwidth from the set 0.01, 0.05, 0.1, 0.15, 0.2, 0.3, 0.4. I Figure 2 we show a typical simulatio for the two quatile estimates usig the data-drive badwidth ad the optimal badwidth, respectively. The mea values (stadard deviatios) of our errors are i case of our data drive choice of the badwidth approximately give by (0.034) as compared to (0.015) i case of the optimal badwidth choice, which is ot applicable i practice sice it depeds o the ukow maximal value. Hece the mea absolute error of our estimate usig the datadrive choice of the badwidth is approximately oly 2.5-times larger tha the mea absolute error of the estimate usig the optimal choice of the badwidth, which is ot kow i practice. I our secod example we agai choose radom variable X with a stadard ormal distributio ad P = [0, 1], but this time we defie m by m(p, x) = exp(p + x), which implies that m(p, X) is logormally distributed ad its logarithm has mea p ad variace 1. Agai the α-quatiles are kow ad ca be computed with a statistics package. The badwidth is chose from the same set as i the first example. I Figure 3 we show typical simulatio for the 7

8 With data drive badwidth 0.1 With optimal badwidth Figure 2: Typical simulatio i the first model two quatile estimates usig the data-drive badwidth ad the optimal badwidth, respectively. I this example the mea values (stadard deviatios) of our errors are i case of our data drive With data drive badwidth 0.1 With optimal badwidth Figure 3: Typical simulatio i the secod model choice of the badwidth approximately give by (0.517) as compared to (0.285) i case of the optimal badwidth choice, hece the mea error of the data-drive badwidth choice is withi a factor of 2.5 of the mea value of the optimal (ad i practice ot available) badwidth choice. I our third example we choose X = (X (1), X (2), X (3) ) T where X (1), X (2) ad X (3) are ide- 8

9 pedet stadard ormally distributed radom variables, P = [ 0.5, 0.5] 3 ad m(p, x) = (p (1) + x (1) ) 2 + (p (2) + x (2) ) 2 + (p (3) + x (1) ) 2. I this case m(p, X) is o-cetral chi-square radom variable with 3 degrees of freedom ad ocetrality factor (p (1) ) 2 + (p (2) ) 2 + (p (3) ) 2. Agai the α-quatiles are kow ad ca be computed with a statistics package. The badwidth is chose from the same set as above. I this example the mea values (stadard deviatios) of our errors are i case of our data drive choice of the badwidth approximately give by (0.376) as compared to (0.144) i case of the optimal badwidth choice, hece the mea error of the data-drive badwidth choice is withi a factor of approximately 2 of the mea value of the optimal (ot available i practice) badwidth choice. Fially we apply our ewly proposed estimatio method i the applicatio described i Sectio 1. The umerical simulatio model of the truss structure i Figure 1 is obtaied by fiite elemets with the commercial software ANSYS. Each of the 24 beams is modeled with the same parameters E, r ad l. Timosheko beam theory is applied to all beams by usig BEAM189 elemets i ANSYS. The 10 coectig odes are modeled as rigid liks usig MPC184 elemets. Deflectio is costraied at ode 2 i x-, y- ad z-directio, at ode 3 i y- ad z-directio ad at ode 4 i z-directio such that the truss is supported statically determied. The beam parameters E, r ad l are assumed to be i the itervals give i Table 1. Radom Table 1: Ucertai parameters parameter p value uit Youg s modulus E [69.7, 73.7] 10 9 N/m 2 radius r [0.0049, ] m legth l [0.3995, ] m loadig is applied to ode 1 as a sigle force F z i z-directio ad two symmetric momets with magitude M that act aroud axes i the x-y-plae i a agle of +60 ad 60 from x-directio at odes 6 ad 7, respectively. The load F z is ormally distributed with mea value µ F z ad stadard deviatio σ F z. The momets M have trucated ormal distributio restricted to positive half axis, where the uderlyig ormal distributio has mea value µ M = 0 ad stadard deviatio σ M, see Table 2. To obtai the maximum equivalet stresses σ v,1, σ v,2 ad σ v,3, a static aalysis is carried out for each parameter set ad load set. Two idepedet radom load sets were geerated for each combiatio of 41 values of each parameter set distributed equidistatly i the parameter itervals. I total, determiistic calculatios were carried out. Applicatio of the ewly proposed estimate of the maximum 0.95-quatile to this data results i predicted maximum stresses of N/m 2, N/m 2 ad N/m 2 i the 9

10 Table 2: Ucertai loads load X value uit load F z µ F z = 1000, σ F z = N momets M µ M = 0, σ M = 3.33 Nm three cosidered beams, respectively. I cotrast, if we igore the ucertaity i the parameters ad just compute 0.95-quatiles usig the mea values of these parameters ad a data set of 41 3 correspodig values, we get istead N/m 2, N/m 2 ad N/m 2, hece we get values which are 3.8%, 4.9% ad 4.9% lower, respectively. 5 Proofs 5.1 Auxiliary results I this subsectio we formulate ad prove two lemmas which we will eed i the proof of Theorem 1. Our first lemma shows that if we mix coupled samples from several differet distributios, the empirical quatile of the mixed samples lies with probability oe betwee the miimal ad the maximal quatiles of the idividual samples. Here we use the fact that we do ot chage the joit distributio of our samples if we assume that the radom variables are defied by applyig the iverse of the cumulative distributio fuctio to uiformly distributed radom variables. More precisely, our settig is the followig: Let M N be the umber of samples which we mix, ad for k 1,..., M let H (k) : (0, 1) R be a mootoically icreasig fuctio (which we will choose later as the iverse fuctio of a cumulative distributio fuctio). Let N N, let u 1,..., u N (0, 1) ad deote by u 1:N,..., u N:N the correspodig order statistics. We defie M differet coupled samples by z (k) i = H (k) (u i ) (i = 1,..., N) for k 1,..., M, ad we mix them by choosig for i = 1,..., N. Let Ĝ (k) (y) = 1 N N i=1 z i z (1) i,..., z (M) i I z (k) i y ad Ĝ(y) = 1 N N i=1 I zi y be the empirical cumulative distributio fuctios correspodig to z (k) 1,..., z(k) N ad z 1,..., z N, resp., ad let ˆq (k) α = mi y R : Ĝ (k) (y) α ad ˆq α = mi y R : Ĝ(y) α 10

11 be the correspodig quatile estimates. The the followig result holds. Lemma 1 Defie ˆq (k) α ad ˆq α as above. The mi k=1,...,m ˆq(k) α ˆq α max k=1,...,m ˆq(k) α. Proof. By costructio we have ˆq (k) α = H (k) (u N α :N ) (2) for k 1,..., M. Furthermore, for k 1,..., k N 1,..., M suitably chose we have that H (k1) (u 1:N ),..., H (k N ) (u N:N ) is a permuatio of z 1,..., z N. Choose k mi, k max 1,..., M such that mi k=1,...,m ˆq(k) α = ˆq α (kmi) ad max k=1,...,m ˆq(k) α = ˆq α (kmax). The by (2) we have for ay k 1,..., M H (kmi) (u N α :N ) H (k) (u N α :N ) H (kmax) (u N α :N ). Usig this relatio ad the mootoicity of H (k) we get for ay i 1,..., N α ad for ay i N α,..., N H (ki) (u i:n ) H (ki) (u N α :N ) H (kmax) (u N α :N ) = max k=1,...,m ˆq(k) α H (ki) (u i:n ) H (ki) (u N α :N ) H (kmi) (u N α :N ) = mi k=1,...,m ˆq(k) α. Now, let z 1:N,..., z N:N be the order statistics of z 1,..., z. The the above iequalities imply Sice z N α :N = ˆq α this completes the proof. mi k=1,...,m ˆq(k) α z N α :N max k=1,...,m ˆq(k) α. Our secod lemma relates the error of the quatile estimate to the behavior of order statistics correspodig to some fixed sequece of idepedet radom variables distributed uiformly o (0, 1). 11

12 Lemma 2 Let p P be fixed ad let the estimate ˆq p,α be defied as i Sectio 2. Assume that there exists a costat c 1 > 0 such that for some δ > 0 we have that G p is cotiuously differetiable o [q p,α δ, q p,α + δ] with derivative bouded from below by c 1, for all p P satisfyig p p δ. Let N = 1 i : p p i h be the umber of parameters p i which have at most suporm distace h to p. Let U 1,..., U N be idepedet radom variables distributed uiformly o (0, 1) ad deote their order statistics by U 1:N,..., U N:N. The we have for ay 0 < ɛ < δ satisfyig δ h : P ˆq p,α q p,α > ɛ + Proof. For p P let sup q p,α q p,α : p p h G 1 p (u) = mi y R : G p (y) u (u (0, 1)) 2 P U N α :N α > c1 ɛ. be the geeralized iverse of G p. The it is well-kow that G 1 p (U 1 ) has the same distributio as m( p, X 1 ). Sice Ĝp is by defiitio (observe that K is the aive kerel) the empirical cumulative distributio fuctio of m(p i1, X i1 ),..., m(p in, X in ) for suitable chose pairwise distict i 1,..., i N 1,...,, we see that it has the same distributio as the empirical cumulative distributio fuctio ĜN of G 1 p i1 (U 1 ),..., G 1 p in (U N ). Cosequetly, ˆq p,α has the same distributio as the quatile estimate ˆq U α correspodig to ĜN which implies for ay η > 0 Let P ˆq p,α q p,α > η = P ˆq U α q p,α > η. Ĝ (k) (y) = 1 N N i=1 I G 1 (U i) y be the cumulative emprirical distributio fuctio of G 1 (U 1 ),..., G 1 (U N ) ad deote by ˆq α (k) the correspodig quatile estimate (k = 1,..., N). Applicatio of Lemma 1 yields for ay η > 0 P ˆq α U q p,α > η P mi k=1,...,n ˆq(k) α q p,α > η + P max k=1,...,n ˆq(k) α q p,α > η 2 P max ˆq α (k) q pik,α + max q pik,α q p,α > η. k=1,...,n Summarizig the above results we see that we have show P ˆq p,α q p,α > ɛ + 2 P = 2 P max ˆq (k) k=1,...,n max G 1 k=1,...,n sup : p p h k=1,...,n q p,α q p,α α q pik,α > ɛ (U N α :N ) G 1 (α) > ɛ. 12

13 By the assumptios of Lemma 2, o [q pik,α δ, q pik,α + δ] the derivative of G pik exists ad is bouded from below by c 1 > 0. Cosequetly, o [α c 1 δ, α + c 1 δ], the iverse fuctio of G pik is cotiuously differetiable with derivative bouded from above by 1/c 1. So i case that U N α :N α < c1 δ we ca coclude from the mea value theorem that we have G 1 (U N α :N ) G 1 (α) 1 U N α :N α. c 1 This implies P max G 1 k=1,...,n which completes the proof. (U N α :N ) G 1 (α) > ɛ P U N α :N α > c 1 ɛ, 5.2 Proof of Theorem 1 Sice ˆM sup q p,α it suffices to show ˆM max ad Because of i=1,..., q p i,α ˆM max i=1,..., q p i,α + max q p i,α sup q p,α i=1,..., ( ) = O log() P + sup q p1,α q 1/l p2,α h p 1,p 2 P : p 1 p 2 h ( ) max q log() p i,α sup q p,α = O + sup q p1,α q i=1,..., 1/l p2,α. (4) h p 1,p 2 P : p 1 p 2 h max q p i,α sup i=1,..., q p,α = sup q p,α max i=1,..., q p i,α = sup mi (q p,α q i=1,..., pi,α) sup q p1,α q p2,α, p 1,p 2 P : p 1 p 2 1/ 1/l (4) follows from 1/l h for sufficietly large, which is implied by h l ( ). I order to prove (3) we observe that for a i, b i R (i = 1,..., ) we have max a i max b i i=1,..., i=1,..., max a i b i, (5) i=1,..., which together with the uio boud implies for arbitrary ɛ > 0 P ˆM max q p i,α i=1,..., > ɛ + sup q p1,α q p2,α p 1,p 2 P : p 1 p 2 h P max ˆq p i,α q i=1,..., pi,α > ɛ + sup q p1,α q p2,α p 1,p 2 P : p 1 p 2 h max P i=1,..., ˆq pi,α q pi,α > ɛ + sup q p1,α q p2,α p 1,p 2 P : p 1 p 2 h (3). (6) 13

14 Let U i be defied as i Lemma 2 ad let F N be the empirical cumulative distributio fuctio correspodig to U 1,..., U N ad let F be the cumulative distributio fuctio of U 1. The P U N α :N α > c 1 ɛ N α = P F (U N α :N) F N (U N α :N) + N α > c 1 ɛ P sup F N (t) F (t) > c 1 ɛ 1. t R N I case that c 1 ɛ > 1 N we ca boud the last probability usig well-kow results from Vapik- Chervoekis theory (cf., e.g., Theorem 12.4 i Devroye, Györfi ad Lugosi (1996)) ad get P U N α :N α > c1 ɛ ( ( 8 (N + 1) exp N c 1 ɛ 1 ) ) 2 /32. N Usig this, N h l, Lemma 2 ad (6) we coclude P ˆM max q p i,α i=1,..., > ɛ + sup q p1,α q p2,α p 1,p 2 P : p 1 p 2 h ( ( c 3 ( h l + 1) exp c 3 h l c 1 ɛ 1 ) ) 2 h l, which implies ˆM max i=1,..., q p i,α = O P ( ) log() h l + sup q p1,α q p2,α. p 1,p 2 P : p 1 p 2 h This completes the proof. 6 Ackowledgemets The authors would like to thak the Germa Research Foudatio (DFG) for fudig this project withi the Collaborative Research Cetre SFB 805. Refereces [1] Bhattacharya, P. K. ad Gagopadhya, A. K. (1990). Kerel ad earest-eighbor estimatio of coditioal quatile. Aals of Statistics 18, pp [2] Beirlat, J. ad Györfi, L. (1998). O the asymptotic L 2 -error i partitioig regressio estimatio. Joural of Statistical Plaig ad Iferece, 71, pp [3] Devroye, L. (1982). Necessary ad sufficiet coditios for the almost everywhere covergece of earest eighbor regressio fuctio estimates. Zeitschrift für Wahrscheilichkeitstheorie ud verwadte Gebiete, 61, pp

15 [4] Devroye, L., Györfi, L., Krzyżak, A., ad Lugosi, G. (1994). O the strog uiversal cosistecy of earest eighbor regressio fuctio estimates. Aals of Statistics, 22, pp [5] Devroye, L., Györfi, L. ad Lugosi, G. (1996). A Probabilistic Theory of Patter Recogitio. Spriger-Verlag, New York. [6] Devroye, L. ad Krzyżak, A. (1989). A equivalece theorem for L 1 covergece of the kerel regressio estimate. Joural of Statistical Plaig ad Iferece, 23, pp [7] Devroye, L. ad Wager, T. J. (1980). Distributio-free cosistecy results i oparametric discrimiatio ad regressio fuctio estimatio. Aals of Statistics, 8, pp [8] Gross, D., Hauger, W., Schröder, J. ad Wall, W. A. (2007). Techische Mechaik Bad 2: Elastostatik. Spriger-Verlag, New York. [9] Györfi, L. (1981). Recet results o oparametric regressio estimate ad multiple classificatio. Problems of Cotrol ad Iformatio Theory, 10, pp [10] Györfi, L., Kohler, M., Krzyżak, A. ad Walk, H. (2002). A Distributio-Free Theory of Noparametric Regressio. Spriger Series i Statistics, Spriger-Verlag, New York. [11] Koeker, R. (2005). Quatile regressio. Cambridge Uiversity Press. [12] Nadaraya, E. A. (1964). O estimatig regressio. Theory of Probability ad its Applicatios, 9, pp [13] Nadaraya, E. A. (1970). Remarks o oparametric estimates for desity fuctios ad regressio curves. Theory of Probability ad its Applicatios, 15, pp [14] Steiwart, I. ad Christma, A. (2011). Estimatio coditioal quatiles with the help of the piball loss. Beroulli 17, pp [15] Stoe, C. J. (1977). Cosistet oparametric regressio. Aals of Statististics, 5, pp [16] Stoe, C. J. (1982). Optimal global rates of covergece for oparametric regressio. Aals of Statistics, 10, pp [17] Watso, G. S. (1964). Smooth regressio aalysis. Sakhya Series A, 26, pp [18] Yu, K., Lu, Z. ad Stader, J. (2003). Quatile regressio: applicatio ad curret research areas. Joural of the Royal Statistical Society, Series D 52, pp

Estimation of the essential supremum of a regression function

Estimation of the essential supremum of a regression function Estimatio of the essetial supremum of a regressio fuctio Michael ohler, Adam rzyżak 2, ad Harro Walk 3 Fachbereich Mathematik, Techische Uiversität Darmstadt, Schlossgartestr. 7, 64289 Darmstadt, Germay,

More information

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials

More information

Sieve Estimators: Consistency and Rates of Convergence

Sieve Estimators: Consistency and Rates of Convergence EECS 598: Statistical Learig Theory, Witer 2014 Topic 6 Sieve Estimators: Cosistecy ad Rates of Covergece Lecturer: Clayto Scott Scribe: Julia Katz-Samuels, Brado Oselio, Pi-Yu Che Disclaimer: These otes

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Bull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung

Bull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung Bull. Korea Math. Soc. 36 (999), No. 3, pp. 45{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Abstract. This paper provides suciet coditios which esure the strog cosistecy of regressio

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

Nonparametric estimation of conditional distributions

Nonparametric estimation of conditional distributions Noparametric estimatio of coditioal distributios László Györfi 1 ad Michael Kohler 2 1 Departmet of Computer Sciece ad Iformatio Theory, udapest Uiversity of Techology ad Ecoomics, 1521 Stoczek, U.2, udapest,

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

5. INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY

5. INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY IA Probability Let Term 5 INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY 51 Iequalities Suppose that X 0 is a radom variable takig o-egative values ad that c > 0 is a costat The P X c E X, c is

More information

On Random Line Segments in the Unit Square

On Random Line Segments in the Unit Square O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,

More information

Kernel density estimator

Kernel density estimator Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Stochastic Simulation

Stochastic Simulation Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Basics of Probability Theory (for Theory of Computation courses)

Basics of Probability Theory (for Theory of Computation courses) Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Patter Recogitio Classificatio: No-Parametric Modelig Hamid R. Rabiee Jafar Muhammadi Sprig 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Ageda Parametric Modelig No-Parametric Modelig

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Singular Continuous Measures by Michael Pejic 5/14/10

Singular Continuous Measures by Michael Pejic 5/14/10 Sigular Cotiuous Measures by Michael Peic 5/4/0 Prelimiaries Give a set X, a σ-algebra o X is a collectio of subsets of X that cotais X ad ad is closed uder complemetatio ad coutable uios hece, coutable

More information

Analytic Continuation

Analytic Continuation Aalytic Cotiuatio The stadard example of this is give by Example Let h (z) = 1 + z + z 2 + z 3 +... kow to coverge oly for z < 1. I fact h (z) = 1/ (1 z) for such z. Yet H (z) = 1/ (1 z) is defied for

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number MATH 532 Itegrable Fuctios Dr. Neal, WKU We ow shall defie what it meas for a measurable fuctio to be itegrable, show that all itegral properties of simple fuctios still hold, ad the give some coditios

More information

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4. 4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A. Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Empirical Processes: Glivenko Cantelli Theorems

Empirical Processes: Glivenko Cantelli Theorems Empirical Processes: Gliveko Catelli Theorems Mouliath Baerjee Jue 6, 200 Gliveko Catelli classes of fuctios The reader is referred to Chapter.6 of Weller s Torgo otes, Chapter??? of VDVW ad Chapter 8.3

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

PRELIM PROBLEM SOLUTIONS

PRELIM PROBLEM SOLUTIONS PRELIM PROBLEM SOLUTIONS THE GRAD STUDENTS + KEN Cotets. Complex Aalysis Practice Problems 2. 2. Real Aalysis Practice Problems 2. 4 3. Algebra Practice Problems 2. 8. Complex Aalysis Practice Problems

More information

5.1 A mutual information bound based on metric entropy

5.1 A mutual information bound based on metric entropy Chapter 5 Global Fao Method I this chapter, we exted the techiques of Chapter 2.4 o Fao s method the local Fao method) to a more global costructio. I particular, we show that, rather tha costructig a local

More information

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan Deviatio of the Variaces of Classical Estimators ad Negative Iteger Momet Estimator from Miimum Variace Boud with Referece to Maxwell Distributio G. R. Pasha Departmet of Statistics Bahauddi Zakariya Uiversity

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1 Solutio Sagchul Lee October 7, 017 1 Solutios of Homework 1 Problem 1.1 Let Ω,F,P) be a probability space. Show that if {A : N} F such that A := lim A exists, the PA) = lim PA ). Proof. Usig the cotiuity

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Lecture 4: April 10, 2013

Lecture 4: April 10, 2013 TTIC/CMSC 1150 Mathematical Toolkit Sprig 01 Madhur Tulsiai Lecture 4: April 10, 01 Scribe: Haris Agelidakis 1 Chebyshev s Iequality recap I the previous lecture, we used Chebyshev s iequality to get a

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Lecture 2. The Lovász Local Lemma

Lecture 2. The Lovász Local Lemma Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 Fuctioal Law of Large Numbers. Costructio of the Wieer Measure Cotet. 1. Additioal techical results o weak covergece

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

On Classification Based on Totally Bounded Classes of Functions when There are Incomplete Covariates

On Classification Based on Totally Bounded Classes of Functions when There are Incomplete Covariates Joural of Statistical Theory ad Applicatios Volume, Number 4, 0, pp. 353-369 ISSN 538-7887 O Classificatio Based o Totally Bouded Classes of Fuctios whe There are Icomplete Covariates Majid Mojirsheibai

More information

Mi-Hwa Ko and Tae-Sung Kim

Mi-Hwa Ko and Tae-Sung Kim J. Korea Math. Soc. 42 2005), No. 5, pp. 949 957 ALMOST SURE CONVERGENCE FOR WEIGHTED SUMS OF NEGATIVELY ORTHANT DEPENDENT RANDOM VARIABLES Mi-Hwa Ko ad Tae-Sug Kim Abstract. For weighted sum of a sequece

More information

Comparison of Minimum Initial Capital with Investment and Non-investment Discrete Time Surplus Processes

Comparison of Minimum Initial Capital with Investment and Non-investment Discrete Time Surplus Processes The 22 d Aual Meetig i Mathematics (AMM 207) Departmet of Mathematics, Faculty of Sciece Chiag Mai Uiversity, Chiag Mai, Thailad Compariso of Miimum Iitial Capital with Ivestmet ad -ivestmet Discrete Time

More information

Disjoint Systems. Abstract

Disjoint Systems. Abstract Disjoit Systems Noga Alo ad Bey Sudaov Departmet of Mathematics Raymod ad Beverly Sacler Faculty of Exact Scieces Tel Aviv Uiversity, Tel Aviv, Israel Abstract A disjoit system of type (,,, ) is a collectio

More information

Regression with an Evaporating Logarithmic Trend

Regression with an Evaporating Logarithmic Trend Regressio with a Evaporatig Logarithmic Tred Peter C. B. Phillips Cowles Foudatio, Yale Uiversity, Uiversity of Aucklad & Uiversity of York ad Yixiao Su Departmet of Ecoomics Yale Uiversity October 5,

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

On the Asymptotic Normality of an Estimate of a Regression Functional

On the Asymptotic Normality of an Estimate of a Regression Functional Joural of Machie Learig Research 6 205) 863-877 Submitted 6/5; Published 9/5 O the Asymptotic Normality of a stimate of a Regressio Fuctioal László Györfi Departmet of Computer Sciece Iformatio Theory

More information

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values Iteratioal Joural of Applied Operatioal Research Vol. 4 No. 1 pp. 61-68 Witer 2014 Joural homepage: www.ijorlu.ir Cofidece iterval for the two-parameter expoetiated Gumbel distributio based o record values

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

REGRESSION WITH QUADRATIC LOSS

REGRESSION WITH QUADRATIC LOSS REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Riesz-Fischer Sequences and Lower Frame Bounds

Riesz-Fischer Sequences and Lower Frame Bounds Zeitschrift für Aalysis ud ihre Aweduge Joural for Aalysis ad its Applicatios Volume 1 (00), No., 305 314 Riesz-Fischer Sequeces ad Lower Frame Bouds P. Casazza, O. Christese, S. Li ad A. Lider Abstract.

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

Rates of Convergence for Quicksort

Rates of Convergence for Quicksort Rates of Covergece for Quicksort Ralph Neiiger School of Computer Sciece McGill Uiversity 480 Uiversity Street Motreal, HA 2K6 Caada Ludger Rüschedorf Istitut für Mathematische Stochastik Uiversität Freiburg

More information

Lecture 15: Learning Theory: Concentration Inequalities

Lecture 15: Learning Theory: Concentration Inequalities STAT 425: Itroductio to Noparametric Statistics Witer 208 Lecture 5: Learig Theory: Cocetratio Iequalities Istructor: Ye-Chi Che 5. Itroductio Recall that i the lecture o classificatio, we have see that

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

Regression with quadratic loss

Regression with quadratic loss Regressio with quadratic loss Maxim Ragisky October 13, 2015 Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X,Y, where, as before,

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

2.2. Central limit theorem.

2.2. Central limit theorem. 36.. Cetral limit theorem. The most ideal case of the CLT is that the radom variables are iid with fiite variace. Although it is a special case of the more geeral Lideberg-Feller CLT, it is most stadard

More information

STA Object Data Analysis - A List of Projects. January 18, 2018

STA Object Data Analysis - A List of Projects. January 18, 2018 STA 6557 Jauary 8, 208 Object Data Aalysis - A List of Projects. Schoeberg Mea glaucomatous shape chages of the Optic Nerve Head regio i aimal models 2. Aalysis of VW- Kedall ati-mea shapes with a applicatio

More information

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series Applied Mathematical Scieces, Vol. 7, 03, o. 6, 3-337 HIKARI Ltd, www.m-hikari.com http://d.doi.org/0.988/ams.03.3430 Compariso Study of Series Approimatio ad Covergece betwee Chebyshev ad Legedre Series

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

A Proof of Birkhoff s Ergodic Theorem

A Proof of Birkhoff s Ergodic Theorem A Proof of Birkhoff s Ergodic Theorem Joseph Hora September 2, 205 Itroductio I Fall 203, I was learig the basics of ergodic theory, ad I came across this theorem. Oe of my supervisors, Athoy Quas, showed

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information