Constrained linear discriminant rule for 2-groups via the Studentized classification statistic W for large dimension

Size: px
Start display at page:

Download "Constrained linear discriminant rule for 2-groups via the Studentized classification statistic W for large dimension"

Transcription

1 Costraied liear discrimiat rule for -groups via the Studetized classificatio statistic W for large dimesio Takayuki Yamada Istitute for Comprehesive Educatio Ceter of Geeral Educatio Kagoshima Uiversity --30 Korimoto Kagoshima Japa Abstract This paper is cocered with -group liear discrimiat aalysis for multivariate ormal populatios with ukow mea vectors ad ukow commo covariace matrix for the case i which the sample sizes N N ad the dimesio p are large. We give Studetized versio of the W statistic uder the high-dimesioal asymptotic framework A that N N ad p ted to ifiity together uder the coditio that p/n + N coverges to a costat i 0 ad N /N coverges to a costat i 0. Asymptotic expasio of the distributio for the coditioal probability of misclassificatiocpmc of the Studetized W is derived uder A. By usig this asymptotic expasio we give the cut-off poit such that the oe of two CPMCs is less tha the presettig value. Such the costraied discrimiat rule is studied by Aderso 973 ad McLachla 977. Mote Carlo simulatio revealed that the proposed method is more accurate tha McLachla 977 s method for the case i which p is relatively large. AMS 000 subject classificatio: primary 6H30; secodary 6H Key Words ad Phrases: Discrimiat aalysis W classificatio statistic locatio ad scale mixtured expressio CPMC High-dimesioal asymptotic results Upper cofidece bouds o CPMC. Itroductio Let x ij j =... N i i = be the jth sample observatio j =... N i from the ith populatio Π i i = with mea µ i ad commo covariace matrix Σ. We cosider the problem to allocate a observatio vector x which is accordig to either Π or Π. A commoly used rule is that W = x x S {x } x + x < c > c allocate x as Π Π which is called the liear discrimiat rule where c is a cut-off poit x x ad S are the sample mea vectors ad the pooled sample covariace matrix defied by x i = N i N i j= x ij i = S = = N = N + N. N i x ij x i x ij x i i= j= There are two types of probability of misclassificatio. Oe is the probability of allocatig x ito Π eve though it is actually belogig to Π. The other is the probability that x is classified as Π although it is actually belogig to Π. These two types of expected probabilities of misclassificatios EPMCs for W-rule are expressed as e c = P W < c x Π ad e c = P W > c x Π. address:yamada@gm.kagoshima-u.ac.jp

2 I geeral it is hard to evaluate these expected probabilities of misclassificatio EPMC explicitly but some asymptotic results icludig asymptotic expasios have bee obtaied. Aderso [] derived a asymptotic expasio for Studetized W ad applied it to idetify c such that e c = ε + O where ε 0 is a presettig level which is give by experimeter. This discrimiat rule is used to cotrol oe of EPMCs for the case i which oe type of errors is geerally regarded as more serious tha the others such as medical applicatios associated with the diagosis of diseases. Aderso [] s asymptotic expasio is obtaied uder the asymptotic framework A0: A0 : N N N /N γ 0 p is fixed. For achievig the same aim with Aderso [] McLachla [0] proposed the cut-off poit c such that where P c c < Ξ L = ε + O c c = P W < c x Π ; x x S ε 0 is a presettig level which is set by experimeter ad Ξ L is a upper boud which is set by experimeter. These ad some other asymptotic results were reviewed by Siotai [3] by McLachla [] ad by Aderso []. Geerally the precisio of asymptotic approximatios uder A0 gets worth as the dimesio p becomes large. As a alterative approach to overcome this shortcomig it has bee cosidered to derive asymptotic distributios of discrimiat fuctios i a high-dimesioal situatio where ad p ted to ifiity together. Yamada et al. [4] derived a asymptotic expasio of Studetized W uder the high-dimesioal asymptotic framework A ad the assumptio C such that A : N N N /N γ 0 p p/ γ 0 [0 ; C : 0 0 where is the squared Mahalaobis distace defied as = µ µ Σ µ µ. Usig the asymptotic expasio they proposed a cut-off poit c such that e c = ε + O AC where the symbol O AC stads for the term such that O AC coverges to a costat as uder A ad C. The usefuless such the high-dimesioal asymptotic framework is metioed i Fujikoshi et al. [6]. The aim of this paper is to obtai a cut-off poit c h which satisfies that P [ c c h < Ξ H ] = ε + OAC for the presettig values ε ad Ξ H. I order to derive this we show a asymptotic expasio of the distributio for the statistic c c h uder A ad C. Sice the distributio for c c is the same as the oe for c c with iterchagig N ad N we oly derive asymptotic expasio of the distributio for c c. This paper is orgaized as the followig: Sectio presets Studetizatio for W uder A. I Sectio 3 we derive the asymptotic distributio of c c via the Studetized statistic W uder A. Asymptotic expasio for the Studetized c c is derived by makig use of a powerful method kow as the method by the differetial operator which was used by James [8] Okamoto [] etc. I Sectio 4 we propose costraied liear discrimiat rule for CPMC for large dimesioal case. Simulatio results are writte i Sectio 5. We revealed that the proposed methods performs well for the case i which Ξ H is ot so small. I Sectio 6 cocludig remarks are writte. Some proofs ad techical results are give i Appedix.

3 Studetizatio for W uder A For x Π i it follows from Lachebruch [9] that W = x x S {x } x + x = V / Z i U i i = where V = x x S ΣS x x Z i = V / x x S x µ i U i = x x S x µ i D ad D is the squared sample Mahalaobis distace defied by D = x x S x x. The we fid that V is a positive radom variable ad U i V are joitly idepedet of Z i. Further Z i is distributed as N0. This ormality follows by cosiderig the coditioal distributio of Z i whe x x ad S are give. I this case W is called a locatio ad scale mixture of the stadard ormal distributio. It ca be expressed that E[U i ] = i E[V ] = m p m N N + Np N N + m m + where m = p. The aalytic expressios for VarU i ad for VarV which are provided by Fujikoshi [3] show that VarU i 0 ad VarV 0 uder A ad C. It implies that the limitig distributio of W uder A ad C is ormal with mea u 0i = lim A E[U i ] ad variace v 0 = lim A {E[V ]}. The atural estimate for E[U i ] E[V ] is obtaied by replacig with the followig ubiased estimator : = m D Np N N ad we write it as Ê[U i] Ê[V ]. The ubiasedess for Ê[U i] Ê[V ] also holds. We ca show that Ê[U i] Ê[V ] has cosistecy uder A ad C. From Slutsky s theorem W + Ê[U i] Ê[V ] D N0 for x Π i D where the symbol stads for the covergece i distributio. From the techical reaso istead of usig Ê[U i] Ê[V ] we use U 0i V 0 for the Studetizatio of W i this paper which is defied as follows. U 0i = Ê[U i] + i m + m N i = i m p + i m N N m + m V 0 = m m + Ê[V ] = + m m + m + + Np N N It is oted that U 0i V 0 is ot the ubiased estimator of E[U i ] E[V ] but has has the cosistecy uder A ad C. We ca also show that W i. = W + U 0i D N0 for x Πi. V0 N i 3

4 3 Asymptotic distributio for the Studetized CPMC Let c j be the cut-off poit for Wj ad let C i j deote the coditioal probability of misclassificatio of Wj misallocatig a observatio from Π j where i j. The C i j is give by C i j = c i j V0 c j U 0j = P i W < i V0 c j U 0j x Π j x x S = Φ i V0 c j U 0j + U j V V0 Φ V c V {U 0 U } i = j = = V0 Φ V c V {U 0 U } i = j =. From Lemma ad 3 i Appedix A the distributio for C is idetical to C if c = c. From that reaso we oly deal with C. Asymptotic expasio of the distributio for C ca be obtaied by the oe for C by replacig N N c with N N c. Hereafter we set U 0 = U 0 U = U ad c = c uless makig cofusio. 3. Stochastic expressio for CPMC Now we cosider to express C as the fuctio of simple variables. Let u = + / Σ / x x N N u = Σ / N x + N x N µ N µ N B = Σ / SΣ /. The u u ad B are idepedet. I additio u N p /N + /N / δ I p ad u N p 0 I p where δ = Σ / µ µ. It also holds that B is distributed as a Wishart distributio with degrees of freedom ad covariace matrix I p which is deoted as W p I p. Substitutig them we have U = u B u u + B u N δ B u N N N N NN It is also described that V = N u B u. N N Usig this expressio we ca write = Nm u B u Np. N N N N U 0 = N u B u + p N N m + N + N u V 0 = B u. m + m + N N The followig lemma which is give by Yamada et al. [4] eables to see the fuctios of the idepedet stadard ormal ad chi-squared variables for U V U 0 ad V 0. 4

5 Lemma. Let v N p a I p v N p 0 I p W W p I p ad v v ad W are idepedet. The a a Z + X a a W X a Z X v 3 v W v v W v = D X + X {Z + a X a + Z + X 4}Z 3 3 v W v {Z + a X a + Z + X 4 } + X {Z + a X a + Z + X 4 } 3 X where X i χ f i i = 3 4; Z i N0 i = 3; all the seve variables X X X 3. X 4 Z Z Z 3 are idepedet; f = p + f = p = p + f 4 = p. Similar results to Lemma was treated i Fujikoshi ad Seo [5] Fujikoshi [4] ad Hyodo ad Kubokawa [7]. Put where b = b w w w 3 z z = N N z + f + w N b = b w w w 3 w 4 z z z 3 = + f t sz 3 f + w f tz q = q w w 4 z z = s f + w q = q w w w 3 w 4 z z = + f t s f + w s = sw 4 z z = z + t = tw w 3 = + w + w 3 ; N N N + z + f 4 + w 4 f f f 4 are defied i Lemma. The we have B δ B u B Q def = u B u u B u Q u B u b /f W /f W / W 3 Z / Z / D = b /f W /f W / W 3 /f 4 W 4 Z / Z / Z 3 / q /f W /f 4 W 4 Z / Z / q /f W /f W / W 3 /f 4 W 4 Z / Z / where W i = f i /X i /f i i = This implies that B U U 0 α 4 α 3 α 0 α α V 0 = D 0 0 β 0 B Q V β 0 0 Q 0 5

6 where Lettig α = { N } = α = p N N N N N m + α N 3 = α 4 = N N NN + N β = β = N. m + m + N N N N β q c + α q α + α 3 b α 4 b Ψw w w 3 w 4 z z z 3 = Φ β q we ca express that V0 c U 0 + U C = Φ V D = Ψ / W /f W / W 3 /f 4 W 4 Z / Z / Z 3 /. 3 Hereafter we set v as the variable vector ad y as the radom variable vector which are defied by v = v v v 3 v 4 v 5 v 6 v 7 = w w w 3 w 4 z z z 3 y = Y Y Y 3 Y 4 Y 5 Y 6 Y 7 = W W W 3 W 4 Z Z Z 3 ad uless makig cofusio. Ψv = Ψw w w 3 w 4 z z z Studetizatio for CPMC uder A It ca be expressed that b = N N f N = b 0 b = 0 q = N N f N + f 4 q = f + f = q 0 N N N + f 4 = q 0. So we have Ψ0 = Φ β q 0 c + α q 0 α α 4 b 0 β q 0 = Φc. It is oted that Ψv is the smooth fuctio o R 3. We will expad Ψ / W /f W / W 3 /f 4 W 4 Z / Z / Z 3 / at /f W /f W / W 3 /f 4 W 4 Z / Z / Z 3 / = Let ψ = ψ c be the vector valued fuctio i R 7 defied by ψ = D v Ψv 0 5 6

7 where D = diag /f /f / /f 4 Ψv is defied as 4 ad the otatio 0 stads for the value at the poit that v = 0. We ca express that ψ = ϕcp where p = p c δ = c f f f f + f + f 4 f q0 β f α 0 0 f f 4 α f 4 f α β f f f α 4 α 3 q0. 6 I additio let Ψ = Ψ c be 7 7 matrix valued fuctio defied by We ca show that Ψ = D v v ΨvD 0. DΨ D = ϕcp where P = P c is the 7 7 matrix valued fuctio defied as P = P 0 + cp + c P + c 3 P 3 with P i = P i i = 0 3 beig the 7 7 symmetric matrix valued fuctio. The aalytic form for P i i = 0 3 are give i Appedix C. It ca be expressed that Ψ / W /f W / W 3 /f 4 W 4 Z / Z / Z 3 / = Φc + ψ y + y Ψ y + 3/ R 7 where R is a remaider term cosistig of a homogeeous polyomial of order 3 i the elemets of y of which the coefficiets are O as uder A ad C plus / times a homogeeous polyomial of order 4 plus a remaider term that is O uder A ad C for fixed y. By virtue of 7 with combied the use of the formula 3 we have C Φc ψ ψ / D = ψ ψ y + y Ψ y + ψ R. 8 It follows from the defiitio of y that y D N 7 0 I 7 uder A ad C which implies the followig theorem. Theorem. Uder the high-dimesioal asymptotic framework A ad the assumptio C where C Φc ϕc ρc / ρc = ψ ψ /{ϕc} = c + f f f + { + + β q 0 f f 4 f c f f 4 D N0 α f β α + q0 + f } α + α β f 3q

8 Sice ρc is ukow parameter it is eeded to estimate for Studetizatio. The atural estimate ρc ca ot be used. The reaso is that {q 0 } / which is icluded i ρc ca ot by defied for the case i which 0 < D < {/m }N/N N sice q 0 takes egative value. Istead of usig the ubiased estimator we use which is ot the ubiased estimate of. It ca be expressed that ρc = f + f A = m + Np D 0 N N + β q 0 f c f 4 α β f + q 0 f f f + f 3 f + f ατ + α 3 β where τ = f + f f + f f. f f 4 β It is sufficiet to show the positiveess for τ A to esure that ρc A > 0. We ca express that τ A = = f = f + f f + f + + f f f + f D + + β f f 4 f + f f f + [ q 0 A + + f { f + + f f } f f 4 β + f ] D f 4 β f f 4 f f + f + f f + where the last equality follows from the fact that q 0 A = + f D. 3 f β The o-egativeess for τ A follows from that f f 4 + f { f + + f f } = p p p + + p p + p + 7 > 0. Note that A p uder A ad C. From this rate cosistecy we obtai that By Theorem ad Slutsky s theorem ρc A ρc p. C Φc D N0. 4 ϕc ρc A / 8

9 3.3 Asymptotic expasio for the distributio of the proposed Studetized statistic for CPMC uder A I this sectio we derive a asymptotic expasio for the distributio for the Studetized C to improve the covergece rate i 4. Firstly we give a geeral result for the cumulative distributio fuctio of the radom variable T which has the form: T = h h h y + h h y Hy + R 5 for h R 7 ad the symmetric matrix H where R is the term cosistig of a homogeeous polyomial of order 3 i the elemets of y of which the coefficiets are O uder A ad C plus / times a homogeeous polyomial of order 4 plus a remaider term that is O uder A ad C for fixed y. Theorem. The cumulative distributio fuctio of T which is described as 5 ca be expressed as P T x = Φx s H 0 x + s H xϕx + O where Φ deotes the cumulative distributio fuctio of the stadard ormal distributio ϕ is the derivative of Φ ad H k x deotes the Hermite polyomial of degree k especially H 0 x = H x = x. Here for h h 7 = h. { s = tr H h h s = h h 3/ 3 4 k= } h 3 k + h Hh f k The proof of Theorem is give i Appedix B. Now we cosider to express the proposed Studetized statistic as the form 5. By virtue o combied with the use of the formula ad the fact that Q = {N/N N }D we have q 0 = D + f q /f W /f 4 W 4 Z / Z /. 6 f Put Ωw w 4 z z = where q = q w w 4 z z α 5 = + f f f + α 6 = f 4 α α 5 f f β α 5 c α 6 + α 7 + f + f q α 8 f + f q α 7 = α + α3 β [ α 8 = + f ] 3 f { f + + f f } f α f f 4 3f. 3f + α 5 β Without makig cofusio we express Ωv = Ωw w 4 z z. / 9

10 for v = w w 4 z z. From the expressios ad 6 we have Ω /f W /f 4 W 4 Z / Z / D = ρc A. By takig ito cosideratio that q 0 = /f + f / q it is easy to see that Ω0 = ρc. Sice Ωv is the smooth fuctio o R Taylor series expasio at /f W /f 4 W 4 Z / Z / = gives Ω /f W /f 4 W 4 Z / Z / = ρc + ω y + R where y = W W 4 Z Z ω = ω c is the vector valued fuctio i R 4 defied by ω = D Ωv 0 7 v with beig that D = diag /f /f 4 ad R is the residue term of which the property is similar to R. We ca express that where ω = {ρc } p 3/ p = p c α5 α 6 = c α 8 + α 5 α 6 q0 q f 0 Let ω be the extesio for ω defied as f f 4 q 0 N N f N q for ω = ω ω ω 3 ω 4. The we have ω = ω 0 0 ω ω 3 ω 4 0 ρc D ρc = + ρc ω A y + R 3 9 where R 3 is the residue term of which the property is similar to R. Combiig 8 ad 9 we have C Φc D = ϕc ρc p A / p y + } y {P + p p p ρc p p + p p y + R 4 where p is the extesio for p of which the defiitio is the same as ω ad R 4 is the residue term of which the property is similar to R. Summarizig the above results asymptotic expasio for the coditioal probability of misclassificatio is obtaied which is give as the followig theorem. 0

11 Theorem 3. P c V0 c U 0 Φc ϕc ρc x = Φx s H 0 x + s H x ϕx + O AC A / where s = s c = s = s c = { ρc tr P + { {ρc } 3/ 3 } ρc p p + p p 4 p 3 k + p f P p + p p k k= }. Aalytic forms for s ad s are complicated so we omit to describe them. We otice that s ad s cotai oly the term i which the power of is the eve umber. 4 Costraied liear discrimiat rule for CPMC I this sectio we give a costraied liear discrimiat rule for -groups of which oe of the two coditioal misclassificatio probabilities does ot exceed the presettig value Ξ H with the cofidece level α. Suppose that c H = ξ H ρξ H A z ε 0 where ξ H = Φ Ξ H for Ξ H 0. By virtue of 4 combied with Slutsky s theorem we have lim P c V0 c H U 0 < Ξ H = ε A uder the assumptio C. As a extesio of 0 we obtai the followig result. Theorem 4. Let c H = ξ H h h where h = ρξ H A z ε ξ H h = ρξ H A + f 4 + f α z β q 0 A ε ρξ H A b ξ H A H 0z ε + b ξ H A H z ε with beig that ξ H = Φ Ξ H for Ξ H 0. The P c V0 c H U 0 < Ξ H = ε + O AC. The proof of Theorem 4 is similar to the oe of Theorem i McLachra [0] ad so we omit to describe it.

12 5 Simulatio result Simulatio experimets were performed to cofirm the asymptotic result of Theorem 4. We also compared the accuracies with the asymptotic result of Theorem i McLachla [0] for the case i which N = N = 50 p = = 3 ε = 0.05 Ξ = Φ/ where the settigs of ad Ξ are followed to McLachla [0]. Whe we treat the distributios of W -rule without loss of geerality from ivariat property of the distributio for the orthogoal trasformatio of observatio vector we may assume that two give ormal populatios with the same covariace matrix are Π : N p /e I p Π : N p /e I p where e = To compute misclassificatio probability geerate 0 4 traiig samples. For each traiig samples we geerate 0 4 test samples i which observatio vectors are i.i.d. as N p /e I p. The value of the coditioal misclassificatio probability was calculated by sim k = umber of misclassificatio 0 4 k = i each traiig samples. We took the average of Isim < Ξ... Isim 0 4 < Ξ where I. deotes the idicator fuctio ad wrote it as the value for the actual level i row Y i Tables. The same value for McLachla [0] s approximatio was writte i row Mc. Table : Actual levels of cofideces that the coditioal error probabilities are less tha Ξ whe the omial level is ε = p = 0 p = 30 p = 50 p = 70 = Y Mc = Y Mc = 3 Y Mc From Tables we ca see that our proposed asymptotic approximatio has good accuracy whe =. The actual level of cofidece becomes small as the dimesio gets large for the case i which = 3. We ca check that McLachla [0] s result does ot work well for our settigs. Extra simulatio results which does ot writte i this paper reveals that the actual cofidece level gets small from the omial level as the dimesio becomes close to sample size for the case i which Ξ is small. 6 Cocludig remarks I this paper we derived Studetized statistic for the coditioal probability of misclassificatio for the Studetized W ad derived its asymptotic expasio for distributio up to the term of O / uder the high-dimesioal asymptotic framework A. It may be oted that the order of its error is O. Based o the derived asymptotic expasio we gave the cut-off poit for the liear discrimiat rule that the oe of two coditioal error probabilities is less tha the presettig value. Simulatio results revealed that our proposed rule is superior tha McLachla [0] s result. Ufortuately our proposed rule did ot work well for the case i which Ξ is small. The modificatio should be cosidered ad is beig a future problem. A Equality i distributios for proposed statistics I this sectio firstly we metio the equality i distributios for U 0 U ad U 0 U which is give as the followig lemma

13 Lemma. The distributio for U 0 U is the same as the oe for U 0 U with exchagig N for N. Proof. Set S N N = U 0 U ad set S N N = U 0 U. I additio put x i D = µi + Ni z i i = S D = W δ = Σ/ µ µ where z z W are idepedet; z ad z are distributed as N p 0 I p ; W is distributed as W p I p. The we have { S N N = D z z δ W z } z δ Np m N N N N N N p m N N m m + N z z δ W z + z δ N N N N { S N N = D z z δ W z } z δ Np m N N N N N N + p m N N m m + N + z z δ W z + z + δ. N N N N By iterchagig N ad N { S N N = D m + p m N N + z z δ W z z δ N N N N z z δ N N { = m p m N N m m + N W z + z + δ N N z z δ W z z δ N N N N N z N z δ D = S N N where z = z ad z = z. m m + N W z + z δ N N Np N N Np N N Next we show the equality i distributio for V 0 ad for V which is give as the followig lemma. Lemma 3. Each of the distributios for V 0 ad for V is the same as the oe with exchagig N for N. We omit to write the proof of Lemma 3 sice it is similar to Lemma. } } 3

14 B Proof of Theorem I this sectio we gave the proof of Theorem. Firstly we gave the followig two lemmas which is to be used. Lemma 4. Suppose that a R ad g. is a polyomial fuctio. Let Z ad Y are radom variables; Z is distributed as the stadard ormal distributio; Y is distributed as the chi-square distributio with f degrees of freedom. The E [ gze itaz] a = exp it E[gZ + ita] E [ gw e itaw ] f/ [ ] f W + ita = ita exp ita E g f ita /a where i = ad W = f/y/f. It is easy to prove Lemma 4 so we omit to write the proof. Proof of Theorem. From the assumptio for T give i 5 the characteristic fuctio ca be expaded as [ E[expitT ] = E T 0 + it ] h h T + O where T 0 = exp it h h h y T = y Hy exp it h h h y. From Lemma 4 we have 4 h k E[T 0 ] = it k= h h f k [ { = it3 k= k= fk / exp f k h 3 k h h 3/ uder A. It ca be expressed that 7 [ ] E[T ] = h kk E Yk h k exp it h h Y k E exp it + 7 k= l= l k h k it h h }] fk 7 k=5 e it / + O 7 h k h h Y k l= l k 7 [ ] h k h kl E Y k Y l exp it h h Y h l k exp it h h Y l E exp it exp it h k h h 7 α= α kα l h α h h Y α 4

15 for h kl = H. From Lemma agai we have [ 4 ] + it h k E[T ] = E[T 0 ] h /h h 7 kk k= { ith k / h h /f k } + h kk { + it h k/h h} k=5 4 4 ith k / h h + E[T 0 ] h kl k= l= ith k / h h ith l / h h /f k ith l / h h /f k l k = E[T 0 ] k= l=5 h kl ith k / h h ith k / h h h it l /f k h h + { tr H + it h Hh h h + O/ k=5 l=5 l k h kl it h kh l h h uder A. The desired result ow follows by formally ivertig the expasio for the characteristic fuctio. } C Aalytic forms for P 0 P P ad P 3 I this sectio we give the aalytic form for P i i = 0 3. The derivatio is straightforward ad so is omitted. 5

16 P 0 = P = q 0β f 0 f f+f3 f α f f+f3 f3 α α q0 f f+f3 f f+f3 f f α f3 α α q0 3 f f+f3 f f+f3 3/ α β q0 f3 f α β α β α β q0 f f+f3 f f+f3 0 0 f3 α 4 0 f f3 f α 4 f3 q0 f α4 0 α 0 q0 f3 α 0 0 { } α 3/ α f3 f f3 f αα4 β β + q0 4 3 f f f+f3 f f+f3 f { 3 f f+f3 { 3 4 f f+f3 f f+f3 + f f3 f+f3 f f+f3 f } } α β q0 ff3 0 0 f3 0 0 q f3 f f3 αα4 β αα4 β 3/ α 4 β q0 q0 q0 f f3 αα3 β αα3 β 3/ αα3 β α3α4 β α 3 β q0 q0 q0 q0 α β q0 α4 β q0 α β 3/ α β β 3/ q0 q0 0 q0 6

17 P = P 3 = β q0 f α f f+f3 f α f f+f3 0 0 f3 α 0 f f+f3 f f+f3 3/ α α β f α f3 α f f+f3 f f+f / f f f+f3 f f+f3 f f+f3 f f+f3 f3 α 4 f3 α 4 f f3 α4 f f+f3 f f+f3 α 3 f q 0 α 3 q f f f+f3 f f f+f3 f f+f3 f3 α 3 q f3 ff f f f3 α β α β 7

18 Refereces [] Aderso T.W A asymptotic expasio of the distributio of the Studetized classificatio statistic W. A. Statist [] Aderso T.W A Itroductio of Multivariate Statistics 3rd ed. Wiley Hoboke. [3] Fujikoshi Y Error bouds for asymptotic approximatios of the liear discrimiat fuctio whe the sample size ad dimesioality are large. J. Multivariate Aal [4] Fujikoshi Y. 00. Selectio of variables for discrimiat aalysis i a high-dimesioal case. Sakhyā [5] Fujikoshi Y. ad Seo T Asymptotic approximatios for EPMC s of the liear ad the quadratic discrimiat fuctios whe the sample sizes ad the dimesio are large. Radom Oper. Stochastic Equatios [6] Fujikoshi Y. Ulyaov V.V. ad Shimizu R. 00. Multivariate Statistics High-Dimesioal ad Large-Sample Approximatios Wiley Hoboke NJ. [7] Hyodo M. ad Kubokawa T. 04. A variable selectio criterio for liear discrimiat rule ad its optimality i high dimesioal ad large sample data. J. Multivariate Aal [8] James G.S Tests of liear hypotheses i uivariate ad multivariate aalysis whe the ratios of the populatio variaces are ukow. Biometrika [9] Lachebruch P.A O expected probability of misclassificatio i discrimiat aalysis secessary sample size ad a relatio with the multiple correlatio coefficiets. Biometrics Wiley Hoboke NJ. [0] McLachla G. J Costraied sample discrimiatio with the Studetized classificatio statistic W Comm. Statist. -Theory ad Methods [] McLachla G. J. 99. Discrimiat aalysis ad statistical patter recogitio Wiley NY. [] Okamoto M As asymptotic expasio for the distributio of the liear discrimiat fuctio. A. Math. Statist [3] Siotai M. 98. Large sample approximatios ad asymptotic expasios of classificatio statistic. Hadbook of Statistics P. R. Krishaiah ad L. N. Kaal ed. North-Hollad [4] Yamada T. Himeo T. ad Sakurai T. 07. Asymptotic cut-off poit i liear discrimiat rule to adjust the misclassificatio probability for large dimesios. Hiroshima Math. J

High-dimensional asymptotic results for EPMCs of W - and Z- rules

High-dimensional asymptotic results for EPMCs of W - and Z- rules 1 High-dimesioal asymptotic results for EPMCs of W - ad Z- rules Takayuki Yamada 1 Tetsuro Sakurai ad Yasuori Fujikoshi 3 1 Istitute for Comprehesive Educatio Ceter of Geeral Educatio Kagoshima Uiversity

More information

Asymptotic Results for the Linear Regression Model

Asymptotic Results for the Linear Regression Model Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

Regression with an Evaporating Logarithmic Trend

Regression with an Evaporating Logarithmic Trend Regressio with a Evaporatig Logarithmic Tred Peter C. B. Phillips Cowles Foudatio, Yale Uiversity, Uiversity of Aucklad & Uiversity of York ad Yixiao Su Departmet of Ecoomics Yale Uiversity October 5,

More information

The Method of Least Squares. To understand least squares fitting of data.

The Method of Least Squares. To understand least squares fitting of data. The Method of Least Squares KEY WORDS Curve fittig, least square GOAL To uderstad least squares fittig of data To uderstad the least squares solutio of icosistet systems of liear equatios 1 Motivatio Curve

More information

Strong consistency of log-likelihood-based information criterion in high-dimensional canonical correlation analysis

Strong consistency of log-likelihood-based information criterion in high-dimensional canonical correlation analysis Strog cosistecy of log-likelihood-based iformatio criterio i high-dimesioal caoical correlatio aalysis Ryoya Oda, Hirokazu Yaagihara ad Yasuori Fujikoshi Departmet of Mathematics, Graduate School of Sciece,

More information

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Rank tests and regression rank scores tests in measurement error models

Rank tests and regression rank scores tests in measurement error models Rak tests ad regressio rak scores tests i measuremet error models J. Jurečková ad A.K.Md.E. Saleh Charles Uiversity i Prague ad Carleto Uiversity i Ottawa Abstract The rak ad regressio rak score tests

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality A goodess-of-fit test based o the empirical characteristic fuctio ad a compariso of tests for ormality J. Marti va Zyl Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State,

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

MA Advanced Econometrics: Properties of Least Squares Estimators

MA Advanced Econometrics: Properties of Least Squares Estimators MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

MATHEMATICAL SCIENCES PAPER-II

MATHEMATICAL SCIENCES PAPER-II MATHEMATICAL SCIENCES PAPER-II. Let {x } ad {y } be two sequeces of real umbers. Prove or disprove each of the statemets :. If {x y } coverges, ad if {y } is coverget, the {x } is coverget.. {x + y } coverges

More information

Performance Accuracy of Linear classifiers for Two-level Multivariate Observations in High-dimensional Framework

Performance Accuracy of Linear classifiers for Two-level Multivariate Observations in High-dimensional Framework THE UNIVERSITY OF TEXAS AT SAN ANTONIO, COLLEGE OF BUSINESS Workig Paper SERIES Date February, 6 WP # MGST-53-6 Performace Accuracy of Liear classifiers for Two-level Multivariate Observatios i High-dimesioal

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

Appendix to: Hypothesis Testing for Multiple Mean and Correlation Curves with Functional Data

Appendix to: Hypothesis Testing for Multiple Mean and Correlation Curves with Functional Data Appedix to: Hypothesis Testig for Multiple Mea ad Correlatio Curves with Fuctioal Data Ao Yua 1, Hog-Bi Fag 1, Haiou Li 1, Coli O. Wu, Mig T. Ta 1, 1 Departmet of Biostatistics, Bioiformatics ad Biomathematics,

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Jacob Hays Amit Pillay James DeFelice 4.1, 4.2, 4.3

Jacob Hays Amit Pillay James DeFelice 4.1, 4.2, 4.3 No-Parametric Techiques Jacob Hays Amit Pillay James DeFelice 4.1, 4.2, 4.3 Parametric vs. No-Parametric Parametric Based o Fuctios (e.g Normal Distributio) Uimodal Oly oe peak Ulikely real data cofies

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable. Chapter 10 Variace Estimatio 10.1 Itroductio Variace estimatio is a importat practical problem i survey samplig. Variace estimates are used i two purposes. Oe is the aalytic purpose such as costructig

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

ON POINTWISE BINOMIAL APPROXIMATION

ON POINTWISE BINOMIAL APPROXIMATION Iteratioal Joural of Pure ad Applied Mathematics Volume 71 No. 1 2011, 57-66 ON POINTWISE BINOMIAL APPROXIMATION BY w-functions K. Teerapabolar 1, P. Wogkasem 2 Departmet of Mathematics Faculty of Sciece

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State Bayesia Cotrol Charts for the Two-parameter Expoetial Distributio if the Locatio Parameter Ca Take o Ay Value Betwee Mius Iity ad Plus Iity R. va Zyl, A.J. va der Merwe 2 Quitiles Iteratioal, ruaavz@gmail.com

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Subject: Differential Equations & Mathematical Modeling-III

Subject: Differential Equations & Mathematical Modeling-III Power Series Solutios of Differetial Equatios about Sigular poits Subject: Differetial Equatios & Mathematical Modelig-III Lesso: Power series solutios of differetial equatios about Sigular poits Lesso

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

An Introduction to Asymptotic Theory

An Introduction to Asymptotic Theory A Itroductio to Asymptotic Theory Pig Yu School of Ecoomics ad Fiace The Uiversity of Hog Kog Pig Yu (HKU) Asymptotic Theory 1 / 20 Five Weapos i Asymptotic Theory Five Weapos i Asymptotic Theory Pig Yu

More information

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION [412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION BY ALAN STUART Divisio of Research Techiques, Lodo School of Ecoomics 1. INTRODUCTION There are several circumstaces

More information

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio

More information

Similarity Solutions to Unsteady Pseudoplastic. Flow Near a Moving Wall

Similarity Solutions to Unsteady Pseudoplastic. Flow Near a Moving Wall Iteratioal Mathematical Forum, Vol. 9, 04, o. 3, 465-475 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/0.988/imf.04.48 Similarity Solutios to Usteady Pseudoplastic Flow Near a Movig Wall W. Robi Egieerig

More information

ki, X(n) lj (n) = (ϱ (n) ij ) 1 i,j d.

ki, X(n) lj (n) = (ϱ (n) ij ) 1 i,j d. APPLICATIONES MATHEMATICAE 22,2 (1994), pp. 193 200 M. WIŚNIEWSKI (Kielce) EXTREME ORDER STATISTICS IN AN EQUALLY CORRELATED GAUSSIAN ARRAY Abstract. This paper cotais the results cocerig the wea covergece

More information

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n Review of Power Series, Power Series Solutios A power series i x - a is a ifiite series of the form c (x a) =c +c (x a)+(x a) +... We also call this a power series cetered at a. Ex. (x+) is cetered at

More information

Zeros of Polynomials

Zeros of Polynomials Math 160 www.timetodare.com 4.5 4.6 Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered with fidig the solutios of polyomial equatios of ay degree

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

NCSS Statistical Software. Tolerance Intervals

NCSS Statistical Software. Tolerance Intervals Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

The Sample Variance Formula: A Detailed Study of an Old Controversy

The Sample Variance Formula: A Detailed Study of an Old Controversy The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace

More information

Chapter 9 - CD companion 1. A Generic Implementation; The Common-Merge Amplifier. 1 τ is. ω ch. τ io

Chapter 9 - CD companion 1. A Generic Implementation; The Common-Merge Amplifier. 1 τ is. ω ch. τ io Chapter 9 - CD compaio CHAPTER NINE CD-9.2 CD-9.2. Stages With Voltage ad Curret Gai A Geeric Implemetatio; The Commo-Merge Amplifier The advaced method preseted i the text for approximatig cutoff frequecies

More information

Lecture 24: Variable selection in linear models

Lecture 24: Variable selection in linear models Lecture 24: Variable selectio i liear models Cosider liear model X = Z β + ε, β R p ad Varε = σ 2 I. Like the LSE, the ridge regressio estimator does ot give 0 estimate to a compoet of β eve if that compoet

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

V. Nollau Institute of Mathematical Stochastics, Technical University of Dresden, Germany

V. Nollau Institute of Mathematical Stochastics, Technical University of Dresden, Germany PROBABILITY AND STATISTICS Vol. III - Correlatio Aalysis - V. Nollau CORRELATION ANALYSIS V. Nollau Istitute of Mathematical Stochastics, Techical Uiversity of Dresde, Germay Keywords: Radom vector, multivariate

More information

Law of the sum of Bernoulli random variables

Law of the sum of Bernoulli random variables Law of the sum of Beroulli radom variables Nicolas Chevallier Uiversité de Haute Alsace, 4, rue des frères Lumière 68093 Mulhouse icolas.chevallier@uha.fr December 006 Abstract Let be the set of all possible

More information

Solutions to HW Assignment 1

Solutions to HW Assignment 1 Solutios to HW: 1 Course: Theory of Probability II Page: 1 of 6 Uiversity of Texas at Austi Solutios to HW Assigmet 1 Problem 1.1. Let Ω, F, {F } 0, P) be a filtered probability space ad T a stoppig time.

More information

Recurrence Relations

Recurrence Relations Recurrece Relatios Aalysis of recursive algorithms, such as: it factorial (it ) { if (==0) retur ; else retur ( * factorial(-)); } Let t be the umber of multiplicatios eeded to calculate factorial(). The

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

Entropy Rates and Asymptotic Equipartition

Entropy Rates and Asymptotic Equipartition Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,

More information

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca

More information

Estimation of the Mean and the ACVF

Estimation of the Mean and the ACVF Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators

More information

Dirichlet s Theorem on Arithmetic Progressions

Dirichlet s Theorem on Arithmetic Progressions Dirichlet s Theorem o Arithmetic Progressios Athoy Várilly Harvard Uiversity, Cambridge, MA 0238 Itroductio Dirichlet s theorem o arithmetic progressios is a gem of umber theory. A great part of its beauty

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

A NEW CLASS OF 2-STEP RATIONAL MULTISTEP METHODS

A NEW CLASS OF 2-STEP RATIONAL MULTISTEP METHODS Jural Karya Asli Loreka Ahli Matematik Vol. No. (010) page 6-9. Jural Karya Asli Loreka Ahli Matematik A NEW CLASS OF -STEP RATIONAL MULTISTEP METHODS 1 Nazeeruddi Yaacob Teh Yua Yig Norma Alias 1 Departmet

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Asymptotic distribution of products of sums of independent random variables

Asymptotic distribution of products of sums of independent random variables Proc. Idia Acad. Sci. Math. Sci. Vol. 3, No., May 03, pp. 83 9. c Idia Academy of Scieces Asymptotic distributio of products of sums of idepedet radom variables YANLING WANG, SUXIA YAO ad HONGXIA DU ollege

More information

A Risk Comparison of Ordinary Least Squares vs Ridge Regression

A Risk Comparison of Ordinary Least Squares vs Ridge Regression Joural of Machie Learig Research 14 (2013) 1505-1511 Submitted 5/12; Revised 3/13; Published 6/13 A Risk Compariso of Ordiary Least Squares vs Ridge Regressio Paramveer S. Dhillo Departmet of Computer

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

A Challenging Test For Convergence Accelerators: Summation Of A Series With A Special Sign Pattern

A Challenging Test For Convergence Accelerators: Summation Of A Series With A Special Sign Pattern Applied Mathematics E-Notes, 6(006), 5-34 c ISSN 1607-510 Available free at mirror sites of http://www.math.thu.edu.tw/ ame/ A Challegig Test For Covergece Accelerators: Summatio Of A Series With A Special

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Lecture 10 October Minimaxity and least favorable prior sequences

Lecture 10 October Minimaxity and least favorable prior sequences STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least

More information

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)]. Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x

More information

Berry-Esseen bounds for self-normalized martingales

Berry-Esseen bounds for self-normalized martingales Berry-Essee bouds for self-ormalized martigales Xiequa Fa a, Qi-Ma Shao b a Ceter for Applied Mathematics, Tiaji Uiversity, Tiaji 30007, Chia b Departmet of Statistics, The Chiese Uiversity of Hog Kog,

More information

The log-behavior of n p(n) and n p(n)/n

The log-behavior of n p(n) and n p(n)/n Ramauja J. 44 017, 81-99 The log-behavior of p ad p/ William Y.C. Che 1 ad Ke Y. Zheg 1 Ceter for Applied Mathematics Tiaji Uiversity Tiaji 0007, P. R. Chia Ceter for Combiatorics, LPMC Nakai Uivercity

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

Solution to Chapter 2 Analytical Exercises

Solution to Chapter 2 Analytical Exercises Nov. 25, 23, Revised Dec. 27, 23 Hayashi Ecoometrics Solutio to Chapter 2 Aalytical Exercises. For ay ε >, So, plim z =. O the other had, which meas that lim E(z =. 2. As show i the hit, Prob( z > ε =

More information

It is often useful to approximate complicated functions using simpler ones. We consider the task of approximating a function by a polynomial.

It is often useful to approximate complicated functions using simpler ones. We consider the task of approximating a function by a polynomial. Taylor Polyomials ad Taylor Series It is ofte useful to approximate complicated fuctios usig simpler oes We cosider the task of approximatig a fuctio by a polyomial If f is at least -times differetiable

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information