Variance-Stabilizing and Confidence-Stabilizing. Transformations for the Normal Correlation Coefficient. with Known Variances

Size: px
Start display at page:

Download "Variance-Stabilizing and Confidence-Stabilizing. Transformations for the Normal Correlation Coefficient. with Known Variances"

Transcription

1 Variace-Stabilizig ad Cofidece-Stabilizig Trasformatios for the Normal Correlatio Coefficiet with Kow Variaces Bailey K. Fosdick ad Michael D. Perlma Techical Report o. 610 Departmet of Statistics Uiversity of Washigto Seattle, WA March 6, 013

2 Abstract Fosdick ad Raftery 01 revisited the classical problem of iferece for a bivariate ormal correlatio coefficiet ρ whe the variaces are kow. They cosidered several frequetist ad Bayesia estimators, the former icludig the maximum likelihood estimator MLE, but did ot obtai the stadard errors of these estimators or cofidece itervals for ρ. Here we preset a ew variace-stabilizig trasformatio y for the MLE i the kow-variace case. Adjustig y appropriately accordig to the sample size produces a cofidece-stabilizig trasformatio y that provides more accurate iterval estimates for ρ tha the MLE, as does Fisher s classical z trasformatio for the MLE i the ukow-variace case. Iterestigly, the z trasform applied to the MLE for the ukow-but-equal-variace case performs well i the kow-variace case for smaller values of ρ. Both these methods are also useful for comparig two or more correlatio coefficiets i the kow-variace case; hypothesis testig i this case is also discussed.

3 1. Itroductio Let x i, y i, i = 1,...,, be i.i.d. observatios from a bivariate ormal distributio with correlatio coefficiet ρ ad variaces σ x ad σ y. We cosider the followig three icreasigly restrictive covariace models: Model 1: σx ad σy both ukow; Model : σx/σ y aσ, a kow, σ ukow; Model 3: σx ad σy both kow. Without essetial loss of geerality we assume throughout that the meas are both 0, that a = 1 so σx = σy = σ i Model, ad that σx = σy = 1 i Model 3. These well-kow models form the basis for may textbook examples ad exercises. Models 1 ad are full expoetial families with complete sufficiet statistics W 1 s x, s y, y ad W s x + s y, y respectively, where s x = 1 x i, s y = 1 y i, y = 1 xi y i. 1 Both Models 1 ad admit explicit maximum likelihood estimators MLE for their ukow parameters: Model 1: ˆρ = r 1, ˆσ x = s x, ˆσ y = s y; Model : ˆρ = r, ˆσ = 1 + s y; where r 1 = y, r = y. s y s x + s y The geometric mea of s x ad s y i r 1 is replaced by the arithmetic mea i r, so r < r 1. Model 3 is a curved expoetial family with log likelihood fuctio l ρ = c log1 ρ + s y 1 ρ + ρy 1 ρ, 3 cf. Stuart ad Ord 1991, Ch. 18. Here W is miimal sufficiet but ot complete; ote that 1

4 W is two-dimesioal while the ukow parameter ρ is oe-dimesioal. The MLE ˆρ is r 3, the maximizig root of the cubic equatio ρ1 ρ ρs x + s y ρ y = 0. 4 I this paper we focus o Model 3, which is less ameable to exact aalysis but was recetly ecoutered by Fosdick ad Raftery 01 whe comparig fertility rate forecast errors betwee developed ad udeveloped coutries. They cosidered several variats of three frequetist estimators for ρ, as well as several Bayesia estimators. The frequetist estimators were r 1, r 3, ad the empirical estimator r 4 = y 5 s trucated to lie i [ 1, 1], where the estimated stadard deviatio estimates ad s y i r 1 ad r are replaced by the kow values 1. However, Fosdick ad Raftery did ot discuss the stadard errors of their estimators. It is well kow 1 that as, r1 ρ N 0, 1 ρ uder Model 1, 6 r ρ N 0, 1 ρ uder Model, 7 r3 ρ N 0, 1 ρ uder Model 3, ρ r4 ρ N 0, 1 + ρ uder Model 3. 9 Because Model 3 is regular sufficietly smooth, the MLE r 3 is amptotically optimal for this model, that is, has smallest amptotic variace amog all amptotically ormal estimators. Whereas the MLE r 3 is optimal for large samples sizes uder Model 3, Fosdick ad Raftery s iterest was i the case of small ad moderate sample sizes. I Table 1 the mea-squared errors 1 E.g. Lehma 1983, 19 p.441, Problems 6..0 ad 6.5.9, Stuart ad Ord 1991, Ch. 18. The result 7 follows from 6 because r r 1 = O p 1 uder Model by a stadard Taylor expasio argumet.

5 of r 1, r, r 3, ad r 4 have bee obtaied,3 by simulatio. Here r 3 performs best uless ρ is small, where r may be somewhat better, while r 4 performs poorly i most cases. These effects become more marked as the sample size icreases, which is i agreemet with the orderig of the amptotic variaces i 6 9. Table 1: Root MSE 1000; r 4 trucated to [-1,1]. largest stadard error of estimates is 0.4. Estimator ρ r r r r r r r r r r r r I additio to its large MSE the estimator r 4 sometimes falls outside the admissible rage [ 1, 1], while the Bayesia estimators are ot ameable to frequetist iterval estimatio; these will ot be discussed further here. As is the case for r 1, covergece to ormality is also slow for r ad r 3, so for small or moderate sample sizes the amptotic variaces i 6 8 do ot provide good approximatios for their stadard errors Table. Fisher s celebrated z trasformatio greatly improves the ormal approximatio to the distributio of r 1 uder Model 1 so is ivaluable for iferece about ρ i this case see.1. Iterestigly, the exact distributio of zr is ea to specify uder Model see. ad therefore uder Model 3, where it provides estimates ad tests that perform well for smaller values of ρ see.4 ad Sectio 3. The MSE is show for a uiform rage of ρ values rather tha ρ values, because it is ρ rather tha ρ that idicates the stregth of the relatioship betwee x ad y. 3 Usig the formulas for r 3 give i F&R, % of estimates did ot satisfy the cubic equatio i 4 withi ±1e 8. For these simulatios, the default optimize procedure i R was used to obtai the estimate r 3 that maximizes the likelihood. 3

6 Table : Ratio of empirical variace to amptotic variace. Estimator ρ r r r r r r r r r For Model 3, a ew variace-stabilizig trasformatio y for the MLE r 3 is preseted i.3. Ulike the z-trasform, the y-trasform must be adjusted for the sample size to stabilize the cofidece coverage of r 3. Whe this is doe the resultig cofidece-stabilized trasformatio y r 3 provides more precise cofidece itervals uder Model 3 tha itervals based o r 1 ad r for moderate ad large values of ρ. For small values, itervals based o zr are preferable. This is demostrated via simulatio i.4. The use of zr, yr 3, ad y r 3 for comparig two or more correlatio coefficiets uder Model 3 is outlied i Sectio 3. F&R also cosidered the problem of testig H 0 : ρ = 0 idepedece vs. the oe-sided alterative H 1 : ρ > 0 uder Model 3. They cosidered the tests that reject H 0 for large values of r 1, r 3, ad r 4 ad their variats, as well as several Bayes tests, ad approximated the sigificace levels of these tests by Mote Carlo simulatio. It is well kow that the r 1 test is exact for this problem. I Sectio 4 we ote that the r test is also exact ad has a iterestig although limited optimality property. Also we show that the r 4 test is locally most powerful for alteratives ρ 0 ad derive the amptotically most powerful test also exact for alteratives ρ 1. O the basis of umerical power comparisos, the r resp., r 3 test is recommeded if small resp., large alterative values of ρ are expected.. Cofidece itervals for ρ uder Model 3. 4

7 .1. Cofidece itervals based o the Model 1 MLE r 1. Let g γ deote the upper γ- quatile of the stadard ormal Gaussia distributio. From 6, for sufficietly large r 1 ± 1 r 1 g α/ 10 is a approximate 1 α cofidece iterval 4 for ρ uder the urestricted Model 1, hece for Models ad 3. Ufortuately the sample size required for the accuracy of the ormal approximatio 6 depeds o the ukow ρ, but this is remedied by Fisher s celebrated z-trasformatio for r 1 cf. Aderso 1984, 4..3, give by the idefiite itegral zρ = dρ 1 ρ = 1 log 1 + ρ ρ This is a variace-stabilizig trasformatio that satisfies [zr1 zρ] N0, 1 1 with faster covergece to ormality tha 6 see Table 3. This provides the approximate 1 α cofidece iterval for ρ give by z 1 zr 1 ± g α/, 13 valid for Model 1 hece for Models ad 3. Note that z 1 r = tahr. A exact cofidece iterval for ρ uder Model ad therefore Model 3 is readily obtaied from the classical Studet t-distributio of the sample regressio coefficiet of y i give x i e.g. Stuart ad Ord 1987, eq. 16.9: 1 r1 sx s y ρ 1 r 1 t As is well kow for 13, usig rather tha results i a coverage probability somewhat closer to the omial value 1 α. 5

8 This provides the exact 1 α cofidece iterval 5 1 r1 r 1 ± 1 t 1;α/ 15 for ρ uder Models ad 3, where t 1;γ deotes the upper γ-quatile of the t 1 distributio. Although the iterval 15 is exact, it is ot a fuctio of the miimal sufficiet statistic s x + s y, y for Model 3 so may be iefficiet; this is cofirmed i.4... Cofidece itervals based o the Model MLE r. Because r has the same amptotic ormal distributio 6 as r 1, r ± 1 r 1 g α/ 16 is a approximate 1 α cofidece iterval for ρ uder Model, hece uder Model 3. Furthermore, the z-trasformatio also applies to r i this case, yieldig the same ormal approximatio: 1 [zr zρ] N0, 1; 17 this gives aother approximate 1 α cofidece iterval for ρ uder Model, hece Model 3: z 1 zr ± g α/ It is perhaps less well kow that Fisher s z-trasformatio i fact applies exactly to r uder Model. The orthogoally-trasformed radom vectors u i 1 = x i + y i, i = 1,...,, 19 1 x i y i v i 5 By regressig x i o y i, a secod exact 1 α cofidece iterval is obtaied by iterchagig ad s y i See Footote 1. Also, the etries for zr i Table 3 show that its variace is approximated much better by 1/ 1 tha by 1/, hece its use i 17. 6

9 have the zero-mea bivariate ormal distributio with covariace matrix 1 + ρσ ρσ σ u 0 0 σv. 0 Thus u i v i, i = 1,...,, ad u u i σuχ, v vi σvχ, 1 u v = 1 + r σ u 1 + ρ F, F,. 1 r 1 ρ σ v Take logarithms to obtai the exact relatio zr zρ + 1 log F, zρ + Z, 3 where Z deotes Fisher s Z distributio with ad degrees of freedom cf. Stuart ad Ord 1987, This yields the followig exact 1 α cofidece iterval for ρ: z 1 zr ± Z ;α/, 4 valid uder Model hece Model 3. Here Z ;γ deotes the upper γ-quatile of Z, which ca be expressed i terms of the γ-quatile of F,. Ulike r 1, r is a fuctio of the miimal sufficiet statistic W s x + s y, y for Model 3 so it may be expected to produce more efficiet estimates tha r 1 i this case. This should be most oticeable whe ρ is small, sice r < r 1 ; see Tables 1 ad..3. Cofidece itervals based o the Model 3 MLE r 3. From 8, for sufficietly large 1 r3 r 3 ± g α/ r 3 is a approximate 1 α cofidece iterval for ρ uder Model 3. As for r 1, however, the ormal approximatio 8 for r 3 is iaccurate uless the sample size is large see Table. This suggests 7

10 seekig a variace-stabilizig traformatio y for r 3 uder Model 3. Startig from 8 ad applyig Mathematica we obtai yρ = 1 + ρ = 1 log dρ 6 1 ρ [ 1 ] + ρ log 1 + ρ + ρ, 1 ρ 1 + ρ ρ 1 + ρ + 1 ρ so that [yr3 yρ] N0, 1 as. 7 Thus a approximate 1 α cofidece iterval for ρ valid uder Model 3 is give by 7 y 1 yr 3 ± g α/. 8 It follows from 6 that y0 = 0, yρ = y ρ atimmetry, ad y ρ 1 so y is strictly icreasig o 1, 1. I fact y ρ z ρ by comparig 6 ad 11, so y icreases faster tha z. This is most marked i the tails, as see from Figure 1: yρ zρ for 0 ρ.5, while 1 < yρ/zρ i ρ for ρ >.5. This form of y is eeded to stabilize the variace of r 3 because its amptotic variace is smaller tha that of r 1 for larger values of ρ, cf. 6 ad 8. Table 3 shows, however, that whe the sample size is small or moderate, y is ot etirely successful at stabilizig the variace of r 3 for ρ ear 0. For example, whe = 10 ad ρ = 0, the actual variace of yr 3 is 33% greater tha the amptotic approximatio 1/ give by 7. Furthermore, i Table 5 it is see that the coverage probability of the cofidece iterval 8 based o yr 3 may deviate oticeably from the omial value 1 α for α =.05 whe ρ is small. This suggests makig a multiplicative adjustmet y ρ m ρ yρ 9 of the y-trasformatio such that m ρ < 1 for ρ ear 0 so that y ρ will icrease slower tha yρ for ρ i that regio, while m ρ 1 for larger values of ρ. 7 Like Varzr 1, Varyr 3 is better approximated by 1 tha by 1 ; see Table 3. 8

11 Table 3: Empirical variace. Estimator ρ zr zr yr y r zr zr yr y r zr zr yr y r This ca be accomplished by a ad hoc choice for m ρ of the form m ρ = 1 a /10 e b yρ c, 30 where a, b, ad c are positive costats chose as described below. Like y, y 0 = 0 ad y ρ is atimmetric ad strictly icreasig o 1, 1. It is see i Figure 1 that as desired, y ρ icreases slower tha yρ for ρ ear 0 ad y ρ yρ outside that regio. Trasformatios Trasformatios ear ρ=0 Trasformatios for ρ z y y ρ ρ ρ Figure 1: Trasformatios z, y, ad y. 9

12 Because y r 3 yr 3 = O p 1, y r 3 has the same amptotic distributio as yr 3, amely [y r 3 y ρ] N0, 1 as, 31 so y 1 y r 3 ± g α/ 3 is also a approximate 1 α cofidece iterval for ρ uder Model 3. We choose a, b, ad c to miimize the maximum differece betwee the empirical coverage probabilities of 3 ad the omial values 1 α across the rages of, α, ad ρ cosidered i Table 4. Specifically, we search for the triple a, b, c that satisfies a, b, c = argmi a,b,c max N, α L, ρ R [ 1 α Pr y ρ y r 3 ± g ] α/, where N = {10, 0, 40}, L = {.01,.05,.1}, ad R = {0,.1,.3,.5,.7,.9}. Due to the o-covexity of the optimizatio problem, first we chose the best three triples a, b, c o the grid A B C, where A = B = C = {.1,.,...,.9, 3.0}. Next, three improved triples were obtaied usig the Nelder-Mead 1965 optimizatio procedure iitialized at each of the first three chose triples. The best of the three improved triples was selected for y : a = 0.403, b = 1.091, ad c = See the Appedix for further details. With this choice of a, b, c for y, Table 4 shows that the coverage probabilities for the 1 α cofidece iterval 3 usig y r 3 are acceptably close to the omial value 1 α for α A, eve though its variace remais somewhat below the omial value 1/ whe ρ = 0 Table 3. For this reaso y might better be called a cofidece-stabilizig trasformatio, rather tha variace-stabilizig..4. Compariso of the iterval estimators for ρ. Table 5 shows the coverage probabilities for the ie 1 α cofidece itervals for ρ preseted i.1,., ad.3 whe α = 0.05, i.e., 95% cofidece. Oly five of these, marked by, attai or adequately approximate the omial 95% level: the exact itervals 15 based o r 1 ad 4 based o zr, ad the approximate 10

13 Table 4: Coverage probabilities for the 1 α-cofidece itervals based o y r 3. α ρ itervals 13 based o zr 1, 18 based o zr, ad 31 based o the modified y-trasform y r 3 but ot the approximate iterval 8 based o yr 3 itself. The average half-widths of these five iterval estimators are show i Table 6 for α =.10,.05,.01. Of these five, 18 ad 4, both based o zr, are most precise for small values of ρ while 31 based o y r 3 is most precise for itermediate ad large values of ρ. The rage of ρ values for which 31 is preferable to 18 ad 4 expads as the sample size icreases, i accordace with the smaller amptotic variace of r 3 except at ρ = 0, as see i 7 ad Comparig two or more correlatios whe the variaces are kow. Suppose that samples of sizes k, k = 1,..., q, are draw from q bivariate ormal distributios with ukow populatio correlatios ρ k ad kow variaces. Just as the z-trasformatio zr 1 is useful for combiig or comparig two or more sample correlatio coefficiets uder Models 1 ad cf. Sedecor ad Cochra 1967, 7.7, the y-trasform yr 3 or its modificatio y r 3 ca be used for these purposes uder Model 3, the kow-variace case. The z-trasform zr ca also be used, especially if small values of the correlatios are expected. i Suppose that ρ 1 = = ρ q ρ ad that it is desired to estimate this commo ρ. If the sample sizes k are large, we ca weight the y-trasforms of the Model 3 MLEs r 1 3,..., r q 3 accordig to their amptotic iverse variaces 1,..., q to obtai the followig weighted 11

14 Table 5: Coverage probabilities of 95% cofidece itervals for ρ largest stadard error The itervals marked have exact or approximate 95% coverage for all three samples sizes. Cofidece Iterval ρ r 10 r 1 ± 1 g α/ z 1 zr 1 ± g α/ r r 1 ± 1 1 t 1;α/ r r ± g α/ z 1 zr ± g α/ z 1 zr ± Z ;α/ r3 r 3 ± g 1+r α/ y 1 yr 3 ± g α/ y r 3 ± g α/ y 1 1 r 0 r 1 ± 1 g α/ z 1 zr 1 ± g α/ r 1 ± y 1 1 r 1 1 t 1;α/ r r ± g α/ z 1 zr ± g α/ z 1 zr ± Z ;α/ r3 r 3 ± g 1+r α/ y 1 yr 3 ± g α/ y r 3 ± g α/ r 40 r 1 ± 1 g α/ z 1 zr 1 ± g α/ r 1 ± y 1 1 r 1 1 t 1;α/ r r ± g α/ z 1 zr ± g α/ z 1 zr ± Z ;α/ r3 r 3 ± g 1+r α/ y 1 yr 3 ± g α/ y r 3 ± g α/

15 Table 6: Average half-width of the five 1 α-cofidece itervals marked i Table 5 largest stadard error The smallest half-width for each combiatio of, ρ ad α is i bold. α Cofidece Iterval z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 y 1 z 1 zr ± Z ;α/ y r 3 ± g α/ 0 z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y 1 40 z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y 1 0 z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ ρ

16 y 1 y r 3 ± g α/ 40 z 1 zr 1 ± g α/ r 1 ± z 1 zr ± g α/ 1 y 1 1 r 1 1 t 1;α/ z 1 zr ± Z ;α/ y r 3 ± g α/ z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y 1 0 z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y 1 40 z 1 zr 1 ± g α/ 1 r r 1 ± 1 1 t 1;α/ z 1 zr ± g α/ 1 z 1 zr ± Z ;α/ y r 3 ± g α/ y estimator for yρ: ȳ w r 3 = q k=1 k yr k 3 q k=1 k N yρ, 1 q. 33 k=1 k This provides a approximate 1 α cofidece iterval for yρ, which is the iverted to obtai 14

17 a approximate 1 α cofidece iterval for ρ: y 1 ȳ w r 3 ± g α/ q k=1 k. 34 If the sample sizes are small or moderate, however, the ȳ w r 3 should be replaced by ȳ,w r 3 = q k=1 k y kr k 3 q k=1 k N ȳ,w ρ, 1 q, 35 k=1 k where ȳ,w ρ = m,w ρ yρ [ q ] k=1 k m kρ q yρ. 36 k=1 k The 35 provides a approximate 1 α cofidece iterval for ȳ,w ρ, which is strictly icreasig i ρ hece ca be iverted to provide a approximate 1 α cofidece iterval for ρ: ȳ 1,w ȳ,w r 3 ± g α/ q k=1 k. 37 If the sample sizes are equal, i.e., 1 = = q, the 35 ad 37 simplify to ȳ r 3 = 1 q q k=1 y r k 3 N y ρ, 1, 38 q ȳ 1 ȳ r 3 ± g α/. 39 q Lastly, if it is expected that the commo ρ is small, the the fidigs i.4 suggest that yr k 3 be replaced by the z-trasforms zr k of the Model MLEs to obtai a more precise estimate of ρ. Thus 33 would be replaced by recall 17 z w r = q k=1 k 1zr k q k=1 k 1 N zρ, 1 q, 40 k=1 k 1 which provides a approximate 1 α cofidece iterval for zρ that is the iverted to obtai 15

18 a approximate 1 α cofidece iterval for ρ: z 1 z w r ± g α/ q k=1 k Of course, if the uderlyig bivariate ormal data is available from the q populatios, the these data should be combied ito a sigle sample of size q from which a more efficiet estimate of the commo ρ ca be obtaied by the methods of Sectio. However, the weighted estimates obtaied here would still be useful for testig homogeeity of the ρ k as ow described. ii The homogeeity hypothesis H 0 : ρ 1 = = ρ q is equivalet to homogeeity of the y- trasforms: yρ 1 = = yρ q. To test this agaist the geeral alterative, if the sample sizes are large we may reject H 0 for large values of the weighted chi-square statistic T y,w = q k=1 k yr k 3 ȳ w r 3, 4 distributed approximately as χ q 1 uder H 0. If the samples sizes are ot large but equal, i.e., 1 = = q, the H 0 is equivalet to homogeeity of the modified y trasforms, i.e., y ρ 1 = = y ρ q, so T y,w ca be replaced by the statistic T y = q k=1 y r k 3 ȳ r 3, 43 distributed approximately as χ q 1 uder H 0. However, if the sample sizes are ot equal the H 0 is ot equivalet to homogeeity of y 1ρ 1,..., y qρ q, so the weighted test statistic T y,w = q k=1 k y kr k 3 ȳ,w r 3 44 is ot ecessarily appropriate for testig H 0. Fially, if it is expected that ρ 1,..., ρ q are small, the we would reject H 0 for large values 16

19 of the weighted chi-square statistic T z,w = q k=1 k 1 zr k z w r, 45 also distributed approximately as χ q 1 uder H Testig ρ = 0 bivariate idepedece uder Model 3. Fosdick ad Raftery 01 also discussed the problem of testig ρ = 0 vs. ρ > 0 i a sigle bivariate ormal populatio with kow variaces. They cosidered tests that reject H 0 for large values of r 1, r 3, ad r 4 ad their variats, as well as several Bayes tests, determiig the critical of these tests by Mote Carlo simulatio. Here we add a few observatios about these tests ad some others for this testig problem. Exact tests for idepedece are well kow for Models 1 ad. Uder Model 1 the test that rejects ρ = 0 for large values of r 1 is the uiformly most powerful ubiased UMPU test for ρ = 0 vs. ρ > 0 Lehma 1986, Whe ρ = 0 it follows from 14 that 1 r 1 1 r 1 t 1, 46 from which the exact ull distributio of this test is readily obtaied. Uder Model the problem of testig ρ = 0 vs. ρ > 0 is equivalet to the problem of comparig two ormal variaces, i.e., testig σ u = σ v vs. σ u > σ v recall. The F -test that rejects ρ = 0 if 1+r 1 r u v > F,;α is UMPU level α uder Model Lehma 1986, 5.3. Therefore this test is exact for testig ρ = 0 vs. ρ > 0 uder Model 3 as well, ad should perform reasoably well there. Note that σ u + σ v = uder Model 3. Uder Model 3, it follows from 3 that the pdf of x i, y i does ot have mootoe likelihood ratio MLR ad that o uiformly most powerful UMP test exists for ρ = 0 vs. ρ > 0. I fact, for a fixed alterative ρ 1 > 0 the most powerful level α test rejects ρ = 0 iff y > ρ 1s x + s y + c α 47 17

20 where c α is chose to attai size α. Thus the locally most powerful LMP level α test for alteratives ρ 1 0 rejects H 0 iff r 4 = y > c α, whereas the amptotically most powerful AMP level α test for alteratives ρ 1 1 rejects H 0 iff y s x s y > c α, equivaletly, iff v < χ ;1 α recall 19 ad 1. Sice these two tests are differet, o UMP test exists uder Model 3. The exact test based o r has a iterestig albeit limited optimality property uder Model 3. If we set c α = 0 i 47, it follows that the test that rejects ρ = 0 if r > ρ 1 is the MP test of its size for the fixed alterative ρ 1 > 0. By 3 this size is give by αρ 1 Pr[r > ρ 1 ρ = 0] = 1 F, e zρ 1, 48 where F, deotes the cdf of the F, distributio. Values of αρ 1 are show i Table 7. Table 7: Size αρ 1 of the MP test of ρ = 0 vs ρ = ρ 1 that rejects whe r > ρ 1. ρ e-4 1.3e e-7 <1e e-5 8.3e-8 <1e-10 <1e-10 Uder Model 3 the powers of the size.05 tests for ρ = 0 vs. ρ > 0 based o r 1, r, r 3, r 4 the LMP test ad v the AMP test are compared via simulatio 8 i Table 8. The LMP AMP test is domiated i power by the other three tests for all except very small very large values of the alterative ρ 1, so is ot recommeded. The r 1 test is domiated by the r test but oly slightly, which suggests that for testig purposes ot much power is gaied from the kowledge that the variaces are equal. For sample size = 10 the r test domiates the r 3 test for ρ 1.3 while the reverse is true whe ρ 1.5. For sample sizes 0 ad 40 this crossover occurs for ρ 1.1,.3. Thus we recommed either the r or r 3 test uder Model 3, depedig o whether small values or large 8 The size.05 critical values for the tests based o r 3 ad r 4 also were obtaied by simulatio, whereas the exact critical values were used for the other three tests. 18

21 Table 8: Power whe testig ρ = 0 vs ρ > 0 for various alteratives ρ 1 at the α = 0.05 sigificace level ρ Test Rejectio Criterio r 1 > t 1;α r 1 1+r 1 r > F,;α r 3 > c α, r 4 > c α, v < χ ;1 α r 1 > t 1;α r 1 1+r 1 r > F,;α r 3 > c α, r 4 > c α, v < χ ;1 α r 1 > t 1;α r 1 1+r 1 r > F,;α r 3 > c α, r 4 > c α, v < χ ;1 α values of the alterative ρ 1 are of most iterest. Because the r test is exact, it is easier to apply tha the r 3 test, so the r test might be recommeded for Model 3 o this basis. Remark. Uder Model 3, E ρ s x + s y = for all ρ, but s x + s y is ot a acillary statistic. Noetheless it might be of iterest to cosider the coditioal test based o the coditioal distributio of y give s x + s y. Refereces Aderso, T. W A Itroductio to Multivariate Statistical Aalysis, d editio. Wiley, New York. Fosdick, B. K. ad Raftery, A. E. 01. Estimatig the correlatio i bivariate ormal data with kow variaces ad small sample sizes. The America Statisticia, 661: Lehma, E. L Theory of Poit Estimatio. Wiley, New York. 19

22 Lehma, E. L Testig Statistical Hypotheses, d editio. Wiley, New York. Nelder, J. A. ad Mead, R A simplex method for fuctio miimizatio. Computer Joural, 7: Sedecor, G. W. ad Cochra, W. G Statistical Methods. Iowa State Uiversity Press, Ames, Iowa. Stuart, A. ad Ord, J. K. 1987, Kedall s Advaced Theory of Statistics, Volumes 1 ad. Oxford Uiversity Press, New York. Appedix: Choosig a, b, ad c for y The y ρ trasformatio has the form y ρ = 1 a /10 e b yρ c yρ. The costats a, b, ad c are defied as a, b, c = argmi fa, b, c, where a,b,c [ fa, b, c = max 1 α Pr y ρ y r 3 ± g ] α/, N, α L, ρ R N = {10, 0, 40}, L = {0.01, 0.05, 0.1}, ad R = {0, 0.1, 0.3, 0.5, 0.7, 0.9}. The procedure used to search for a, b, c is outlied below; the results associated with each step are also icluded. 1. Fid the three triples ã, b, c o the grid A B C that result i the smallest values of fã, b, c, where A = B = C = {0.1, 0.,...,.9, 3.0}. ã b c fã, b, c

23 . Iitialize the Nelder-Mead optimizatio algorithm i R at each of the ã, b, c triples foud i Step 1 ad obtai three improved triples a, b, c. Iitial Values Improved Values ã b c a b c fa, b, c From the three improved triples obtaied i Step, select that which results i the smallest value of fa, b, c. a, b, c = 0.403, 1.091,

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation

Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation Some Properties of the Exact ad Score Methods for Biomial Proportio ad Sample Size Calculatio K. KRISHNAMOORTHY AND JIE PENG Departmet of Mathematics, Uiversity of Louisiaa at Lafayette Lafayette, LA 70504-1010,

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Approximations to the Distribution of the Sample Correlation Coefficient

Approximations to the Distribution of the Sample Correlation Coefficient World Academy of Sciece Egieerig ad Techology Iteratioal Joural of Mathematical ad Computatioal Scieces Vol:5 No:4 0 Approximatios to the Distributio of the Sample Correlatio Coefficiet Joh N Haddad ad

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

x = Pr ( X (n) βx ) =

x = Pr ( X (n) βx ) = Exercise 93 / page 45 The desity of a variable X i i 1 is fx α α a For α kow let say equal to α α > fx α α x α Pr X i x < x < Usig a Pivotal Quatity: x α 1 < x < α > x α 1 ad We solve i a similar way as

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan Deviatio of the Variaces of Classical Estimators ad Negative Iteger Momet Estimator from Miimum Variace Boud with Referece to Maxwell Distributio G. R. Pasha Departmet of Statistics Bahauddi Zakariya Uiversity

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Last Lecture. Unbiased Test

Last Lecture. Unbiased Test Last Lecture Biostatistics 6 - Statistical Iferece Lecture Uiformly Most Powerful Test Hyu Mi Kag March 8th, 3 What are the typical steps for costructig a likelihood ratio test? Is LRT statistic based

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

A proposed discrete distribution for the statistical modeling of

A proposed discrete distribution for the statistical modeling of It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical

More information

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued) Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences. Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

Power Comparison of Some Goodness-of-fit Tests

Power Comparison of Some Goodness-of-fit Tests Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Theses ad Dissertatios Uiversity Graduate School 7-6-2016 Power Compariso of Some Goodess-of-fit Tests Tiayi Liu tliu019@fiu.edu DOI: 10.25148/etd.FIDC000750

More information

of the matrix is =-85, so it is not positive definite. Thus, the first

of the matrix is =-85, so it is not positive definite. Thus, the first BOSTON COLLEGE Departmet of Ecoomics EC771: Ecoometrics Sprig 4 Prof. Baum, Ms. Uysal Solutio Key for Problem Set 1 1. Are the followig quadratic forms positive for all values of x? (a) y = x 1 8x 1 x

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

Estimation of Gumbel Parameters under Ranked Set Sampling

Estimation of Gumbel Parameters under Ranked Set Sampling Joural of Moder Applied Statistical Methods Volume 13 Issue 2 Article 11-2014 Estimatio of Gumbel Parameters uder Raked Set Samplig Omar M. Yousef Al Balqa' Applied Uiversity, Zarqa, Jorda, abuyaza_o@yahoo.com

More information

Statistics 20: Final Exam Solutions Summer Session 2007

Statistics 20: Final Exam Solutions Summer Session 2007 1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Lecture Notes 15 Hypothesis Testing (Chapter 10) 1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Asymptotic distribution of the first-stage F-statistic under weak IVs

Asymptotic distribution of the first-stage F-statistic under weak IVs November 6 Eco 59A WEAK INSTRUMENTS III Testig for Weak Istrumets From the results discussed i Weak Istrumets II we kow that at least i the case of a sigle edogeous regressor there are weak-idetificatio-robust

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information