A nonparametric two-sample wald test of equality of variances
|
|
- Magdalen Marilynn Jacobs
- 6 years ago
- Views:
Transcription
1 Unversty of Wollongong Research Onlne Centre for Statstcal & Survey Methodology Workng Paper Seres Faculty of Engneerng and Informaton Scences 0 A nonparametrc two-sample wald test of equalty of varances Davd Allngham Unversty of Newcastle John Rayner Unversty of Wollongong Recommended Ctaton Allngham, Davd and Rayner, John, A nonparametrc two-sample wald test of equalty of varances, Centre for Statstcal and Survey Methodology, Unversty of Wollongong, Workng Paper -, 0. Research Onlne s the open access nsttutonal repostory for the Unversty of Wollongong. For further nformaton contact the UOW Lbrary: research-pubs@uow.edu.au
2 A Nonparametrc Two-Sample Wald Test of Equalty of Varances Davd Allngham* and J. C. W. Rayner Abstract We develop a test for equalty of varances gven two ndependent random samples of observatons. The test can be expected to perform well when both sample szes are at least moderate and the sample varances are asymptotcally equvalent to the maxmum lkelhood estmators of the populaton varances. The test s motvated by and s here assessed for the case when both populatons sampled are assumed to be normal. Popular choces of test would be the two sample F test f normalty can be assumed and Levene s test f ths assumpton s dubous. Another compettor s the Wald test for the dfference n the populaton varances. We gve a nonparametrc analogue of ths test and call t the R test. In an ndcatve emprcal study when both populatons are normal, we fnd that for moderate sample szes the R test s nearly as robust as Levene s test and nearly as powerful as the F test. Key Words: Bartlett s test; Levene s test; Wald tests.. Introducton: Testng Equalty of Varances for Two Independent Samples In the two-sample problem we are gven two ndependent random samples X,..., X n and X,..., X n. The locaton problem attracts most attenton. Assumng that the samples are from normal populatons, the pooled t-test s used to test equalty of means assumng equal varances and Welch s test can be used when equalty of varances s suspect but normalty s not. When normalty s n doubt the Wlcoxon test s often used. The correspondng dsperson problem s of nterest to confrm the valdty of, for example, the pooled t-test, and for ts own sake. For example, testng for reduced varablty s of nterest n confrmng natural selecton (see, for example, [, Secton 5.5]) and f some processes are n control. In exploratory data analyss t s sensble to assess f one populaton s more varable than another. If t s, the cause may be that one populaton s b-modal and the other s not; the consequences of ths n both the scenaro and the model can then be explored n depth. The study here ntroduces a new test of equalty of varances. The asymptotc null dstrbuton of the test statstc s, but, dependng on the populatons sampled from, ths may or may not be satsfactory for small to moderate sample szes. To assess ths, and other aspects of the proposed test, an ndcatve emprcal study s undertaken. We gve comparsons when both populatons sampled are normal, when both samples have the same sample sze, and for 5% level tests. Frst *Author to whom correspondence should be addressed. Centre for Computer-Asssted Research Computaton and ts Applcatons, School of Mathematcal and Physcal Scences, Unversty of Newcastle, NSW 308, Australa e-mal: Davd.Allngham@newcastle.edu.au Centre for Statstcal and Survey Methodology, School of Mathematcs and Appled Statstcs, Unversty of Wollongong, NSW 5, Australa and School of Mathematcal and Physcal Scences, Unversty of Newcastle, NSW 308, Australa e-mal: John.Rayner@newcastle.edu.au
3 we derve a fnte sample correcton to the crtcal values based on the asymptotc null dstrbuton when samplng from normal populatons. Dfferent correctons would be needed when samplng from dfferent dstrbutons. Next we show that n moderate samples the new test s nearly as powerful as the F test when normalty may be assumed, and fnally, that t s nearly as robust as Levene s test when normalty s n doubt. Moore and McCabe [, p.59] clam that the F test and other procedures for nference about varances are so lackng n robustness as to be of lttle use n practce. The new test gves a counterexample to that proposton. We acknowledge that the new test s most effectve for at least moderate sample szes. In the normal case each random sample should have at least 5 observatons, whch s what we would expect of a serous study amng at reasonable power that cannot be hoped for wth samples of sze 0 or so. See Secton. We are aware of more expansve comparatve studes such as [3 and ]. Our goal here s not to emulate these studes but to show that the new test s compettve n terms of test sze, robustness and power. In Secton the new test s ntroduced. Sectons 3, and 5 gve the results of an emprcal nvestgaton when the populatons sampled are assumed to be normal. In Secton 3 we nvestgate test sze, showng that the asymptotc crtcal values should only be used for moderate to large sample szes. For smaller sample szes a fnte sample correcton to the asymptotc 5% crtcal value s gven. Ths results actual test szes between.6% and 5.3%. In Secton we show that when normalty holds the new test s not as powerful as the Levene test for smaller sample szes, but overtakes t for moderate samples of about 5. The new test s always nferor to the optmal F test. However the R test has power that approaches that of the F test, beng at least 95% that of the F test throughout most of the parameter space n samples of at least 80. In Secton 5 we show that f we sample from t dstrbutons wth varyng degrees of freedom, the F test s hghly non-robust for small degrees of freedom, as s well-known for fat-taled dstrbutons. The new test performs far better than the F test, and although t s challenged somewhat for small degrees of freedom, ts performance s only slghtly nferor to the Levene test. That the new test gves a good compromse between power and robustness, and s vald when normalty doesn t hold, are strong reasons for preferrng the new test for sample szes that are at least moderate and normalty s dubous.. Compettor Tests for Equalty of Varance Intally we assume that we have two ndependent random samples X,..., X from normal populatons, N(, ) for = and. We wsh to test H: = n aganst the alternatve K: nusance parameters. If n S = j, wth the populaton means beng unknown ( Xj X. ) /( n ), n whch X. = n j X j / n, =,, then the S are the unbased sample varances, and the so-called F test s based on ther quotent, S / S = F, say. It s well-known, and wll be confrmed yet agan n Secton 5, that the null dstrbuton of F, Fn, n, s senstve to departures from normalty. If F n, n (x) s the cumulatve dstrbuton functon of the Fn, n 3 August 00
4 dstrbuton, and f c p s such that F n, n (c p ) = p, then the F test rejects H at the 00 % level when F < c / and when F > c ( /). Common practce when normalty s n doubt s to use Levene s test or a nonparametrc test such as Mood s test. Levene s test s based on the ANOVA F test appled to the resduals. There are dfferent versons of Levene s test usng dfferent defntons of resdual. The two most common versons use resduals based on the ~ ~ group means, X j X., and the group medans, X j X., n whch X. s the medan of the th sample. The latter s called the Brown-Forsythe test. Agan t s well-known that these tests are robust n that when the populaton varances are equal but the populatons themselves are not normal, they acheve levels close to nomnal. However ths happens at the expense of some power. As the emprcal study n ths paper s ntended to be ndcatve rather than exhaustve, we wll henceforth make comparsons only wth the Levene test, based on a test statstc we denote by L. We now construct a new test that we call the R test. For unvarate parameters, a Wald test statstc of H: = 0 aganst the alternatve K: 0 s based on ˆ, the maxmum lkelhood estmator of, usually va the test statstc ˆ ( ) /estvar( ˆ 0 ), where est var( ˆ ) s a consstent estmate of var( ˆ). Under the null hypothess ths test statstc has an asymptotc dstrbuton. As well as beng equvalent to the lkelhood rato test, the F test s also a Wald test for testng H: = / = aganst K:. Rayner [5] derved the Wald test for testng H: = = 0 aganst K: 0. The test statstc s ( S S ) S S n n = W, say. Beng a Wald test, the asymptotc dstrbuton of W s, whle ts exact dstrbuton s not mmedately obvous. However W s a - functon of F, so the two tests are equvalent. Snce the exact dstrbuton of F s avalable, the F test s the obvous test to use. In W, the varances var( S ) are estmated optmally usng the Rao-Blackwell j theorem. Ths depends very strongly on the assumpton of normalty. If normalty s n doubt then we can estmate var( S S ) usng results gven, for example, n [6]. For a random sample Y,..., Y n wth populaton and sample central moments r and m r = n j r ( Y Y ) / n, r =, 3,..., [6] gves that j E[m r ] = r + O(n ) and var(m ) = ( )/n + O(n ). Applyng [6, 0.5], may be estmated to O(n ) by m, or, equvalently, by n m /(n ) = S, where S s the unbased sample varance. It follows that, to order O(n ), var(m ) may be estmated by (m m )/n. We thus propose a robust alternatve to W, gven by 3 3 August 00
5 ( S m S n S ) m S n = R say, n whch m, are the fourth central sample moments for the th sample, =,. We call the test based on R the R test. As the sample szes ncrease, the dstrbutons of the sample varances approach normalty, the denomnator n R wll approxmate var( S S ), and R wll have asymptotc dstrbuton. Thus, f c s the pont for whch the dstrbuton has weght n the rght hand tal, then the R test rejects H at approxmately the 00 % level when R > c. We emphasse that although the motvaton for the dervaton of R s under the assumpton of samplng from normal populatons, t s a vald test statstc for testng equalty of varances no matter what the populatons sampled. If the sample varances are equal to or asymptotcally equvalent to the maxmum lkelhood estmators of the populaton varances, as s the case when samplng from normal populatons, then the R test s a Wald test for equalty of varances n the sense descrbed above. Snce t doesn t depend on any dstrbutonal assumptons about the data, t can be thought of as a nonparametrc Wald test. As such t can be expected to have good propertes n large samples. We note that all the above test statstcs are nvarant under transformatons a(x j b ), for constants a, b and b and for j =,..., n and =,. The next three sectons report an emprcal study when the dstrbutons sampled are assumed to be normal. As ths s an ndcatve study, we fx the samples szes to be equal, n = n = n, say, and the sgnfcance level to be 5% throughout. 3. Test Sze Under Normalty Under the null hypothess, the dstrbuton of F s known exactly, the dstrbuton of L s known approxmately, and, as mentoned above, the dstrbuton of R s known asymptotcally. In analysng data these dstrbutons are used to determne p-values and crtcal values. We now nvestgate ther use n determnng test sze, the probablty of rejectng the null hypothess when t s true. Two emprcal assessments of test sze wll now be undertaken. Snce the test statstcs are scale nvarant, t s suffcent under the null hypothess to take both populaton varances to be one. In the frst assessment we assume normalty. For varous values of the common sample sze n, we estmate the 5% crtcal ponts for each test by generatng 00,000 pars of random samples of sze n, calculatng the test statstcs, orderng them and hence dentfyng the 0.95th percentle. The estmated crtcal ponts of R approach the 5% crtcal pont 3.8. These estmated crtcal ponts wll subsequently be used n the power study to gve tests wth test sze of exactly 5%. To see the extent of the error caused by usng the asymptotc crtcal pont 3.8, Fgure gves the proporton of rejectons n 00,000 pars of random samples for sample szes up to 00. For n = 0 the proporton of rejectons s nearly 0% and although for n = 0 ths has dropped to nearly 7%, most users would hope for observed test szes closer to 5%. 3 August 00
6 Fgure. Proporton of rejectons of the R test usng the 5% crtcal pont 3.8 for sample szes up to 00 To mprove the applcaton of the test we plotted the estmated 5% crtcal ponts aganst n and used standard curve fttng technques to fnd that the exact crtcal ponts were well approxmated between n = 0 and 00 by c(n, 0.05) = 3.86( / n +.7/n). For larger n t s suffcent to use the asymptotc 5% value 3.86, the error beng at most 0.8%. We checked the exact probabltes of rejecton under the null hypothess when applyng the test wth crtcal value c(n, 0.05), and all were between.6% and 5.3%. For levels other than 5%, and when sample szes are unequal, further emprcal work needs to be done to fnd crtcal values. However, as ths study was ntended to be ndcatve, we leave extensve tabulaton of crtcal values for another tme.. Power Under Normalty For the F, Levene and R tests we estmate the power as the proporton of rejectons from 00,000 pars of random samples of sze n when the frst sample s from a N(0, ) populaton and the second s from a N(0, ) populaton wth >. To compare lke wth lke, we use estmated crtcal values that gve exact 5% level tests. It s apparent that for sample szes less than about 0 the Levene test s more powerful than the R test and that between approxmately 0 and 30 the R test takes over from the Levene test; thereafter the R test s always more powerful than the Levene test. Ths s shown n Fgure. 5 3 August 00
7 Fgure. Power of the 5% level L test (sold lne) and R test (dashed lne) for varous sample szes Fgure 3. Contour plots of the L test (left panel) and R test (rght panel) relatve to the F test showng regons n whch the power ratos are less than 95%, between 95% and 99.99%, and greater than 99.99%. 6 3 August 00
8 Both the Levene and R tests are always less powerful that the F test. Ths s explored n Fgure 3 that compares the Levene test to the F test n the left hand panel and the R test to the F test n the rght hand panel. What s gven s a contour plot of the regons n whch the ratos of the power of the stated test to the F test s less than 95%, between 95% and 99.99%, and greater than 99.99%. Generally, for any gven n and, t s clear that the power of the Levene test s at most that of the R test. For example t appears that for n = n = 80 approxmately the power of the R test s always at least 95% that of the F test, whereas there s a consderable regon where the power of the Levene test s less than 95% that of the F test. Fgure. Test szes for the F (dots), L (dashes) and R (sold lne) tests for t dstrbutons wth varyng degrees of freedom,, from to 0 5. Robustness Even f the R test has good power, the test s of lttle value unless t s robust n the sense that when the dstrbutons from whch we are samplng are not from the nomnated populaton (here, the normal dstrbuton) the p-values are reasonably accurate. It s thus of nterest to estmate the proporton of rejectons when the null hypothess s true and both the populatons from whch we sample are not normal. We could have looked at varable skewness through gamma dstrbutons wth varyng shape parameter, but nstead we consder varable kurtoss va t dstrbutons wth varyng degrees of freedom. If the degrees of freedom, say, are large, the dstrbuton sampled wll be close enough to normal that we could expect the proporton of rejectons to be close to the nomnal. The crtcal values used here are c(n, 0.05) gven n Secton August 00
9 In Fgure we plot the proporton of rejectons for the Levene, F and R tests when samplng from t dstrbutons for =,..., 0. We show curves for each test wth common sample szes n = 0, 5, 50 and 00. The crtcal values used are the c(n, 0.05) from Secton 3. It s apparent that the F test performs ncreasngly poorly as the degrees of freedom reduce and the tals of the dstrbuton become fatter. Interestngly, n ths scenaro, the F test s always lberal (exact test sze more than the sgnfcance level) whle the R test s almost always conservatve (exact test sze less than the sgnfcance level). In general the latter s to be preferred. The Levene test generally has exact level closer to the nomnal level than the R test except for small degrees of freedom. However the level of the R test s almost always reasonable, and whle for very small the level s not as close to the exact level as perhaps we may prefer, the same s the case for the Levene test. 5. Concluson Frst, we reflect on testng for equalty of varances when t s assumed that the populatons sampled are normal. The F test s both the lkelhood rato test and a Wald test, and s the approprate test to apply. When normalty does not hold, the F test s no longer an asymptotcally optmal test and ts well-known non-robustness means that tests such as the Levene are more approprate for small to moderate sample szes. However for sample szes of about 5 or more the R test s more powerful than the Levene and wth the small sample corrected crtcal values t holds ts nomnal sgnfcance level well. For these sample szes t can be preferred to the Levene test. Second, consder testng for equalty of varances when both samples are drawn from the same populaton. If that populaton s nomnated then the R test may be appled after determnng crtcal values or usng p-values calculated by Monte Carlo methods. When the sample varances are asymptotcally equvalent to the maxmum lkelhood estmators of the populaton varances (as, for example, s the case when samplng from normal populatons but not Posson populatons), the R test s a nonparametrc Wald test and hence wll have good power n suffcently large samples. If the populaton s not specfed the R test can be confdently appled when the sample szes are large, usng the asymptotc null dstrbuton to calculate p- values or crtcal values. References [] B.F.J. Manley, The Statstcs of Natural Selecton, Chapman and Hall, London, 987. [] D.S. Moore and G.P. McCabe, Introducton to the Practce of Statstcs (5 th ed.), W.H. Freeman, New York, 006. [3] W.J. Conover, Mark E Johnson and Myrle M. Johnson, A comparatve study of tests of homogenety of varances, wth applcatons to the outer contnental shelf bddng data, Technometrcs, vol. 3, no., pp , 98. [] Denns D. Boos and Cavell Browne, Bootstrap methods for testng homogenety of varances, Technometrcs, vol. 3, no., pp. 69-8, 989. [5] J.C.W. Rayner, The Asymptotcally Optmal Tests, Journal of the Royal Statstcal Socety: Seres D, vol. 6, no. 3, , 997. [6] A. Stuart and J.K. Ord, Kendall's Advanced Theory Of Statstcs, Vol.: Dstrbuton theory (6th ed.), Hodder Arnold, London, August 00
A nonparametric two-sample wald test of equality of variances
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More informationBOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu
BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More informationSimulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests
Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationComputing MLE Bias Empirically
Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.
More informationLecture 4 Hypothesis Testing
Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to
More informationJoint Statistical Meetings - Biopharmaceutical Section
Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve
More informationDurban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications
Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department
More informationLecture 6 More on Complete Randomized Block Design (RBD)
Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationSpring Research Conference - Section on Physical & Engineering Sciences (SPES) n n -1 4 ).
Sprng Research Conference - Secton on Physcal & Engneerng Scences (SPES) ROBUSTIFYING THE F TEST AND BARTLETT S TEST FOR HOMOGENEOUS SPREADS Lews H. Shoemaker & James W. Fenwck, Department of Mathematcs,
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan
More informationANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.
ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationTopic- 11 The Analysis of Variance
Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already
More informationexperimenteel en correlationeel onderzoek
expermenteel en correlatoneel onderzoek lecture 6: one-way analyss of varance Leary. Introducton to Behavoral Research Methods. pages 246 271 (chapters 10 and 11): conceptual statstcs Moore, McCabe, and
More informationA Note on Test of Homogeneity Against Umbrella Scale Alternative Based on U-Statistics
J Stat Appl Pro No 3 93- () 93 NSP Journal of Statstcs Applcatons & Probablty --- An Internatonal Journal @ NSP Natural Scences Publshng Cor A Note on Test of Homogenety Aganst Umbrella Scale Alternatve
More informationTesting for seasonal unit roots in heterogeneous panels
Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More information2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification
E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationAnswers Problem Set 2 Chem 314A Williamsen Spring 2000
Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More informationTopic 23 - Randomized Complete Block Designs (RCBD)
Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,
More information4.1. Lecture 4: Fitting distributions: goodness of fit. Goodness of fit: the underlying principle
Lecture 4: Fttng dstrbutons: goodness of ft Goodness of ft Testng goodness of ft Testng normalty An mportant note on testng normalty! L4.1 Goodness of ft measures the extent to whch some emprcal dstrbuton
More informationBOOTSTRAP METHOD FOR TESTING THE EQUALITY OF MEANS: IN CASE OF HETEROSCEDASTICITY
SAÜ Fen Edebyat Dergs(0-II) M.EKİZ, U.EKİZ BOOTSTRAP METHOD FOR TESTING THE EQUALITY OF MEANS: IN CASE OF HETEROSCEDASTICITY Meltem EKİZ, Ufuk EKİZ Gaz Unversty, Faculty of Scence, Department of Statstcs,
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationGeneralized Linear Methods
Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationDepartment of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution
Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable
More informationChapter 5 Multilevel Models
Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level
More informationComparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method
Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationThe Ordinary Least Squares (OLS) Estimator
The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal
More informationANOVA. The Observations y ij
ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2
More informationF statistic = s2 1 s 2 ( F for Fisher )
Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the
More information7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA
Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationParametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010
Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationChapter 3 Describing Data Using Numerical Measures
Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The
More informationEstimation: Part 2. Chapter GREG estimation
Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the
More informationChapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result
Chapter 5: Hypothess Tests, Confdence Intervals & Gauss-Markov Result 1-1 Outlne 1. The standard error of 2. Hypothess tests concernng β 1 3. Confdence ntervals for β 1 4. Regresson when X s bnary 5. Heteroskedastcty
More informationGlobal Sensitivity. Tuesday 20 th February, 2018
Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE
P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationChapter 8 Indicator Variables
Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n
More informationCopyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor
Taylor Enterprses, Inc. Adjusted Control Lmts for U Charts Copyrght 207 by Taylor Enterprses, Inc., All Rghts Reserved. Adjusted Control Lmts for U Charts Dr. Wayne A. Taylor Abstract: U charts are used
More informationAn accurate test for homogeneity of odds ratios based on Cochran s Q-statistic
Kulnskaya and Dollnger BMC Medcal Research Methodology (2015) 15:49 DOI 10.1186/s12874-015-0034-x TECHNICAL ADVANCE Open Access An accurate test for homogenety of odds ratos based on Cochran s Q-statstc
More informationDefinition. Measures of Dispersion. Measures of Dispersion. Definition. The Range. Measures of Dispersion 3/24/2014
Measures of Dsperson Defenton Range Interquartle Range Varance and Standard Devaton Defnton Measures of dsperson are descrptve statstcs that descrbe how smlar a set of scores are to each other The more
More informationTest for Intraclass Correlation Coefficient under Unequal Family Sizes
Journal of Modern Appled Statstcal Methods Volume Issue Artcle 9 --03 Test for Intraclass Correlaton Coeffcent under Unequal Famly Szes Madhusudan Bhandary Columbus State Unversty, Columbus, GA, bhandary_madhusudan@colstate.edu
More informationChapter 6. Supplemental Text Material
Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.
More informationIntroduction to Generalized Linear Models
INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours
UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationChapter 12 Analysis of Covariance
Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty
More informationAppendix B: Resampling Algorithms
407 Appendx B: Resamplng Algorthms A common problem of all partcle flters s the degeneracy of weghts, whch conssts of the unbounded ncrease of the varance of the mportance weghts ω [ ] of the partcles
More informationStatistical analysis using matlab. HY 439 Presented by: George Fortetsanakis
Statstcal analyss usng matlab HY 439 Presented by: George Fortetsanaks Roadmap Probablty dstrbutons Statstcal estmaton Fttng data to probablty dstrbutons Contnuous dstrbutons Contnuous random varable X
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationChapter 11: I = 2 samples independent samples paired samples Chapter 12: I 3 samples of equal size J one-way layout two-way layout
Serk Sagtov, Chalmers and GU, February 0, 018 Chapter 1. Analyss of varance Chapter 11: I = samples ndependent samples pared samples Chapter 1: I 3 samples of equal sze one-way layout two-way layout 1
More informationNUMERICAL DIFFERENTIATION
NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the
More informationAn (almost) unbiased estimator for the S-Gini index
An (almost unbased estmator for the S-Gn ndex Thomas Demuynck February 25, 2009 Abstract Ths note provdes an unbased estmator for the absolute S-Gn and an almost unbased estmator for the relatve S-Gn for
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationWhich estimator of the dispersion parameter for the Gamma family generalized linear models is to be chosen?
STATISTICS Dalarna Unversty D-level Master s Thess 007 Whch estmator of the dsperson parameter for the Gamma famly generalzed lnear models s to be chosen? Submtted by: Juan Du Regstraton Number: 8096-T084
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationa. (All your answers should be in the letter!
Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal
More informationModeling and Simulation NETW 707
Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationTHE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationUCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,
UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer
More informationCopyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor
Taylor Enterprses, Inc. Control Lmts for P Charts Copyrght 2017 by Taylor Enterprses, Inc., All Rghts Reserved. Control Lmts for P Charts Dr. Wayne A. Taylor Abstract: P charts are used for count data
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have
More informationNon-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT
Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationPsychology 282 Lecture #24 Outline Regression Diagnostics: Outliers
Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.
More informationMultiple Contrasts (Simulation)
Chapter 590 Multple Contrasts (Smulaton) Introducton Ths procedure uses smulaton to analyze the power and sgnfcance level of two multple-comparson procedures that perform two-sded hypothess tests of contrasts
More informationRandom Partitions of Samples
Random Parttons of Samples Klaus Th. Hess Insttut für Mathematsche Stochastk Technsche Unverstät Dresden Abstract In the present paper we construct a decomposton of a sample nto a fnte number of subsamples
More informationwhere I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).
11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e
More informationDERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION
Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung
More informationA PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS
HCMC Unversty of Pedagogy Thong Nguyen Huu et al. A PROBABILITY-DRIVEN SEARCH ALGORITHM FOR SOLVING MULTI-OBJECTIVE OPTIMIZATION PROBLEMS Thong Nguyen Huu and Hao Tran Van Department of mathematcs-nformaton,
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More information