An Alternative Goodness-of-fit Test for Normality with Unknown Parameters

Size: px
Start display at page:

Download "An Alternative Goodness-of-fit Test for Normality with Unknown Parameters"

Transcription

1 Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Teses ad Dissertatios Uiversity Graduate Scool A Alterative Goodess-of-fit Test for Normality wit Ukow Parameters Weilig Si amadasi335@gmail.com DOI: 0.548/etd.FI40735 Follow tis ad additioal works at: ttp://digitalcommos.fiu.edu/etd Part of te Applied Statistics Commos, Statistical Models Commos, ad te Statistical Teory Commos Recommeded Citatio Si, Weilig, "A Alterative Goodess-of-fit Test for Normality wit Ukow Parameters" 04). FIU Electroic Teses ad Dissertatios. 63. ttp://digitalcommos.fiu.edu/etd/63 Tis work is brougt to you for free ad ope access by te Uiversity Graduate Scool at FIU Digital Commos. It as bee accepted for iclusio i FIU Electroic Teses ad Dissertatios by a autorized admiistrator of FIU Digital Commos. For more iformatio, please cotact dcc@fiu.edu.

2 FLORIDA INTERNATIONAL UNIVERSITY Miami, Florida AN ALTERNATIVE GOODNESS-OF-FIT TEST FOR NORMALITY WITH UNKNOWN PARAMETERS A tesis submitted i partial fulfillmet of te requiremets for te degree of MASTER OF SCIENCE i STATISTICS by Weilig Si 04

3 To: Iterim Dea Micael R. Heitaus College of Arts ad Scieces Tis tesis, writte by Weilig Si, ad etitled a Alterative Goodess-of-Fit Test for Normality wit Ukow Parameters, avig bee approved i respect to style ad itellectual cotet, is referred to you for judgmet. We ave read tis tesis ad recommed tat it be approved. Gauri Gai Florece George Date of Defese: November 4, 04 Te tesis of Weilig Si is approved. Zemi Ce, Major Professor Iterim Dea Micael R.Heitaus College of Arts ad Scieces Dea Laksmi N. Reddi Uiversity Graduate Scool Florida Iteratioal Uiversity, 04 ii

4 Copyrigt 04 by Weilig Si All rigts reserved. iii

5 DEDICATION I dedicate tis tesis to my parets. Te completio of tis tesis would ot be possible witout teir love, support ad ecouragemet. iv

6 ACKNOWLEDGMENTS I am deeply grateful to my major professor ad metor Dr. Zemi Ce. Wit Dr. Ce s cosistet support, guidace ad elp I fiised my master s study at Florida Iteratioal Uiversity. Dr. Ce s professio, resposibility ad carig eert a importat ifluece i my life. I beefited from Dr. Ce s wisdom ad approac to researc. All is virtues will be a valuable treasure of my wole life i te future. I would also like to tak te members of my tesis committee, Dr. Gauri Gai ad Dr. Florece George, for teir time, advice ad reviewig my tesis. I te ed, I would like to tak my parets for teir uderstadig, ecouragemet ad love trougout te years. Tey gave me te warmest family oe as ever wised for. v

7 ABSTRACT OF THE THESIS AN ALTERNATIVE GOODNESS-OF-FIT TEST FOR NORMALITY WITH UNKNOWN PARAMETERS by Weilig Si Florida Iteratioal Uiversity, 04 Miami, Florida Zemi Ce, Major Professor Goodess-of-fit tests ave bee studied by may researcers. Amog tem, a alterative statistical test for uiformity was proposed by Ce ad Ye 009). Te test was used by Xiog 00) to test ormality for te case tat bot locatio parameter ad scale parameter of te ormal distributio are kow. Te purpose of te preset tesis is to eted te result to te case tat te parameters are ukow. A table for te critical values of te test statistic is obtaied usig Mote Carlo simulatio. Te performace of te proposed test is compared wit te Sapiro-Wilk test ad te Kolmogorov-Smirov test. Mote-Carlo simulatio results sow tat proposed test performs better ta te Kolmogorov-Smirov test i may cases. Te Sapiro Wilk test is still te most powerful test altoug i some cases te test proposed i te preset researc performs better. vi

8 TABLE OF CONTENTS CHAPTER PAGE I. INTRODUCTION... Itroductio... Basic Idea... 4 II. METHODOLOGY... 7 Metodology of Tree Tests... 7 Power Study... 3 III. POWER COMPARISON... 5 Alterative Distributios... 5 Summary of Power Study... 9 IV. CONCLUSION AND DISCUSSION... 3 V. REFERENCES vii

9 LISTS OF TABLES TABLE PAGE. G Test Critical Value Table Power Compariso: V-Sape triagle =0.5) Power Compariso: V-Sape triagle =0.5) Power Compariso: V-Sape triagle =0.75) Power Compariso: Beta α=4, β=) Power Compariso: Beta α=0.5, β=0.5) Power Compariso: Beta α=, β=4) Power Compariso: Beta α=, β=) Power Compariso: Triagle =0.5) Power Compariso: Triagle =0.5) Power Compariso: Triagle =0.75) viii

10 LISTS OF FIGURES FIGURE PAGE. Power Compariso: V-Sape triagle =0.5) Power Compariso: V-Sape triagle =0.5) Power Compariso: V-Sape triagle =0.75) Power Compariso: Beta α=4, β=) Power Compariso: Beta α=0.5, β=0.5) Power Compariso: Beta α=, β=4) Power Compariso: Beta α=, β=) Power Compariso: Triagle =0.5) Power Compariso: Triagle =0.5) Power Compariso: Triagle =0.75) i

11 CHAPTER I INTRODUCTION. Itroductio Te goodess-of-fit test is a particular useful statistical model for testig weter observed data are represetative of a particular distributio. A goodess-of-fit test ca summarize te discrepacy betwee observed values ad te values epected uder ay give model. Numerous researc papers ave bee publised by scietists cocerig tese tests. Tere are may eistig test statistics icludig some commoly used goodess-of-fit tests suc as te Ci-squared test Pearso, 900), te Kolmogorov-Smirov test Kolmogorov, 933 ad Smirov,939), te Cramer-Vo Mises test Cramer,98 ad vo Mises), ad te Aderso-Darlig test Aderso ad Darlig, 95). All tese commoly used statistical tests ca be used to test ormality. Te Ci-squared test is te most importat member of te oparametric family of statistical tests because it as some attractive features icludig te fact tat it ca be applied to ay uivariate distributio ad calculated muc easier ta oter test statistics. It is used for quatitative ad bied data. For o-bied data, a istogram or frequecy table sould be costructed to put te data ito te categories before te Ci-squared test is used. However, te values of te Ci-squared test are affected by skewess ad kurtosis. Plus, it is sesitive to te sample size. Te Ci-squared test as reduced power especially for te small sample size uder 50. Te Kolmogorov-Smirov test K-S test) is also a oparametric test for te equality of cotiuous, oe-dimesioal probability distributios tat ca be used to

12 compare a sample wit a referece probability distributio, or to compare two samples. Te K-S test relies o te fact tat te value of te sample cumulative desity fuctio is asymptotically ormally distributed. Te Kolmogorov Smirov statistic quatifies a distace betwee te empirical distributio fuctio of te sample ad te cumulative distributio fuctio of te referece distributio, or betwee te empirical distributio fuctios of two samples. However, te K-S test teds to be more sesitive ear te ceter of te distributio ta it is at te tails of te distributio. Additioally, te most serious limitatio is tat te distributio must be fully specified. If locatio, scale, ad sape parameters are estimated from te data, te critical regio of te K-S test is o loger valid. Te K-S test statistic typically must be determied by simulatio. Various studies ave foud tat, eve i tis corrected form, te test is less powerful for testig ormality ta te Sapiro-Wilk test or te Aderso Darlig test. Te Aderso-Darlig test is a modificatio of te K-S test wic gives more weigt to te tails of te distributio ta te K-S test. Te K-S test is distributio free i te sese tat te critical values do ot deped o te specific distributio beig tested, wile te Aderso-Darlig test makes use of te specific distributio i calculatig critical values. Te Aderso-Darlig test as te advatage of allowig a more sesitive test ta te K-S test ad te disadvatage tat critical values must be calculated for eac distributio. Te Sapiro-Wilk test, proposed by Samuel Saford Sapiro ad Marti Wilk i 965, is used for testig ormality ad logormal distributios. It compares te observed cumulative frequecy distributio curve wit te epected cumulative frequecy curve. Te Sapiro-Wilk test is based o te ratio of te best estimator of

13 te variace to te usual corrected sum of squares estimator of te variace. Te Sapiro-Wilk test is ot as affected by ties as te Aderso-Darlig test, but is still biased by sample size. Power study of te most commoly used goodess-of-fit tests as bee coducted by may researcers icludig Sapiro, Wilk ad Ce 968), ad Aly ad Sayib 99). Some recet researc papers cocluded tat te Sapiro Wilk s test as te best power for a give sigificace, followed closely by Aderso-Darlig we comparig te Sapiro-Wilk, Kolmogorov Smirov, Lilliefors, ad Aderso- Darlig tests. Altoug it was metioed by Steele ad Caselig 009) tat oe of te eistig test statistics ca be regarded as te best test statistic, to maimize te power of te test statistic for ceckig ormality is still uder eplored ad modified by may statistics researcers. Te purpose of tis tesis is to compare tese tests. I te preset researc, te statistical tests proposed i Ce ad Ye 009) ad Xiog 00) will be adopted ad will be eteded to test ormality for te case tat te locatio parameter ad te scale parameter of te ormal distributio are bot ukow. A table for te critical values of te test statistics is provided usig Mote Carlo simulatio. Te performace of te ewly updated statistics are compared wit te Sapiro-Wilk s test ad te Kolmogorov-Smirov test.. Basic Idea Ce ad Ye 009) proposed a ew test statistic G statistic) for testig uiformity. Te test statistic ca be used to test if te uderlyig populatio distributio is a uiform distributio. O te basis of te probability itegral 3

14 trasformatio See F.N. David ad N.L. Joso, 948), te uderlyig populatio distributio ca be ay distributio. Suppose,,, are te observatios of a radom sample from a populatio distributio wit distributio fuctio F). Suppose also tat,,, are te correspodig order statistics. Te purpose is to test: H0 : F ) F0 ). H : F ) F0 ). It ca be see tat F ), F ),..., F ) are te ordered observatios of a radom ) ) ) sample from te U0, ) distributio. Te G test statistics ca be used to coduct te followig test procedure. Te test statistics ca be defied as G ), ),... ) ) ) i F 0 i) ) F 0 i) ) )). ) H 0 sould be rejected at sigificace level if G. ), ),..., ) ) G Here G is te upper critical value of te G statistic. Te value of G is calculated by te Mote Carlo simulatio. For simplicity, G,,..., ) ca also be epressed as ) ) ) G ), ),... ) ) i F 0 i) ) F 0 i) )). ) Epressio ) will be used i te Mote Carlo simulatio. Te test ca be used for testig ay ypotesized distributio. Te ormal distributio is merely a special case. 4

15 Te rage of te fuctio G,,..., ) is from 0 to ad te ) ) ) matematical epectatio ad variace of te test statistic ave bee give i Ce ad Ye 009). I Xiog s researc, te parameters icludig te epected value ad te stadard deviatio of te ormal distributio are assumed to be kow. We te parameters of te distributio are ukow, te test is o loger valid. To solve tis problem, Lilliefors idea is adopted ere to treat te case wit ukow parameters. Estimatio of te populatio mea ad populatio variace derived from te sample data is coducted before calculatig statistic. Te procedure i tis researc for simulatig te critical values of te statistic is summarized as follows: G test. Geerate a pseudo radom sample,,..., of size from te stadard ormal distributio;. Calculate te sample mea ) ad variace s ); 3. Fid te ordered values ), ),..., ) ad defie 0) 0 ad ) ; 4. Calculate F ), F ),..., F ). Here F ) is te cumulative distributio fuctio of te distributio; ) ) ) 5. Calculate te value of G,,..., ) usig Equatio ); ) ) ) 6. Repeat steps to 5 k times k=,000,000 i tis researc); 7. Sort all te values of G i ascedig order; 5

16 8. Fid te critical values wit = 0., 0.05, 0.0, 0.005, 0.00, tat is, to calculate te 90t, 95t, 99t, 99.5t ad 99.9t percetiles. Te procedure sow above uses te stadard ormal distributio. It will ot affect te simulatio result. I fact, it ca be sow tat it remais ivariat we te parameters of te ormal distributio cage. Suppose X as a ormal distributio wit te mea ad stadard deviatio. G ), ),..., ) ) 0 ) ) 0 ) )) F i F i i i t) t) ) i) = e dt) e dt) ). i Let z t. Te value of G,..., ) becomes ), ) ) i z i z ) z i) z e dt) e dt) ) z i z z z ) i) = e dt) e dt) ). i Tis is te same as i te case tat te stadard ormal distributio is picked. Te performace of te G test statistics is compared wit te Sapiro- Wilk s test ad te K-S test for testig ormality. Sice te Sapiro-Wilk s test ca 6

17 oly be used for testig ormal distributio ad logormal distributio, te ormality test for te power compariso is coducted i tis researc. Capter outlies te metod for calculatig tese tree test statistics. Te power study results of tese tree tests are aalyzed i Capter 3. Capter 4 cocludes te performace comparisos of te performace comparisos wit te G test, te Sapiro-Wilk s test ad te Kolmogorov-Smirov test. Mote Carlo simulatio was used to coduct power study for tese tree tests. Te computer programmig laguages used i tis researc are SAS/IML ad SAS/Base. 7

18 CHAPTER II METHODOLOGY Te performace of a test statistic ca be evaluated by a power study. To evaluate te performace of test statistic G,,..., ) proposed i tis tesis, ) ) ) various alterative distributios are used to fid te power of te test statistic. Te test power is compared wit te Sapiro-Wilk test ad te Kolmogorov-Smirov test i te preset researc. Te ull ypotesis assumes tat te uderlyig distributio is a ormal distributio, wile te alterative ypotesis assumes a distributio tat is ot a ormal distributio. Te alterative distributios used ere iclude te triagle distributios, V-saped triagle distributios, ad Beta distributios. G. Metodology of Tree Tests.. G test Suppose,...,, are te observatios of a radom sample from a populatio distributio wit a distributio fuctio F ). Suppose also tat ), ),..., are te correspodig order statistics. To test weter or ot te ) uderlyig distributio is a ormal distributio, te ull ad alterative ypoteses are H 0 : Te populatio distributio is a ormal distributio, H : Te populatio distributio is ot a ormal distributio. As discussed i Capter, we te test statistics G,,..., ) is used, H 0 ) ) ) sould be rejected at sigificat level if G, were ), ),..., ) ) G G is te 8

19 critical value of te G test statistic. Te value of G,,..., ) ca be calculated ) ) ) usig equatio ) for coveiece... Sapiro-Wilk Test Te Sapiro-Wilk test utilizes te ull ypotesis priciple to determie weter a sample,...,, come from a ormally distributed populatio. Te W test statistic is te ratio of te best estimator of te variace derived from te square of a liear combiatio of te order statistics) to te usual corrected sum of squares estimator of te variace Sapiro ad Wilk; 965). We is greater ta tree, te coefficiets to compute te liear combiatio of te order statistics ca be approimated by te metod of Roysto 99). Te statistic W is always greater ta zero ad less ta or equal to oe 0 W ). Small values of W lead to te rejectio of te ull ypotesis of ormality. Te distributio of W is igly skewed. Seemigly large values of W suc as 0.90) may be cosidered small will result i rejectig te ull ypotesis. Researc papers sow tat te Sapiro-Wilk test as te better performace compared wit te Aderso-Darlig test ad te Kolmogorov-Smirmov test. Te test statistic is W i i a i i ) i) ) were... 9

20 is te sample mea; is te order statistic; te costats ' a i s are give by a, a,..., a ) m T m V T V V m). Here m,..,, m m are te epected values of te order statistics of idepedet ad idetically distributed radom variables sampled from te stadard ormal distributio ad V is te covariace matri of tose order statistics. Reject H 0 if W is too small. To compute te value of test statistic W for a give complete radom sample,,...,, te procedure proposed i Sapiro &Wilk 965) is as follows: ) Order te observatios to obtai a ordered sample ) )... ). i ) i i ) Compute i) ), were is te sample mea. 3) a) If is eve, m, compute b a ) values of a ai are give i Sapiro ad Wilk 965). m i i i) i), were te b) If is odd, m te computatio is te same as te oe i 3)a) sice a 0 m. Tus b a ) ) )... a ) ) ), were te value of ), m m m te sample media, does ot eter te computatio of b. 4) Compute W b / i ) ). i 5) Compare wit te critical values from quatiles of te Sapiro-Wilk test for m 0

21 ormality table. If te calculated value of te test statistic W is smaller ta is rejected at te sigificace level...3 Kolmogorov-Smirov Test W, H 0 Te Kolmogorov-Smirov test statistic is defied as D D D sup F ) F ) ma D, D supf ) F ), supf ) F ). ), Here F is te cumulative distributio fuctio specified by te ull ypotesis. Let ) )... ) be te order statistic of ) )... ). Te te empirical distributio fuctio is i ) = F i,,..., ). for ) ) i i D ma sup i ma F 0i Similarly, i i F ) ma i 0 i ) ) ) i i i i ) i) i ) mama F i i) if ),0 F ) D i ma maf i) ),0 i If te calculated value of te test statistic is greater ta D, H 0 is rejected at te stated sigificace level. Here D is te critical value of te Kolmogorov- Smirrov test statistic.

22 Te popularity of te Kolmogorov-Smirov test relates to te fact tat te test does ot deped o te uderlyig cumulative distributio fuctio beig tested. However, its disadvatages also limit its applicatio sometimes because it oly applies to cotiuous distributios. Te test statistic becomes more sesitive ear te ceter of te distributio ta at te bot tails. All te parameters suc as locatio, scale ad sape of a distributio must be fully specified. If ot, te K-S test is o loger valid. It must be estimated by simulatio...4 Mote Carlo Simulatio Metod Mote Carlo simulatio is applied to geerate pseudo radom samples from a variety of alterative distributios to compare te power of te G test, Sapiro-Wilk test ad K-S test. Firstly, k pseudo radom samples of size are geerated from a specified distributio. I tis researc, V-sape triagle distributio, triagle distributio ad te Beta distributio are cose. For eac pseudo radom sample, te observatios,...,, are sorted ad become order statistics oe ), ),..., ). Te put te ordered statistics ito te formulas of te G test ad Sapiro-Wilk test, ad te K-S test. Te values of te test statistics ca be computed. By comparig te calculated value ad te critical values of te G test, Sapiro-Wilk test ad te K-S test, te rejectio rates ca be foud. I te preset researc, te sample sizes =5, 0, 0, 30, 40, 50 are selected to coduct Mote Carlo simulatio. Te umber of repetitios is selected to be k,000,000 to esure te accuracy of te power. Te procedure for calculatig te power is summarized as follows:

23 . Geerate a radom sample,,..., from te specified alterative distributio listed above;. Fid te ordered values ) )... ad defie ) 0) 0 ad ) ; 3. Calculate te correspodig F ), F ),..., F ),were F 0 is te 0 ) 0 ) 0 ) cumulative distributio fuctio of ormal ere ad te epected value ad stadard deviatio of te ormal distributio are calculated from te pseudo radom sample; 4. Calculate te value of G,,..., ) usig equatio ); ) ) ) 5. Compare te value of G,,..., ) wit te idicated critical value at ) ) ) sigificace level α=0.05, ad determie weter H 0 is rejected; 6. Usig te metod metioed above to calculate te value of te Sapiro-Wilk test statistic W ; 7. Compare te value of W wit W at te same sigificace level as i step 5, ad determie weter H 0 is rejected; 8. Usig te metod metioed above to calculate te value of te K-S test statistic D ; 9. Compare te value of D wit D at te same sigificace level, ad determie weter H 0 is rejected; 0. Repeat steps to 9,000,000 times;. Calculate te rejectio rates for te test. G test, te Sapiro-Wilk test ad te K-S 3

24 . Power Study Te power of a statistical test is te probability tat it correctly rejects te ull ypotesis we te ull ypotesis is false. Tat is, Power = P reject ull ypotesis/ ull ypotesis is false) wic ca be deoted as were β is te probability of committig type II error. Te power of te test statistic i tis researc ca be preseted as P H G G ) P G G H ) for G test; P H W W ) P W W H ) for Sapiro-Wilk test; P H D D ) P D D H ) for Kolmogorov-Smirov test; Te power estimate is ig meas tat te performace of te test is good. Statistical power may deped o a umber of factors. Some of tese factors may be particularly because of a specific testig situatio, but at a miimum, power always depeds o te followig tree factors: sample size, te sigificace level, ad te sesitivity of te data. Te rejectio rates of te tree test statistics are used as estimates of teir power i te tesis. Hig rejectio rates meas te power of te test statistic is ig. To evaluate te performace of te test statistic G,,..., ) proposed i tis ) ) ) researc, various alterative distributios icludig V-sape triagle distributio, Beta distributios ad triagle distributios are used to study te power of tis test statistic. 4

25 To fid te powers of te G test, Sapiro-Wilk test ad Kolmogorov-Smirov test, Mote Carlo simulatio was used to geerate pseudo radom samples from te various alterative distributios. To accomplis tis, k pseudo radom samples of size are geerated from a specified distributio. For eac pseudo radom sample, te observatio,...,, are sorted ad te sorted observatios become ), ),..., ).Te te values of te test statistics ca be computed for all tree tests. Fially, te rejectio rates or powers for teg test, Kolmogorov-Smirov test ad Sapiro-Wilk test are calculated. I te preset researc, te sample sizes = 5 to 50 are selected to coduct te Mote Carlo simulatio. I order to esure te accuracy of te power study, te umber of te repetitios is selected to be k=,000,000. 5

26 CHAPTER III POWER COMPARISON 3. Alterative Distributios 3.. V-saped Triagle Alterative Distributios Te probability desity fuctio of te V-saped triagle distributio is 0 ) f ) 0 elsewere. Here is a costat betwee 0 ad. Te followig V-saped triagle distributios are used i tis researc: Alterative Distributio Cosider = 0.5. Tis is a left-skewed V-saped triagle distributio. Te power compariso result uder tis V-saped triagle distributio is sow i Table. It ca be foud tat te G test is performs better ta Sapiro-Wilk test we sample size is 5. We sample size icreases, te power of tese tree tests also icreases. Compared wit te Kolmogorov-Smirov test, te G test outperforms te Kolmogorov-Smirov test i all cases. Te Sapiro-Wilk test performs better ta te oter two tests we te sample size becomes large. 6

27 Alterative Distributio Cosider = 0.5. Tis is a symmetric V-saped triagle distributio. Te power compariso result uder tis V-saped triagle distributio is sow i Table 3. Te result is similar to te previous case. Te G test performs better ta Sapiro- Wilk test we sample size is 5 ad is more powerful ta Kolmogorov-Smirov test for all sample sizes. We sample size icreases, te power of Sapiro-Wilk test icrease faster ta te oter two test statistics. Alterative Distributio 3 Cosider = Tis is a rigt-skewed V-saped triagle distributio. Te power compariso result uder tis V-saped triagle distributio is sow i Table 4. It sows tat te G test is performs better ta Sapiro-Wilk test also we sample size is 5. Sapiro-Wilk test performs very well we sample size icreases. Te test still outperforms te Kolmogorov-Smirov test i all cases. Sice tere is o fuctio call of V-sape triagle distributio i SAS, te followig propositio is eeded. Propositio: Suppose U is a radom variable wit uiform distributio o iterval 0, ). Te HU) as a V-saped triagle distributio wit parameter. Here Hu) is defied as G H u) u ) u 0 u u. Let X HU ). Te te cumulative distributio of X is 7

28 8. ) 0 ) 0 0 ) ) 0 ) ) ) ) ) P U P U U P u P U H P X P F X Tat is, ) F X Te te pdf of X deoted as ) f ) is:, 0 ) 0 ) elsewere f wic is te probability desity fuctio of V-sape triagle distributio wit parameter. 3.. Beta Alterative Distributios Te probability desity fuctio of te Beta distributio is 0). 0, 0 0 ) ) ) ) ) elsewere f Te followig special cases of te beta distributios are used i te power study:

29 Alterative Distributio 4 B4,) distributio. Tis is a left-skewed Beta distributio. Te power compariso result uder tis Beta alterative distributio is sow i Table 5. It ca be foud tat te G test performs better ta Sapiro-Wilk test uder small sample size case we = 5. We sample size icreases, te power of tese tree tests icreases too. Te Sapiro-Wilk test icreases more ta G test. Te Sapiro-Wilk test performs still well i most cases. However, te G test is better ta Sapiro-Wilk test for small sample size we 5. Kolmogorov-Smirov test outperforms te uder tis distributio. Alterative Distributio 5 G test B 0.5,0.5) distributio. Tis is a symmetric battub-saped Beta distributio. Te power compariso result uder tis Beta alterative distributio is sow i Table 6. It ca be foud from te table tat G test performs better ta Sapiro-Wilk test uder small sample size we =5. TeG test is still more powerful ta te Kolmogorov-Smirov test i all cases. Alterative Distributio 6 B,4) distributio. Tis is a rigt-skewed Beta distributio. Te power compariso result uder tis Beta alterative distributio is sow i Table 7. Te power compariso result sows tat G test performs better ta Sapiro-Wilk test uder small sample size we = 5. G test is more powerful ta te Kolmogorov- Smirov test we sample size is 0,0,30,40,50 ecept te case of sample size 5 9

30 Alterative Distributio 7 B,) distributio. Tis is actually a uiform distributio. Te power compariso result uder tis Beta alterative distributio is sow i Table 8. It ca be see from te table tat G test performs better ta Sapiro-Wilk test uder small sample size case icludig = 5 ad 0. Te Kolmogorov-Smirov test we sample size is 0, 0, 30, 40, 50. G test is also more powerful ta te 3..3 Triagle Alterative Distributio Te probability desity fuctio of te triagle distributio is ) f ) 0 0 elsewere. Here is a costat betwee 0 ad. Te followig triagle distributios are used i te power study. Alterative Distributio 8 Cosider =0.75. Tis is a left-skewed triagle distributio. Te power compariso result of tis alterative distributio is sow i Table 9. It ca be foud form te figure tat te G test performs better ta Sapiro-Wilk test we sample size =5 ad 0. Te Sapiro-Wilk size performs well we te sample size icreases. Kolmogorov-Smirov test performs better ta te G test i tis case. 0

31 Alterative Distributio 9 Cosider =0.5. Tis is a symmetric triagle distributio. Te power compariso result of tis alterative distributio is sow i Table 0. Te statistic performs te best i tis case. test we sample size is = 5,0,0,30. Alterative Distributio 0 G test G test is more powerful ta te Sapiro-Wilk Cosider =0.5. Tis is a rigt-skewed triagle distributio. Te power compariso result of tis alterative distributio is sow i Table. It ca be foud form te figure tat te G test performs better ta Sapiro-Wilk test we sample size =5. Te Kolmogorov-Smirov test performs better ta te G test i tis case. 3. Summary of Power Compariso From te above aalysis, we ca coclude te followig: For all te above alterative distributios, te G test statistics performs better ta te Sapiro-Wilk test for small sample size; For all te V-saped alterative distributios, icludig tree V-saped triagle distributios ad te left-skewed, battub saped, rigt-skewed Beta distributio, te G test statistics performs better ta te Sapiro-Wilk test for small sample size. For all te V-saped alterative distributios ad battub saped Beta distributios, te for all cases; G test statistics performs better ta te Kolmogorov-Smirov test

32 For te symmetric triagle alterative distributio, te G test statistics performs te best amog all tese cases coducted. It performs better ta te Sapiro- Wilk test does we sample sizes are 5,0,0,30 40; For te left-skewed ad rigt-skewed triagle alterative distributios, te G test performs better ta te Kolmogorov-Smirov test for small sample sizes; For te uiform alterative distributio, te G test statistics performs better ta te Kolmogorov-Smirov test ad sows similar power to te Kolmogorov- Smirov test we sample size icreases; For te uiform alterative distributios, te G test statistics performs better ta te Sapiro-Wilk test we sample size is less ta 0.

33 CHAPTER IV CONCLUSION AND DISCUSSION Te goodess-of-fit test is a statistical procedure to measure te discrepacy betwee observed values ad te values epected uder a specific distributio. Te goal of te goodess-of-test is to ceck weter te uderlyig probability distributio differs from a ypotesized distributio. Tere are may eistig test statistics icludig some commoly used goodess-of-fit tests suc as te Sapiro-Wilk test, Kolmogorov-Smirov test, Aderso-Darlig test ad Cramer-Vo Mises test. All tese commoly used statistical tests ca be used to test ormality. Amog tem, a alterative statistical test G test for uiformity was proposed by Ce ad Ye 009). Te test was used by Xiog 00) to test ormality for te case tat bot locatio parameter ad te scale parameter of te ormal distributio are kow. Te purpose of tis tesis is to eted te result to te case tat te parameters are ukow. Power study is coducted to compare te performace of tis proposed test wit te Sapiro-Wilk test ad te Kolmogorov-Smirov test. Te result of te Mote Carlo simulatio sows tat te G test performs better ta te Sapiro-Wilk test for small sample cases uder all te alterative distributios used i tis researc. Te G test also outperforms te Kolmogorov-Smirov test i most of cases. It ca also be foud tat te Sapiro-Wilk test performs better we te sample size icreases. Sice te computatio of te G test statistic is less complicated ta te Sapiro- Wilk test. Terefore, te G test statistics i tis tesis is wort beig recommeded to be a alterative approac for testig ormality, especially we te sample size is 3

34 small. However, we sample size icreases, its power does ot icrease as fast as te Sapiro-Wilk test does. Sice te Kolmogorov-Smirov test ad te G test ca be used to test ay ypotesized distributio, wile Sapiro-Wilk test ca be oly used for ceckig ormality ad logormality, te power compariso betwee tem of ormality test amog te tree tests is coducted i tis researc. Etedig te usage of te G test for te case tat te parameters of te distributio are ukow is useful. We ormality is tested, te mea ad te variace of te distributio are usually ukow. For testig weter or ot te uderlyig distributio of a data set belogs to a specified distributio family suc as te ormal distributio family, te epoetial distributio family ad so o, ca still be used eve if te parameters of te distributio are ukow. G test 4

35 Table Critical Value of G test Statistic G G G G G

36

37 Table Power Compariso : V-Sape triagle =0.5) G-Test SW-Test KS-Test Table 3 Power Compariso: V-Sape triagle =0.5) G-Test SW-Test KS-Test

38 Table 4 Power Compariso: V-triagle =0.75) G-Test SW-Test KS-Test Table 5 Power Compariso: Beta α= 4, β=) G-Test SW-Test KS-Test

39 Table 6 Power Compariso: Beta α= 0.5, β=0.5) G-Test SW-Test KS-Test Table 7 Power Compariso: Beta α=, β=4) G-Test SW-Test KS-Test

40 Table 8 Power Compariso: Beta α=, β=) G-Test SW-Test KS-Test Table 9 Power Compariso: Triagle =0.5) G-Test SW-Test KS-Test

41 Table 0 Power Compariso: Triagle =0.5) G-Test SW-Test KS-Test Table Power Compariso: Triagle =0.75) G-Test SW-Test KS-Test

42 Figure Power Compariso: V-Sape triagle =0.5) Figure Power Compariso: V-Sape triagle =0.5) 3

43 Figure 3 Power Compariso: V-triagle =0.75) Figure 4 Power Compariso: Beta 4, ) 33

44 Figure 5 Power Compariso: Beta 0.5, 0. 5) Figure 6 Power Compariso: Beta, 4) 34

45 Figure 7 Power Compariso: Beta, ) Figure 8 Power Compariso: Triagle =0.5) 35

46 Figure 9 Power Compariso: Triagle =0.5) Figure 0 Power Compariso: Triagle =0.75) 36

47 REFERENCES Ce, Z. ad Ye, C. 009) A alterative test for uiformity, Iteratioal Joural of Reliability, Quality ad Safety Egieerig, 6, Aderso, T. W. ad Darlig, D.A. 95) Asymptotic Teory of Certai Goodess of Fit Criteria Based o Stocastic Process, Aals of Matematical statistics, 3:93-. Kolmogorov, A.N. 933) Sulla Determiazioe Empirica di Ua Legge di Distribuzioe, Giorale dell Istituto degli attuari, 4:83-9 Steele,M,N.Smart, C.Hurst ad J. Caselig 009) Evaluatig te Statistical power of goodess-of-fit tests for ealt ad medicie survey data, 8t IMACS world cogress MODSIM 09: Iteratioal cogress o modellig ad simulatio: Iterfacig modellig ad simulatio wit matematical ad computatioal scieces. Cairs, Qld,Australia. F.N. David ad N.L. Joso 948) Te probability itegral trasformatio we parameters are estimated from te sample, Biometrika 06/948; 35Pts -):8-90. Lilliefors, W.H. 967) O te Kolmogorov-Smirov test for ormality wit mea ad variace ukow, Joural of te America Statistics Associatio, 6: Lilliefors, W.H. 969) O te Kolmogorov-Smirov test for te epoetial distributio wit ukow mea, Joural of te America Statistics Associatio, 64: Sapiro, S. S. ad Fracia, R. S. 97) Approimate Aalysis of Variace Tests for Normality, Joural of te America Statistics Associatio, 67:5-6. Sapiro, S. S. ad Wilk, M. B. 965) A Aalysis of avariace Test for Normality complete Samples), Biometrika, 5:59-6. Sapiro, S. S., Wilk, M. B. ad Ce, H. 968) A Comparative Study of Various Tests for Normality, Joural of te America Statistics Associatio, 63: Ju, X. 00). A alterative statistic for testig ormality. Upublised master s tesis, Florida Iteratioal Uiversity, Miami, Florida. 37

Power Comparison of Some Goodness-of-fit Tests

Power Comparison of Some Goodness-of-fit Tests Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Theses ad Dissertatios Uiversity Graduate School 7-6-2016 Power Compariso of Some Goodess-of-fit Tests Tiayi Liu tliu019@fiu.edu DOI: 10.25148/etd.FIDC000750

More information

Statistical Test for Multi-dimensional Uniformity

Statistical Test for Multi-dimensional Uniformity Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Theses ad Dissertatios Uiversity Graduate School -0-20 Statistical Test for Multi-dimesioal Uiformity Tieyog Hu Florida Iteratioal Uiversity,

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality A goodess-of-fit test based o the empirical characteristic fuctio ad a compariso of tests for ormality J. Marti va Zyl Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State,

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Estimating the Population Mean using Stratified Double Ranked Set Sample

Estimating the Population Mean using Stratified Double Ranked Set Sample Estimatig te Populatio Mea usig Stratified Double Raked Set Sample Mamoud Syam * Kamarulzama Ibraim Amer Ibraim Al-Omari Qatar Uiversity Foudatio Program Departmet of Mat ad Computer P.O.Box (7) Doa State

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

A proposed discrete distribution for the statistical modeling of

A proposed discrete distribution for the statistical modeling of It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical

More information

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution Iteratioal Mathematical Forum, Vol. 8, 2013, o. 26, 1263-1277 HIKARI Ltd, www.m-hikari.com http://d.doi.org/10.12988/imf.2013.3475 The Samplig Distributio of the Maimum Likelihood Estimators for the Parameters

More information

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values Iteratioal Joural of Applied Operatioal Research Vol. 4 No. 1 pp. 61-68 Witer 2014 Joural homepage: www.ijorlu.ir Cofidece iterval for the two-parameter expoetiated Gumbel distributio based o record values

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 7 Testing Nonlinear Inequality Restrictions 1

Lecture 7 Testing Nonlinear Inequality Restrictions 1 Eco 75 Lecture 7 Testig Noliear Iequality Restrictios I Lecture 6, we discussed te testig problems were te ull ypotesis is de ed by oliear equality restrictios: H : ( ) = versus H : ( ) 6= : () We sowed

More information

The performance of univariate goodness-of-fit tests for normality based on the empirical characteristic function in large samples

The performance of univariate goodness-of-fit tests for normality based on the empirical characteristic function in large samples The performace of uivariate goodess-of-fit tests for ormality based o the empirical characteristic fuctio i large samples By J. M. VAN ZYL Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests Joural of Moder Applied Statistical Methods Volume 5 Issue Article --5 Bootstrap Itervals of the Parameters of Logormal Distributio Usig Power Rule Model ad Accelerated Life Tests Mohammed Al-Ha Ebrahem

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Nonparametric regression: minimax upper and lower bounds

Nonparametric regression: minimax upper and lower bounds Capter 4 Noparametric regressio: miimax upper ad lower bouds 4. Itroductio We cosider oe of te two te most classical o-parametric problems i tis example: estimatig a regressio fuctio o a subset of te real

More information

NCSS Statistical Software. Tolerance Intervals

NCSS Statistical Software. Tolerance Intervals Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Modified Lilliefors Test

Modified Lilliefors Test Joural of Moder Applied Statistical Methods Volume 14 Issue 1 Article 9 5-1-2015 Modified Lilliefors Test Achut Adhikari Uiversity of Norther Colorado, adhi2939@gmail.com Jay Schaffer Uiversity of Norther

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution World Academy of Sciece Egieerig ad echology Iteratioal Joural of Mathematical ad Computatioal Scieces Vol: No:6 008 rimmed Mea as a Adaptive Robust Estimator of a Locatio Parameter for Weibull Distributio

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Improved Estimation of Rare Sensitive Attribute in a Stratified Sampling Using Poisson Distribution

Improved Estimation of Rare Sensitive Attribute in a Stratified Sampling Using Poisson Distribution Ope Joural of Statistics, 06, 6, 85-95 Publised Olie February 06 i SciRes ttp://wwwscirporg/joural/ojs ttp://dxdoiorg/0436/ojs0660 Improved Estimatio of Rare Sesitive ttribute i a Stratified Samplig Usig

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Control Charts for Mean for Non-Normally Correlated Data

Control Charts for Mean for Non-Normally Correlated Data Joural of Moder Applied Statistical Methods Volume 16 Issue 1 Article 5 5-1-017 Cotrol Charts for Mea for No-Normally Correlated Data J. R. Sigh Vikram Uiversity, Ujjai, Idia Ab Latif Dar School of Studies

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

ALLOCATING SAMPLE TO STRATA PROPORTIONAL TO AGGREGATE MEASURE OF SIZE WITH BOTH UPPER AND LOWER BOUNDS ON THE NUMBER OF UNITS IN EACH STRATUM

ALLOCATING SAMPLE TO STRATA PROPORTIONAL TO AGGREGATE MEASURE OF SIZE WITH BOTH UPPER AND LOWER BOUNDS ON THE NUMBER OF UNITS IN EACH STRATUM ALLOCATING SAPLE TO STRATA PROPORTIONAL TO AGGREGATE EASURE OF SIZE WIT BOT UPPER AND LOWER BOUNDS ON TE NUBER OF UNITS IN EAC STRATU Lawrece R. Erst ad Cristoper J. Guciardo Erst_L@bls.gov, Guciardo_C@bls.gov

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Estimation of Gumbel Parameters under Ranked Set Sampling

Estimation of Gumbel Parameters under Ranked Set Sampling Joural of Moder Applied Statistical Methods Volume 13 Issue 2 Article 11-2014 Estimatio of Gumbel Parameters uder Raked Set Samplig Omar M. Yousef Al Balqa' Applied Uiversity, Zarqa, Jorda, abuyaza_o@yahoo.com

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Computation of Hahn Moments for Large Size Images

Computation of Hahn Moments for Large Size Images Joural of Computer Sciece 6 (9): 37-4, ISSN 549-3636 Sciece Publicatios Computatio of Ha Momets for Large Size Images A. Vekataramaa ad P. Aat Raj Departmet of Electroics ad Commuicatio Egieerig, Quli

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Celestin Chameni Nembua University of YaoundéII, Cameroon. Abstract

Celestin Chameni Nembua University of YaoundéII, Cameroon. Abstract A ote o te decompositio of te coefficiet of variatio squared: comparig etropy ad Dagum's metods Celesti Camei Nembua Uiversity of YaoudéII, Cameroo Abstract Te aim of tis paper is to propose a ew decompositio

More information

1036: Probability & Statistics

1036: Probability & Statistics 036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY Sulema Nasiru, MSc. Departmet of Statistics, Faculty of Mathematical Scieces, Uiversity for Developmet Studies, Navrogo, Upper East Regio, Ghaa,

More information

A new distribution-free quantile estimator

A new distribution-free quantile estimator Biometrika (1982), 69, 3, pp. 635-40 Prited i Great Britai 635 A ew distributio-free quatile estimator BY FRANK E. HARRELL Cliical Biostatistics, Duke Uiversity Medical Ceter, Durham, North Carolia, U.S.A.

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation Metodološki zvezki, Vol. 13, No., 016, 117-130 Approximate Cofidece Iterval for the Reciprocal of a Normal Mea with a Kow Coefficiet of Variatio Wararit Paichkitkosolkul 1 Abstract A approximate cofidece

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

1 Review of Probability & Statistics

1 Review of Probability & Statistics 1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Differentiation Techniques 1: Power, Constant Multiple, Sum and Difference Rules

Differentiation Techniques 1: Power, Constant Multiple, Sum and Difference Rules Differetiatio Teciques : Power, Costat Multiple, Sum ad Differece Rules 97 Differetiatio Teciques : Power, Costat Multiple, Sum ad Differece Rules Model : Fidig te Equatio of f '() from a Grap of f ()

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos .- A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES by Deis D. Boos Departmet of Statistics North Carolia State Uiversity Istitute of Statistics Mimeo Series #1198 September,

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

PROBABILITY DISTRIBUTION RELATIONSHIPS. Y.H. Abdelkader, Z.A. Al-Marzouq 1. INTRODUCTION

PROBABILITY DISTRIBUTION RELATIONSHIPS. Y.H. Abdelkader, Z.A. Al-Marzouq 1. INTRODUCTION STATISTICA, ao LXX,., 00 PROBABILITY DISTRIBUTION RELATIONSHIPS. INTRODUCTION I spite of the variety of the probability distributios, may of them are related to each other by differet kids of relatioship.

More information

Using the IML Procedure to Examine the Efficacy of a New Control Charting Technique

Using the IML Procedure to Examine the Efficacy of a New Control Charting Technique Paper 2894-2018 Usig the IML Procedure to Examie the Efficacy of a New Cotrol Chartig Techique Austi Brow, M.S., Uiversity of Norther Colorado; Bryce Whitehead, M.S., Uiversity of Norther Colorado ABSTRACT

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology

More information

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION 4. Itroductio Numerous bivariate discrete distributios have bee defied ad studied (see Mardia, 97 ad Kocherlakota ad Kocherlakota, 99) based o various methods

More information

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Hazard Rate Function Estimation Using Weibull Kernel

Hazard Rate Function Estimation Using Weibull Kernel Ope Joural of Statistics, 4, 4, 65-66 Publised Olie September 4 i SciRes. ttp://www.scirp.org/joural/ojs ttp://d.doi.org/.46/ojs.4.486 Hazard Rate Fuctio Estimatio Usig Weibull Kerel Raid B. Sala, Hazem

More information

Last time: Moments of the Poisson distribution from its generating function. Example: Using telescope to measure intensity of an object

Last time: Moments of the Poisson distribution from its generating function. Example: Using telescope to measure intensity of an object 6.3 Stochastic Estimatio ad Cotrol, Fall 004 Lecture 7 Last time: Momets of the Poisso distributio from its geeratig fuctio. Gs () e dg µ e ds dg µ ( s) µ ( s) µ ( s) µ e ds dg X µ ds X s dg dg + ds ds

More information

GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)*

GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* GUIDE FOR THE USE OF THE DECISION SUPPORT SYSTEM (DSS)* *Note: I Frech SAD (Système d Aide à la Décisio) 1. Itroductio to the DSS Eightee statistical distributios are available i HYFRAN-PLUS software to

More information

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function Iteratioal Joural of Statistics ad Systems ISSN 973-2675 Volume 12, Number 4 (217), pp. 791-796 Research Idia Publicatios http://www.ripublicatio.com Bayesia ad E- Bayesia Method of Estimatio of Parameter

More information

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State Bayesia Cotrol Charts for the Two-parameter Expoetial Distributio if the Locatio Parameter Ca Take o Ay Value Betwee Mius Iity ad Plus Iity R. va Zyl, A.J. va der Merwe 2 Quitiles Iteratioal, ruaavz@gmail.com

More information

A New Hybrid in the Nonlinear Part of Adomian Decomposition Method for Initial Value Problem of Ordinary Differential Equation

A New Hybrid in the Nonlinear Part of Adomian Decomposition Method for Initial Value Problem of Ordinary Differential Equation Joural of Matematics Researc; Vol No ; ISSN - E-ISSN - Publised b Caadia Ceter of Sciece ad Educatio A New Hbrid i te Noliear Part of Adomia Decompositio Metod for Iitial Value Problem of Ordiar Differetial

More information

4 Conditional Distribution Estimation

4 Conditional Distribution Estimation 4 Coditioal Distributio Estimatio 4. Estimators Te coditioal distributio (CDF) of y i give X i = x is F (y j x) = P (y i y j X i = x) = E ( (y i y) j X i = x) : Tis is te coditioal mea of te radom variable

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

LECTURE 2 LEAST SQUARES CROSS-VALIDATION FOR KERNEL DENSITY ESTIMATION

LECTURE 2 LEAST SQUARES CROSS-VALIDATION FOR KERNEL DENSITY ESTIMATION Jauary 3 07 LECTURE LEAST SQUARES CROSS-VALIDATION FOR ERNEL DENSITY ESTIMATION Noparametric kerel estimatio is extremely sesitive to te coice of badwidt as larger values of result i averagig over more

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes. Exam II Review CEE 3710 November 15, 017 EXAM II Friday, November 17, i class. Ope book ad ope otes. Focus o material covered i Homeworks #5 #8, Note Packets #10 19 1 Exam II Topics **Will emphasize material

More information

The Distribution of the Concentration Ratio for Samples from a Uniform Population

The Distribution of the Concentration Ratio for Samples from a Uniform Population Applied Mathematics, 05, 6, 57-70 Published Olie Jauary 05 i SciRes. http://www.scirp.or/joural/am http://dx.doi.or/0.436/am.05.6007 The Distributio of the Cocetratio Ratio for Samples from a Uiform Populatio

More information

Uniformly Consistency of the Cauchy-Transformation Kernel Density Estimation Underlying Strong Mixing

Uniformly Consistency of the Cauchy-Transformation Kernel Density Estimation Underlying Strong Mixing Appl. Mat. If. Sci. 7, No. L, 5-9 (203) 5 Applied Matematics & Iformatio Scieces A Iteratioal Joural c 203 NSP Natural Scieces Publisig Cor. Uiformly Cosistecy of te Caucy-Trasformatio Kerel Desity Estimatio

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

The Sample Variance Formula: A Detailed Study of an Old Controversy

The Sample Variance Formula: A Detailed Study of an Old Controversy The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution Joural of Moder Applied Statistical Methods Volume 11 Issue Article 0 11-1-01 Extreme Value Charts ad Aalysis of Meas (ANOM) Based o the Log Logistic istributio B. Sriivasa Rao R.V.R & J.C. College of

More information