The performance of univariate goodness-of-fit tests for normality based on the empirical characteristic function in large samples

Size: px
Start display at page:

Download "The performance of univariate goodness-of-fit tests for normality based on the empirical characteristic function in large samples"

Transcription

1 The performace of uivariate goodess-of-fit tests for ormality based o the empirical characteristic fuctio i large samples By J. M. VAN ZYL Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State, Bloemfotei, South Africa wwjvz@ufs.ac.za SUMMARY A empirical power compariso is made betwee two tests based o the empirical characteristic fuctio ad some of the best performig tests for ormality. A simple ormality test based o the empirical characteristic fuctio calculated i a sigle poit is show to outperform the more complicated Epps-Pulley test ad the frequetist tests icluded i the study i large samples. Key words: Normality test; Empirical characteristic fuctio; Cumulat; Goodess-offit 1. INTRODUCTION 1

2 Several goodess-of-fit tests based o the empirical characteristic fuctio (ecf) are available. Feuerverger ad Mureika (1977) developed a test for symmetry ad this was exteded by Epps ad Pulley (1983) to test uivariate ormality. Heze ad Barighaus (1988) exteded the Epps-Pulley test to test multivariate ormality ad this is called the BHEP test. A review paper with commets of procedures based o the ecf is the paper by Meitais (016) ad also the book by Ushakov (1999). The Epps-Pulley approach is still the mai approach for testig ormality of tests based o the ecf ad is based o a itegral over the weighted squared distaces betwee the ecf ad the expected characteristic fuctio. A weakess of the test is that the asymptotic ull-distributio is ot very accurate ad otherwise itractable ad complicated i fiite samples (Taufer (016), Swaepoel ad Alisso (016)). I this work a test is proposed ad asymptotic distributioal results derived usig the work by Murota ad Takeuchi (1981). They derived a locatio ad scale ivariat test usig studetized observatios ad showed that the use of a sigle value whe calculatig the ecf is sufficiet to get good results with respect to power whe testig hypotheses. Csörgő (1986) derived a multivariate extesio of the asymptotic results derived by Murota ad Takeuchi (1981). A simulatio study is coducted to compare the power of the proposed test agaist the Epps-Pulley test ad five of the most recogized goodess-of-fit tests for ormality. The proposed test with asymptotic properties similar to those of the test of Murota ad Takeuchi (1981) performs reasoably i small sample, but excellet i large samples with respect to power. The test statistic is a simple ormal test which will perform better

3 as the sample icreases ad it was foud to domiate the much more complicated Epps- Pulley test. Murota ad Takeuchi (1981) compared their test agaist a test based o the sample kurtosis ad coducted a small simulatio study. Various overview simulatio studies were coducted to ivestigate the performace of tests for ormality. Oe of the most cited papers is the work by Yap ad Sim (011), but they did ot iclude a goodess-offit test based o the empirical characteristic fuctio. A paper which icluded a very large selectio of tests for ormality is the work by Romao et al. (013), but the test of Murota ad Takeuchi (1981) was ot icluded i this study. The tests icluded are the Jarque Bera, Shapiro-Wilk, Lilliefors, Aderso-Darlig ad D Agostio ad Pearso tests. The focus will be o uimodal symmetric distributios ad large sample sizes, that is sample sizes larger tha fifty. Murota ad Takeuchi (1981) proved that the square of the modulus of the empirical characteristic fuctio coverges weakly to a complex Gaussia process where the observatios are stadardized usig affie ivariat estimators of locatio ad scale ad they derived a expressio for the asymptotic variace. Let X,..., 1 X be a i.i.d. sample itx of size, from a distributio F. The characteristic fuctio is E( e ) = φ( t) ad it is estimated by the ecf ˆ 1 itx ( ) j φf t = e, (1) j= 1 3

4 The studetized sample is Z,..., 1 Z, where Z ( ˆ ) / ˆ j = X j µ σ, j = 1,...,, with µ = ad σ = S. Deote the ecf, based o the studetized data by ˆ X ˆ ˆ itz ( ) (1/ ) j φ t = e. The statistic proposed to test ormality is S j= 1 ν (1) = log( ˆ φ (1)/exp( 1/ ) ), () S where ( ν (1)) N(0,0.0431) asymptotically. Absolute value deotes the modulus of a complex umber if the argumet is complex. The expressio is ˆ ˆ I = φs ( t) φ0( t) w( t) dt, with φ ( t) 0 deotig the ecf of a stadard ormal, w( t) a weight fuctio which is of the same form as a ormal desity with mea zero ad variace the sample estimate of the variace. Of the may variatios usig the ecf to test goodess-of-fit tests, this expressio attracted the most iterest ad is still used Meitais (016). Epps ad Pulley (1983) used this expressio ad derived a test for ormality usig a weight fuctio which has the form of a stadard ormal desity. They gave a exact expressio for the characteristic fuctio of the ormal distributio. Heze (1990) derived a large sample approximatio for this test ad used Pearso curves to approximate the distributio. It is show i the simulatio study that the proposed test with similar properties as that of Murota ad Takeuchi (1981) ad calculated i a sigle 4

5 poit without usig a weight outperforms the Epps-Pulley test i the cases cosidered with respect to power.. MOTIVATION AND ASYMPTOTIC VARIANCE OF THE TEST STATISTIC A motivatio will be give i terms of the cumulat geeratig fuctio. The ormal distributio has the uique property that the cumulat geeratig fuctio caot be a fiite-order polyomial of degree larger tha two, ad the ormal distributio is the oly distributio for which all cumulats of order larger tha 3 are zero (Cramér (1946), Lukacs, (197)). The motivatio for the test will be show by usig the momet geeratig fuctio, but experimetatio showed that the use of the characteristic fuctio rather tha the momet geeratig fuctio gives much better results whe used to test for ormality. Cosider a radom variable X with distributio F, mea µ ad variace σ. The cumulat geeratig fuctio Κ F ( t ) of F ca be writte as r Κ F ( t) = κ rt / r!, where r= 1 κ r is the r-th cumulat. The first two cumulats are κ µ κ σ 1 = E( X ) =, = Var( X ) =. Sice Κ ( t ) is the logarithm of the momet geeratig fuctio, the momet geeratig F fuctio ca be writte as F M ( t) E( e ) e Κ t F tx ( ) = =. It follows that Κ ( t ) = tx F log( E ( e )) 5

6 = r= 1 κ t r r / r! =( 1 ) r κ t + κ t / + ( κrt / r!) = ( ) r= 3 r µ t + σ t / + ( κ t / r!). r= 3 r Let F N deote a ormal distributio with a mea µ ad variace M σ. N ( ) t deotes the momet geeratig fuctio of the ormal distributio with cumulat geeratig fuctio 1 Κ ( t N ) = µ t + σ t. The logarithm of the ratio of the momet geeratig fuctios of F ad F N is give by log( M ( t)/ M ( t)) = log(exp( Κ ( t) Κ ( t))) F N F N = 1 r [( µ t + σ t ) + κ t / r!] ( µ t + 1 σ t ) r r = 3 r = Κ ( t) + κ t / r! Κ ( t)) N r N r= 3 r = κrt / r!. (3) r= 3 If F is a ormal distributio, the sum give i (1) is zero. Replacig Κ ( t) Κ ( t) by Κ ( it) Κ ( it) it follows that F N F N log( log( φ ( t)) log( φ ( t)) ) = log( Κ ( it) Κ ( it) ) F N F N 4+ r = κ4+ rt /(4 + r)!, r= 0 6

7 which would be equal to zero whe the distributio F is a ormal distributio ad this expressio ca be used to test for ormality. Murota ad Takeuchi (1981) used the fact that the square root of the log of the modulus of the characteristic fuctio of a ormal distributio is liear i terms of t, i other words ( log( φ ( t N ) )) 1/ is a liear fuctio of t. A asymptotic variace of ν ( t) = log( ˆ φ ( t) / exp( t / ) ) ca be foud by usig the NS delta method ad the results of Murota ad Takeuchi (1981). Let ˆ φ ( t ) deote the ecf calculated i the poit t usig studetized ormally distributed observatios. They showed that the process defied by NS Z ɶ t ˆ t t, (4) ( ) = ( φns ( ) exp( )) coverges weakly to a zero mea Gaussia process ad variace 4 E( Zɶ ( t)) = 4exp( t )(cosh( t ) 1 t / ). (5) Note that ˆ φ t / NS ( t ) e = ad by applyig the delta method it follows that Var( ˆ φ ( t) ) Var( ˆ φ ( t) )( ˆ φ ( t) ), NS NS NS thus Var( ˆ φ ( t) ) Var( ˆ φ ( t) ) /( ˆ φ ( t) ). NS NS NS By applyig the delta method agai it follows that 7

8 Var ν t Var ˆ φ t e 1/ ( ( )) = (log( NS ( ) / )) (1/ ˆ φ ( t) ) Var( ˆ φ ( t) ) NS NS = Var( ˆ φ ( t) ) / 4( ˆ φ ( t) ) NS 4 NS 4 = (cosh( t ) 1 t / ) /. (6) The statistic ν ( t ) coverges weakly to a Gaussia distributio with mea zero ad 4 variace Var( ν ( t)) = (cosh( t ) 1 t / ) /, where ( ν (1)) N(0,0.0431) asymptotically. Var( ν (1)) = /, t = 1. (7) Reject ormality if ν (1) / ( / = ν (1) > z 1 α /. (8) I the followig figure the average of the log the modulus calculated i the poit, t = 1, usig stadard ormally distributed samples, for various samples sizes is show. The solid lie is where studetized observatios were used ad the dashed lie where the ecf is calculated usig the origial sample. It ca be see that there is a large bias i small samples ad the studetized ecf has less variatio. 8

9 average log of the modulus Fig. 1. Plot of the average log of the modulus of the ecf for various sample sizes calculated usig m = 5000 calculated usig samples form a stadard ormal distributio. The solid lie is where studetized observatios are used ad the dashed lie usig the origial sample. Calculated i the poit t=1, ad the expected value is I the followig histogram 5000 simulated values of ν (1) /( / = ν (1) are show, where the ν (1)' s are calculated usig simulated samples of size = 1000 from a stadard ormal distributio. 9

10 frequecy ν Fig.. Histogram of 5000 m = simulated values of v (1) / ( / ), with = Calculated usig ormally distributed samples, data stadardized usig estimated parameters. I Figure 3 the variace of ν (1) is estimated for various sample sizes, based o 1000 estimated values of ν (1) for each sample size cosidered. The estimated variaces is plotted agaist the asymptotic variace Var( v ) = /. 10

11 1.8 x asymptotic ad estimated variace sample size Fig. 3. Estimated variace of v ad asymptotic variace for various values of. Dashed lie the estimated variace. Estimated variace calculated usig 1000 ormally distributed samples, data stadardized usig estimated parameters. There are a few variatios of the parameters used whe choosig the weight fuctio for the Epps-Pulley test, but a versio suggested by Epps ad Pulley to be used as a omibus test is whe choosig the weigh fuctio a ormal desity with mea zero ad variace the sample estimate of the variace, that is t /( ˆ σ ) = πσˆ e. w( t) (1/ ) I = ˆ t ˆ t w t dt φs ( ) φ0( ) ( ) = ˆ ˆ X j X k σ X j X σ j= 1 k= 1 j= 1 exp{ ( ) / } exp{ ( ) / } + 1/ 3. (9) 11

12 Heze (1990) derived a large sample approximatio for T = I ad used Pearso curves to approximate the distributio. A simulatio study was coducted to calculate the 1 α percetiles of T, α = 0.05, based o m = simulated values of T. It was foud that eve with this large sample there is still variatio i the 4 th decimal ad the first 3 decimals where used i the simulatio study to estimate the power of the test. Table 1. Simulated percetiles to test for ormality usig the Epps-Pulley test at the 5% level. Calculated from m = simulated samples of size.95 percetile mea variace Table 1 Simulated percetiles to test for ormality usig the Epps-Pulley test at the 5% level. Calculated from m = simulated samples of size. Heze (1990) used the value 1 istead of σ ad foud for example for = 100, that the.95 th percetile is which is approximately equal to foud i this study. ˆ They calculated the asymptotic expected value of T as ad variace

13 3. SIMULATION STUDY The paper of Yap ad Sim (011) is used as a guidelie to decide which tests to iclude. The proposed test will be deoted by ECFT ad EP deotes the Epps-Pulley test i the tables. The power of the test will be compared agaist several tests for ormality: The Lilliefors test (LL), Lilliefors (1967) which is a slight modificatio of the Kolmogorov-Smirov test for where parameters are estimated. The Jarque-Bera test (JB), Jarque ad Bera (1987), where the skewess ad kurtosis is combied to form a test statistics. The Shapiro-Wilks test (SW), Shapiro ad Wilk (1965). This test makes use of properties of order statistics ad were later developed to be used for large samples too by Roysto (199). The Aderso-Darlig test (AD), Aderso ad Darlig. The D Agostio ad Pearso test (DP), D Agostio ad Pearso (1973). This statistic combies the skewess ad kurtosis to check for deviatios from ormality. Samples are geerated from a few symmetric uimodal symmetric distributios with sizes = 50,100, 50,500, 750,1000. The proportio rejectios are reported based o m=5000 repetitios. The test are coducted at the 5% level ad for the ecf, the ormal approximatio is used. Sice the sample sizes are large, o-ormality with respect to multi-modal ad skewed distributios ca easily be picked up by usig graphical methods. 13

14 The followig symmetric distributios are cosidered, uiform o the iterval [0,1], the logistic distributio with mea zero the stadard t-distributio ad the Laplace distributio with mea zero ad scale parameter oe. The stadard t -distributio with 4, 10 ad 15 degrees of freedom. Skewed distributios ad multimodal distributios would ot be ivestigated, sice i large samples such samples ca be already excluded with certaity as beig ot from a ormal distributio by lookig at the histograms. All the tests performed for the ecf test were coducted usig the ormal approximatio, but percetiles ca also easily be simulated. Simulated estimates of the Type I error for = 30,50,100, 50,500, 750,1000, are give i Table 1 based o m = 5000 simulated samples. The simulated samples are stadard ormally distributed ad studetized to calculate the Type I error. Table Simulated percetiles to test for ormality at the 5% level. Calculated from m = 5000 simulated samples of size each i the poit t=1. Type I error ECFT EP LL JB SW AD DP

15 I Table the rejectio rates, whe testig at the 5% level ad symmetric distributios, are show for various sample sizes based o samples each time. I table 1 the t-distributio where ot all momets exist is cosidered. The JB, DP ad ECFT tests performs best, ad i large samples the ECFT test performs the best. Table 3. Simulated power of ormality tests. Rejectio proportios whe testig for ormality at the 5% level. ECFT EP LL JB SW AD DP t(4) t(10) t(15)

16 Fig. 4. Plot of the three best performig tests with respect to power, testig for ormality, data t- distributed with 10 degrees of freedom. Solid lie, ecf test, dashed lie JB test ad dash-dot the DP test power - ECFT, JB, DP - t(10)

17 Table 4. Simulated power of ormality tests. Rejectio proportios whe testig for ormality at the 5% level. ECFT EP LL JB SW AD DP U(0,1) Laplace Logistic It ca be see the ECFT outperforms the other tests with respect to power i large samples, especially whe testig data from a logistic distributio. Samples were simulated from a mixture of two ormal distributios, with a proportio α from a stadard ormal ad a proportio 1 α from a ormal with variace σ. This ca also be cosidered as a cotamiated distributio. The results are show i Table 4. 17

18 The proposed test yielded good results. Table 5. Simulated power of ormality tests. Rejectio proportios whe testig for ormality at the 5% level. Mixture of two ormal distributios (cotamiated data). ( σ, α ) ECFT EP LL JB SW AD DP (.0,0.) (0.5,0.) (.0,0.5)

19 1 0.9 power - ECFT,JB,AD - mixture ormal Fig. 5. Plot of the three best performig tests with respect to power, testig for ormality, data mixture of ormal distributios with two compoets. Mixture.5N(0,1)+0.5N(0,4). Solid lie, ecf test, dashed lie JB test ad dash-dot the AD test. 4. CONCLUSIONS The proposed test performs better with respect to power i large samples tha the other tests for ormality for the distributios cosidered i the simulatio study. I small samples of say less tha = 50, it was foud that the test of D Agostio ad Pearso (1973) was ofte either the best performig or close to the best performig test. 19

20 I practice oe would ot test data from a skewed distributio for ormality i large samples. The simple ormal test approximatio will perform better, the larger the sample is. The asymptotic ormality ad variace properties, which is of a very simple form, ca be used i large samples. This test ca be recommeded as probably the test of choice i terms of power ad easy of applicatio i large samples ad shows that the empirical characteristic fuctio has the potetial to outperform the usual frequetist methods. Refereces ANDERSON, T.W., DARLING, D.W. (1954). A test of goodess of fit. J. Amer. Statist. Assoc., 49 (68), BARINGHAUS, L., HENZE, N. (1988). A cosistet test for multivariate ormality based o the empirical characteristic fuctio. Metrika 35 (1): CRAMÉR, H. (1946). Mathematical Methods of Statistics. NJ: Priceto Uiversity Press, Priceto, NJ. CSÖRGŐ, S. (1986). Testig for ormality i arbitrary dimesio. Aals of Statistics, 14 (), D AGOSTINO, R., PEAESON, R. (1973). Testig for departures from ormality. Biometrika, 60, EPPS, T.W., PULLEY, L.B. (1983). A test for ormality based o the empirical characteristic fuctio. Biometrika, 70, FEUERVERGER, A., MUREIKA, R.A. (1977). The Empirical Characteristic Fuctio ad Its Applicatios. The Aals of Statistics, 5 (1), HENZE, N. (1990), A Approximatio to the Limit Distributio of the Epps-Pulley Test Statistic for Normality. Metrika, 37,

21 JARQUE, C.M., BERA, A.K. (1987). A test for ormality of observatios ad regressio residuals. It. Stat. Rev., 55 (), LILLIEFORS, H.W. (1967). O the Kolmogorov-Smirov test for ormality with mea ad variace ukow. J. America. Statist. Assoc., 6, LUKACS, E. (197). A Survey of the Theory of Characteristic Fuctios. Advaces i Applied Probability, 4 (1), MEINTANIS, S. G. (016). A review of testig procedures based o the empirical characteristic fuctio. South Africa Statist. J., 50, MUROTA, K., TAKEUCHI, K. (1981). The studetized empirical characteristic fuctio ad its applicatio to test for the shape of distributio. Biometrika; 68, ROMÃO, X, DELGADO, R, COSTA, A. (010). A empirical power compariso of uivariate goodess-of-fit tests for ormality. J. of Statistical Computatio ad Simulatio.; 80:5, ROYSTON, J.P. (199). Approximatig the Shapiro-Wilk W-test for o-ormality. Stat. Comput.,, SHAPIRO, S.S., WILK, M.B. (1965). A aalysis of variace test for ormality (complete samples). Biometrika 5, SWANEPOEL, J.W.H., ALLISON, J. (016). Commets: A review of testig procedures based o the empirical characteristic fuctio. South Africa Statist. J., 50, TAUFER, E. (016). Commets: A review of testig procedures based o the empirical characteristic fuctio. South Africa Statist. J., 50, USHAKOV, N.G. (1999). Selected Topics i Characteristic Fuctios. VSP, Utrecht. YAP, B.W., SIM, C.H. (011). Comparisos of various types of ormality tests. J. of Statistical Computatio ad Simulatio, 81 (1),

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality

A goodness-of-fit test based on the empirical characteristic function and a comparison of tests for normality A goodess-of-fit test based o the empirical characteristic fuctio ad a compariso of tests for ormality J. Marti va Zyl Departmet of Mathematical Statistics ad Actuarial Sciece, Uiversity of the Free State,

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Applying least absolute deviation regression to regressiontype estimation of the index of a stable distribution using the characteristic function

Applying least absolute deviation regression to regressiontype estimation of the index of a stable distribution using the characteristic function Applyig least absolute deviatio regressio to regressiotype estimatio of the idex of a stable distributio usig the characteristic fuctio J. MARTIN VAN ZYL Departmet of Mathematical Statistics ad Actuarial

More information

Power Comparison of Some Goodness-of-fit Tests

Power Comparison of Some Goodness-of-fit Tests Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Theses ad Dissertatios Uiversity Graduate School 7-6-2016 Power Compariso of Some Goodess-of-fit Tests Tiayi Liu tliu019@fiu.edu DOI: 10.25148/etd.FIDC000750

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation Metodološki zvezki, Vol. 13, No., 016, 117-130 Approximate Cofidece Iterval for the Reciprocal of a Normal Mea with a Kow Coefficiet of Variatio Wararit Paichkitkosolkul 1 Abstract A approximate cofidece

More information

NCSS Statistical Software. Tolerance Intervals

NCSS Statistical Software. Tolerance Intervals Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

A new distribution-free quantile estimator

A new distribution-free quantile estimator Biometrika (1982), 69, 3, pp. 635-40 Prited i Great Britai 635 A ew distributio-free quatile estimator BY FRANK E. HARRELL Cliical Biostatistics, Duke Uiversity Medical Ceter, Durham, North Carolia, U.S.A.

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

A CONSISTENT TEST FOR EXPONENTIALITY BASED ON THE EMPIRICAL MOMENT PROCESS

A CONSISTENT TEST FOR EXPONENTIALITY BASED ON THE EMPIRICAL MOMENT PROCESS STATISTICA, ao LXIX,., 009 A CONSISTENT TEST FOR EXPONENTIALITY BASED ON THE EMPIRICAL MOMENT PROCESS Simos G. Meitais. INTRODUCTION The expoetial is oe of the most frequetly used distributios i survival

More information

arxiv: v3 [math.st] 2 Aug 2011

arxiv: v3 [math.st] 2 Aug 2011 O two simple tests for ormality with high power Mås Thuli First versio: August 3, 00. Updated: May 9, 08 arxiv:008.539v3 [math.st] Aug 0 Abstract The test statistics of two powerful tests for ormality

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes. Exam II Review CEE 3710 November 15, 017 EXAM II Friday, November 17, i class. Ope book ad ope otes. Focus o material covered i Homeworks #5 #8, Note Packets #10 19 1 Exam II Topics **Will emphasize material

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

The random version of Dvoretzky s theorem in l n

The random version of Dvoretzky s theorem in l n The radom versio of Dvoretzky s theorem i l Gideo Schechtma Abstract We show that with high probability a sectio of the l ball of dimesio k cε log c > 0 a uiversal costat) is ε close to a multiple of the

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution Iteratioal Mathematical Forum, Vol. 8, 2013, o. 26, 1263-1277 HIKARI Ltd, www.m-hikari.com http://d.doi.org/10.12988/imf.2013.3475 The Samplig Distributio of the Maimum Likelihood Estimators for the Parameters

More information

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment HW5 Solution

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment HW5 Solution Departmet of Civil Egieerig-I.I.T. Delhi CEL 899: Evirometal Risk Assessmet HW5 Solutio Note: Assume missig data (if ay) ad metio the same. Q. Suppose X has a ormal distributio defied as N (mea=5, variace=

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution

A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution A Note o Box-Cox Quatile Regressio Estimatio of the Parameters of the Geeralized Pareto Distributio JM va Zyl Abstract: Makig use of the quatile equatio, Box-Cox regressio ad Laplace distributed disturbaces,

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

Large Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution

Large Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution Large Sample Theory Covergece Covergece i Probability Covergece i Distributio Cetral Limit Theorems Asymptotic Distributio Delta Method Covergece i Probability A sequece of radom scalars {z } = (z 1,z,

More information

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6) STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos .- A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES by Deis D. Boos Departmet of Statistics North Carolia State Uiversity Istitute of Statistics Mimeo Series #1198 September,

More information

Monte Carlo Integration

Monte Carlo Integration Mote Carlo Itegratio I these otes we first review basic umerical itegratio methods (usig Riema approximatio ad the trapezoidal rule) ad their limitatios for evaluatig multidimesioal itegrals. Next we itroduce

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

Kernel density estimator

Kernel density estimator Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I

More information

Testing Statistical Hypotheses for Compare. Means with Vague Data

Testing Statistical Hypotheses for Compare. Means with Vague Data Iteratioal Mathematical Forum 5 o. 3 65-6 Testig Statistical Hypotheses for Compare Meas with Vague Data E. Baloui Jamkhaeh ad A. adi Ghara Departmet of Statistics Islamic Azad iversity Ghaemshahr Brach

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

4.5 Multiple Imputation

4.5 Multiple Imputation 45 ultiple Imputatio Itroductio Assume a parametric model: y fy x; θ We are iterested i makig iferece about θ I Bayesia approach, we wat to make iferece about θ from fθ x, y = πθfy x, θ πθfy x, θdθ where

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Investigating a new estimator of the serial correlation coefficient

Investigating a new estimator of the serial correlation coefficient Ivestigatig a ew estimator of the serial correlatio coefficiet Mafred Mudelsee Istitute of Mathematics ad Statistics Uiversity of Ket at Caterbury Caterbury Ket CT2 7NF Uited Kigdom 8 May 1998 1 Abstract

More information

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution World Academy of Sciece Egieerig ad echology Iteratioal Joural of Mathematical ad Computatioal Scieces Vol: No:6 008 rimmed Mea as a Adaptive Robust Estimator of a Locatio Parameter for Weibull Distributio

More information

Singular Continuous Measures by Michael Pejic 5/14/10

Singular Continuous Measures by Michael Pejic 5/14/10 Sigular Cotiuous Measures by Michael Peic 5/4/0 Prelimiaries Give a set X, a σ-algebra o X is a collectio of subsets of X that cotais X ad ad is closed uder complemetatio ad coutable uios hece, coutable

More information

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences. Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests

Bootstrap Intervals of the Parameters of Lognormal Distribution Using Power Rule Model and Accelerated Life Tests Joural of Moder Applied Statistical Methods Volume 5 Issue Article --5 Bootstrap Itervals of the Parameters of Logormal Distributio Usig Power Rule Model ad Accelerated Life Tests Mohammed Al-Ha Ebrahem

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES* Kobe Uiversity Ecoomic Review 50(2004) 3 POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES* By HISASHI TANIZAKI There are various kids of oparametric

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y 1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these

More information

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY Sulema Nasiru, MSc. Departmet of Statistics, Faculty of Mathematical Scieces, Uiversity for Developmet Studies, Navrogo, Upper East Regio, Ghaa,

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

5. INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY

5. INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY IA Probability Let Term 5 INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY 51 Iequalities Suppose that X 0 is a radom variable takig o-egative values ad that c > 0 is a costat The P X c E X, c is

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION [412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION BY ALAN STUART Divisio of Research Techiques, Lodo School of Ecoomics 1. INTRODUCTION There are several circumstaces

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

1 Constructing and Interpreting a Confidence Interval

1 Constructing and Interpreting a Confidence Interval Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad

More information

LECTURE 8: ASYMPTOTICS I

LECTURE 8: ASYMPTOTICS I LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Axis Aligned Ellipsoid

Axis Aligned Ellipsoid Machie Learig for Data Sciece CS 4786) Lecture 6,7 & 8: Ellipsoidal Clusterig, Gaussia Mixture Models ad Geeral Mixture Models The text i black outlies high level ideas. The text i blue provides simple

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Summarizing Data. Major Properties of Numerical Data

Summarizing Data. Major Properties of Numerical Data Summarizig Data Daiel A. Meascé, Ph.D. Dept of Computer Sciece George Maso Uiversity Major Properties of Numerical Data Cetral Tedecy: arithmetic mea, geometric mea, media, mode. Variability: rage, iterquartile

More information