Notes on the use of the Goldfeld-Quandt test for heteroscedasticity in environment research

Size: px
Start display at page:

Download "Notes on the use of the Goldfeld-Quandt test for heteroscedasticity in environment research"

Transcription

1 Biometrical Letters Vol. 45(008), No., Notes o the use of the Goldfeld-Quadt test for heteroscedasticity i eviromet research Aa Budka, Dauta Kachlicka, Maria Kozłowska Departmet of Mathematical ad Statistical Methods, Poza Uiversity of Life Scieces, Wojska Polskiego 8, Pozań, Polad budka@up.poza.pl SUMMARY Some methods of usig the Goldfeld-Quadt test are described. The use of classical statistics (mea, stadard deviatio) or positio statistics (media, average deviatio, media absolute deviatio) is proposed. The results ca be used to test the hypothesis that the residuals from a liear regressio are homoscedastic, whe the experimetal distributio of the cosidered variable is symmetrical or asymmetrical (right-sided skew or left-sided skew). Usig Mote Carlo method, properties of these modificatios of the Goldfeld-Quadt procedure are explored. A compariso of the effectiveess of the described methods for eviromet research is preseted. Key words: Goldfeld-Quadt test, heteroscedasticity, liear regressio, positio statistics. Itroductio The pure scieces have wide applicatio to experimetatio i the field of evirometal egieerig. Statistics ad ecoometrics offer quatitative methods which are a importat tool i the aalysis of results of eviromet research. These iclude research o the cotet of the air, air quality beig a importat issue. High air quality meas that the level of pollutio of the air is low. Pollutio may come from sources which are either atural or athropogeic (caused by huma activity). Natural pollutio is produced maily by forest fires or the decompositio of livig orgaisms, while athropogeic sources iclude trasport, fuel burig, idustrial processes ad others. The quality compositio of the air is variable, ad what is worse there arise secodary pollutats, which are ofte more toxic ad harmful tha the primary oes. The course of these processes depeds o may factors, icludig the spatial distributio of cocetratios. Aother importat aspect is the asphalt

2 44 A. Budka, D. Kachlicka, M. Kozłowska effect. Whe we cover the soil surface with a impermeable layer of cocrete or asphalt, the damp from the groud caot pass to the air. The air becomes dry ad susceptible to pollutio. Therefore a fudametal criterio to cosider whe buyig a plot of buildig lad is its positio. Research ito the effect o lad prices of, amog other thigs, the distace of the plot from the earest highway ad airport requires certai coditios to be checked, which we do usig the Goldfeld- Quadt test. The object of this paper is to preset certai modificatios of the use of the Goldfeld-Quadt test for ivestigatig the presece of heteroscedasticity.. Framework We cosider theliear model y = Xβ + ε, () where y is a x vector of observatios o the depedet variable, X=[, x,, x k- ] is a x k matrix of observatios o k- idepedet variables, β is a k x vector of ukow parameters, ad ε is a x radom vector of errors, where is the vector of oes. We assume that errors are ormally distributed ad the value of the error term correspodig to a observed value of the depedet variable is statistically idepedet of the value of the error term correspodig to ay other observed value of the variable. The error is the distace betwee the observed value of the depedet variable ad its expected value. This expectatio is labelled E(y) ad E(y) = Xβ, because we assume that E(ε) = 0, where 0 is the ull vector. I order for the estimates of the parameters ot to be biased (or best liear ubiased estimates) oe more assumptio should hold: the variace should be costat. The variace matrix is labelled D(ε) ad D(ε) = I, where I is the idetity matrix. If we observe departures from these assumptios the the estimators of the parameters are biased ad cosequetly they are ot best or other drawbacks may to be. Hece cotrol of the model is a very importat part of the regressio aalysis. It is well kow that i the presece of heteroscedasticity of error variaces D(ε)=diag(,,..., ), the least squares method has two major drawbacks:

3 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 45 iefficiet parameter estimates ad biased variace estimates which make stadard hypothesis tests iappropriate. Kow tests for heteroscedasticity based o error aalysis iclude, for example, the most popular Goldfeld-Quadt test, the Breusch-Paga test ad the White test, described respectively by Goldfeld ad Quadt (965), Breusch ad Paga (979), White (980). The literature o testig for heteroscedasticity icludes may more tests (see Dufour et al., 004). The test for heteroscedasticity i regressio models based o the Goldfeld Quadt methodology defied by Carapeto ad Holt (003) also deserves attetio. Most of these tests are what Goldfeld ad Quadt call o-costructive tests, i that they ca be used to determie the presece or absece of heteroscedasticity, but reveal othig about the form of the variace structure (Buse, 984). We are iterested i testig for heteroscedasticity i situatios with kow deflator. Our ull hypothesis is H 0 : i = for i=,,,, ad the alterative hypothesis is H : ~H 0 (the symbol ~ deotes egatio). The Goldfeld-Quadt test is proposed by, for example, Goldfeld ad Quadt (965), Buse (984) or Maddala (006) for verificatio of the ull hypothesis. The test proposed by Goldfeld-Quadt (965) is carried out i four steps. I the first step we must sort the multivariate variable with respect to the choice of oe idepedet variable, for example x d, d {l: l=,,,k-}. This variable is a potetial deflator. Next, i the secod step, we ca discard oe or more cetral observatios. I the third step, we fit separate regressio aalyses to each of two remaiig sets of observatios. I the last step, we form the statistic, which has F-distributio. The statistic is the quotiet of the mea squares for error from the separate regressios. Let x d x d x d, where the secod subscript deotes the umber of observatios of the variable x d. We obtai a set of observatios of the multivariate variable i the ew orderig. Now we cosider model () i the form y y y 3 X = X X 3 ε + β ε ε 3, ()

4 46 A. Budka, D. Kachlicka, M. Kozłowska where for j=,, 3, y j ad ε j are j x vectors, X j is a j x k dimesio matrix. Goldfeld ad Quadt (965) state that the vector y icludes some cetral observatios after they are sorted with respect to the chose potetial deflator. The choice of the x dimesio vector y is the subject of cosideratio i the ext sectio. The choice of the vector y follows as a cosequece of the choice of cetral observatios of the variable x d. Goldfeld ad Quadt (965) state that the dimesio of vectors y ad y 3 is the same, but other authors, for example Thursby (98), Dufour et al. (004), take ito accout varyig dimesios of these vectors. Goldfeld ad Quadt (965) proposed usig the test for heteroscedasticity after the removal of some umber of cetral observatios. They did ot specify how may observatios should be removed. They gave the relative frequecy of cases i which the false hypothesis is rejected for samples of dimesio =30 ad =60 after omittig 0, 4, 8, or 6 cetral observatios, ad they estimated the power of the test. They obtaied the largest frequecy for =30 ad =60 after the omissio, respectively, of 8 ad 6 cetral observatios (equal to 6.7%). Buse (984) aalysed the problem for =0, 40, 80 removig 0% of cetral observatios. The same dimesio of the removed set of observatios was used by Dufour et al. (004) for =50 ad =00. Maddala (006) suggests the removal of cetral observatios to icrease the power of the test, but he does ot aswer the questio of how may observatios to remove. 3. Results We must quote here two seteces formulated by Goldfeld ad Quadt (965): (i) The power of this test will clearly deped upo the value of, the umber of omitted observatios; for every large value of the power will be small but it is ot obvious that the power icreases mootoically as teds to 0, (ii) The power of the test will clearly deped o the ature of the sample of values for the variable which is the deflator. Thus, if the variace of x d is small relative to the mea of x d the power ca be expected to be small ad coversely. We believe that the umber of omitted observatios depeds o the precisio of the measuremets carried out ad o their distributio.

5 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 47 The distributio of the idepedet variable called the deflator may be defied by a cumulative probability fuctio labelled F(x d ). If for a value x d(q), where q (0;), the cumulative probability fuctio F(x d(q) ) is equal to q, the x d(q) is called a quatile of order q. Lower quartile, media ad upper quartile are measures of locatio ad are labelled x d(/4), x d(/) ad x d(3/4) respectively. Let us recall that may authors suggest removig 0% of observatios; these are values of the variable x d withi the iterval (x d(/5) ; x d(3/5) ). Whe for a set of several measuremets the mea has bee calculated, the stadard deviatio ca be calculated for the set too. The stadard deviatio tells us how repeatable the placemet of the measure is, ad how much this cotributes to the ucertaity of the mea value. From these, the estimated stadard deviatio of the mea (the stadard error) may be calculated. The stadard deviatio of the mea has also bee called a stadard ucertaity. The stadard ucertaity is a margi whose size ca be thought of as plus or mius oe stadard error. We propose usig classical statistics or positio statistics with the aim of defiig the set of removed observatios. 3.. Method based o classical statistics Propositio. Whe the variable x d has symmetrical distributio, the mea is the cetral poit of the distributio. We assume that stadard error is the measure of deviatio from the mea. From the set of observatios of variable x d, such that x d x d x d, we propose to remove all observatios belogig to the iterval defied by the mea of the variable plus or mius the stadard error, that is ˆ ˆ x di ( µ ˆ ; µ ˆ + ), (3) where µˆ ad ˆ are estimates of the mea ad stadard deviatio of the variable x d respectively. Let us otice that, for the variable x d N(µ;), the orders of the quatile x (q ) = µ ad = µ + have the forms: d x d(q )

6 48 A. Budka, D. Kachlicka, M. Kozłowska q q = F( µ = F = F 0; ( = F( µ + 0; ) = P(x ) = ) = ( ) = d π x P( π d < µ e e µ < ) = xd xd dx x P( dx ) = d µ < ) =, Hece, we have q q = F0; (/ ). I the case whe =6 the differece of the orders is equal to /5, that is q q = F0; (0.5) = / 5. Let us otice that for icreasig sample size, we have lim ( q q) = lim (F0; ( ) ) = 0. Whe x d has the ormal distributio, the legth of the iterval (3) depeds o the size of the sample. I the case we will remove for example 0% of observatios whe =6 or 0% whe = Methods based o positio statistics Whe the variable x d has asymmetrical distributio the the media med(x d ) is the cetral poit of the distributio. The media is i the middle of the sorted data (x d x d x d ). I a similar way we ca defie the first quartile to be /4 of the way through the sorted data (x d(/4) ), ad the third quartile to be 3/4 of the way through the sorted data (x d(3/4) ). For ivestigatio of the variability of the media, a positio statistic called media absolute deviatio is used. The media absolute deviatio (MAD) is defied as the media of the absolute value of the differece betwee a variable ad its media, that is MAD(x d ) = med x d med(x d ). For ormally distributed x d N(µ;), the MAD is give by: MAD(x d )= med (x d µ)/ =0.67

7 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 49 Let us otice that µ MAD(x d )=med(x d ) MAD(x d )=x d(/4), µ+mad(x d )= =med(x d )+MAD(x d )= x d(3/4) ad F(µ+0.5 MAD(x d )) F(µ 0.5 MAD(x d ))=0.5 because x d µ F( µ / ) F( µ 0.67 / ) = P( < 0.67 / ) = F (0.67 / ) = 0.5 0; We propose takig MAD(x d ) as the upper limitatio of the legth of the iterval icludig omitted observatios of the variable x d. We may ivestigate the symmetry of distributio of the variable x d by calculatig quartiles. Whe the media is equal to half of the iter-quartile rage, that is med(x d )=½ (x d(3/4) x d(/4) ), the the distributio is symmetrical. For right-sided skew distributio we have med(x d )<½ (x d(3/4) x d(/4) ), ad for left-sided skew distributio med(x d )>½ (x d(3/4) x d(/4) ). For variable x d havig asymmetrical distributio we propose some methods for defiig the set of omitted observatios. Propositio. Adoptig the asymptotic approach for buildig the cofidece iterval for the media applyig stadard error (see Dawid, 98) we propose to defie the set of removed observatios by the iterval media plus or mius stadard error. Hece we propose removig the followig observatios : ˆ ˆ x di ( med(x d ) ; med(x d ) + ). (4) Propositio 3. Aother measure of variatio is mea absolute deviatio. The mea absolute deviatio (MeaAD) is defied as follows: MeaAD (xd ) = xdi med(x d ). (5) i The mea absolute deviatio takes accout of values from zero to half of the rage of observatios. Hece we believe that we may remove observatios belogig to the iterval media plus or mius the mea absolute deviatio divided by the square root of the umber of observatios, amely MeaAD(x d ) MeaAD(x d ) x di ( med(xd ) ;med(x d ) + ). (6) =.

8 50 A. Budka, D. Kachlicka, M. Kozłowska Propositio 4. I experimets there is o such thig as a perfect measuremet. Each measuremet cotais a degree of ucertaity due to the limits of istrumets ad the people usig them. The ucertaity idex for sample size has the form.859 MAD(x d ) /sqrt(), where sqrt() deotes the square root of. The rage of omitted values of x d ca be defied as the media plus or mius the value of the ucertaity idex. We propose removig the followig cetral observatios:.859 MAD(xd ).859 MAD(xd ) x di ( med(xd ) ;med(xd ) + ) (7) Above we have give four propositios for defiig the set of cetral values of observatios of the idepedet variable x d called a deflator. These propositios cocer the use of measures of variatio such as the stadard error, the mea absolute deviatio ad the media absolute deviatio. The rage of removed observatios is defied as a iterval whose cetral poit is the mea or media of the variable x d. We will check the usefuless of the above propositios o geerated data. 4. Mote Carlo simulatio I order to compare the effectiveess of the described modificatios of the Goldfeld-Quadt procedure we performed a Mote Carlo study usig SEPATH from Statistica 7.. We cosidered sample size =00 ad we used the sigle regressor model as applied for example i Goldfeld ad Quadt (965), Griffiths ad Surekha (986), Carapeto ad Holt (003). However we performed our study i a differet way tha these authors. I the first step, values y i ad x i for i=,,,00, were geerated from a ormal distributio N(0;), such that these values (y i ;x i ) had highly sigificat correlatio. I the secod step we sorted (y i ;x i ) with respect to x i. I the third step we omitted some cetral observatios. We used oe of six methods of defiig the set. I the ext step we calculated the Goldfeld-Quadt statistics. Nie hudred replicatios were geerated. I the last step we calculated the p-value for the Goldfeld-Quadt test ad the cumulative frequecy of cases i which the ull

9 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 5 hypothesis is ot rejected. The power of the Goldfeld-Quadt test was calculated usig Statistica 7. for first five hudred replicatios. We cocetrate ow o comparig differet methods for omittig cetral observatios. Two stadard methods are cosidered here; the first method was described by Goldfeld ad Quadt (965) ad the secod by, for example, Buse (984). The first method ivolves use of the test without removig cetral observatios, ad i the secod method twety percet of cetral observatios are omitted. These two methods are compared with our four propositios. Table displays the experimetally calculated mea p-value, cumulative frequecy of cases i which the ull hypothesis is ot rejected o sigificace level 0.05 or 0.0 or 0.00 ad mea power of the Goldfeld-Quadt test tabulated by six methods. Table. Mea p-value, cumulative frequecy of cases i which ull hypothesis is ot rejected (α=0.05, 0.0, 0.00) mea power of the Goldfield-Quadt test for Mote-Carlo experimets ad umber of removed observatios (ro) Set of omitted observatios p-value frequecy power Nro Null % cetral observatios mea ± stadard error media ± stadard error media ± MeaAD/sqrt() media ±.859 MAD/sqrt() The magitude of the differeces i the p-values of our propositios compared with the two methods from earlier papers is small (<%). We obtaied the smallest p-value for the iterval mea plus or mius stadard error. The power give i Table clearly idicates that the power of the Goldfeld-Quadt test deped upo the umber of omitted observatios but it is ot true that power icreases mootoically as this umber teds to ull. This cosideratio shows that the stadard method recommeded by may authors (0% omitted cetral observatios) gives the same result as our methods. Whe we make measuremets, we have o way of kowig how accurate the values are. The use of variatio measures such as stadard error, media absolute deviatio or stadard ucertaity make it possible to avoid takig a overoptimistic view of reality. By performig 'truthful' aalysis, we

10 5 A. Budka, D. Kachlicka, M. Kozłowska ca refie the Goldfeld-Quadt procedure. We ucover ad take advatage the major sources of the ucertaity. I doig so, we use iformatio about measuremets. 5. Research problem Regressio models i eviromet research, ad particularly the estimated stadard errors, rely upo the assumptio that the residuals are idepedetly ad idetically distributed. Two commo violatios of this assumptio are serial correlatio ad heteroscedasticity. We assume that serial correlatio is ot a sigificat issue. However heteroscedasticity, which refers to o-costat error variace, is a commo problem i eviromet regressio models. The preset aalysis is based o the results of research described i example 4.7 by Maddala (006). I this example six variables were described. The price of lad per acre is here called the depedet variable labelled y, the percetage afforested area is labelled x, the distace of the lad parcel from the airport is labelled x, the distace from the highway is labelled x 3, the area of the lad parcel is labelled x 4 ad the moth i which it was sold is labelled x 5. The depedet variable y ad two idepedet variables x ad x 3 were trasformed ad deoted respectively ly, lx ad lx 3. The idepedece variable lx 3 is called the deflator. Hece we ca write x d =lx 3. Table displays classical ad positio statistics tabulated by the six variables. The aalysis bega with a study of homoscedasticity for each idepedet variable separately ad the depedet variable (Table 3). We performed the Goldfeld-Quadt test usig six methods of defiig the set of omitted observatios, separately. The same aalyses were made for two groups of idepedet variables, i which we sorted these data with respect to x d =lx 3. We performed the Goldfeld-Quadt test for two groups to compare their results. We calculated the media absolute deviatio of the deflator MAD(x d )=MAD(lx 3 )=0.397 ad iter-quatile rage (x d(3/5) -x d(/5) )= Table 3 examies the p-value for these aalyses ad the umber of omitted observatios of the deflator lx 3 for two stadard methods ad our four propositios for defiig the set of omitted observatios. For the last two aalyses we also calculated the power of the test.

11 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 53 Table. Classical statistics ad positio statistics for the variables cosidered i example 4.7 by Maddala (006) ly x lx lx 3 x 4 x 5 Mea Stadard deviatio Stadard error Media Miimum observatio Maksimum observatio Before we cosider the ifluece of a defied of set of omitted observatios o the p-value ad power of the Goldfeld-Quadt test, it is ecessary to make a short digressio o the relatio betwee the magitude of the sets cosidered. I the example the greatest umber of observatios belogs to the secod stadard set (0% =3), but the smallest p-value ad the highest power is for the propositios mea plus or mius stadard deviatio ad media plus or mius stadard deviatio. I the respective sets we have 9 observatios i each. For the ext two propositios, based o mea absolute deviatio or media absolute deviatio, we obtaied worse results. 6. Coclusio To coclude our cosideratios we offer four methods for buildig the set of omitted observatios. O the basis of the Mote Carlo simulatio we geerally prefer first propositio. Propositio is preferred whe the deflator has symmetrical distributio, propositio two whe the distributio is asymmetrical. For the cosidered research problem the results of each applicatio of the Goldfeld-Quadt test are differet. I may cases our propositios are better tha the secod stadard method cosidered here. Methods based o measuremet of the deviatio of the deflator are better tha the removal of 0% cetral observatios without regard to their distributio. We should reiterate that the use of variatio measures such as stadard error, media absolute deviatio or stadard ucertaity makes it possible to avoid takig a overoptimistic view of reality.

12 54 A. Budka, D. Kachlicka, M. Kozłowska Table 3. The p-value for the Goldfeld-Quadt test for simple liear regressio ad two multivariate liear regressio with xd=lx3 (ro umber of removed observatios of variable lx3) Set of omitted observatios lx lx3 x4 x5 ro x; lx; lx3; x4; x5 lx; lx3 p-value p-value p-value p-value p-value power p-value power ull % cetral observatios mea ± stadard error media ± stadard error media ± MeaAD/sqrt() media ±.859 MAD/sqrt()

13 Notes o the use of the Goldfeld-Quadt test for heteroscedasticity 55 REFERENCE Breusch T., Paga A. (979): A Simple Test for Heteroscedasticity ad Radom Coefficiet Variatio. Ecoometrica 47: Buse A. (984): Tests for additive heteroskedasticity Goldfeld ad Quadt revisited. Empirical Ecoomics 9: Carapeto M., Holt W. (003): Testig for heteroscedasticity i regressio models. Joural of Applied Statistics 30(): 3-0. Dawid H.A. (98): Order Statistics. New York, Wiley. Dufour J.-M., Khalaf L., Berard J.-T., Geest I. (004): Simulatio-based fiitesample tests for heteroskedasticity ad ARCH effects. Joural of Ecoometrics : Griffiths W.E., Surekha K. (986): A Mote Carlo evaluatio of the power of some tests for heteroscedasticity. Joural of Ecoometrics 3: 9-3. Goldfeld S.M., Quadt R.E. (965): Some tests for homoscedasticity. America Statistical Associatio Joural 60: Maddala G.S. (006): Ekoometria. PWN, Warszawa Thursby G. (98): Misspecificatio, heteroscedasticity, ad the chow ad Goldfeld- Quadt tests. The Review of Ecoomics ad Statistics 64(): White H. (980): A Heteroscedasticity-Cosistet Covariace Matrix Estimator ad a Direct Test for Heteroscedasticity. Ecoometrica 48:

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls Ecoomics 250 Assigmet 1 Suggested Aswers 1. We have the followig data set o the legths (i miutes) of a sample of log-distace phoe calls 1 20 10 20 13 23 3 7 18 7 4 5 15 7 29 10 18 10 10 23 4 12 8 6 (1)

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract Goodess-Of-Fit For The Geeralized Expoetial Distributio By Amal S. Hassa stitute of Statistical Studies & Research Cairo Uiversity Abstract Recetly a ew distributio called geeralized expoetial or expoetiated

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 9 Multicolliearity Dr Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Multicolliearity diagostics A importat questio that

More information

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i : Error Error & Ucertaity The error is the differece betwee a TRUE value,, ad a MEASURED value, i : E = i There is o error-free measuremet. The sigificace of a measuremet caot be judged uless the associate

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation Metodološki zvezki, Vol. 13, No., 016, 117-130 Approximate Cofidece Iterval for the Reciprocal of a Normal Mea with a Kow Coefficiet of Variatio Wararit Paichkitkosolkul 1 Abstract A approximate cofidece

More information

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements.

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements. CHAPTER 2 umerical Measures Graphical method may ot always be sufficiet for describig data. You ca use the data to calculate a set of umbers that will covey a good metal picture of the frequecy distributio.

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7:

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7: 0 Multivariate Cotrol Chart 3 Multivariate Normal Distributio 5 Estimatio of the Mea ad Covariace Matrix 6 Hotellig s Cotrol Chart 6 Hotellig s Square 8 Average Value of k Subgroups 0 Example 3 3 Value

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

MA131 - Analysis 1. Workbook 2 Sequences I

MA131 - Analysis 1. Workbook 2 Sequences I MA3 - Aalysis Workbook 2 Sequeces I Autum 203 Cotets 2 Sequeces I 2. Itroductio.............................. 2.2 Icreasig ad Decreasig Sequeces................ 2 2.3 Bouded Sequeces..........................

More information

Analysis of Experimental Data

Analysis of Experimental Data Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

MA Advanced Econometrics: Properties of Least Squares Estimators

MA Advanced Econometrics: Properties of Least Squares Estimators MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

DISTRIBUTION LAW Okunev I.V.

DISTRIBUTION LAW Okunev I.V. 1 DISTRIBUTION LAW Okuev I.V. Distributio law belogs to a umber of the most complicated theoretical laws of mathematics. But it is also a very importat practical law. Nothig ca help uderstad complicated

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

A proposed discrete distribution for the statistical modeling of

A proposed discrete distribution for the statistical modeling of It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical

More information

http://www.xelca.l/articles/ufo_ladigsbaa_houte.aspx imulatio Output aalysis 3/4/06 This lecture Output: A simulatio determies the value of some performace measures, e.g. productio per hour, average queue

More information

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution Joural of Moder Applied Statistical Methods Volume 11 Issue Article 0 11-1-01 Extreme Value Charts ad Aalysis of Meas (ANOM) Based o the Log Logistic istributio B. Sriivasa Rao R.V.R & J.C. College of

More information

The target reliability and design working life

The target reliability and design working life Safety ad Security Egieerig IV 161 The target reliability ad desig workig life M. Holický Kloker Istitute, CTU i Prague, Czech Republic Abstract Desig workig life ad target reliability levels recommeded

More information

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution World Academy of Sciece Egieerig ad echology Iteratioal Joural of Mathematical ad Computatioal Scieces Vol: No:6 008 rimmed Mea as a Adaptive Robust Estimator of a Locatio Parameter for Weibull Distributio

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

On Random Line Segments in the Unit Square

On Random Line Segments in the Unit Square O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION [412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION BY ALAN STUART Divisio of Research Techiques, Lodo School of Ecoomics 1. INTRODUCTION There are several circumstaces

More information