Econometrics II Tutorial Problems No. 4

Size: px
Start display at page:

Download "Econometrics II Tutorial Problems No. 4"

Transcription

1 Ecoometrics II Tutorial Problems No. 4 Leart Hoogerheide & Agieszka Borowska Summary Gauss-Markov assumptios (for multiple liear regressio model): MLR.1 (liearity i parameters): The model is y i β 0 + β 1 x i1 + + β k x ik + u i, where β 0,..., β k are ukow parameters (costats) ad u i is a uobserved radom error term. MLR.2 (radom samplig): We have a radom sample of idepedet observatios {(x i1,..., x ik, y i ) : i 1,..., }. MLR.3 (o perfect colliearity): No exact liear relatioships betwee variables (ad oe of the idepedet variables is costat). MLR.4 (zero coditioal mea): E(u i x i1,..., x ik ) 0. MLR.5 (homoskedasticity): Var(u i x i1,..., x ik ) σ 2. Heteroskedasticity of Ukow Form: Heteroskedasticity that may deped o the explaatory variables i a ukow, arbitrary fashio. Heteroskedasticity-Robust Stadard Error: (White stadard errors) A stadard error that is (asymptotically) robust to heteroskedasticity of ukow form. Ca be obtaied as the square root of a diagoal elemet of Var( ˆβ OLS ) (X X) 1 X ˆΩX (X X) 1, where ˆΩ diag(û 2 1,..., û 2 ), the diagoal matrix with squared OLS residuals o the diagoal. Heteroskedasticity-Robust Statistic: A statistic that is (asymptotically) robust to heteroskedasticity of ukow form. E.g. t, F, LM statistics. Breusch-Paga Test: (LM test) A test for heteroskedasticity where the squared OLS residuals are regressed o exogeous variables ofte (a subset of) the explaatory variables i the model, their squares ad/or cross terms. White Test (without cross terms): A special case of Breusch-Paga Test, which ivolves regressig the squared OLS residuals o the squared explaatory variables. Weighted Least Squares (WLS) Estimator: A estimator used to adjust for a kow form of heteroskedasticity, where each squared residual is weighted by the iverse of the variace of the error. Feasible WLS (FWLS) Estimator: A estimator used to adjust for a ukow form of heteroskedasticity, where variace parameters are ukow ad therefore must first be estimated. 1

2 2 Extra Topics 2.1 Goldfeld Quadt (1965) test I a utshell Idea: If the error variaces are homoskedastic (equal across observatios), the the variace for oe part of the sample will be the same as the variace for aother part of the sample. Based o the ratio of variaces. Test for the equality of error variaces usig a F -test o the ratio of two variaces. Key assumptio: idepedet ad ormally distributed error terms. Divide the sample of ito three parts, the discard the middle observatios. Estimate the model for each of the two other sets of observatios ad compute the correspodig residual variaces. It requires that the data ca be ordered with odecreasig variace. The ordered data set is split i three groups: 1. the first group cosists of the first 1 observatios (with variace σ 2 1); 2. the secod group of the last 2 observatios (with variace σ 2 2); 3. the third group of the remaiig observatios i the middle. This last group is left out of the aalysis, to obtai a sharper cotrast betwee the variaces i the first ad secod group. The ull hypothesis is that the variace is costat for all observatios, ad the alterative is that the variace icreases. Hece, the ull ad alterative hypotheses are H 0 : σ 2 1 σ 2 2, H 1 : σ 2 1 < σ 2 2. Apply OLS to groups 1 ad 2 separately, with resultig sums of squared residuals SSR 1 ad SSR 2 respectively ad estimated variaces s 2 1 SSR1 ad 1 k s2 2 SSR2. 2 k Uder the assumptio of idepedetly ad ormally distributed error terms: SSR j σ 2 j χ 2 j k, j 1, 2, ad these two statistics are idepedet. Therefore: SSR 2 ( 2 k)σ 2 2 SSR 1 ( 1 k)σ 2 1 s 2 2 σ 2 2 s 2 1 σ 2 1 F ( 2 k, 1 k). So, uder the ull hypothesis of equal variaces, the test statistic F s2 2 s 2 F ( 2 k, 1 k). 1 The ull hypothesis is rejected i favour of the alterative if F takes large values. There exists o geerally accepted rule to choose the umber 3 of excluded middle observatios. If the variace chages oly at a sigle break-poit, the it would be optimal to select the two groups accordigly ad to take 3 0. O the other had, if early all variaces are equal ad oly a few first observatios have smaller variace ad a few last oes have larger variace, the it would be best to take 3 large. I practice oe uses rules of thumb: e.g. 3 5 if the sample size is small ad 3 3 if is large. 2

3 2.2 Correctio factor for multiplicative models Recall that we distiguish two models for heteroskedasticity i the cotext of FWLS: multiplicative heteroskedasticity model additive heteroskedasticity model Var(u i x i ) σ 2 exp (δ 0 + δ 1 x i1 + + δ k x ik ) ; Var(u i x i ) δ 0 + δ 1 x i1 + + δ k x ik. The latter has, however, a disadvatage that (estimate of) Var(u i x i ) ca be egative, so we maily focus o the former oe. Notice that i the multiplicative model we have so it is equivalet with Var(u i x i ) E(ui xi)0 E(u 2 i x i ) σ 2 exp (δ 0 + δ 1 x i1 + + δ k x ik ), u 2 i σ 2 exp (δ 0 + δ 1 x i1 + + δ k x ik ) v i, v i u 2 i E(u 2 i x i) ( mea 1 radom variable) Hece, we cosider where η i is the error term ad α 0 is a costat term log(u 2 i ) α 0 + δ 1 x i1 + + δ k x ik + η i, η i log(v i ) E(log(v i )) α 0 log(σ 2 ) + δ 0 + E(log(v i )). Hece, the coefficiet δ 0 of the costat term is ot cosistetly estimated by ˆα 0 from OLS. To obtai its cosistet estimate a correctio factor is eeded so δ 0 is the estimated by ˆδ 0 + a, where, if the errors are ormally distributed (u i x i N (0, σ 2 i )), a E[log(χ 2 1)] We will see how this works i Computer Exercise 2(i) 1. 1 Note, however, that a cosistet estimator of δ 0 is ot eeded, because exp(ˆδ 0 ) is merely a costat scalig factor that does ot affect the FWLS estimator. 3

4 3 Warm-up Exercises 3.1 W8/1 Which of the followig are cosequeces of heteroskedasticity? (i) The OLS estimators, ˆβ j, are icosistet. The homoskedasticity assumptio played o role i showig that the OLS estimator is cosistet. Ideed, eve with Var(u X) Ω σ 2 I we have for ˆβ OLS β + (X X) 1 X u: so the OLS estimator is still cosistet. ( ) ( X ) 1 ( X X ) u plim ˆβOLS β + plim plim ( 1 ( ) 1 β + plim x i x 1 i) plim x i u i β + E(X X) 1 E(X u) }{{}, E(XE(u X))0 (ii) The usual (homoskedasticity-oly) F statistic o loger has a F distributio. Now, we have so the usual expressio Var( ˆβ OLS ) (X X) 1 X ΩX (X X) 1, σ 2 (X X) 1 for the variace does ot apply aymore. The latter expressio is biased, which makes the stadard (homoskedasticity-oly) F test (ad t test) ivalid. Oe should use a heteroskedasticity-robust F (ad t) statistic, based o heteroskedasticity-robust stadard errors. (iii) The OLS estimators are o loger BLUE. As heteroskedasticity is a violatio of the Gauss-Markov assumptios, the OLS estimator is o loger BLUE: it is still liear, ubiased, but ot best i a sese that it is ot efficiet. Ituitively, the iefficiecy of the OLS estimator uder heteroskedasticity ca be cotributed to the fact that observatios with low variace are likely to covey more iformatio about the parameters tha observatios with high variace, ad so the former should be give more weight i a efficiet estimator (but all are weighted equally). 3.2 W8/2 Cosider a liear model to explai mothly beer cosumptio: E(u ic, price, educ, female) 0, Var(u ic, price, educ, female) σ 2 ic 2. beer β 0 + β 1 ic + β 2 price + β 3 educ + β 4 female + u, Write the trasformed equatio that has a homoskedastic error term. With Var(u ic, price, educ, female) σ 2 ic 2 we have h(x) ic 2, where h(x) is a fuctio of the explaatory variables that determies the heteroskedasticity (defied as Var(u x) σ 2 h(x)). Therefore, h(x) ic, ad so the trasformed equatio is obtaied by dividig the origial equatio by ic: beer ic β 1 0 ic + β ic 1 ic + β price 2 ic + β educ 3 ic + β female 4 ic 1 β 0 ic + β price 1 + β 2 ic + β educ 3 ic + β female 4 ic + u ic + u ic. Notice that β 1, which is the slope o ic i the origial model, is ow a costat i the trasformed equatio. This is simply a cosequece of the form of the heteroskedasticity ad the fuctioal forms of the explaatory variables i the origial equatio. 4

5 3.3 Small computer exercise Usig the data i the file earigs.wf1 2 ru the regressio y i β 1 d 1i + β 2 d 2i + β 3 d 3i + u i (1) where d ki, k 1, 2, 3, are dummy variables for three age groups. The test the ull hypothesis that E(u 2 i ) σ2 agaist the alterative that E(u 2 i ) γ 1 d 1i + γ 2 d 2i + γ 3 d 3i. Report p-values for both F ad R 2 tests. Recall that tests for homoskedasticity are costructed as follows: H 0 : homoskedasticity, H 1 : ot H 0, i.e. heteroskedasticity. The easiest way to perform the required test is simply to regress the squared residuals from (1) o a costat ad two of the three (to prevet colliearity) dummy variables. Notice that this gives us the same results as ruig the built-i heteroskedastisity test (Breusch-Paga-Godfrey) i EViews: The F statistic from this regressio for the hypothesis that the coefficiets of the dummy variables are zero is It is asymptotically distributed as F (k, k 1) F (2, 4263), ad the p-value is A alterative statistic is R 2, which is equal to It is asymptotically distributed as χ 2 k χ2 2, ad the p value is (Recall from the lecture that this is worse tha F test i fiite samples). The two test statistics yield idetical ifereces, amely, that the ull hypothesis should be rejected at ay covetioal sigificace level. 4 Problem o heteroskedasticity modellig Cosider the model y i βx i + ε i (without costat term ad with k 1), where x i > 0 for all observatios, E(ε i ) 0, E(ε i ε j ) 0, i j, ad E(ε 2 i ) σ2 i. Cosider the followig three estimators of β: b 1 x iy i, x2 i b 2 y i x, i b 3 1 y i. x i 2 Average aual earigs i 1988 ad 1989, i 1982 US dollars, for idividuals i three age groups. 5

6 For each estimator, derive a model for the variaces σi 2 for which this estimator is the best liear ubiased estimator of β. Recall that whe we have a model for heteroskedasticity, i.e. i Var(u i x i ) σ 2 h(x i ) the fuctio h i h(x i ) is kow, the trasformig the origial data by dividig them by h i results i a liear regressio where all Gauss-Markov assumptios are satisfied, which meas that the correspodig OLS estimator is BLUE. Cosider y i βx i + ε i, Var(u i x i ) σ 2 h i, β x i + ε ( ) i ui, Var x i σ 2, hi hi hi y i hi }{{} :y i }{{} :x i }{{} :ε i so that the correspodig OLS estimator is ˆβ OLS x i y i (x i )2 x i hi ( y i hi ) 2 hi x i x iy i h i. x 2 i h i Hece, we simply eed to fid what fuctios h i have led to the three give WLS estimators b 1 b To have ˆβ OLS b 1 we eed x iy i h i x 2 i h i x iy i, x2 i which meas that h i 1, i 1,..., (or h i C for ay other positive costat C, sice this would simply drop out i the umerator ad the deomiator), ad Var(u i x i ) σ 2. Notice that this is simply the OLS estimator for the homoskedastic case. 2. To have ˆβ OLS b 2 we eed x iy i h i x 2 i h i y i x, i which meas that h i x i, i 1,..., (or h i Cx i for ay other positive costat C), ad Var(u i x i ) σ 2 x i. Notice that this is a valid expressio for the variace due to the assumptio that x i > 0, i 1,...,. 3. To have ˆβ OLS b 3 we eed x iy i h i x 2 i h i 1 y i x i y i x i x i y i x i x i x 2 i x 2 i which meas that h i x 2 i, i 1,..., (or h i Cx 2 i for ay other positive costat C), ad Var(u i x i ) σ 2 x 2 i., 5 Computer Exercises Exercise 1 Simulate 100 data poits as follows. Let x i cosist of 100 radom drawigs from the stadard ormal distributio, let η i be a radom drawig from the distributio N (0, x 2 i ), ad let y i x i + η i (i.e. the true value is β 1). We will estimate the model y i βx i + ε i. 6

7 (i) Estimate β by OLS. Compute the homoskedasticity-oly stadard error of ˆβ OLS ad the White heteroskedasticityrobust stadard error of ˆβ OLS. (ii) Estimate β by WLS usig the kowledge that σi 2 σ2 x 2 i. Compare the estimate ad the homoskedasticityoly ad heteroskedasticity-robust stadard errors obtaied for this WLS estimator with the results for OLS i (i). We start with costructig the (correctly) trasformed series: y i : y i x i, x i : x i x i 1, ε i : ε i x i, so that ow the trasformed error terms ε i are homoskedastic. We the ru two OLS regressios o the trasformed series (oe with the homoskedasticity-oly stadard errors ad oe with the White heteroskedasticity-robust stadard errors). Not surprisigly, both give us the same results. Next, we ru two WLS regressios o the origial series, usig the correct weights, h i x 2 i (agai, oe with the homoskedasticity-oly stadard errors ad oe with the White heteroskedasticity-robust stadard errors). Notice that because ow x i ca be egative we eed to take their absolute values for weightig. As expected, the results are exactly the same as i the previous trasformed case. 7

8 (iii) Now estimate β by WLS usig the (icorrect) heteroskedasticity model σi 2 σ2. Compute the stadard x 2 i error of this estimate i three ways: by the WLS expressio correspodig to this (icorrect) model, by the White method for OLS o the (icorrectly) weighted data, ad also by derivig the correct formula for the stadard deviatio of WLS with this icorrect model for the variace. We start with costructig the (icorrectly) trasformed series: y i : y i x i, x i : x i x i x 2 i, ε i : ε i x i, so that ow the trasformed error terms ε i are heteroskedastic. To have a referece to the previous subpoit, we ru four regressios: two OLS oes ad two WLS oes, each time with oe with the homoskedasticity-oly stadard errors ad oe with the White heteroskedasticity-robust stadard errors. Now the ot-heteroskedasticity-robustified regressios (OLS ad WLS) give the same results, ad so do both (OLS ad WLS) with the White correctio. What is left is to derive the correct formula for the stadard deviatio of WLS uder the icorrect model for the variace. Recall that i the oe-variable (ad without a costat term) settig we have ˆβ W LS x iy i h i, x 2 i h i so with the weights h i 1 x 2 i ad usig y i βx i + ε i, we arrive at ˆβ W LS x3 i y i x4 i x3 i (βx i + ε i ) x4 i β + x3 i ε i. x4 i 8

9 Because ˆβ ( ) W LS is ubiased, i.e. E ˆβW LS x β, the variace of ˆβ W LS is ( ) Var ˆβW LS x [ ( ( )) ] 2 E ˆβW LS E ˆβW LS x x [ ( E β + x3 i ε 2 ] i β) x4 i x [ ( E x3 i ε 2 ] i) ( x4 i )2 x x6 i E [ ε 2 i x i] ( ) ( ) ( ) ( x4 i )2 x6 i Var [ε i x i ] ( x4 i )2 x8 i ( x4 i )2, where i ( ) we use the coditioig o x ad the fact that ε i are mutually idepedet, i ( ) the fact that E(ε i x i ) 0 ad i ( ) that Var(ε i x i ) σ 2 x 2 i x2 i. For the simulated x i we obtai x4 i ad x8 i , hece ( ) Var ˆβW LS x ( ) , so that the stadard deviatio of ˆβ W LS is This shows that the stadard error from the heteroskedasticity-robust regressios of 0.22 is still estimated with some error. (iv) Perform 1000 simulatios, where the 1000 values of x i remai the same over all simulatios but the 100 values of η i are differet drawigs from the N (0, x 2 i ) distributios ad where the values of y i x i +η i differ accordigly betwee the simulatios. Determie the sample stadard deviatios over the 1000 simulatios of the three estimators of β i (i)-(iii), that is, OLS, WLS (with correct weights), ad WLS (with icorrect weights). Figure 1 preset a EViews code used for this simulatio experimet (ad also for the previous computatios). The stadard deviatios of the obtaied series of 1000 estimates for β usig the required three methods are as follows: St.dev( ˆβ OLS ) , St.dev( ˆβ W LS,correct ) , St.dev( ˆβ W LS,icorrect ) Notice that the last value is almost idetical to the theoretical oe, obtaied i (iii). (v) Compare the three sample stadard deviatios i (iv) with the estimated stadard errors i (i) (iii), ad commet o the outcomes. Which stadard errors are reliable, ad which oes are ot? The table below summarises the required results. Clearly, WLS with the correctly specified model for the variaces gives reliable stadard errors. OLS ad WLS with the icorrect weightig greatly uderestimate the variability of the estimator for β whe the heteroskedasticity-robust stadard errors are ot used. Whe the latter are applied the stadard error for both methods improve cosiderably, but still are estimated with some error. Sigle estimatio st. errors Method Homosked. oly Heterosked. robust Simulatio st. deviatios OLS WLS correct WLS icorrect

10 Figure 1: EViews code example for Computer Exercise 1. 10

11 Exercise 2 Cosider the bak wages data bakwages.wf1 with the regressio model y i β 1 + β 2 x i + β 3 D gi + β 4 D mi + β 5 D 2i + β 6 D 3i + ε i, where y i is the logarithm of yearly wage, x i is the umber of years of educatio, D g is a geder dummy (1 for males, 0 for females), ad D m is a miority dummy (1 for miorities, 0 otherwise). Admiistratio is take as referece category ad D 2 ad D 3 are dummy variables (D 2 1 for idividuals with a custodial job ad D 2 0 otherwise, ad D 3 1 for idividuals with a maagemet positio ad D 3 0 otherwise). (i) Cosider the followig multiplicative model for the variaces: σ 2 i E[ε 2 i ] e γ1+γ2d2+γ3d3. Estimate the ie parameters (six regressio parameters ad three variace parameters) by (two-step) FWLS. Obtai the estimates of the stadard deviatios per job category ad iterpret the results. To apply (two-step) FWLS, we start by estimatig the regressio ad the model for variaces by OLS. For the latter we cosider as the explaied variable log(ˆε 2 i ), where ˆε i are the OLS residuals of from the first regressio. Keepig i mid the correctio factor for multiplicative models (assumig that ε i has a ormal distributio), we estimate the variaces as so that Pluggig i the obtaied estimates, we obtai: ˆσ 2 i exp( ˆγ 1 + ˆγ 2 D 2i + ˆγ 3 D 3i ), ˆσ 2 1 exp( ˆγ 1 ), ˆσ 2 2 exp( ˆγ 1 + ˆγ 2 ), ˆσ 2 3 exp( ˆγ 1 + ˆγ 3 ). ˆσ 2 1 exp( ) , ˆσ 2 2 exp( ) , ˆσ 2 3 exp( ) , which gives us the required stadard deviatios per job category: ˆσ 1 ˆσ , ˆσ 2 ˆσ , ˆσ 3 ˆσ As expected, the stadard deviatio is smallest for custodial jobs ad it is largest for maagemet jobs. Notice, however, that the estimates ˆγ 2 ad ˆγ 3 are ot sigificat, idicatig that the homoskedasticity of the error caot be rejected. Next, we ru WLS with weights equal to the iverse of the fitted stadard deviatio. 11

12 We ca see that the outcomes are quite close to those of OLS, so that the effect of heteroskedasticity is relatively small (which is i lie with the fact that we did ot reject the ull of homoskedastic error term). (ii) Next, adjust the model for the variaces as follows: E[ε 2 i ] γ 1 + γ 2 D 2 + γ 3 D 3 + γ 4 x i + γ 5 x 2 i, i.e. the model for the variaces is additive ad cotais also effects of the level of educatio. Estimate the eleve parameters (six regressio parameters ad five variace parameters) by (two-step) FWLS ad compare the outcomes with the results i (i). With the additive model we ow estimate the variaces as so that ˆσ 2 i ˆγ 1 + ˆγ 2 D 2i + ˆγ 3 D 3i + ˆγ 4 x i + ˆγ 5 x 2 i, ˆσ 2 1 ˆγ 1 + ˆγ 4 x i + ˆγ 5 x 2 i, x i + 7e-05x 2 i, ˆσ 2 2 ˆγ 1 + ˆγ 2 + ˆγ 4 x i + ˆγ 5 x 2 i x i + 7e-05x 2 i, ˆσ 2 3 ˆγ 1 + ˆγ 3 + ˆγ 4 x i + ˆγ 5 x 2 i x i + 7e-05x 2 i. 12

13 Notice that this time we caot obtai stadard deviatios per job category, because the estimates of stadard deviatio are idividual specific (depedig o the educatio level). However, the estimates ˆγ 2 ˆγ 5 are ot sigificat, idicatig that agai the homoskedasticity of the error caot be rejected. Below we sum up the three sets of stadard errors. Stadard errors Variable ˆβk OLS FWLS o x i FWLS with x i C EDUC GENDER MINORITY DUMJCAT DUMJCAT We ca see that chagig of the model for heteroskedasticity does ot have a big impact o the results, which are similar to those from (i). Nevertheless, the additive FWLS estimator icludig the educatio effect is somewhat more accurate tha the multiplicative, job-category-oly FWLS estimator, which is a bit more accurate tha the OLS oe. (iii) Check that the data i the data file are sorted with icreasig values of x i. Ispect the histogram of x i ad choose two subsamples to perform the Goldfeld Quadt test 3 o possible heteroskedasticity due to the variable x i. Based o the plots of x i above we choose x i 12 as the first group ad x i > 15 as the secod group, so that both groups are large eough ad so that there are some observatios dropped with 12 < x i < 15 (a few oes with x i 14). This results i 1 241, ad Ruig the origial regressio (with k 5) o both subsamples yields SSR ad SSR , so that we obtai F SSR 2 2 k SSR 1 1 k 3 Sice the Goldfeld-Quadt has ot bee i the lecture slides, it will be explaied durig the tutorial. 4 You ca use the commads smpl if educ<12, scalar ad smpl if educ>15, scalar i EViews. 13

14 (with the exact values i EViews, with the above oes it is ), which uder the ull of homosekdasticity follows the F ( 2 k, 1 k) F (225 5, 241 5) F (220, 236) distributio. The correspodig p-value is 9.76E-06 so virtually 0. Hece, at ay reasoable sigificace level we reject the ull of homoskedasticity ad coclude that there is evidece for heteroskedasticy due to the educatio level. (iv) Perform the Breusch Paga test o heteroskedasticity, usig the specified model for the variaces. We still use the additive model for the variaces from (ii), i.e. we cosider R 2 from the auxiliary regressio from (ii) ˆε 2 i γ 1 + γ 2 D 2 + γ 3 D 3 + γ 4 x i + γ 5 x 2 i + η i. With R , the obtaied value of the LM statistic is LM R , with the correspodig p-value of (we use the χ 2 4 distributio). Hece, at the stadard sigificace level of 5% we ca reject the ull of homoskedasticity. Alteratively, we ca ru the built-i test i EViews, where we eed to adjust the regressors i the test specificatio box, which leads to the same results. (v) Also perform the White test o heteroskedasticity. The results for the White test without ad with cross terms, respectively, are show below. The LM statistic for the White test without cross terms is equal to ad uder the ull it follows the χ 2 5 distributio. The correspodig p-value is For the White test with cross terms we obtai LM , which follows the χ 2 14 distributio uder the ull ad yields the p-value of Either way we ca reject the ull of homoskedasticity at the stadard sigificace level of 5%. 14

15 (vi) Commet o the similarities ad differeces betwee the test outcomes i (iii) (v). The mai similarity is that all three tests rejected the ull of homoskedasticity, hece we have strog grouds to claim that the variace of the uobserved factors chages across differet segmets of the aalysed data. A differece is the exact level of the p-value: some tests may have more power to detect heteroskedasticity for this dataset (ad reject H 0 more clearly with a lower p-value). Aother differece is that the Goldfeld-Quadt test assumes that the errors are ormally distributed, whereas the Breusch-Paga ad White tests do ot rely o this assumptio. 15

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Economics 326 Methods of Empirical Research in Economics. Lecture 18: The asymptotic variance of OLS and heteroskedasticity

Economics 326 Methods of Empirical Research in Economics. Lecture 18: The asymptotic variance of OLS and heteroskedasticity Ecoomics 326 Methods of Empirical Research i Ecoomics Lecture 8: The asymptotic variace of OLS ad heteroskedasticity Hiro Kasahara Uiversity of British Columbia December 24, 204 Asymptotic ormality I I

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

Lesson 11: Simple Linear Regression

Lesson 11: Simple Linear Regression Lesso 11: Simple Liear Regressio Ka-fu WONG December 2, 2004 I previous lessos, we have covered maily about the estimatio of populatio mea (or expected value) ad its iferece. Sometimes we are iterested

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Asymptotic Results for the Linear Regression Model

Asymptotic Results for the Linear Regression Model Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Midterm 2 ECO3151. Winter 2012

Midterm 2 ECO3151. Winter 2012 Name: Studet Number: Midterm 2 ECO3151 Witer 2012 Istructios: 1. Prit your ame ad studet umber at the top of this midterm 2. No programmable calculators 3. You ca aswer i pecil or pe 4. This midterm cosists

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

ARIMA Models. Dan Saunders. y t = φy t 1 + ɛ t

ARIMA Models. Dan Saunders. y t = φy t 1 + ɛ t ARIMA Models Da Sauders I will discuss models with a depedet variable y t, a potetially edogeous error term ɛ t, ad a exogeous error term η t, each with a subscript t deotig time. With just these three

More information

Machine Learning Brett Bernstein

Machine Learning Brett Bernstein Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

MA Advanced Econometrics: Properties of Least Squares Estimators

MA Advanced Econometrics: Properties of Least Squares Estimators MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample

More information

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

POLS, GLS, FGLS, GMM. Outline of Linear Systems of Equations. Common Coefficients, Panel Data Model. Preliminaries

POLS, GLS, FGLS, GMM. Outline of Linear Systems of Equations. Common Coefficients, Panel Data Model. Preliminaries Outlie of Liear Systems of Equatios POLS, GLS, FGLS, GMM Commo Coefficiets, Pael Data Model Prelimiaries he liear pael data model is a static model because all explaatory variables are dated cotemporaeously

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

11 THE GMM ESTIMATION

11 THE GMM ESTIMATION Cotets THE GMM ESTIMATION 2. Cosistecy ad Asymptotic Normality..................... 3.2 Regularity Coditios ad Idetificatio..................... 4.3 The GMM Iterpretatio of the OLS Estimatio.................

More information

x iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0.

x iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0. 27 However, β MM is icosistet whe E(x u) 0, i.e., β MM = (X X) X y = β + (X X) X u = β + ( X X ) ( X u ) \ β. Note as follows: X u = x iu i E(x u) 0. I order to obtai a cosistet estimator of β, we fid

More information

Simple Linear Regression

Simple Linear Regression Chapter 2 Simple Liear Regressio 2.1 Simple liear model The simple liear regressio model shows how oe kow depedet variable is determied by a sigle explaatory variable (regressor). Is is writte as: Y i

More information

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

CEU Department of Economics Econometrics 1, Problem Set 1 - Solutions

CEU Department of Economics Econometrics 1, Problem Set 1 - Solutions CEU Departmet of Ecoomics Ecoometrics, Problem Set - Solutios Part A. Exogeeity - edogeeity The liear coditioal expectatio (CE) model has the followig form: We would like to estimate the effect of some

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005 Statistics 203 Itroductio to Regressio ad Aalysis of Variace Assigmet #1 Solutios Jauary 20, 2005 Q. 1) (MP 2.7) (a) Let x deote the hydrocarbo percetage, ad let y deote the oxyge purity. The simple liear

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

Asymptotic distribution of the first-stage F-statistic under weak IVs

Asymptotic distribution of the first-stage F-statistic under weak IVs November 6 Eco 59A WEAK INSTRUMENTS III Testig for Weak Istrumets From the results discussed i Weak Istrumets II we kow that at least i the case of a sigle edogeous regressor there are weak-idetificatio-robust

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

1 General linear Model Continued..

1 General linear Model Continued.. Geeral liear Model Cotiued.. We have We kow y = X + u X o radom u v N(0; I ) b = (X 0 X) X 0 y E( b ) = V ar( b ) = (X 0 X) We saw that b = (X 0 X) X 0 u so b is a liear fuctio of a ormally distributed

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines) Dr Maddah NMG 617 M Statistics 11/6/1 Multiple egressio () (Chapter 15, Hies) Test for sigificace of regressio This is a test to determie whether there is a liear relatioship betwee the depedet variable

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 12: September 27

Lecture 12: September 27 36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

ECO 312 Fall 2013 Chris Sims LIKELIHOOD, POSTERIORS, DIAGNOSING NON-NORMALITY

ECO 312 Fall 2013 Chris Sims LIKELIHOOD, POSTERIORS, DIAGNOSING NON-NORMALITY ECO 312 Fall 2013 Chris Sims LIKELIHOOD, POSTERIORS, DIAGNOSING NON-NORMALITY (1) A distributio that allows asymmetry differet probabilities for egative ad positive outliers is the asymmetric double expoetial,

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information