Lecture 15 Panel Data Models

Size: px
Start display at page:

Download "Lecture 15 Panel Data Models"

Transcription

1 Lecture 5 Panel Data Models Panel Data Sets A panel data set, or longtudnal data set, s one where there are repeated observatons on the same unts. The unts ma be ndvduals, households, enterprses, countres, or an set of enttes that reman stable through tme. Repeated observatons create a potentall ver large panel data sets. Wth N unts and T tme perods => Number of observatons: NT. Repeated observatons can be great for estmaton: Large sample! We use the repeated observatons to our advantage. But, repeated observatons from the same ndvdual ntroduce a problem: Dependence, snce the observatons are, ver lkel, not ndependent. Modelng the potental dependence creates dfferent models.

2 The Natonal Longtudnal Surve (NLS) of Youth s an example. The same respondents were ntervewed ever ear from 979 to 994. Snce 994 the have been ntervewed ever two ears. Panel data allows us a researcher to stud cross secton effects.e., along N, varaton across the frms- & tme seres effects.e., along T, varaton across tme. Panel Data Sets NT T T T Nt t t t N N Tme seres Cross secton A standard panel data set model stacks the s and the x s: = X + c + X s a Σ T xk matrx s a kx matrx c s Σ T x matrx, assocated wth unobservable varables. and are Σ T x matrces Panel Data Sets j j j kt T T k t t k k kt T T k t t k k T t T t w w w w w w x w w w w w X x x x x x x x x x x x x X ;...; ;...; Notaton:

3 Panel Data Sets Fnancal data COMPUSTAT provdes fnancal data b frm (N =99,000) and b quarter (T = 96:I, 96:II,..., ) CRSP dal and monthl stock and ndex returns from 96 on. Datastream provdes economc and fnancal data for countres. It also covers bonds and stock markets around the world. Intra-dal data: Olsen (exchange rates) and TAQ (stock market transacton prces). Essentall nfnte T, large N. OptonMetrcs, also known as Iv DB, s a database of hstorcal prces, mpled volatlt for lsted stocks and opton markets. Internatonal Fnancal Statstcs (IFS from the IMF) covers economc and fnancal data for almost all countres post-wwii. Balanced and Unbalanced Panels Notaton:,t, =,,N; t =,,T Mathematcal and notatonal convenence: - Balanced: NT (that s, ever unt s surveed n ever tme perod.) N - Unbalanced: T = Q: Is the fxed T assumpton ever necessar? SUR models. The NLS of Youth s unbalanced because some ndvduals have not been ntervewed n some ears. Some could not be located, some refused, and a few have ded. CRSP s also unbalanced, some frms are lsted from 96, others started to be lsted later. 3

4 Panel Data Models Wth panel data we can stud dfferent ssues: - Cross sectonal varaton (unobservable n tme seres data) vs. tme seres varaton (unobservable n cross sectonal data) - Heterogenet (observable and unobservable ndvdual heterogenet) - Herarchcal structures (sa, zp code, ct and state effects) - Dnamcs n economc behavor - Indvdual/Group effects (ndvdual effects) - Tme effects Panel Data Models: Example - SUR In Zellner s SUR formulaton (no lnear dependence on t ) we have: (A) t = x t + t (A) E[ X] = 0, (A3 ) Var[ X] = I T --groupwse heteroscedastct. E[ t jt X] = j --contemporaneous correlaton E[ t jt X] = 0 when t t (A4) Rank(X) = full rank In (A)- (A4), we have the a GR model wth heteroscedastct. OLS n each equaton s OK, but not effcent. GLS s effcent. We are not takng advantage of poolng.e., usng NT observatons! Use LR or F tests to check f poolng (aggregaton) can be done. 4

5 Panel Data Models: Example - Poolng Assumptons (A) t = x t + z γ + t - the DGP =,,..., N - we thnk of as ndvduals or groups. t =,,..., T - usuall, N >> T. (A) E[ X,z] = 0, - X and z: exogenous (A3) Var[ X,z] = I. - Heteroscedastct can be allowed. (A4) Rank(X) = full rank We thnk of X as a vector of observed characterstcs. For example, frm sze, Market-to-book, Z-score, R&D expendtures, etc. We thnk of z as a vector of unobserved characterstcs (ndvdual effects). For example, qualt of management, growth opportuntes, etc. Panel Data Models: Basc Model The DGP (A): t k X j jt j p -The ndex refers to ndvduals.e., the unt of observaton, t refers to the tme perod, and j and p are used to dfferentate between dfferent observed and unobserved explanator varables. - A tme trend t allows for a shft of the ntercept over tme, capturng tme effects technologcal change, regulatons, taxes, etc. - If the mplct assumpton of a constant rate of change s strong, the trend can be replaced b a set of dumm varables, one for each tme perod except the reference perod. -TheX j varables are usuall the varables of nterest, whle the Z p varables are responsble for unobserved heterogenet (& dependence on the s) and as such consttute a nusance component of the model. 5 s p Z p t t 5

6 Panel Data Models: Basc Model Snce the Z p varables are unobserved, there s no means of obtanng nformaton about the p Z p component of the model. Usuall, t s convenent to defne a term c known as the unobserved effect, representng the jont mpact of the Z p varables on. s c Z p p p We can rewrte the regresson model as: t k j j X jt c t t The characterzaton of the c component s crucal. Dfferent was of thnkng about z p create dfferent panel data models. 30 Panel Data Models: Basc Model t k j j X jt c t Note that f the X j s are so comprehensve that the capture all the relevant characterstcs of the ndvdual, there wll be no relevant unobserved characterstcs. => Now, c can be dropped. Pooled OLS ma be used, treatng all the observatons for all of the tme perods as a sngle sample. t In general, droppng c leads to mssng varables problem: bas! We usuall thnk of c as contemporaneousl exogenous to the condtonal error. That s, E[ t c ] = 0, t =,..., T 30 6

7 Panel Data Models: Basc Model A stronger assumpton can solve the estmaton problem. Strct exogenet can also be mposed. In ths case, Then, E[ t x, x,...,x T,c ] = 0, E[ x t t, c ] t =,..., T c t => The β j s are partal effects holdng c constant. k j Volatons of strct exogenet are not rare. If x t contans lagged dependent varables or f changes n t affect x t+ (a feedback effect). j X jt But to estmate β we stll need to sa somethng about the relaton between x t and c. Dfferent assumptons wll gve rse to dfferent models. 30 Panel Data Models: Tpes The basc DGP: t = x t + z γ + t & (A)-(A4) appl. Dependng on how we model the heterogenet n the panel, we have dfferent models. Four Popular Models: () Pooled (Constant Effect) Model z γ s a constant. z = α. Dependence on the t ma enter through the varance. That s, repeated observatons on ndvdual are lnearl ndependent. In ths case, t = x t + α + t OLS estmates α and consstentl. We estmate k+ parameters. 7

8 Panel Data Models: Tpes () Fxed Effects Model (FEM) The z s are correlated wth X Fxed Effects: E[z X ] = g(x ) = α* ; effects are correlated wth ncluded varables.e., pooled OLS wll be nconsstent. Assume z γ = α (constant; t does not var wth t). Then, t = x t + α + t the regresson lne s rased/lowered b a fxed amount for each ndvdual (the dependence created b the repeated observatons!). In econometrcs terms, ths s the source of the fxed-effects. We have a lot of parameters: k+n. We have N ndvdual effects! OLS can be used to estmates α and consstentl. Panel Data Models: Tpes (3) Random Effects Model (REM) The dfferences between ndvduals are random, drawn from a gven dstrbuton wth constant parameters. We assume the z s are uncorrelated wth the X. That s, E[z X ] = μ (f X contans a constant term, μ=0 WLOG). Add and subtract E[z γ] = μ* from (*): t = x t + E[z γ] +( z γ) - E[z γ] + t = x t + μ* + u + t We have a compound ( composed ) error.e., u + t = w t. Ths error w t ntroduces contemporaneous cross-correlatons across the group. OLS estmates μ and consstentl, but GLS wll be effcent. 8

9 Panel Data Models: Tpes (4) Random Parameters /Coeffcents Model We ntroduce heterogenet through. But, ths ma ntroduce addtonal N parameters. A soluton s to model. For example, t = x t ( + h ) + α + t h s a random vector that nduces parameter varaton, where h ~D(0, σ h). That s, we ntroduce heteroscedastct. Now, the coeffcents are dfferent for each ndvdual. It s possble to complcate the model b makng them dfferent through tme: t = ( + h ) + θ t where θ t ~D(0, σ t). Estmaton: GLS, MLE. Long hstor: Rao (965) and Chow (975) worked on these models. Compact Notaton Compact Notaton: = X + c + X s a T xk matrx s a kx matrx c s a T x matrx and are T x matrces Recall we stack the s and X s: X s a Σ T xk matrx s a kx matrx c, and are Σ T x matrces = X + c + Or = X* * +, wth X * = [X ι] - Σ T x(k+) matrx. * = [ c] - (k+)x matrx 9

10 Assumptons for Asmptotcs (Greene) Convergence of moments nvolvng cross secton X. Usuall, we assume N ncreasng, T or T assumed fxed. Fxed T asmptotcs (see Greene, p. 96) Tme seres characterstcs are not relevant (ma be nonstatonar) If T s also growng, need to treat as multvarate tme seres. Rank of matrces. X must have full column rank. (X ma not, f T < K.) Strct exogenet and dnamcs. If x t contans,t- then x t cannot be strctl exogenous. X t wll be correlated wth the unobservables n perod t-. Inconsstent OLS estmates! (To be revsted later.) Panel Data Models: (A3 ) - No Homoscedastct We can relax assumpton (A3). The new DGP model: = X* * +, wth X * = [X ι] - Σ T x(k+) matrx. * = [ c] - (k+)x matrx Now, we assume (A3 ) E[ ' X] = Σ σ I ΣT Potentall, a lot of dfferent elements n E[' X] n a panel: - Indvdual heteroscedastct. Usual groupwse heteroscedastct. - Autocorrelaton (Indvdual/group/frm) effects. Errors have arbtrar correlaton across tme for a partcular ndvdual : - Temporal correlaton (Tme) effects. Errors have arbtrar correlaton across ndvduals at a moment n tme (SUR-tpe correlaton). - Persstent common shocks: Errors have some correlaton between dfferent frms n dfferent tme perods (but, these shocks are assumed to de out over tme, and ma be gnored after L perods). 0

11 Panel Data Models: (A3 ) - Error Structures To understand the dfferent elements n Σ, consder the followng DGP for the errors, ε t s: ε t = θ f t + η t, f t ~D(0, σ f ) & η t = ϕ η t- + ς t, ς t ~D(0, σ ς ) f t : vector of random factors common to all ndvduals/groups/frms. θ : vector of factor loadngs, specfc to ndvdual. ς t : random shocks to ndvdual, uncorrelated across both and t. η t : random shocks to. Ths term generates autocorrelaton effects n. θ f t generates both contemporaneous (SUR) and tme-varng crosscorrelatons between and j. (Autorrelatons de out after L perods.) -If f t s uncorrelated across t, =>onl contemporaneous (SUR) effects. -If f t s persstent n t, =>both SUR and persstent common effects. Panel Data Models: (A3 ) - Error Structures Dfferent forms for E[ ' X]: - Indvdual heteroscedastct. E[ X] = σ => standard groupwse heteroscedastct drven b ς t. - Autocorrelaton (Indvdual) effects: E[ t s X] 0 (t s) => auto-/tme-correlaton for errors, t drven b η t. - Temporal correlaton effects: E[ t jt X] 0 ( j) => contemporar cross-correlaton for errors drven b f t. - Persstent common shocks: E[ t js X] 0 ( j) and t s < L => tme-varng cross-correlaton for errors drven b f t. Remark: Heteroscedastct ponts to GLS effcent estmaton, but, as before, for consstent nferences we can use OLS wth (adjusted for panels) Whte or NW SE s.

12 Panel Data Models: (A3 ) Clustered SE For consstent nferences, we can use OLS wth Whte or NW SE s: - Whte SE s adjust onl for heteroscedastct: S 0 = (/T) e x x. - NW SE s adjust for heteroscedastct and autocorrelaton: S T = S 0 + (/T) l w L (l) t=l+,...,t (x t-l e t-l e t x t + x t e t e t-l x t-l ) But, cross-sectonal (SUR) or spatal dependences are gnored. If present, the Whte s or NW s HAC need to be adjusted. Smple ntuton: Repeatng a dataset 0 tmes should not ncrease the precson of parameter estmates. However, the..d. assumpton wll do ths: Now, we dvde b NT, not T or N. We cannot gnore the dependence n the data. An obvous soluton s to aggregate the repeated data. That s, aggregate n groups! Panel Data Models: (A3 ) Clustered SE In general, the observatons are not dentcal, but correlated wthn a cluster.e., a group that share certan characterstc. We assume correlaton wthn a cluster, but ndependence across clusters. Smple dea: Aggregate over the clusters. The ke s how to cluster. Canoncal example: We want to stud the effect of class sze on st graders' grades, the unobservables of st graders belongng to the same classroom wll be correlated (sa, teachers qualt, recess routnes) whle wll not be correlated wth st graders n far awa classrooms. Then, we can cluster b school/teacher. In fnance, t s reasonable to expect that shocks to frms n the same ndustr are not ndependent. Then, we can cluster b ndustr.

13 Panel Data Models: (A3 ) Clustered SE We remove the dependence b assumng correlaton wthn a cluster, but ndependence across clusters. That s, we thnk of the data as g E[ x g jg '] ' g c 0 jg g t f g g ' f g g g ' g,..., G Or stackng the data b cluster: g = X g + c g + g g =,..., G. Eas extenson of the Whte SE: S G N N g g 0 g j x ' gx jg w g, jg where w g,jg = E[ g jg X g ] s the error covarance for the g th and jg th observatons. We use S 0 to calculate clustered Whte SE. Panel Data Models: (A3 ) - PCSE Drscoll and Kraa (998) provde an eas extenson to estmate robust NW SE s n panels wth cross-sectonal dependences: Average the x t e t over the clusters we suspect cause dependence. Wthn a cluster, we assume that the correlatons wthn a cluster are the same for dfferent observatons. The KxK sandwch matrx s defned as ^ X ' X T t t j h ( b) h t j ( b)' wth h ( b) N ( t ) The NW method s appled to the tme seres of cross-sectonal averages of h t (b). We average over the G clusters to get G h t (b). We use the sandwch matrx to estmate clustered NW SE, usng w(l) as usual Bartlett or QS weghts other weghts are OK. t h ( b) t 3

14 Panel Data Models: (A3 ) - PCSE The NW method s appled to the tme seres of cross-sectonal averages of h t (b). B usng cross-sectonal averages, estmated SE are consstent ndependentl of the panel s cross-sectonal dmenson N. Clearl, these clustered NW SE reduce to the usual NW SE f each cluster onl has one observaton. If we do not suspect autocorrelaton problems not rare, gven that man panel data sets have heav temporall spaced observatons-, we can rel on Whte SE (S 0 ). These clustered standard errors are called panel corrected SE (PCSE s) or clustered SE. The clustered Whte-stle SE are, sometmes, called Rogers SE. Panel Data Models: (A3 ) - PCSE These PCSE s are robust to ver general forms of cross-sectonal (and temporal) dependence. These PCSE s usng HAR estmators (sa, based on KVB SE) are also possble. (See Hansen (007). Consstenc of the PCSE s dscussed b Whte (984), Lang and Zegger (986) for panels wth fnte number of observatons per cluster, as G. Hansen (007) shows that PCSE can be used wth N g --.e., long panels--, n addton to G. 4

15 Pooled Model General DGP t = x t + c + t & (A)-(A4) appl. The pooled model assumes that unobservable characterstcs are constant, ndependent of --no heterogenet. That s, we have: t = x t + α + t Now, we have a CLM, wth k+ parameters. Stackng the varables n matrces, we have: = X + α ι + Dmensons: -, ι and are Σ T x -X s Σ T xk - s kx Pooled Model We can re-wrte the pooled equaton model as: = X* * +, X* = [X ι] - Σ T x(k+) matrx: *=[ α] - (k+)x matrx In ths context, OLS produces BLUE and consstent estmator. In ths model, we refer to pooled OLS estmaton Of course, f our assumpton regardng the unobservable varables s wrong, we are n the presence of an omtted varable, c. Then, we have potental bas and nconsstenc of pooled OLS. The magntude of these problems depends on how the true model behaves: fxed or random. 5

16 Pooled Regresson: Heterogenet Bas In the pooled model, there s no model for group/ndvdual heterogenet. Thus, pooled regresson ma result n heterogenet bas: Pooled regresson: t = β 0 +β x t +ε t j True model: Frm True model: Frm 4 j True model: Frm 3 True model: Frm x Pooled Regresson: Wthn Transformaton We can estmate b centerng the observatons around ther group/ndvdual means. That s, t t k j k j j j x ( x jt jt x Ths method s called the wthn-groups estmaton because t reles on varatons wthn ndvduals rather than between ndvduals. That s, ths estmator reflects the tme-seres or wthn-ndvdual nformaton reflected n the changes wthn ndvduals across tme. We are estmatng usng the tme-seres nformaton n the data. j t ) t 6

17 Pooled Regresson: Wthn Transformaton There s a cost n the smplct of the wthn-groups estmaton. Frst, the ntercept and an X varable that remans constant for each ndvdual (sa, gender or College degree) wll drop out of the model. The elmnaton of the ntercept ma not matter, but the loss of the unchangng explanator varables ma be frustratng. Obvousl, f we are nterested on the effect of gender on CEO compensaton, wthn transformaton wll not work. But t wll work well f we are nterested on the effect of an ndependent Board of Drectors, b lookng at the compensaton pre-/post-bod. Under the usual assumptons, pooled OLS s consstent and unbased. Pooled Regresson: Between Transformaton There s an addtonal alternatve to estmate, b expressng the model n terms of group/ndvdual means. That s, t k jk j j j x x jt j It s called the between estmator because t reles on varatons between ndvduals (sa, & j). We are estmatng usng the cross-sectonal nformaton n the data (the tme-seres ndvdual varaton s gone!). Obvousl, f we are nterested on the effect of a new ndependent BOD durng the tenure of a CEO on that CEO s compensaton, between transformaton wll not work. But t wll work well f we stud the effect of gender on CEO s compensaton. t 7

18 Pooled Regresson: Between Transformaton We lose observatons (and power!): we have onl N data ponts. Under the usual assumptons, pooled OLS usng the between transformaton s consstent and unbased. Useful Analss of Varance Notaton (Greene) The varance (total varanton) quantfes the dea that each ndvdual sa, each frm-- dffers from the overall average. We can decompose the varance nto two parts: a wthn-group/ndvdual part and a between group/ndvdual part. Decomposton of Total varaton: N T N T Σ N =Σ t=(zt z) Σ = Σ t=(zt z.) Σ =T z. z Total varaton = Wthn groups varaton + Between groups varaton Interpretaton: - Wthn group varaton: Measures varaton of ndvduals over tme. - Between group varaton: Measures varaton of the means across ndvduals. 8

19 WHO Data (Greene) Note: The varablt s drven b between groups varaton Pooled Model: Lvng wth (A3 ) We start wth the pooled model: = X* * +, wth X * = [X ι] - Σ T x(k+) matrx. * = [ α] - (k+)x matrx Now, we allow E[ j ' X ] = σ j Ω j Potentall a lot of dfferent forms for E[ j ' X ] n a panel: - Indvdual heteroscedastct. E[ X ] = σ - Indvdual/group effects: E[ t s X ] 0 (t s) - Tme (SUR or spatal) effects: E[ t jt X ] 0 ( j) - Persstent common shocks: E[ t js X ] 0 ( j) and t s < L 9

20 Pooled Model: Lvng wth (A3 ) Heteroscedastct ponts to GLS effcent estmaton, but, for consstent nferences we can use OLS wth Whte/NW SE. Before calculatng the NW SE, we cluster the data to remove the dependence caused b the wthn group correlaton of the data. Drscoll and Kraa (998): Average the x t e t over the clusters. Then, the KxK sandwch matrx s defned as ^ X ' X T t t j h ( b) h t j ( b)' wth h ( b) t N ( t) h ( b) t The NW method s appled to the tme seres of cross-sectonal averages of h t (b). If we do not suspect autocorrelaton problems, we can rel on Whte SE (S 0 ), also referred as Rogers SE. Pooled Model: PCSE - Clusterng We can cluster the SE b one varable (sa, ndustr) or b several varables (sa, ear and ndustr) -- mult-level clusterng. If these several varables are nested (sa, ndustr and state), cluster at hghest level. We assume that the correlatons wthn a cluster (a group of frms, a regon, dfferent ears for the same frm, dfferent ears for the same regon) are the same for dfferent observatons. Dfferent clusters can produce ver dfferent SE. We want to cluster n groups that produce correlated errors. Usuall, we cluster usng economc theor (clusterng b ndustr, ear, ndustr and ear). 0

21 Pooled OLS: Clustered SE Results (Greene) Ordnar least squares regresson... LHS=LWAGE Mean = Resduals Sum of squares = Standard error of e = Ft R-squared =.4 Model test F[ 8, 456] (prob) = 36.8(.0000) Panel Data Analss of LWAGE [ONE wa] Uncondtonal ANOVA (No regressors) Source Varaton Deg. Free. Mean Square Between Resdual Total Varable Coeffcent Standard Error b/st.er. P[ Z >z] EXP.04085*** EXPSQ ***.48048D OCC *** SMSA.4856*** MS.06798*** FEM *** UNION.0940*** ED.058*** Constant *** Pooled OLS: Clustered SE Results (Greene) Varable Coeffcent Standard Error b/st.er. P[ Z >z] Constant EXP EXPSQ D OCC SMSA MS FEM UNION ED Clustered SE Constant EXP EXPSQ D OCC SMSA MS FEM UNION ED Note: Clustered SE s tend to be bgger. The more correlaton allowed, the hgher the SE.

22 Pooled Model: PCSE Clusterng: Remarks Snce we allow for correlaton between observatons, clustered SE wll ncrease CIs. The hgher the clusterng level, the larger the resultng SE. Thus, dfferent clusters can produce dfferent SE. Rel when possble on economc theor/ntuton to cluster. It s not a bad dea to tr dfferent was of defnng clusters and see how the estmated SE are affected. Be conservatve, report largest SE. Practcal rules - If aggregate varables (sa, b ndustr, or zp code) are used n the model, clusterng should be done at that level. - When the data correlates n more than one wa, we have two cases: - If nested (sa, ct and state), cluster at hghest level of aggregaton - If not nested (e.g., tme and ndustr), use mult-level clusterng. Pooled Model: PCSE - Remarks The bgger the cross-sectonal correlaton, the bgger the SE. That s, NW SE s tend to be smaller than Drscoll and Kraa SE. In smulatons, t s found (as expected) that the PCSE perform better when there s cross-sectonal dependence n the data. But, when there s no dependence n the cross-secton, the standard Whte or NW SE do better. In some cases, these dfferences can be sgnfcant Testng for cross-sectonal dependence ma be a good dea, especall when results are not robust to dfferent SE. LM tests can be easl mplemented. Pesaran (004) proposes an eas test.

23 Pooled Model: PCSE - Remarks The computatonal ssues are straghtforward for balanced data. We need onl the vector of resduals, the model matrx (X), and ndcators for group (and, usuall, tme) to form the clusters. But for unbalanced there are two approaches - Create a balanced subset of the panel to estmate Ω. Advantage: Computatonall smple. - Loop over e t e jt pars to estmate covarances over avalable overlappng tme frames (loop over all pars can take a long tme). Advantage: Informaton s not thrown out. More on PCSE later. Cornwell and Rupert Data (Greene) Cornwell and Rupert Returns to Schoolng Data, 595 Indvduals, 7 Years Varables n the fle are EXP = work experence WKS = weeks worked OCC = occupaton, f blue collar, IND = f manufacturng ndustr SOUTH = f resdes n south SMSA = f resdes n a ct (SMSA) MS = f marred FEM = f female UNION = f wage set b unon contract ED = ears of educaton BLK = f ndvdual s black LWAGE = log of wage = dependent varable n regressons These data were analzed n Cornwell, C. and Rupert, P., "Effcent Estmaton wth Panel Data: An Emprcal Comparson of Instrumental Varable Estmators," Journal of Appled Econometrcs, 3, 988, pp See Baltag, page for further analss. The data were downloaded from the webste for Baltag's text. 3

24 Applcaton : Cornell and Rupert (Greene) Applcaton : Bd-Ask Spread (Hoechle) 4

25 Pooled Model wth (A3 ) - GLS We start wth the pooled model: = X* * + where X * = [X ι] - Σ T x(k+) matrx: * = [ α] - (k+)x matrx Now, we allow E[ j ' X ] = σ j Ω j We can use OLS wth PCSE s or we can do GLS. Note: Wh GLS? Effcenc. Suppose Ω j = I T Then, we onl have cross-equaton correlaton, not tme correlaton. We are back n the (aggregaton) SUR framework Pooled Model wth SUR - GLS Suppose Ω j = I T. We are n the (aggregaton) SUR framework: ˆ GLS ( X' V ( X'[ X) X' V I] X) ( X'[ I] X'[ I] X) X'[ I] For FGLS, use the pooled OLS resduals e and e j to estmate the covarance σ j. Note that ˆ T T t et et ' E' E T where E s a TxN matrx and e t =[e t e t... e Nt ]' s Nx vector. We need to nvert ˆ (NxN matrx). Note: In general, the rank(e) T. Then, rank( ˆ ) T < N => sngulart, FGLS cannot be computed. Ths s a problem of the data, not the model. 5

26 Pooled Model wth Heteroscedastct - GLS Now, suppose we have groupwse heteroscedastct. That s, E[ j ' X ] = 0 for j E[ X ] = σ We do FGLS, as usual, usng the pooled OLS resduals e to estmate the varance σ and, thus, to estmate Σ: We can test ths model wth H 0 : σ =σ =...=σ N. We can use: W = Σ (s s pooled)/var(s ) χ N where s s computed usng the pooled OLS e resduals. N Pooled Model wth Autocorrelaton - GLS Now, suppose we have ndvdual autocorrelaton. That s, E[ t js ' X ] = 0 for j E[ t t-p ' X ] 0 -for example, t t u Var[ t X ] = σ t We do FGLS, as usual, usng the pooled OLS resduals e to estmate the ρ and, thus, to estmate Σ : u ( ).. T.. T T We can test ths model wth H 0 : ρ = ρ =...= ρ N =0. We can use an LM test to test H

27 Pooled OLS wth Frst Dfferences From the general DGP: t = x t + c + t & (A)-(A4) appl. It ma stll be possble to use OLS to estmate, when we have ndvdual heterogenet. We can use OLS f we elmnate the cause of heterogenet: c We can do ths b takng frst dfferences of the DGP. That s, Δ t = t t- = (x t -x t- ) + Δ c + Δ t = Δx t + u t Note: All tme nvarant varables, ncludng c dsappear from the model (one dff ). If the model has a tme trend economc fluctuatons-, t also dsappear, t become the constant term (the other dff ). Thus, ths method s usuall called dffs n dffs (DD or DD). Pooled OLS wth Frst Dfferences Wth strct exogenet of (X,c ), the OLS regresson of Δ t on Δx t s unbased and consstent, but neffcent. Wh? The error s not longer t, but u t. The Var[u] s gven b:,, 0 0,3, 0 0,T,T Var (Toepltz form) That s, frst dfferencng produces heteroscedastct. Effcent estmaton method: GLS. It turns out that GLS s complcated. Use OLS n frst dfferences and use Newe-West SE/PCSE wth one lag. 7

28 OLS wth Frst Dffs: Treatment Applcaton Suppose there s random assgnment to treatment and control groups, lke n a tpcal medcal experment. We compare the change n outcomes across the treatment and control groups to estmate the treatment effect. (We used ths method natural experment n Lecture 8 to deal wth endogenet.) Wth two perods.e., before and after and strct exogenet: Then, t = = δ 0 + δ Treatment + (x ' x ' ) u t (Ths s a CLM. OLS s consstent and BLUE). E[ t Treatment = ] = δ 0 + δ + E[(x ' x ' ) Treatment = ] E[ t Treatment = 0] = δ 0 + E[(x ' x ' ) Treatment = 0] OLS wth Frst Dffs: Treatment Applcaton Assumng that controls are orthogonal to Treatment: δ = E[ t Treatment = ] E[ t Treatment = 0] δ s the dfference n average change n the two perods.e., before and after between the treated and control groups. Ths s the dffs n dffs (DD, DD) estmator. Tpcal problem: Exogenet (randomness) of treatment. That s, (A) E[u t Treatment ] = 0. In medcal experments, dffs n dffs estmaton s routnel used to evaluate the effectveness of a new treatment and/or medcaton. Usual H 0 : δ = 0. It can be tested wth a t-test (usng HAR/PCSEs). 8

29 OLS wth Frst Dffs: Treatment Applcaton Same result can be derved b lookng at levels DGP ( t, x t ) ncludng two dummes: One for Treatment t and one for after Treatment t (Post ): t = x t + c + γ 0 + γ Treatment + γ Post + δ Treatment x Post u t Now, t s eas to separate cross-sectonal dfferences from tme-seres dfferences. - Cross-sectonal dfference E[ t Treatment =, Post =] E[ t Treatment = 0, Post =] = γ + δ Note: Unbased f γ =0 No permanent dfference between the treatment and control groups. OLS wth Frst Dffs: Treatment Applcaton - Tme-sectonal dfference E[ t Treatment =, Post =] E[ t Treatment =, Post =0] = γ + δ Note: Unbased f γ =0 => No common trend over the pre- and posttreatment tmes. In Lecture 8 we emphaszed that we need to make sure that the treatment s the onl dfference between the two groups. That s, n the absence of treatment, the average change n t would have been the same for both groups. Ths s a ke assumpton behnd the DD estmator, usuall tested b lookng at a graph of the behavor of both groups before treatment recall example n Lecture 8 from Reddng & Sturm (008). 9

30 OLS wth Frst Dffs: Natural Experment In Fnance & Economcs, especall n Corporate Fnance, we appl the DD method when we use natural experments (a change n a law, a polc or a regulaton) to stud the effect of X t on t. (Recall Lecture 8.) We have two perods: Before and after the natural experment (the treatment). If we also have a well-defned control group, where the treatment was not admnstered.e., the natural experment never occurred, then, we can use DD estmaton. The number of groups, S, (treated & not treated) under consderaton s usuall small tpcall. N s usuall ver large. Dffs n Dffs: Natural Experment - Example : We are nterested n the effect of labor shocks on wages and emploments. Natural experment: The 980 Marel boatlfs, a temporar lftng of emgraton restrctons n Cuba. Most of the mareltos (the 980 Cuban mmgrants) settled n Mam. Two perods: Before and after the 980 Marel boatlfs. Control group: Low sklled workers n Houston, LA and Atlanta. Calculate unemploment and wages of low sklled workers n both perods. Then, regress t aganst a set of control varables (ndustr, educaton, age, etc.) and a treatment dumm: t = = δ 0 + δ Treatment + (x ' x ' ) u t H 0 : δ =0. Card (990) found no effect of massve mmgraton. 30

31 Dffs n Dffs: Natural Experment - Example : Suppose we are nterested n the effect of accountng transparenc on analsts forecast errors. We can use the Jul 30, 00 Sarbanes-Oxle (SOX) law as a natural experment. Two perods: Before and after SOX. Control group (frms under no SOX law): It ma be dffcult to fnd a good control group. Mabe, we can use analsts n Canada or n German as a control group. Calculate the dsperson of the analsts forecasts,, n both perods & regress t aganst a set of control varables (experence of analst, educaton, age, etc.) and a treatment dumm: t = = δ 0 + δ Treatment + (x ' x ' ) u t H 0 : δ =0. Dffs n Dffs: Remarks We express the DGP n terms of (ndvuals), s (groups), and t (tme): st = δ s + δ t + δ Treatment st + x st ' st Usuall, we have small S and T; but large N. Snce, n general, we have wthn group correlaton (treated ndvduals show smlar errors), the asmptotcs of the t-test are drven b S*T. Donald and Lang (004): Under the usual (generous) assumptons, t converges to a normal dstrbuton (a t ST-K ma work better). Intuton: Suppose that wthn s,t groups the errors are perfectl correlated. Then, we onl have S*T ndependent observatons! Gven the potental (tme-varng) correlatons n the errors, OLS SE can be terrble. PCSE tend to do better. 3

32 Dealng wth Attrton Attrton problem: If an unbalanced panel s a result of some selecton process related to ε t, then endogenet s present and need to be dealt wth usng some correcton methods. Otherwse, we have attrton bas. Example: In the "Qualt of Lfe for cancer patents" stud dscussed n Greene, appearance for the second ntervew was low for people wth ntal low QOL (death or depresson) or wth ntal hgh QOL (don t need the treatment). Solutons to the attrton problem Heckman selecton model (used n the stud) Prob[Present at ext covarates] = Φ(z θ) (Probt model) Addtonal varable added to dfference model = ϕ(z θ)/φ(z θ) The FDA soluton: fll wth zeros. (!) Pooled Model: ML Estmaton In the pooled model, = X +, we assume t ~N(0, Σ), where t = [ t, t,..., Nt ]' and Σ s an NxN matrx. We can wrte the log lkelhood functon as: L = log L(, Σ X) = -NT/ ln(π) T/ ln Σ - ½ Σ t t 'Σ - t The ML estmator s equal to the terated FGLS estmator. Testng s straghtforward wth lkelhood rato test. Example: H 0 : No cross correlaton across equatons: The off-dagonal elements of Σ are zero. LR = T (ln ˆ R - ln ˆ U ) = T (Σ ln s - ln ˆ ), whch follows a χ wth N(N-)/ d.f. 3

33 Man Models: FEM and REM Two man approaches to fttng models usng panel data: () Fxed effects regressons. () Random effects regressons. The ke dfference between these two approaches s how the unobservable characterstcs the ndvdual effects- are modeled. Termnolog from expermental desgn (sa, pscholog or medcne), where the emphass was on the knd of sample at hand and nferences: - FE: The ndvduals are fxed. The dfferences between them are not of nterest, onl s nterestng. No ntent on generalzng the results. - RE: The ndvduals come from a random sample drawn from a larger populaton, and the varance between them s nterestng and can be nformatve about the larger populaton. The Fxed Effects Model (FEM) The fxed effects (FE) model (A) t = x t + c + t -observaton for ndvdual at tme t. (A) E[ε t X s,c s ]=0, for all t and s. - X and c strct exogenous. The unobserved component, c, s arbtrarl correlated wth x t : E[c X ] = g(x ) = constant Cov[x t,c ] 0 Under the FE assumpton, snce we omt c, pooled OLS s based and nconsstent. We summarze ( control for ) these unobservable effects wth α, a constant. All tme nvarant characterstc of ndvdual (locaton, gender, natonalt, etc.) are swept awa under ths formulaton. Note: In a FEM, ndvduals serve as ther own controls. 33

34 Estmaton wth Fxed Effects Whatever effects the omtted varables have on the ndvdual at one tme, the wll also have the same effect at a later tme, thus, ther effects wll be constant, or fxed. For ths, we need the omtted varables to have tme-nvarant values wth tme-nvarant effects. Tpcal example, a CEO s IQ/manageral sklls. We expect ths varable to have the same effect at t= or t=0. As we wll see, FEM are estmated usng the wthn transformaton. Thus, f ndvduals do not change much (or at all) across tme, a FEM ma not work ver well. We need wthn-ndvduals varablt n the varables f we are to use ndvduals as ther own controls. Estmaton wth Fxed Effects Matrx notaton - In matrx notaton for ndvdual : = x + c + - c s a T x vector. (Each ndvdual has T observatons.) - In matrx notaton for all ndvduals.e., stackng: = X + c + - we have Σ T observatons. Now, c,, and are Σ T x vectors. Dumm varable representaton t x t ' N j c j d jt t ; d jt ( j) 34

35 Assumptons for the FE Model (FEM) The ndvdual unobservable characterstcs (effects) are correlated wth the ncluded varables: E[c X ] = g(x ) Cov[x t,c ] 0 The FE model assumes c = α (constant; t does not var wth t): = X + d α + ε, for each ndvdual. Stackng X d X 0 d 0 0 β ε α N XN dn β = [X,D] ε α = Zδ ε FEM: Estmaton The FEM s the CLM, but wth man ndependent varables: k+n. OLS s unbased, consstent, effcent, but mpractcal f N s large. The OLS estmates of β and α are gven b: b X X X D X a D X D D D Usng the Frsch-W aug h theorem b =[ X M DX ] X M D Note (Greene): LS s an estmator, not a model. Gven the formulaton wth a lot of dumm varables, ths partcular LS estmator s called Least Squares Dumm Varable (LSDV) estmator. 35

36 FEM: Estmaton M D MD 0 M D (The dumm varables are orthogonal) N 0 0 MD M I d dd d = I d d D T ( ) (/T) T N T D = D D k,l t= t,k.,k t,l.,l N T D = D, D k t=(xt,k -x.,k )(t-. ) XM X= XM X, XM X (x -x )(x -x ) XM = XM XM That s, we subtract the group mean from each ndvdual observaton. Then, the ndvdual effects dsappear. Now, OLS can easl be used to estmate the k β parameters, usng the demeaned data. We know ths method: The wthn-groups estmaton. FEM: Wthn Transformaton Removes Effects The wthn-groups method estmates the parameters usng demeaned data. That s, Y t Y Y t Y k j ( X j k j k j X jt j X j jt j X j t t ) ( t t ) t t Recall: It s called wthn-groups/ndvduals method because the model s explanng the varatons about the mean of the dependent varable n terms of the varatons about the means of the explanator varables for the group of observatons relatng to a gven ndvdual. 36

37 FEM: Wthn Transformaton Removes Effects t k j j ( x jt x j ) ( t t ) t There are costs n the smplct of the wthn-groups estmaton. Frst, all tme-nvarant varables (ncludng constant) for each ndvdual drop out of the model. Ths elmnates all betweenndvduals varablt (whch ma be contamnated b omtted varable bas) and leaves onl the wthn-subject varablt to analze. The loss of the unchangng explanator varables ma matter. For example, f we are studng CEO compensaton, the wthn group transformaton wll lose ndvdual schoolng, ears of experence, prevous job network, etc. That s, schoolng effects on CEO compensaton cannot be stud n ths context. FEM: Wthn Transformaton Removes Effects t k j j ( x jt x j ) ( t t ) t Second, the dependent varables are lkel to have much smaller varances than n the orgnal specfcaton. Now, the are measured as devatons from the ndvdual mean, rather than as absolute amounts. Ths s lkel to adversel affect the precson of the estmates of the coeffcents. It s also lkel to aggravate measurement error bas f the explanator varables are subject to measurement error. Thrd, the manpulaton nvolves the loss of N degrees of freedom (we are estmatng N means!). 37

38 FEM: LS Dumm Varable (LSDV) Estmator b s obtaned b wthn-groups least squares (group mean devatons). Then, we use the normal equatons to estmate a: D Xb + D Da=D a = (D D) - D ( Xb) Note: a=(/t)σ ( - x b)= e T t= t t - Ths s smple algebra the estmator s just OLS - Agan, LS s an estmator, not a model. Ths partcular OLS estmator s called LSDV estmator. - Note what a s when T =. Follow ths wth t -a -x t b=0 f T =. FEM: LSDV Estmator Recall the dumm varable trap: If a constant s present n the model, the number of dumm varable should be N-. The omtted ndvdual or group becomes the reference categor. However, the choce of reference categor s often arbtrar and, thus, the nterpretaton of the wll not be partcularl nterestng. Alternatvel, we can drop the ntercept and defne dumm varables for all of the ndvduals. Ths s the more common approach. The now become the ntercepts for each of the s. If E[ε t X s,c s ] 0, then LSDV cannot be used. It s nconsstent. In ths case, we need to use IVs. Or a good natural experment. 38

39 FEM: Frst-Dfference (FD) Method We can also elmnate the ndvdual FE usng the frst-dfference method. The unobserved effect s elmnated b subtractng the observaton for the prevous tme perod from the observaton for the current tme perod, for all tme perods. Y t k Yt )) t j k j j ( X jt X jt) ( t ( t t Y t j X jt t t The error term s now ( t t ). As before, dfferencng nduces a movng average autocorrelaton f t satsfes the CLM assumptons. Note: If t s subject to AR() autocorrelaton and s close to, takng frst dfferences ma approxmatel solve the problem. FEM: Estmaton FE or FD? Fxed-effects (or Wthn) Estmator Each varable s demeaned -.e., subtracted b ts average. Dumm Varable Regresson -.e., put n a dumm varable for each cross-sectonal unt, along wth other explanator varables. Ths ma cause estmaton dffcult when N s large. FD Estmator Each varable s dfferenced once over tme, so we are effectvel estmatng the relatonshp between changes of varables. 39

40 FEM: Estmaton FE or FD? Theoretcall, when N s large and T s small but greater than (for T=, FE=FD), FE s more effcent when ε t are serall uncorrelated whle FD s more effcent when ε t follows a random walk (ρ=). When T s large and N s small FD has advantage for processes wth large postve autocorrelaton. (If s near, FD solves the nonstatonar problem!) FE s more senstve to nonnormalt, heteroskedastct, and seral correlaton n ε t. On the other hand, FE s less senstve to volaton of the strct exogenet assumpton. Then, FE s preferred when the processes are weakl dependent over tme FEM: Calculaton of Var[b X] Assume strct exogenet: Cov[ε t,(x js,c j )]=0. Ever dsturbance n ever perod for each person s uncorrelated wth varables and effects for ever person and across perods. Now, we have OLS n a CLM. As.Var[b X] = N N N ( / T )plm[( / T ) XM X] = = = D whch s the usual estmator for OLS ( -a -x b) ˆ N T = t= t t N =T - N - K (Note the degrees of freedom correcton) 40

41 FEM: PCSE All prevous comments and remarks appl to the FEM. If dependence structure s suspected, we cluster the SE. We can clustered them b one varable (sa, ndustr, regon) or b several varables (sa, ear and ndustr) -- mult-level clusterng. We buld the SE accordng to the tpe of data we have: - If we do not suspect autocorrelated errors not a strange stuaton-, we can rel on clustered Whte SE s (S 0 ). - If we suspect autocorrelated errors, then the Drscoll and Kraa SE should be used. FEM: Testng for Fxed Effects Under H 0 (No FE): α = α for all. => That s, we test whether to pool or not to pool the data. Dfferent tests: F-test based on the LSDV dumm varable model: constant or zero coeffcents for D. Test follows an F( N-,NT-N-K ) dstrbuton. F-test based on FEM (the unrestrcted model) vs. pooled model (the restrcted model). Test follows an F( N-,NT-N-K ) dstrbuton. A LR can also be done usuall, assumng normalt. Test follows a χ N- dstrbuton. 4

42 FEM: Hpothess Testng Based on estmated resduals of the fxed effects model. () Estmate FEM: t = x t β + α + t => Keep resduals e FE,t () Tests as usual: Heteroscedastct Breusch and Pagan (980) Autocorrelaton: AR() Breusch and Godfre (98) LM NT e ' FE efe d ' T efe efe Applcaton: Cornwell and Rupert Data (Greene) Cornwell and Rupert Returns to Schoolng Data, 595 Indvduals, 7 Years Varables n the fle are: (Not used n regressons) EXP = work experence, EXPSQ = EXP WKS = weeks worked OCC = occupaton, f blue collar, (IND = f manufacturng ndustr) (SOUTH = f resdes n south) SMSA = f resdes n a ct (SMSA) MS = f marred FEM = f female UNION = f wage set b unon contract ED = ears of educaton (BLK = f ndvdual s black) LWAGE = log of wage = dependent varable n regressons (Y) These data were analzed n Cornwell, C. and Rupert, P., "Effcent Estmaton wth Panel Data: An Emprcal Comparson of Instrumental Varable Estmators," Journal of Appled Econometrcs, 3, 988, pp

43 Applcaton: Cornwell and Rupert (Greene) () Returns to Schoolng - Pooled OLS Results K RSS & R X onl Applcaton: Cornwell and Rupert (Greene) () Returns to Schoolng - LSDV Results N+K RSS & R X and group effects 43

44 FEM: Testng for FE (and other formulatons) Pooled FEM Calculatons: F-test 594,3566 = [( )/594]/[83.89/3566] = The Random Effects Model (REM) Recall the general DGP: t = x t + z γ + t -observaton for ndvdual at tme t. When the observed characterstcs are constant for each ndvdual, a FEM s not an effectve tool because such varables cannot be ncluded. An alternatve approach, known as a random effects (REM) model that, subject to two condtons, provdes a soluton to ths problem. Condtons: ()It s possble to treat each of the unobserved Z p varables as beng drawn randoml from a gven dstrbuton. () The Z p varables are dstrbuted ndependentl of all of the X j varables. 44

45 The Random Effects Model (REM) Condtons: () The unobserved Z p varables are drawn randoml from a gven dstrbuton. Thus, the c ma be treated as RV (thus, the name of ths approach) drawn from a gven dstrbuton. Let s call t u. Then, Y t k j k j X j X j jt jt u t t w t t w t u t We deal wth the unobserved effect b subsumng t nto a compound dsturbance term, w t. We wll assume that the Z p s drawn from a dstrbuton wth zero mean and constant varance. Then, E( w ) E( u ) E( u ) E( ) 0 t t t The Random Effects Model (REM) The zero mean assumpton E[u ] = 0-- s not crucal, an nonzero component s beng absorbed b the ntercept,. () The Z p varables are dstrbuted ndependentl of all of the X j varables. Otherwse, u and the compounded error, w t, wll not be uncorrelated wth X j. The RE estmaton wll be based and nconsstent. Note: We would have to use the FEM, even f the frst condton seems to be satsfed. If () and () are satsfed, we can use the REM, but there s a complcaton. Now, w t wll be heterosccedastc. 45

46 REM: Error Components Model REM Assumptons: t = x t + z γ + t = x t + u + t = x t + w t E[ t X ]=0 E[ t X ]= σ ε E[u X ]=0 E[u X ]= σ u E[u t X ] = E[u jt X ] =0 - u and are ndependent. E[u u j X ] = 0 ( j) -no cross-correlaton of RE. E[ t jt X ]= 0 ( j) -no cross-correlaton for the errors, t. E[ t js X ]= 0 (t s) -there s no autocorrelaton for t. w w t w t t u t ( u t u )( u t t ) u, u u t REM: Notaton (Greene) X ε u T observatons u X ε T observatons β N XN εn unn T N observatons N = Xβ+ ε+ u T observatons = Xβ+ w = In all that follows, except where explctl noted, X, X and x contan a constant term as the frst element. t To avod notatonal clutter, n those cases, x etc. wll smpl denote the counterpart wthout the constant term. Use of the smbol K for the number of varables wll thus be context specfc but wll usuall nclude the constant term. t 46

47 REM: Notaton (Greene) u u u u u u Var[ +u ] ε u u u = IT u T T = I T u = Ω Ω 0 0 Var[ ] 0 Ω 0 w X (Note these dffer onl n the dmenson T ) 0 0 ΩN Note: If E[ t jt X ] 0 ( j) or E[ t js X ] 0 (t s), we no longer have ths nce dagonal-tpe structure for Var[w X]. REM: Assumptons - Convergence of Moments XX T N XΩX T N f N f N XX T XΩ X T a weghted sum of ndvdual moment matrces N N u T a weghted sum of ndvdual moment matrces XX = f f xx Note asmptotcs are wth respect to N. Each matrx XX T s the moments for the T observatons. Should be 'well behaved' n mcro level data. The average of N such matrces should be lkewse. T or T s assumed to be fxed (and small). 47

48 REM: Pooled OLS Estmaton (Greene) Standard results for the pooled OLS estmator b n the GR model - Consstent and asmptotc normal - Unbased - Ineffcent We can use pooled OLS, but for nferences we need the true varance.e., the sandwch estmator: XX XΩX XX Var[ b X] N N N N T T T T Q Q * Q 0 as N wth our convergence assumptons REM: Sandwch Estmator for OLS (Greene) N N N XX XΩ X XX Var[ b X] N N N N T T T T X Ω X N X Ω X f, where = Ω =E[ w w X ] T T In the sprt of the Whte estmator, use X Ω X T X ˆ ˆ w w X f, T w ˆ = Hpothess tests are then based on W ald statstcs. - X b THIS IS THE 'CLUSTER' ESTIMATOR Recall: Clustered standard errors or PCSE There s a groupng, or cluster, wthn whch the error term s possbl correlated, but outsde of whch (across groups) t s not. 48

49 REM: Sandwch Estmator Mechancs (Greene) N ˆ ˆ Est.Var[ b X] XX XwwX XX wˆ = set of T OLS resduals for ndvdual. X = TxK data on exogenous varable for ndvdual. Xw ˆ = K x vector of products ( Xw ˆ )( wx ˆ ) N Xw wx KxK matrx (rank, outer product) ˆ ˆ = sum of N rank matrces. Rank K. N N ˆ ˆ ˆ We could compute ths as X w w X = X Ω X. Wh not do t that wa? REM: GLS Standard results for GLS n a GR model - Consstent - Unbased - Effcent (f functonal form for Ω correct) ˆ - - β =[ X Ω X ] [ X Ω ] N - N - X Ω X X Ω =[ ] [ ] - Ω I T T u (note, depends on onl through T ) As usual, the matrx Ω -/ =P wll be used to transform the data. 49

50 REM: GLS The matrx Ω -/ = P s used to transform the data. That s, t where As. Var [ ˆ x GLS t x ] ( X ' t T X ) u ( X *' X *) We call the transformed data: quas tme-demeaned data. As expected, GLS s just pooled OLS wth the transformed data. Note: The RE can be seen as mxture of two estmators: - when θ = 0 (σ u =0) => pooled OLS estmator - when θ = (σ ε =0 or σ u ) => LSDV estmator (u s become the FE) Then, the bgger (smaller) the varance of the unobserved effect.e., ndvdual heterogenet s bgger-, the closer t s to FE (pooled OLS). Also, when T s large, t becomes more lke FE REM: FGLS - Estmators for the Varances To transform the data, we need to estmate σ ε and σ u, consstentl. Usual steps (assume a balanced panel): () Start wth a consstent estmator of β. For example, pooled OLS, b. () Compute Σ Σ t ( t - x t b) -It estmates Σ Σ t (σ ε + σ u ) (3) Dvde b a functon of NT. For example: NT K => We have an estmator of σ, s pooled = e pooled e pooled /(NT K ) We wll use s pooled to estmate the sum: σ ε + σ u (4) Use LSDV estmaton to get a and b LSDV. Keep resduals, e FE,t. (5) Compute Σ Σ t ( t -a - x t b LSDV ) -It estmates Σ Σ t (σ ε ) (6) Dvde b NT-K-N. => We have an estmator of σ ε, s ε=σ Σ t (e FE,t ) /(NT-K-N) (7) Estmate σ u as s u= s pooled -s ε 50

51 REM: FGLS - Estmators for the Varances Feasble GLS requres (onl) consstent estmators of and. Canddates: u ( a xb ) From the robust LSDV estmator: ˆ From the pooled OLS estmator: N T t t t LSDV N TK N N u ( a xb ) T t t OLS t OLS N TK N (t a xb MEANS) From the group means regresson: / T u NK ww ˆ ˆ (Wooldrdge) Based on E[w w ] f t s, ˆ There are man others. N T T t s t st t s X u u N TK N Note: A slght chance n notaton, x does not contan the constant term. REM: Practcal Problems wth FGLS All of the precedng regularl produce negatve estmates of. Estmaton s made ver complcated n unbalanced panels. A bulletproof soluton (orgnall used n TSP, now LIMDEP and others). From the robust LSDV estmator: ˆ u ( a xb ) N T t t t LSDV N T ( a xb ) From the pooled OLS estmator: ˆ ( a xb ) ( N T t t OLS t OLS u N ˆ T a xb ) N T N T t t OLS t OLS t t t LSDV u N T 0 Bullet proof soluton: Do not correct b degrees of freedom. Then, gven that the unrestrcted RSS (LSDV) wll be lower than the restrcted (pooled OLS) RSS, σ u wll be postve! 5

52 Applcaton: Fxed Effects Estmates (Greene) Least Squares wth Group Dumm Varables... LHS=LWAGE Mean = Resduals Sum of squares = Standard error of e =.505 These varables have no wthn group varaton. FEM ED F.E. estmates are based on a generalzed nverse Varable Coeffcent Standard Error b/st.er. P[ Z >z] Mean of X EXP.346*** EXPSQ *** D OCC SMSA ** MS FEM (Fxed Parameter)... UNION.0343** ED (Fxed Parameter) REM: Computng Varance Estmators (Greene) Usng full lst of varables (FEM and ED are tme nvarant) OLS sum of squares = = / (465-9) = u Usng full lst of varables and a generalzed nverse (same as droppng FEM and ED), LSDV sum of squares = u = / ( ) = = Both estmators are postve. We stop here. If u were negatve, we would use estmators wthout DF correctons. 5

Econometric Analysis of Panel Data. William Greene Department of Economics Stern School of Business

Econometric Analysis of Panel Data. William Greene Department of Economics Stern School of Business Econometrc Analyss of Panel Data Wllam Greene Department of Economcs Stern School of Busness Econometrc Analyss of Panel Data 5. Random Effects Lnear Model The Random Effects Model The random effects model

More information

Empirical Methods for Corporate Finance. Identification

Empirical Methods for Corporate Finance. Identification mprcal Methods for Corporate Fnance Identfcaton Causalt Ultmate goal of emprcal research n fnance s to establsh a causal relatonshp between varables.g. What s the mpact of tangblt on leverage?.g. What

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Exam. Econometrics - Exam 1

Exam. Econometrics - Exam 1 Econometrcs - Exam 1 Exam Problem 1: (15 ponts) Suppose that the classcal regresson model apples but that the true value of the constant s zero. In order to answer the followng questons assume just one

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Lecture 3 Specification

Lecture 3 Specification Lecture 3 Specfcaton 1 OLS Estmaton - Assumptons CLM Assumptons (A1) DGP: y = X + s correctly specfed. (A) E[ X] = 0 (A3) Var[ X] = σ I T (A4) X has full column rank rank(x)=k-, where T k. Q: What happens

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting Onlne Appendx to: Axomatzaton and measurement of Quas-hyperbolc Dscountng José Lus Montel Olea Tomasz Strzaleck 1 Sample Selecton As dscussed before our ntal sample conssts of two groups of subjects. Group

More information

Introduction to Analysis of Variance (ANOVA) Part 1

Introduction to Analysis of Variance (ANOVA) Part 1 Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned b regresson

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor Reduced sldes Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor 1 The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Properties of Least Squares

Properties of Least Squares Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Professor Chris Murray. Midterm Exam

Professor Chris Murray. Midterm Exam Econ 7 Econometrcs Sprng 4 Professor Chrs Murray McElhnney D cjmurray@uh.edu Mdterm Exam Wrte your answers on one sde of the blank whte paper that I have gven you.. Do not wrte your answers on ths exam.

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

If we apply least squares to the transformed data we obtain. which yields the generalized least squares estimator of β, i.e.,

If we apply least squares to the transformed data we obtain. which yields the generalized least squares estimator of β, i.e., Econ 388 R. Butler 04 revsons lecture 6 WLS I. The Matrx Verson of Heteroskedastcty To llustrate ths n general, consder an error term wth varance-covarance matrx a n-by-n, nxn, matrx denoted as, nstead

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values Fall 007 Soluton to Mdterm Examnaton STAT 7 Dr. Goel. [0 ponts] For the general lnear model = X + ε, wth uncorrelated errors havng mean zero and varance σ, suppose that the desgn matrx X s not necessarly

More information

The Granular Origins of Aggregate Fluctuations : Supplementary Material

The Granular Origins of Aggregate Fluctuations : Supplementary Material The Granular Orgns of Aggregate Fluctuatons : Supplementary Materal Xaver Gabax October 12, 2010 Ths onlne appendx ( presents some addtonal emprcal robustness checks ( descrbes some econometrc complements

More information

Available online Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

Available online   Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research 4 6(5):7-76 Research Artcle ISSN : 975-7384 CODEN(USA) : JCPRC5 Stud on relatonshp between nvestment n scence and technolog and

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

MD. LUTFOR RAHMAN 1 AND KALIPADA SEN 2 Abstract

MD. LUTFOR RAHMAN 1 AND KALIPADA SEN 2 Abstract ISSN 058-71 Bangladesh J. Agrl. Res. 34(3) : 395-401, September 009 PROBLEMS OF USUAL EIGHTED ANALYSIS OF VARIANCE (ANOVA) IN RANDOMIZED BLOCK DESIGN (RBD) ITH MORE THAN ONE OBSERVATIONS PER CELL HEN ERROR

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information