An Algorithm to Estimate the Two-Way Fixed Effect Model Paulo Somaini Frank Wolak

Size: px
Start display at page:

Download "An Algorithm to Estimate the Two-Way Fixed Effect Model Paulo Somaini Frank Wolak"

Transcription

1 An Algorthm to Estmate the Two-Way Fxed Effect Model Paulo Soman Frank Wolak May 2014 Abstract. We present an algorthm to estmate the two-way fxed effect lnear model. The algorthm reles on the Frsch-Waugh-Lovell Theorem and apples to ordnary least squares (OLS, two-stage least squares (TSLS and generalzed method of moments estmators (GMM. The coeffcents of nterest are computed usng the resduals from the projecton of all varables on the two sets of fxed-effects. Our algorthm has three mportant advantages. Frst, t manages memory and computatonal resources effcently whch speeds up the computaton of the estmates. Second, t allows the researcher to estmate multple specfcatons usng the same set structure of fxed effects at a very low computatonal cost. Thrd, the asymptotc varance of the parameters of nterest can be consstently estmated usng standard routnes on the resdualzed data. 1 Introducton Large data sets allow researchers to obtan precse estmates even after controllng for dfferent sources of heterogenety. It s often the case that approprately controllng for heterogenety requres ncludng two sets of hgh-dmensonal fxed effects. For example, consder estmatng resdental electrcty demand usng daly consumpton from a panel of 1000 households over a 5-year perod. Half of the households were exposed to real-tme prcng whle the other half faced a regular twopart tarff. It seems approprate to nclude households fxed effects that capture heterogenety n electrcty consumpton habts, and day-of-sample fxed effects that capture common unobserved tme-specfc demand shocks (e.g., due to varyng whether condtons. If each consumer s observed 1826 tmes, the data would contan 1.8 mllon observatons. Gven memory constrants, creatng a set of dummes for ether household or day effects s not practcal and may not be feasble at all. In ths paper we propose a feasble algorthm that computes the effect of the varables of nterest (e.g., exposure to tme-based prcng and prce elastctes managng memory and computatonal power effcently. The algorthm reles on the Frsch Waugh Lovell Theorem. The coeffcents of nterest are computed by OLS, TSLS or GMM usng the resduals from the projecton of all varables on the two sets of fxed-effects. Ths procedure s trval n balanced panels wth equally-weghted Authors afflatons: Department of Economcs, MIT, Cambrdge, Massachusetts, U.S.A. e-mal: psoman@mt.edu Department of Economcs, Stanford Unversty, Stanford, Calforna, U.S.A. e-mal: wolak@stanford.edu 1

2 observatons but t can be qute complcated n cases where observatons are not equally weghted or the panel s unbalanced. The algorthm n ths paper s specally suted for the latter case. Let N be the number of households/groups and T be the number of perods. The data conssts of approxmately N T observatons. Unbalanced panels may have less observatons; conversely, f a groups s observed more than once n each tme perod the number of observatons may be larger. Constructng an addtonal set of dummes wll requre to add mn (N, T 1 varables. Ths s feasble only f the memory s able to store an array wth dmensons N T (mn (N, T 1. We propose an algorthm where ths s not necessary. The frst step of the algorthm computes a set of three matrces that characterze the structure of fxed effects gven a partcular sample. The frst matrx s N by T wth typcal element equal to the sum of weghts for observatons that share a partcular par of fxed effects. The other two matrces can be computed from the frst one and are used repeatedly n the subsequent steps. The sngle most computatonally ntensve operaton n ths step s nvertng a (mn (N, T 1 by (mn (N, T 1 matrx. The second step of the algorthm obtans the resduals of the projecton of each of the varables on the two sets of fxed effects. Ths step s computatonally nexpensve as the matrx nverson requred to compute the projecton was done once and for all n the frst step. The thrd step s to estmate the desred specfcaton wth the resdualzed data usng standard statstcal routnes. By vrtue of the Frsch Waugh Lovell Theorem, the estmate of the asymptotc varance calculated wth the resdualzed data s consstent. Our algorthm has three mportant advantages relatve to exstent routnes. Frst, t manages memory and computatonal resources effcently whch speeds up the computaton of the estmates. Second, t allows the researcher to estmate multple specfcatons usng the same set of dummes at a very low computatonal cost. The most computatonal ntensve operaton, nvertng the matrx n step one, has to be performed only once no matter how many explanatory varables are ncluded n the analyss. The results of step one can be stored and used over and over as the researcher adds more varables to the analyss and runs dfferent specfcatons. Thrd, the asymptotc varance of the parameters of nterest can be consstently estmated usng standard routnes for OLS, TSLS or GMM on the resdualzed data, ncludng the estmator robust to heteroskedastcty and wthngroup correlaton. Standard routnes avalable n statstcal software do not deal wth two-way fxed effect models effcently. Stata allows the user to absorb one set of fxed effects but requres generatng the a set of dummes for the other. In SAS, PROC PANEL has a TWOWAY opton that creates one set of dummes. Both procedures may requre more memory than typcally avalable. There are some useful user-generated algorthms that avod creatng dummes; they are specfcally desgned to deal wth stuatons where the panel structure s sparse, that s, cases where N and T are both very large but the number of observatons s order of magntudes lesser than N T. For example, 2 mllon wage observatons for N = 100, 000 ndvduals n T = 20, 000 frms. In these cases, the challenge s nvertng a T T matrx. In Stata, algorthms such as reg2hdfe and a2reg avod ths nverson relyng n teratve procedures. Ther drawback s that the results cannot be used to estmate dfferent specfcatons. The algorthm proposed here s specfcally desgned to deal wth 2

3 dense panel structures where nvertng a T T matrx s costly, but possble. The nverse matrx can be stored and used repeatedly when estmatng dfferent specfcatons. The rest of ths paper s organzed as follows. The next secton descrbes the panel data model wth two-way fxed effects and the OLS, TSLS, and GMM estmators wth ther correspondng asymptotc varance. Secton 3 descrbes the algorthm. Secton 4 compares ts performance wth other avalable methods. 2 The Panel Model Followng Arellano (1987, consder the model: y t = x tβ + e + h t + u t (t {1,..., T } ; {1,..., N}, where y t s the outcome of household n perod t, x t s a K 1 vector of ncluded varables, h t s a tme fxed effect and e s a group/household fxed effect. 1 The household effect e and the tme effect h t are unobservable and potentally correlated wth x t. There s a set of nstruments z t (L 1 vector, L K that s assumed to satsfy the followng exogenety condton: E (u t z 1,..., z T, e, h 1,..., h T = 0. The u t are assumed to be ndependently dstrbuted across groups (or ndvduals but no restrctons are placed on the form of the autocovarances for a gven group. E ( u u z 1,..., z T, e, h 1,..., h T = Ω Each observaton has a weght that can be related to the nverse of the probablty of beng sampled. If a par (, t s unobserved then w t = 0 (mssng data s assumed to occur at random. Let ỹ denote the vector of outcomes y t stacked by group and then ordered chronologcally. Smlarly, denote X be the matrx of stacked vectors x t and Z be the matrx of stacked vectors z t. D s a matrx of dummes for groups and H s a matrx of dummes for tme perods. w s the vector of weghts and ([ ũ s the ] vector of unobserved u t. Ether H or D has one of ts columns removed so that rank D, H = T + N 1. It s convenent to remove a column of H f T < N and a column of D f N < T. Let y = dag( wỹ, u = dag( wũ, X = dag( w X, Z = dag( w Z, D = dag( w D and H = dag( w H. The model can be wrtten as: Y = Xβ + De + Hh + u (1 Let M denote the annhlator matrx of S = [D, H]. M = I S (S S 1 S, M = M, MM = M and MS = 0. Denote y + = My, X + = MX, Z + = MZ. By the Frsch Waugh Lovell Theorem 1 The panel can be unbalanced. There can be unobserved pars (, t or pars (, t that are observed more than once. To keep notaton smple, we wll focus only on mbalances of the frst knd but the algorthm handles both. 3

4 (Frsch and Waugh, 1933; Lovell, 1963, 2008; Gles, 1984, the GMM estmator of β for a weghtng matrx Ŵ s: ˆβGMM = (X + Z + Ŵ Z + X + 1 (X + Z + Ŵ Z + y + = β + (X + Z + Ŵ Z + X + 1 (X + Z + Ŵ Z + y + If Ŵ = (Z+ Z + 1, ths estmator s the TSLS estmator; f K = L, t s the nstrumental varables estmator; and f X = Z, t s the OLS estmator. ˆβ OLS = ( X + X + 1 ( X + y + = β + ( X + X + 1 ( X + u Under approprate regularty condtons: N ( ˆβ β N ( 0, J 1 V J 1, where J = plm N N 1 ( X + Z + Ŵ Z + X + ( [ N V = plm N N 1 X + Z + ( Ŵ Z + Ω Z + =1 ] (Ŵ Z + X + and Ω = dag( w Ω dag( w. Consstent estmates of the asymptotc varance can be obtaned applyng standard statstcal routnes to the resdualzed data (y +, X +, Z +. Consder for example the OLS estmator. Its asymptotc dstrbuton specalzes to: N ( ˆβ β N ( 0, J 1 V J 1 J = plm N N 1 ( X + X + V = plm N N 1 N =1 ( X + Ω X + ( Arellano (1987 consders three estmators of avar ˆβOLS. Let u + = y + X + estmator s robust to heteroskedastcty and wthn-group correlaton: ˆβ OLS. The frst ( avar 1 ˆβOLS = ( ( X + X + N 1 =1 X + u + u+ X + (X + X + 1 Ths estmator can be calculated clusterng standard errors at the group level. 2 (2 The second estmator wll produce consstent standard errors f x t and u t are fourth order ndependent and f dsturbances are homoskedastc: Ω = Ω for all : ( avar 2 ˆβOLS = ( ( X + X + N 1 X + ˆΩ + X+ 2 In Stata: reg plus y plus x*, vce(cluster h =1 (X + X + 1 (3 4

5 where ˆΩ + = N 1 N =1 u + u+. (4 The thrd estmator wll produce consstent standard errors under the classcal assumpton Ω = σ 2 I: avar 3 ( ˆβOLS = ˆσ 2 ( X + X + 1, (5 where ˆσ 2 u + u + = L N T K. and L s the number of observatons. Ths estmator can be calculated runnng the regresson of y + on X + and multplyng the estmate of the asymptotc varance of ˆβ L K OLS by L N T K. These estmators are obtaned applyng standard routnes to the resdualzed data (y +, X +. 3 The Algorthm In balanced panels wth constants weghts n one of the two dmensons w t = w or w t = w t the computaton of ˆβ and ts asymptotc varance s straghtforward. y + s obtaned subtractng the weghted mean of ỹ along both dmensons: where y + t = ỹ t ȳ ȳ t t ȳ = w tỹ t t w and ȳ t = t w t (ỹ t ȳ w t X + and Z + are computed followng the same procedure. Ths heurstc approach can be justfed formally. Premultplyng the orgnal model n equaton (1 frst by M D, the annhlator of D, and then by M H, the annhlator of H, results n: M H M D Y = M H M D Xβ + M H M D Hh + u (6 If the panel s balanced and weghts are constant n t or n, then M H M D = M H or M H M D = 0. In ether case, M H M D H = 0, and the transformed model does not depend on the fxed effects. If the panel s unbalanced or the group weghts vary, 0 M H M D M H and the transformaton M H M D does not elmnate the fxed effects n matrx H. The fxed effects are annhlated only f the model s premultpled by M = I S (S S 1 S where S = [D, H]. The algorthm presented n ths paper s specfcally desgned to deal wth unbalanced panels and conssts of three steps: 1. Compute (S S 1. Ths step requres nvertng a (mn (N, T 1 by (mn (N, T 1 matrx and storng t along wth two N by T matrces. These three matrx contan all the nformaton requred to construct the annhlator matrx M. 2. Obtan (y +, X +, Z +.. 5

6 3. Use standard methods to estmate ˆβ and ts asymptotc varance by OLS, TSLS or GMM. Only the frst step s computatonally ntensve as t requres nvertng a potentally large matrx. Ths step only has to be performed once for a gven panel structure. The compuatonal cost of resdualzng varables and runnng dfferent specfcatons wth them s relatvely low. Computaton of (S S 1 Accordng the the defnton of S, ( S S [ 1 D D = D H D H H H D D s a dagonal matrx wth typcal dagonal element (, equal to t w t. Smlarly, H H s a dagonal matrx wth typcal dagonal element (t, t equal to w t. D H s a N-by-T matrx wth typcal element (, t equal to w t. Usng the formula for the nverse of a parttoned matrx: [ ] A B B C [ D D = D H ] 1 ] 1 D H H, (7 H A = C = ( (D D D H ( H H 1 1 H D (8 ( (H H H D ( D D 1 1 D H (9 B = AD H ( H H 1 ( = D D 1 D HC (10 If T < N, the frst step of the algorthm calculates and stores C, B, D H and (D D 1 (C s T 1 by T 1, D H and B are N by T 1 and D D s N by N but t s dagonal, so N numbers are stored. The only non-dagonal matrx that s nverted s (9. A can be calculated from the stored matrces by A = ( D D 1 ( + D D 1 D HCH D ( D D 1. (11 To economze computng power and memory, A an N by N matrx s not constructed. If N < T, ths step calculates and stores A, B, D H and (H H 1 (A s N 1 by N 1, D H and B are N 1 by T and H H s T by T but t s dagonal, so T numbers are stored. The only non-dagonal matrx that s nverted s (8. C can be calculated by C = ( H H 1 ( + H H 1 H DAD H ( H H 1. (12 However, C s never constructed or stored. Notce that the algorthm never constructs or stores a non-dagonal square matrx of sze equal to the greatest of T and N. Ths feature of the algorthm makes t more robust to memory constrants. 6

7 The resdualzed data (y +, X +, Z + We obtan the projecton coeffcents ˆδ = AD y + BT y ˆτ = B D y + CT y and compute y + = y Dˆδ T ˆτ. Varables x + k n X+, and z + k n Z+ are obtaned n the same way. If T < N, the matrx A s replaced by the expresson n (11. D y s a N-th order vector so AD y can be calculated wthout ever creatng A, or any other N by N matrx that s not dagonal: AD y = ( D D 1 ( D y ( D D 1 ( D H C ( H D ( D D 1 ( D y Smlarly, If T > N, CT y s calculated wthout ever creatng C, or any other T by T matrx that s not dagonal. Estmaton of the parameters and ther asymptotc varance The resdualzed data (y +, X +, Z + can be used as f t was the orgnal data n any statstcal package. In Stata, for example, reg plus y plus x*, vce(cluster h returns ˆβ OLS along wth ts asymptotc varance estmate that s robust to wthn group correlaton; vregress gmm plus y (plus x*=plus z*, wmatrx(cluster h returns ˆβ GMM where the weghtng matrx equal to N 1 ( N =1 Z + u + u+ Z + 1. Ŵ s set Stata also returns a consstent estmate of the asymptotc varance of ˆβ GMM. vregress 2sls plus y (plus x=plus z returns a two-stage least square estmate of β along wth a consstent estmate of ts asymptotc varance robust to heteroskedastcty. If the asymptotc varance s estmated by (5, whch s consstent under the classcal assumpton that the error terms are ndependently and dentcally dstrbuted (Ω = σ 2 I, then the the esmaton of ˆσ should be adjusted to account for the reduced degrees of freedom. In other words, the estmate of the asymptotc varance should be multpled by L K L N T K. 4 Comparatve Performance Analyss Ths secton presents a compares the performance of the proposed algorthm wth three exstng algorthms n Stata: a2reg, reg2hdfe and felsdvreg. The comparsons are not totally far as these algorthms have been desgned to work n sparse panel models that are typcal n labor market applcatons. They are stll the best avalable optons even for dense panels. We mplemented the algorthm n mata, the matrx language n Stata. The mplementaton contans two man programs. twowayset performs the frst step of the algorthm for a par of fxed 7

8 effects and stores the requred matrces. projvar performs the second steps and returns a set of resdualzed varables. For example, f hhd s a varable that contans a household dentfer and td contans an hour-of-sample dentfed, then twowayset hhd td, root("c:/abc" performs step one and stores the matrces n the fle C:/abc. If consumpton and hourprce are the endogenous and exogenous varables, respectvely, projvar consumpton hourprce, root("c:/abc", p(plus creates the varables plus consumpton plus hourprce that contan the resdualzed verson of the orgnal varables. 3 The actual estmaton s performed usng the bult-n commands regress or vregress on the resdualzed varables. The command a2reg, proposed by Ouazad (2008, uses a Conjugate gradent method to solve the mnmum squares problem and obtan ˆβ OLS. The algorthm never computes X X; therefore, t does not report an estmate of the asymptotc varance. The command reg2hdfe, proposed by Gumaraes and Portugal (2010, teratvely projects the dependent varable on the ndependent varables and the two sets of dummes. Whle computatonally ntensve, ths approach mposes mnmum memory requrements. The command felsdvreg, proposed by Cornelssen (2008, absorbs one set of fxed effects and constructs components of the OLS normal equatons usng the dentfer of the other set of fxed effects. Thus, t solves for ˆβ OLS wthout creatng a set of dummes or relyng on an teratve process. We generate an panel of N = 10 n households and T = 10 t tme perods and randomly drop 10 percent of the observatons. The resultng panel s unbalanced. We draw household-specfc and tme-specfc fxed effect from a standard normal. We also draw K covarates x and errors υ from ndpendent standard normals. The dependent varable y s: K y t = x tk + d + h t + υ t k=1 We estmate the parameters of the model usng our procedure (twoway and the three exstng algorthms and compare ther runnng tme. Notce that the panel structure s dfferent n each draw, so we are not takng advantage of the fact that our method allows to run the matrx nverson n step one just once. Table 1 summarzes the results for dfferent values of N, T and K. Table 1 shows that our procedure outperforms algorthms desgned for sparse panels. The only excepton s the case wth N = T = 10, 000 when twoway spends too much tme nvertng a 9, 999 9, 999 matrx. reg2hdfe performs the estmaton much faster. Even ths case s useful to llustrate one of the advantages of our procedure. The ncrease n computng tme from K = 2 to K = 10 s relatvely small compared to the ncrease observed for reg2hdfe. Ths s true also n the other cases where twoway ourperforms reg2hdfe. Our method only requres to nvert the matrx once ndependently of the number of explanatory varables K. Table 2 shows that the computng tme employed n step 1, whch performs the nverson, does not depend on K. The computng 3 See the documentaton to the Stata code for addtonal detals. The algorthm was also mplemented n Matlab and SAS to take advantage of some specfc features of those programs. The Matlab mplementaton uses sparse matrces. The SAS mplementaton never loads the raw data n the computer RAM memory. 8

9 Runnng Tme (n seconds N T K twoway reg2hdfe a2reg felsdvreg * >1d * >1d >1d >1d * >1d * >1d Table 1: Ths table compares the performance of each algorthm n Stata/SE 12.1 on an Intel Core CPU 3.40GHz wth 32 GB RAM operatng under 64-bt Wndows 7. N s the number of groups, T s the number of observatons per group, K s the number of explanatory varables. N T K Step 1 Step 2 Step 3 The column labeled as twoway shows the average computng tme of our algorthm measured n seconds. Columns reg2hdfe, a2reg and felsdvreg show computng tme for each alternatve algorthm. The astersk 1000 represents 100 a volaton 2 of the matrx 0.1 sze lmt 0.2 n Stata 0.0and >1d ndcates that the procedure1000 was stopped100 after one day

10 >1d * >1d * >1d Runnng Tme by Step (n seconds N T K Step 1 Step 2 Step Table 2: Ths table breaks down the computng tme spent n each of the steps of our algorthm. N s the number of groups, T s the number of observatons per group, K s the number of explanatory varables. Step 1 performs the matrx nverson and constructs the matrces that wll be needed later on. Step 2 resdualzes the dependent and explanatory varables. Step 3 use the command regress to compute the OLS estmator usng the resdualzed data. tme n step 2, whch projects each explanatory varables on the set of fxed effects, s ncreasng n K. Fnal Remarks We present an algorthm to estmate a two-way fxed effect lnear model. Whle exstent algorthms are desgned for sparse panels, ours works best n dense, but unbalanced panels. The algorthm reles on the Frsch-Waugh-Lovell Theorem and apples to ordnary least squares (OLS, two-stage least squares (TSLS and generalzed method of moments estmators (GMM. The coeffcents of nterest are computed usng the resduals from the projecton of all varables on the two sets of fxed-effects. Our algorthm has three mportant advantages. Frst, t manages memory and computatonal resources effcently whch speeds up the computaton of the estmates. Second, t allows the researcher to estmate multple specfcatons usng the same set structure of fxed effects at a very low computatonal cost. Thrd, the asymptotc varance of the parameters of nterest can be consstently estmated usng standard routnes on the resdualzed data. 10

11 References Arellano, M. (1987: Computng Robust Standard Errors for Wthn-Groups Estmators, Oxford Bulletn of Economcs and Statstcs, 49(4, Baum, C. F., M. E. Schaffer, and S. Stllman (2007: Enhanced routnes for nstrumental varables/gmm estmaton and testng, Stata Journal, 7(4, Cornelssen, T. (2008: The Stata command felsdvreg to ft a lnear model wth two hghdmensonal fxed effects, Stata Journal, 8(2, Frsch, R., and F. V. Waugh (1933: Partal Tme Regressons as Compared wth Indvdual Trends, Econometrca, 1(4, Gaure, S. (2013: OLS wth multple hgh dmensonal category varables, Computatonal Statstcs & Data Analyss, 66, Gles, D. E. A. (1984: Instrumental varables regressons nvolvng seasonal data, Economcs Letters, 14(4, Gumaraes, P., and P. Portugal (2010: A smple feasble procedure to ft models wth hgh-dmensonal fxed effects, Stata Journal, 10(4, 628. Hayash, F. (2000: Econometrcs. Prnceton Unversty Press, Prnceton. Lovell, M. C. (1963: Seasonal Adjustment of Economc Tme Seres and Multple Regresson Analyss, Journal of the Amercan Statstcal Assocaton, 58(304, (2008: A Smple Proof of the FWL Theorem, The Journal of Economc Educaton, 39(1, Ouazad, A. (2008: A2REG: Stata module to estmate models wth two fxed effects,. Wansbeek, T., and A. Kapteyn (1989: Estmaton of the error-components model wth ncomplete panels, Journal of Econometrcs, 41(3, Wooldrdge, J. M. (2010: Econometrc Analyss of Cross Secton and Panel Data. The MIT Press, Cambrdge, Mass, second edton edton edn. 11

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

An Algorithm to Estimate the Two-Way Fixed Effects Model

An Algorithm to Estimate the Two-Way Fixed Effects Model J. Econom. Meth. 205; aop Practitioner s Corner Paulo Somaini * and Frank A. Wolak An Algorithm to Estimate the Two-Way Fixed Effects Model Abstract : We present an algorithm to estimate the two-way fixed

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book

More information

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Factor models with many assets: strong factors, weak factors, and the two-pass procedure Factor models wth many assets: strong factors, weak factors, and the two-pass procedure Stanslav Anatolyev 1 Anna Mkusheva 2 1 CERGE-EI and NES 2 MIT December 2017 Stanslav Anatolyev and Anna Mkusheva

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

APPROXIMATE PRICES OF BASKET AND ASIAN OPTIONS DUPONT OLIVIER. Premia 14

APPROXIMATE PRICES OF BASKET AND ASIAN OPTIONS DUPONT OLIVIER. Premia 14 APPROXIMAE PRICES OF BASKE AND ASIAN OPIONS DUPON OLIVIER Prema 14 Contents Introducton 1 1. Framewor 1 1.1. Baset optons 1.. Asan optons. Computng the prce 3. Lower bound 3.1. Closed formula for the prce

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

4DVAR, according to the name, is a four-dimensional variational method.

4DVAR, according to the name, is a four-dimensional variational method. 4D-Varatonal Data Assmlaton (4D-Var) 4DVAR, accordng to the name, s a four-dmensonal varatonal method. 4D-Var s actually a drect generalzaton of 3D-Var to handle observatons that are dstrbuted n tme. The

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

LECTURE 9 CANONICAL CORRELATION ANALYSIS

LECTURE 9 CANONICAL CORRELATION ANALYSIS LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

SIO 224. m(r) =(ρ(r),k s (r),µ(r))

SIO 224. m(r) =(ρ(r),k s (r),µ(r)) SIO 224 1. A bref look at resoluton analyss Here s some background for the Masters and Gubbns resoluton paper. Global Earth models are usually found teratvely by assumng a startng model and fndng small

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

The Granular Origins of Aggregate Fluctuations : Supplementary Material

The Granular Origins of Aggregate Fluctuations : Supplementary Material The Granular Orgns of Aggregate Fluctuatons : Supplementary Materal Xaver Gabax October 12, 2010 Ths onlne appendx ( presents some addtonal emprcal robustness checks ( descrbes some econometrc complements

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Econometric Analysis of Panel Data. William Greene Department of Economics Stern School of Business

Econometric Analysis of Panel Data. William Greene Department of Economics Stern School of Business Econometrc Analyss of Panel Data Wllam Greene Department of Economcs Stern School of Busness Econometrc Analyss of Panel Data 5. Random Effects Lnear Model The Random Effects Model The random effects model

More information

On the testing of heterogeneity effects in dynamic unbalanced panel data models

On the testing of heterogeneity effects in dynamic unbalanced panel data models Economcs Letters 58 (1998) 157 163 On the testng of heterogenety effects n dynamc unbalanced panel data models Serg Jmenez-Martn* Unversdad Carlos III de Madrd epartment of Economcs, Av. Madrd, 16, 8903

More information

A new construction of 3-separable matrices via an improved decoding of Macula s construction

A new construction of 3-separable matrices via an improved decoding of Macula s construction Dscrete Optmzaton 5 008 700 704 Contents lsts avalable at ScenceDrect Dscrete Optmzaton journal homepage: wwwelsevercom/locate/dsopt A new constructon of 3-separable matrces va an mproved decodng of Macula

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

If we apply least squares to the transformed data we obtain. which yields the generalized least squares estimator of β, i.e.,

If we apply least squares to the transformed data we obtain. which yields the generalized least squares estimator of β, i.e., Econ 388 R. Butler 04 revsons lecture 6 WLS I. The Matrx Verson of Heteroskedastcty To llustrate ths n general, consder an error term wth varance-covarance matrx a n-by-n, nxn, matrx denoted as, nstead

More information

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH Turbulence classfcaton of load data by the frequency and severty of wnd gusts Introducton Oscar Moñux, DEWI GmbH Kevn Blebler, DEWI GmbH Durng the wnd turbne developng process, one of the most mportant

More information

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,

More information

Exam. Econometrics - Exam 1

Exam. Econometrics - Exam 1 Econometrcs - Exam 1 Exam Problem 1: (15 ponts) Suppose that the classcal regresson model apples but that the true value of the constant s zero. In order to answer the followng questons assume just one

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Lab 4: Two-level Random Intercept Model

Lab 4: Two-level Random Intercept Model BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

F statistic = s2 1 s 2 ( F for Fisher )

F statistic = s2 1 s 2 ( F for Fisher ) Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

0.1 The micro "wage process"

0.1 The micro wage process 0.1 The mcro "wage process" References For estmaton of mcro wage models: John Abowd and Davd Card (1989). "On the Covarance Structure of Earnngs and Hours Changes". Econometrca 57 (2): 411-445. Altonj,

More information

CHAPTER 8. Exercise Solutions

CHAPTER 8. Exercise Solutions CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

A TEST FOR SLOPE HETEROGENEITY IN FIXED EFFECTS MODELS

A TEST FOR SLOPE HETEROGENEITY IN FIXED EFFECTS MODELS A TEST FOR SLOPE HETEROGEEITY I FIXED EFFECTS MODELS TED JUHL AD OLEKSADR LUGOVSKYY Abstract. Typcal panel data models make use of the assumpton that the regresson parameters are the same for each ndvdual

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Projection estimators for autoregressive panel data models

Projection estimators for autoregressive panel data models Econometrcs Journal (2002), volume 5, pp 457 479 Projecton estmators for autoregressve panel data models STEPHEN BOND AND FRANK WINDMEIJER Nuffeld College, Unversty of Oxford, Oxford, OX1 1NF, UK CEMMAP,

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information