covariance between any two observations

Size: px
Start display at page:

Download "covariance between any two observations"

Transcription

1 1 Ordinary Least Squares (OLS) 1.1 Single Linear Regression Model assumptions of Classical Linear Regression Model (CLRM) (i) true relationship y i = α + βx i + ε i, i = 1,..., N where α, β = population parameters α, β = parameter estimates ε i = idiosyncratic error term (reflects randomness, unobserved factors) ε i = estimated residual (ii) E[ε i ] = 0 error is mean zero (iii) E[ε 2 i ] = σ2 equal to variance of ε given (ii) variance identical i (iv) E[ε i ε j ] = 0 i j covariance between any two observations 1

2 (v) E[x i ε i ] = 0 x is independent of the error (vi) ε i N(0, σ 2 ) estimation errors distributed normally not needed for unbiasedness, consistency, only std errors given a random sample {y i, x i } N i=1, OLS minimizes the sum of the squared residuals α, β = arg min α,β N i=1 (y i α βx i ) 2 solution implies β OLS = Cov(y i, x i ) Var(x i ) N i=1 = (y i y)(x i x) N i=1 (x i x) 2 = N i=1 y i(x i x) N i=1 x i(x i x) α OLS = y β OLS x 2

3 properties α, β are unbiased (finite sample property), consistent (asymptotic property) α, β are efficient Var( β) = σ 2 / N i=1 (x i x) 2 = σ 2 / [N Var(x)] smallest variance of any linear, unbiased estimate (Gauss-Markov Theorem), where linear estimators are those in the class β = i ω iy i where ω i is some weighting function 3

4 1.2 Multiple Regression Model assumptions (i) true relationship y i = α + K = α + x i }{{} k=1 β kx ki + ε i, i = 1,..., N 1xK β }{{} Kx1 + ε i vector vector where K = # of independent variables (also known as regressors or covariates) (ii) E[x ki ε i ] = 0 k (iii) x s are linearly independent (no perfect multicolinearity) other assumptions follow from CLRM 4

5 estimation given a random sample {y i, x i } N i=1, OLS minimizes the sum of the squared residuals α, β = arg min α,β N i=1 (y i α x i β) 2 solution implies β OLS = (x x) 1 (x y) α OLS = y β OLS x where x is an NxK matrix and y is a Nx1 vector estimators retain same properties as in CLRM 5

6 2 Maximum Likelihood Estimation (MLE) alternative estimation technique to OLS useful in nonlinear models equivalent to OLS in classical linear regression model intuition: y depends on x, θ (e.g., θ = {β, σ})... choose θ ML to maximize probability of the realized data {y i, x i } N i=1 the likelihood function gives the total probability of observing the realized data as a function of θ 6

7 general model y i = f(x i, β) + ε i, ε i iid N(0, σ 2 ) θ = {β, σ} data = {y i, x i } N i=1 implies L(θ) = Pr(y 1,..., y N x 1,..., x N, θ) = Pr(y 1 x 1, θ) Pr(y N x N, θ) = i Pr(y i x i, θ) taking logs implies ln[l(θ)] = i ln[pr(y i x i, θ)] and θml = arg max θ ln[l(θ)] 7

8 example: Classical Linear Regression Model y i = x i β + ε i ε i iid N(0, σ 2 ) what is Pr(y i x i, θ)? Pr(y i x i, θ) = Pr(ε i x i, θ) = Pr(y i x i β x i, θ) { 1 = exp 1 ( ) } 2 yi x i β 2πσ 2 2 σ }{{} NORMAL PDF this implies θml = arg max θ = arg max θ [ { N ln 1 exp 1 ( ) }] 2 yi x i β i=1 2πσ 2 2 σ N 2 ln(2π) N ln σ 1 ( N yi x i β 2 i=1 σ ) 2 which is maximized by minimizing the sum of squared residuals (thus, identical to OLS) 8

9 alternative notation Pr(y i x i, θ) = 1 ( σ φ εi ) σ where φ( ) is the std normal pdf, 1/σ is the Jacobian this implies θml = arg max θ = arg max θ N [ 1 ( ln i=1 σ φ εi σ N ln σ N ) ] ( φ εi ) i=1 σ which is algebraically equivalent to the above representation 9

10 properties consistent plim θ ML = θ asymptotically normal θml N(θ, Σ θ ) asymptotically efficient 10

11 hypothesis testing Wald test equivalent to F-test in OLS requires estimation of only the unrestricted model Likelihood ratio test intuition: imposing restrictions can never improve the value of the likelihood fn; the amount by which imposing the restriction(s) lowers the likelihood fn provides some indication of the validity of the restriction (the bigger the decrease, the more likely the restriction(s) do not hold) requires estimation of the unrestricted and restricted models test statistic LR = 2 { } ln[l( θ UR )] ln[l( θ R )] 0 where LR χ 2 q and q = # of restrictions being tested maximization typically by numerical methods as analytical derivatives are messy 11

12 3 Limited Dependent Variable (LDV) Models class of models where y depends on x, but y is not continuous common cases y {0, 1} binary model estimation: probit, logit, LPM y [a, b] censored model a, b are known a, b may vary by i if a =, b =, then continuous estimation: censored regression, tobit y {0, 1, 2,...} count model estimation: poisson, negative binomial 12

13 y {0, 1, 2,...} qualitative response (QR) model values correspond to choices with no natural ordering examples brand choice mode of transportation estimation: multinomial logit, multinomial probit, conditional logit, nested logit y {0, 1, 2,...} ordered QR model values correspond to choices with natural ordering distance between values may vary between choices examples OLF, PT, FT <HS, HS, College, >College estimation: ordered logit, ordered probit 13

14 3.1 Binary Models applicable to problems where the dependent variable is binary examples: LFP, default on a loan, belong to a FTA, etc. Linear Probability Model (LPM) y i = x i β + ε i ε i N(0, σ 2 i ) estimated by OLS problems heteroskedasticity since implies ε i = x i β if y i = 0 1 x i β if y i = 1 Var(ε i x i ) = x i β(1 x i β) predictions not bounded by 0,1 x i β / [0, 1] and therefore do not correspond to probabilities 14

15 solution: model Pr(y i = 1 x i ) using proper functional form Pr(y i = 1 x i ) = F (x i β) where F ( ) satisfies lim F (x iβ) 1 x i β lim F (x iβ) 0 x i β obvious candidates are CDFs, since these map numbers from the entire real number line to the unit interval two parametric solutions F ( ) = Φ( ) }{{} = xi β φ(u)du (probit) std normal CDF F ( ) = Λ( ) }{{} logistic CDF = exp(x iβ) 1 + exp(x i β) (logit) 15

16 interpretation of β Pr(y i = 1) x j = F (x iβ) β x j }{{ j } Marginal Effect where F (x i β) x j = φ(x i β) (probit) Λ(x i β)[1 Λ(x i β)] (logit) notes since β j is more difficult to interpret (it s not the change in the probability given a unit change in x), typically report marginal effects marginal effects are observation-specific 16

17 common reporting options (i) marginal effects evaluated at the sample mean φ(x β) β j OR Λ(x β)[1 Λ(x β)] β j (ii) the sample mean of the marginal effects 1 N 1 N N N i=1 φ(x i β) β j, OR i=1 Λ(x i β)[1 Λ(x i β)] βj (iii) marginal effects evaluated at some combination of values of interest: φ(x o β) βj OR Λ(x o β)[1 Λ(xo β)] βj where x o is a vector of values of interest (e.g., white, male, 40 years old, with a college diploma) caution needed for marginal effects of dummy vars, interaction effects 17

18 estimation by MLE ln[l(β)] = i ln[pr(y i x i, β)] where Pr(y i x i, β) = F (x i β) if y i = 1 1 F (x i β) if y i = 0 which implies ln[l(β)] = i:y i =1 ln[f (x iβ)] + i:y i =0 ln[1 F (x iβ)] = i ln[f (x iβ)] y i ln[1 F (x i β)] 1 y i 18

19 latent variable framework probit/logit model can be recast in a latent (unobserved) variable framework model yi iid = x i β + ε i, ε i N/L 1 if yi y i = > 0 0 if yi 0 y i unobserved y i observed given data {y i, x i } N i=1, estimate β via MLE again, what is Pr(y i x i, θ)? Pr(yi > 0 x i, θ) = Pr(ε i > x i β) ( ) = Pr(ε i < x i β) = F xi β σ if y i = 1 Pr(y i x i, θ) = Pr(yi 0 x i, θ) = Pr(ε i x i β) ( ) [ )] = F = 1 F if y i = 0 x iβ σ which implies identical form of L(θ) ( xi β σ note, σ is not identified since it would appear everywhere as β/σ, so it is normalized to one STATA: -probit-, -logit-, -dprobit-, -margin-, -mfx- 19

20 3.2 Censored Regression Models applicable to problems where the dependent variable is censored (potentially from above and below) at certain threshholds note, while the dependent variable is censored, the x s are always observed examples: income/wealth may be top-coded, age at first birth for nonmothers, duration of unemployment spells for currently unemployed latent variable setup yi iid = x i β + ε i, ε i N(0, σ 2 ) y i = b i if yi b i yi if yi [a i, b i ] if yi a i a i y i unobserved y i observed terminology right-censoring: if b i i left-censoring: if a i i given data {y i, x i } N i=1, estimate β via MLE 20

21 again,what is Pr(y i x i, θ)? Pr(y i b i x i, θ) = Pr(ε i b i x i β) Pr(y i x i, θ) = = 1 Pr(ε i < b i x i β) [ ( )] = 1 Φ bi x i β σ if y i = b i ( ) Pr(yi x i, θ) = 1 σ φ y i x i β σ if y i [a i, b i ] Pr(yi a i x i, θ) = Pr(ε i a i x i β) ) = Φ if y i = a i ( ai x i β σ log likelihood fn given by ln[l(θ)] = [ ( )] ai x i β ln Φ i:y i =a i σ + [ ( )] bi x i β ln 1 Φ i:y i =b i σ + [ ( )] 1 y ln i:y i [a i,b i ] σ φ i x i β σ notes interpretation of β is the impact of a x on y (the latent variable) σ is identified as long as y i [a i, b i ] for some i 21

22 special case: tobit model a i = 0, b i = i examples: labor supply, R&D expenditures by a firm implies the following setup yi iid = x i β + ε i, ε i N(0, σ 2 ) yi if yi y i = > 0 0 if yi 0 y i unobserved y i observed estimate via MLE, where log likelihood fn given by ln[l(θ)] = ( [Φ ln x )] iβ i:y i =0 σ }{{} as in probit + [ ( )] 1 y ln i:y i >0 σ φ i x i β σ }{{} as in CLRM 22

23 interpretation of β is the impact of a x on y (the latent variable) not directly comparable to OLS, which estimates the change in E[y i x i ]/ x j marginal effects (for comparison to OLS) E[y i x i ] x j ( ) xi β = β j Φ σ these are observation-specific example: Tobit Illustration with One Regressor x True values: alpha= 2, beta=2. OLS Data Points Tobit STATA: -tobit-, -cnreg- 23

24 3.3 Count Models applicable to situations where the dependent variable is a non-negative integer count of events dependent value typically takes on only a few values examples: # of children, # of patents held by a firm, # of doctor visits per year, # of cigarettes smoked per day Poisson Model setup model expected number of events conditional on x E[y i x i ] = F (x i β) where F ( ) 0 functional form of F ( ) = exp{x i β} estimation via MLE need a distribution for y i, which cannot be normal assume Poisson distribution, which depends only on the mean given by exp{x i β} 24

25 again, what is Pr(y i x i, θ)? Pr(y i x i, θ) = exp{ λ i}λ y i i y i! (Poisson pdf) where λ i =exp{x i β} can show that E[y i x i ] = Var(y i x i ) = λ i log likelihood fn given by ln[l(θ)] = i [ λ i + y i ln λ i ln(y i!)] interpretation of β as if ln(y) is dependent variable; implies %, or elasticity if ln(x) marginal effects E[y i x i ] x i = λ i β 25

26 shortcoming Poisson assumes the mean and variance are identical regression-based test (Cameron & Trivedi 1990) H o : Var(y i ) = E[y i ] H 1 : Var(y i ) = E[y i ] + αg(e[y i ]) regress z i = ( y i λ i ) 2 yi λ i 2 on either (i) a constant or (ii) λ i and no constant; t-statistic is a test of the null hypothesis 26

27 Wooldridge proposes a correction to the standard errors when there is over- or under-dispersion assumes proportionality Var(y x) = σ 2 E[y x] where σ 2 is unknown σ 2 > 1 = overdispersion; σ 2 < 1 = under-dispersion define û i = y i λ i obtain σ 2 = 1 n k 1 i û2 i / λ i multiply usual standard errors by σ = σ 2 > 0 alternative estimation: negative binomial model STATA: -poisson-, -nbreg- 27

28 3.3.2 Zero-Inflated Poisson Model (skip) applicable to count models with a mass at zero, or where the decision between zero and some positive count is different than the decision among positive counts notation observations assumed to be drawn from either of two regimes observations in regime 1, given by z i = 1, always have a count of zero outcomes for observations in regime 2, given by z i = 2, follow a Poisson process, with possible outcomes given by y i 0 28

29 MLE set-up entails Pr(y = 0 x) = Pr(z = 1 x) + Pr(y = 0 x, z = 2) Pr(z = 2) Pr(y = j x) = Pr(y = j x, z = 2) Pr(z = 2), j = 1, 2,... log likelihood fn given by ln[l(θ)] = i:y i =0 ln{f (x iγ) + [1 F (x i γ)] exp( λ i )} + i:y i >0 {ln[1 F (x iγ)] λ i + y i ln λ i ln(y i!)} where F ( ) is either the logistic of normal CDF notes as in poisson model, λ i = x i β β is not constrained to be equal to γ some elements of β or γ may be constrained to be zero (i.e., the regressors in the two parts of the model need not be the same) STATA: -zip-, -zinb- 29

30 3.3.3 Application: Talley et al. (2005) question: determinants of crew injuries in ship accidents data structure sample includes accidents investigated by US Coast Guard includes US ships anywhere, and foreign ships in US waters outcomes # deaths, # injuries, # missing small, non-negative integer count data lots of covariates; mix of discrete and continuous data over 11 years, model a bit structural, then arrives at reduced form eqn (5) count data = Poisson, negative binomial seperate model by dependent variable and type of ship (freight, tanker, tugboat) results: interpreted as %, also report marginal effects 30

31 shortcoming non-random sample selection; only accidents included not account for impact of x s on probability of an accident, only severity conditional on accident occuring 31

32 3.4 Qualitative Response Models applicable to analyses of choice by agents among many (unordered) alternatives e.g., brand choice, mode of transportation, type of mortgage (fixed-30 yr, fixed-15 yr, adjustable rate,...), type of school (public, charter, privatenonreligious, private-religious,...), occupation dependent variable typically coded as (positive) integers corresponding to specific choice; value/order of actual numbers is irrelevant 32

33 3.4.1 Multinomial Logit setup choose among J + 1 alternatives y i {0, 1,..., J} x s are observation-specific attributes, not choice-specific attributes again, what is Pr(y i = j x i, θ), j = 0, 1,..., J? Pr(y i = j x i, θ) = F (x i, θ) F ( ) should satisfy two criteria F ( ) (0, 1) j Pr(y i = j x i, θ) = 1 i functional form of F ( ) Pr(y i = j x i, θ) = known as multinomial logit exp(x i β j ) J k=0 exp(x iβ k ) 33

34 notes β s are choice-specific (subscripted by j) MLE based on above model is not identified, since β j = β j + c leaves L(θ) unchanged; implies an infinite # of solutions to identify the model, normalize β 0 = 0 identification also requires that the x s be observation-specific, not choice-specific; cannot identify choice-specific effects of choice-specific variables likelihood function ln[l(θ)] = i:y i =0 ln ( + i:y i 0 1 ) 1 + J k=1 exp(x iβ k ) ( ) d exp(x i β j ) ij ln j=1 1 + J k=1 exp(x iβ k ) J where d ij = 1 if y i = j, 0 otherwise 34

35 interpretation log odds ratio (relative to base choice) ln where P ij = Pr(y i = j) ( Pij P i0 ) = x i β j implies β j is the % change in odds ratio from unit change in x log odds ratio (relative to any other choice) ln ( Pij P ij ) = x i (β j β j ) 35

36 shortcoming: Independence of Irrelevant Alternatives (IIA) due to assumption of independent, homoskedastic errors expressed as ln ( Pij P ij ) f(p ik ), k j, j Hausman test estimate full model = β j, j = 1,..., J estimate partial model on only a subset of choices = βj, j = 1,..., J P, where J P < J if IIA holds: βj β j, j = 1,..., J P thus, if partial model and full model yield estimates that are too different then we reject IIA multinomial probit does not impose IIA, but estimation is more complex since it involves multiple integration nested logit is also a possible alternative 36

37 random utility model (RUM) model can be derived from a RUM utility of observation i from choice j given by U ij = V ij + ε ij = x i β j + ε ij where ε is an iid from a Type I extreme value dbn, where G(ε) = exp( e ε ) observations choose to maximize utility implies Pr(y i = j) = Pr(U ij U ij ) j j =. exp(v ij ) J k=0 exp(v ij) which is identical to above STATA: -mlogit-, -mprobit- 37

38 3.4.2 Conditional Logit multinomial logit does not identify effects of choice-specific x s usual model with such data is a conditional logit results analagous to many pairwise binary logit models MLE again, what is Pr(y i = j x j, θ), j = 0, 1,..., J? Pr(y i = j x j, θ) = F (x j, θ) F ( ) should satisfy two criteria F ( ) (0, 1) j Pr(y i = j x j, θ) = 1 i functional form of F ( ) Pr(y i = j x j, θ) = known as conditional logit exp(x j β) J k=0 exp(x kβ) likelihood function ln[l(θ)] = ( ) d exp(x j β) ij ln i j J k=0 exp(x kβ) 38

39 log odds ratio ln ( Pij P ij ) = (x j x j )β which still implies IIA interpretation marginal effects: own continuous attributes Pr(y i = j) x j = {Pr(y i = j) [1 Pr(y i = j)]} β marginal effects: other continuous attributes Pr(y i = j) x j = {Pr(y i = j) Pr(y i = j )} β, j j notes model allows attributes of all other choices to affect probability of each option being chosen however, two derivatives must necessarily be of the opposite sign may not be a good property (e.g., FDI is increasing in own market size, and market size of neighbors) can similarly be derived from a random utility model STATA: -clogit- 39

40 3.4.3 Nested Logit (skip) reframes the problem as a sequential, multi-stage decision e.g., a business may first decide on a state in which to build a new plant, and then choose a county within the state notation J = total number of potential choices (e.g., the number of counties in the US) L = number of subgroups (e.g., the number of states in the US) J l = number of choices within subgroup l, l = 1,..., L (e.g., the number of counties in state l) implies the following identity J = L l=1 J l x j l = attributes of choice j in subgroup l z l = attributes of subgroup l 40

41 MLE probability of a given choice given by Pr(y i = j, l) = L h=1 which is the unconditional probability exp(x j l β + z l γ) Jl k=1 exp(x k hβ + z h γ) the unconditional probability is equal to the conditional times the marginal probability Pr(y i = j, l) = Pr(y i = j l) Pr(y i = l) = exp(x j l β) Jl k=1 exp(x L k lβ) h=1 exp(z l γ) J l k=1 exp(x j lβ) Jl k=1 exp(x k hβ + z h γ) define the inclusive value as I l = ln [ Jl k=1 exp(x j lβ) ] re-arranging, we get Pr(y i = j, l) = Pr(y i = j l) Pr(y i = l) exp(x j l β) exp(z l γ + τ l I l ) = Jl L h=1 exp(z hγ + τ h I h ) k=1 exp(x k lβ) restricting τ l = 1, l = 1,..., L, yields the conditional logit model 41

42 error term is homoskedastic within each subgroup, heteroskedastic across subgroups log odds ratio any two choices ln ( ) Pr(yi = j, l) Pr(y i = j, l ) = exp(x j lβ) exp(z l γ + τ l I l ) J l k=1 exp(x k l β) exp(x j l β) exp(z l γ + τ l I l ) J l = (x j l x j l )β + (z l z l )γ k=1 exp(x k lβ) + (τ l 1)I l (τ l 1)I l which incorporates all attributes of each subgroup l, l any two choices within a subgroup ln which implies IIA any two subgroups ln ( ) Pr(yi = j, l) Pr(y i = j, l) ( ) Pr(yi = l) Pr(y i = l ) = (x j l x j l)β = (z l z l )γ + (τ l I l τ l I l ) which incorporates all attributes of each subgroup l, l 42

43 interpretation marginal effects: own continuous attributes Pr(y i = j, l) [1 Pr(y i = j l)] = x j l + τ l Pr(y i = j l) [1 Pr(y i = l)] β marginal effects: continuous attributes from j, l Pr(y i = j, l) x j l = { Pr(y i = j l) + τ l Pr(y i = j l) [1 Pr(y i = l)]} β which may or may not be of the same sign as β marginal effects: continuous attributes from j, l Pr(y i = j, l) x j l = { τ l Pr(y i = j l) Pr(y i = l)} β which is of the opposite sign of β STATA: -nlogit- 43

44 3.4.4 Application: Co & List (2004) question: analyze determinants of location choice of FDI in the US; in particular, are knowledge spillovers important data structure outcome is the state choosen by new foreign-owned plants plants entered the US between 1986 and 1993 x s are state-specific attributes in 1986 of particular interest is the x measuring knowledge spillovers, measured by patent counts model: discrete choice data, with choice-specific x s = conditional logit model results min, median, and max elasticity estimates reported elasticity given by ln(p j ) ln(x jm ) = x jmβ m (1 P j ) knowledge spillovers, unionization rates positively associated with probability of receiving a new plant 44

45 energy expenditures, pollution costs negatively associated with probability of receiving a new plant 45

46 3.5 Ordered Response Models applicable to analyses of choice by agents among many ordered alternatives e.g., labor force status (OLF, PT, FT), schooling (<HS, HS, some coll, BA/BS, MA, PhD), bond ratings (AAA,...,D) outcomes coded as discrete integers: 0, 1, 2,... relative to other models MNL, MNP fails to account for ordinal nature of data OLS treats distance between choices as identical (e.g., moving from OLF to PT is equivalent to moving from PT to FT) 46

47 latent variable framework similar to probit/logit framework model yi iid = x i β + ε i, ε i N/L 0 if yi 0 y i = 1 if y i (0, µ 1] 2 if y i (µ 1, µ 2 ]. J if yi > µ J 1 yi unobserved y i observed where µ s are cutoff parameters that are unknown and to be estimated given data {y i, x i } N i=1, estimate β via MLE 47

48 again, what is Pr(y i x i, θ)? Pr(yi 0 x i, θ) = Pr(ε i x i β) = F ( x i β) = [1 F (x i β)] if y i = 0 Pr(µ j 1 < yi µ j x i, θ) Pr(y i x i, θ) = = Pr(ε i < µ j x i β) Pr(ε i < µ j 1 x i β) = F ( µ j x i β ) F ( µ j 1 x i β ) if y i = j, j = 1,..., J 1 Pr(y i > µ J 1 x i, θ) = Pr(ε i > µ J 1 x i β) = 1 Pr(ε i < µ J 1 x i β) = 1 F (µ J 1 x i β) if y i = J where µ 0 = 0 and σ is normalized to one, and 0 < µ 1 < < µ J 1 form of F ( ) defines the ordered probit/logit models interpretation marginal effects are outcome-specific in general Pr(y = j x) x = f(xβ)β if j = 0 [ f(µj 1 xβ) f(µ j xβ) ] β if j = 1,..., J 1 f(µ J 1 xβ)β if j = J where f( ) is the corresponding pdf and µ 0 = 0 STATA: -oprobit-, -ologit- 48

49 3.5.1 Application: Kalb & Williams (2003) question: analyze the differential effects of attributes on juvenile criminal behavior by gender data structure sample of individuals born in 1958 and residing in Philadelphia at some point between ages of 10 and 18 complete criminal history retrospective data on other variables collected in 1988 dependent variable = # of arrests (0, 1, or 2+) other variables represent family background variables issue with respect to sample weighting and missing data can be ignored for our purposes model: ordered, integer data = ordered probit model 49

50 results coefficient estimates and marginal effects are reported results fairly similar across gender non-whites, those leaving school early are more likely to commit criminal acts no impact of past physical/sexual abuse some differences by gender maternal labor supply has no effect on boys, decreases probability of criminal behavior for girls father with HS diploma and fewer siblings have no effect on girls, decrease probability of criminal behavior for boys 50

ECO Econometrics III. Daniel L. Millimet. Fall Southern Methodist University. DL Millimet (SMU) ECO 6375 Fall / 156

ECO Econometrics III. Daniel L. Millimet. Fall Southern Methodist University. DL Millimet (SMU) ECO 6375 Fall / 156 ECO 6375 Econometrics III Daniel L. Millimet Southern Methodist University Fall 2018 DL Millimet (SMU) ECO 6375 Fall 2018 1 / 156 Introduction Goals... 1 Become familiar with alternative estimation techniques

More information

Modeling Binary Outcomes: Logit and Probit Models

Modeling Binary Outcomes: Logit and Probit Models Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x) ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra

More information

Truncation and Censoring

Truncation and Censoring Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of

More information

Limited Dependent Variable Models II

Limited Dependent Variable Models II Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption

More information

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense

More information

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the

More information

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2)

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2) Week 7: (Scott Long Chapter 3 Part 2) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China April 29, 2014 1 / 38 ML Estimation for Probit and Logit ML Estimation for Probit and Logit

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Data-analysis and Retrieval Ordinal Classification

Data-analysis and Retrieval Ordinal Classification Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%

More information

Introduction to GSEM in Stata

Introduction to GSEM in Stata Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

Lecture 10: Alternatives to OLS with limited dependent variables. PEA vs APE Logit/Probit Poisson

Lecture 10: Alternatives to OLS with limited dependent variables. PEA vs APE Logit/Probit Poisson Lecture 10: Alternatives to OLS with limited dependent variables PEA vs APE Logit/Probit Poisson PEA vs APE PEA: partial effect at the average The effect of some x on y for a hypothetical case with sample

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

Maximum Likelihood and. Limited Dependent Variable Models

Maximum Likelihood and. Limited Dependent Variable Models Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Binary choice. Michel Bierlaire

Binary choice. Michel Bierlaire Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?

More information

Comprehensive Examination Quantitative Methods Spring, 2018

Comprehensive Examination Quantitative Methods Spring, 2018 Comprehensive Examination Quantitative Methods Spring, 2018 Instruction: This exam consists of three parts. You are required to answer all the questions in all the parts. 1 Grading policy: 1. Each part

More information

Models of Qualitative Binary Response

Models of Qualitative Binary Response Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course

More information

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1 Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general

More information

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the

More information

Introduction to Estimation Methods for Time Series models Lecture 2

Introduction to Estimation Methods for Time Series models Lecture 2 Introduction to Estimation Methods for Time Series models Lecture 2 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 2 SNS Pisa 1 / 21 Estimators:

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that

More information

ECONOMETRICS HONOR S EXAM REVIEW SESSION

ECONOMETRICS HONOR S EXAM REVIEW SESSION ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;

More information

Parameter Estimation

Parameter Estimation Parameter Estimation Consider a sample of observations on a random variable Y. his generates random variables: (y 1, y 2,, y ). A random sample is a sample (y 1, y 2,, y ) where the random variables y

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Statistical Estimation

Statistical Estimation Statistical Estimation Use data and a model. The plug-in estimators are based on the simple principle of applying the defining functional to the ECDF. Other methods of estimation: minimize residuals from

More information

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity

More information

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22 MLE and GMM Li Zhao, SJTU Spring, 2017 Li Zhao MLE and GMM 1 / 22 Outline 1 MLE 2 GMM 3 Binary Choice Models Li Zhao MLE and GMM 2 / 22 Maximum Likelihood Estimation - Introduction For a linear model y

More information

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter

More information

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from

More information

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

LECTURE 2 LINEAR REGRESSION MODEL AND OLS SEPTEMBER 29, 2014 LECTURE 2 LINEAR REGRESSION MODEL AND OLS Definitions A common question in econometrics is to study the effect of one group of variables X i, usually called the regressors, on another

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

Lecture 14 More on structural estimation

Lecture 14 More on structural estimation Lecture 14 More on structural estimation Economics 8379 George Washington University Instructor: Prof. Ben Williams traditional MLE and GMM MLE requires a full specification of a model for the distribution

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

Maximum Likelihood Methods

Maximum Likelihood Methods Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Advanced Econometrics I

Advanced Econometrics I Lecture Notes Autumn 2010 Dr. Getinet Haile, University of Mannheim 1. Introduction Introduction & CLRM, Autumn Term 2010 1 What is econometrics? Econometrics = economic statistics economic theory mathematics

More information

Econometrics Master in Business and Quantitative Methods

Econometrics Master in Business and Quantitative Methods Econometrics Master in Business and Quantitative Methods Helena Veiga Universidad Carlos III de Madrid This chapter deals with truncation and censoring. Truncation occurs when the sample data are drawn

More information

Poisson Regression. Ryan Godwin. ECON University of Manitoba

Poisson Regression. Ryan Godwin. ECON University of Manitoba Poisson Regression Ryan Godwin ECON 7010 - University of Manitoba Abstract. These lecture notes introduce Maximum Likelihood Estimation (MLE) of a Poisson regression model. 1 Motivating the Poisson Regression

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

The Logit Model: Estimation, Testing and Interpretation

The Logit Model: Estimation, Testing and Interpretation The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research Linear models Linear models are computationally convenient and remain widely used in applied econometric research Our main focus in these lectures will be on single equation linear models of the form y

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I

POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I Introduction For the next couple weeks we ll be talking about unordered, polychotomous dependent variables. Examples include: Voter choice

More information

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

Statistics, inference and ordinary least squares. Frank Venmans

Statistics, inference and ordinary least squares. Frank Venmans Statistics, inference and ordinary least squares Frank Venmans Statistics Conditional probability Consider 2 events: A: die shows 1,3 or 5 => P(A)=3/6 B: die shows 3 or 6 =>P(B)=2/6 A B : A and B occur:

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

POLI 8501 Introduction to Maximum Likelihood Estimation

POLI 8501 Introduction to Maximum Likelihood Estimation POLI 8501 Introduction to Maximum Likelihood Estimation Maximum Likelihood Intuition Consider a model that looks like this: Y i N(µ, σ 2 ) So: E(Y ) = µ V ar(y ) = σ 2 Suppose you have some data on Y,

More information

Ch. 5 Hypothesis Testing

Ch. 5 Hypothesis Testing Ch. 5 Hypothesis Testing The current framework of hypothesis testing is largely due to the work of Neyman and Pearson in the late 1920s, early 30s, complementing Fisher s work on estimation. As in estimation,

More information

Lecture-19: Modeling Count Data II

Lecture-19: Modeling Count Data II Lecture-19: Modeling Count Data II 1 In Today s Class Recap of Count data models Truncated count data models Zero-inflated models Panel count data models R-implementation 2 Count Data In many a phenomena

More information

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )

More information

Ordered Response and Multinomial Logit Estimation

Ordered Response and Multinomial Logit Estimation Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered

More information

Models for Heterogeneous Choices

Models for Heterogeneous Choices APPENDIX B Models for Heterogeneous Choices Heteroskedastic Choice Models In the empirical chapters of the printed book we are interested in testing two different types of propositions about the beliefs

More information

Econometrics Problem Set 10

Econometrics Problem Set 10 Econometrics Problem Set 0 WISE, Xiamen University Spring 207 Conceptual Questions Dependent variable: P ass Probit Logit LPM Probit Logit LPM Probit () (2) (3) (4) (5) (6) (7) Experience 0.03 0.040 0.006

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Lecture 3: Multiple Regression

Lecture 3: Multiple Regression Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u

More information

Linear Regression. Junhui Qian. October 27, 2014

Linear Regression. Junhui Qian. October 27, 2014 Linear Regression Junhui Qian October 27, 2014 Outline The Model Estimation Ordinary Least Square Method of Moments Maximum Likelihood Estimation Properties of OLS Estimator Unbiasedness Consistency Efficiency

More information

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1. Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University. Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability

More information

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet Introduction to Econometrics Final Examination Fall 2006 Answer Sheet Please answer all of the questions and show your work. If you think a question is ambiguous, clearly state how you interpret it before

More information

Introduction to Machine Learning. Lecture 2

Introduction to Machine Learning. Lecture 2 Introduction to Machine Learning Lecturer: Eran Halperin Lecture 2 Fall Semester Scribe: Yishay Mansour Some of the material was not presented in class (and is marked with a side line) and is given for

More information

Duration Analysis. Joan Llull

Duration Analysis. Joan Llull Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 4] 1/43 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Nonlinear Models for Health and Medical Expenditure Data

Nonlinear Models for Health and Medical Expenditure Data 11 Nonlinear Models for Health and Medical Expenditure Data Heath sector variables are seldom continuous and fully observed. For example, they can be discrete (e.g., death), censored (e.g., health care

More information

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Tore Schweder October 28, 2011 Outline Examples of binary respons variables Probit and logit - examples

More information

Ma 3/103: Lecture 24 Linear Regression I: Estimation

Ma 3/103: Lecture 24 Linear Regression I: Estimation Ma 3/103: Lecture 24 Linear Regression I: Estimation March 3, 2017 KC Border Linear Regression I March 3, 2017 1 / 32 Regression analysis Regression analysis Estimate and test E(Y X) = f (X). f is the

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches Sta 216, Lecture 4 Last Time: Logistic regression example, existence/uniqueness of MLEs Today s Class: 1. Hypothesis testing through analysis of deviance 2. Standard errors & confidence intervals 3. Model

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information