Truncation and Censoring
|
|
- Reginald Allison
- 5 years ago
- Views:
Transcription
1 Truncation and Censoring Laura Magazzini Laura Magazzini Truncation and Censoring 1 / 35
2 Truncation and censoring Truncation: sample data are drawn from a subset of a larger population of interest Characteristic of the distribution from which the sample data are drawn Example: studies of income based on incomes above or below the poverty line (of limited usefulness for inference about the whole population) Censoring: values of the dependent variable in a certain range are all transformed to (or reported at) a single value Defect in the sample data Example: in studies of income, people below the poverty line are reported at the poverty line Truncation and censoring introduce similar distortion into conventional statistical results Laura Magazzini (@univr.it) Truncation and Censoring 2 / 35
3 Truncation Truncation Aim: infer the caracteristics of a full population from a sample drawn from a restricted population Example: characteristics of people with income above $100,000 Let Y be a continous random variable with pdf f (y). The conditional distribution of y given y > a (a a constant) is: f (y y > a) = In case of y normally distributed: where α = a µ σ f (y y > a) = f (y) Pr(y > a) 1 σ φ ( x µ ) σ 1 Φ(α) Laura Magazzini (@univr.it) Truncation and Censoring 3 / 35
4 Truncation Moments of truncated distributions E(Y y < a) < E(Y ) E(Y y > a) > E(Y ) V (Y trunc.) < V (Y ) Laura Magazzini (@univr.it) Truncation and Censoring 4 / 35
5 Truncation Moments of the truncated normal distribution Let y N(µ, σ 2 ) and a constant E(y truncation) = µ + σλ(α) Var(y truncation) = σ 2 [1 δ(α)] α = (a µ)/σ φ(α) is the standard normal density λ(α) is called inverse Mills ratio: λ(α) = φ(α)/[1 Φ(α)] λ(α) = φ(α)/φ(α) if truncation is y > a if truncation is y < a δ(α) = λ(α)[λ(α) α], where 0 < δ(α) < 1 for any α Laura Magazzini (@univr.it) Truncation and Censoring 5 / 35
6 Truncation Example: a truncated log-normal income distribution From New York Post (1987): The typical upper affluent American... makes $142,000 per year... The people surveyed had household income of at least $100,000 Does this tell us anything about the typical American?... only 2 percent of Americans make the grade Degree of truncation in the sample: 98% The $142,000 is probably quite far from the mean in the full population Assuming lognormally distributed income in the population (log of income has a normal distribution), the information can be employed to deduce the population mean income Let x = income and y = ln x E[y y > log 100] = µ + σφ(α) 1 Φ(α) By substituting E[x] = E[e y ] = e µ+σ2 /2, we get E[x] = $22, Statistical Abstract of the US listed average household income of about $25, 000 (relatively good estimate based on little information!) Laura Magazzini (@univr.it) Truncation and Censoring 6 / 35
7 The truncated regression model y Truncation i = x i β + ɛ i, ɛ i x i N(0, σ 2 ) Unit i is observed only if yi cross a threshold: { n.a. if y y i = i a yi if yi > a E[y i yi > a] = x i β + σλ(α i), with α i = (a x i β)/σ The marginal effect in the subpopulation is: E[y i yi > a] = β + σ(dλ(α i )/dα i ) α i x i x i =... = β(1 δ(α i )) Since 0 < δ(α i ) < 1, the marginal effect in the subpopulation is less than the corresponding coefficient If the interest is in the linear relationship between y and x (population), the β can be directly interpreted Laura Magazzini (@univr.it) Truncation and Censoring 7 / 35
8 Estimation Truncation and censoring Truncation OLS of y on x leads to inconsistent estimates The model is y i yi > a = E(y i y i > a) + ɛ i = x i β + σλ(α i) + ɛ i By construction, the error term is heteroskedastic Omitted variable bias (λ i is not included in the regression) In applications, it is usually found that the OLS estimates are biased toward zero Under the normality assumption, MLE can be obtained y µ φ( σ ) f (y y > a) = 1 σ 1 Φ(α) with α = a µ σ The log-likelihood can be written as log L = N i=1 [ ( log σ 1 yi x φ σ i β )] N log i=1 [ ( a x 1 Φ σ i β )] Laura Magazzini (@univr.it) Truncation and Censoring 8 / 35
9 Example: simulated data Y = x + ɛ/2, N = 100, a = 0 Truncation Laura Magazzini (@univr.it) Truncation and Censoring 9 / 35
10 Censored data Censored data Censored regression models generally apply when the variable to be explained is partly continuous but has positive probability mass at one or more points Assume there is a variable with quantitave meaning y and we are interested in E[y x] If y and x were observed for everyone in the population: standard regression methods (ordinary or nonlinear least squares) can be applied In the case of censored data, y is not observable for part of the population Conventional regression methods fail to account for the qualitative difference between limit (censored) and nonlimit (continuous) observations Top coding / corner solution outcome Laura Magazzini (@univr.it) Truncation and Censoring 10 / 35
11 Censored data Top coding: example Data generating process Let wealth denote actual family wealth, measured in thousands of dollars Suppose that wealth follows the linear regression model E[wealth x] = x β Censored data: we observe wealth only when wealth > 200 When wealth is smaller than 200 we know that it is, but we do not know the actual value of wealth Therefore observed wealth can be written as wealth = max(wealth, 200) Laura Magazzini (@univr.it) Truncation and Censoring 11 / 35
12 Censored data Top coding: example Estimation of β We assume that wealth given x has a homoskedastic normal distribution wealth = x β + ɛ, ɛ x N(0, σ 2 ) Recorded wealth is: wealth = max(wealth, 200) = max(x β + ɛ, 200) β is estimated via maximum likelihood using a mixture of discrete and continuous distributions (details later...) Laura Magazzini (@univr.it) Truncation and Censoring 12 / 35
13 Censored data Example: seat demanded and ticket sold Laura Magazzini Truncation and Censoring 13 / 35
14 Censored data Corner solution outcomes Still labeled censored regression models Pioneer work by Tobin (1958): household purchase of durable goods Let y be an observable choice or outcome describing some economic agent, such as an individual or a firm, with the following characteristics: y takes on the value zero with positive probability but is a continuous random variable over strictly positive values Examples: amount of life insurance coverage chosen by an individual, family contributions to an individual retirement account, and firm expenditures on research and development We can imagine economic agents solving an optimization problem, and for some agents the optimal choice will be the corner solution, y = 0 The issue here is not data observability, rather individual behaviour We are interested in features of the distribution of y given x, such as E[y x] and Pr(y = 0 x) Laura Magazzini (@univr.it) Truncation and Censoring 14 / 35
15 The censored normal distribution Censored data y N(µ, σ 2 ) Observed data are censored in a = 0: { y = 0 if y 0 y = y if y > 0 The distribution is a mixture of discrete and continuous distribution If y 0: f (y) = Pr(y = 0) = Pr(y 0) = Φ( µ/σ) = 1 Φ(µ/σ) If y > 0: f (y) = φ ( ) y µ σ ( ) E[y] = 0 Pr(y = 0) + E[y y > 0] Pr(y > 0) = (µ + σλ)φ 0 µ σ with λ = φ/φ Laura Magazzini (@univr.it) Truncation and Censoring 15 / 35
16 The censored regression model Tobit model (Tobin, 1958) Censored data Let y be a continuous variable (latent variable): y i = x i β + ɛ i, where ɛ x N(0, σ 2 ) The observed data y are Why not OLS? y i = max(0, y i ) = Estimates can be obtained by MLE { 0 if y i 0 if yi > 0 y i Laura Magazzini (@univr.it) Truncation and Censoring 16 / 35
17 Estimation Truncation and censoring Censored data A positive probability is assigned to the observations y i = 0: Pr(y i = 0 x i ) = Pr(yi 0 x i ) = Pr(x i β + ɛ i 0) = Pr(ɛ i x i β) = 1 Pr(ɛ i < x i β) ( x ) = 1 Φ i β σ The likelihood can be written as: L(β, σ 2 y) = ( ( x )) 1 Φ i β σ y i =0 y i >0 ( 1 Φ = y i =0 ( x )) i β σ y i >0 1 σ φ ( yi x i β ) σ 1 2πσ 2 e 1 2 ( yi x i ) 2 β Laura Magazzini (@univr.it) Truncation and Censoring 17 / 35 σ
18 Censored data Marginal effect in the tobit model In the case of censored data, β estimated from the tobit model can be employed to study the effect of x on E[y x] In the case of corner solution outcome, the estimated β are not sufficient since E[y x] and E[y x, y > 0] depend on β in a non-linear way E[y i x i ] x i = Φ ( x ) i β β σ E[y i x i ] x i = Pr(y i > 0) E[y i x i,y i >0] x i + E[y i x i, y i > 0] Pr[y i >0] x i A change in x i has two effects: (1) It affects the conditional mean of yi in the positive part of the distribution (2) It affects the probability that the observation will fall in the positive part of the distribution Laura Magazzini (@univr.it) Truncation and Censoring 18 / 35
19 Example: simulated data Y = x + ɛ/2, N = 100 Censored data Laura Magazzini (@univr.it) Truncation and Censoring 19 / 35
20 Some issues in specification Censored data Heteroschedasticity MLE is inconsistent However the problem can be approached directly and σ i considered in the likelihood function instead of σ. Specification of a particular model for σ i provides the empirical model for estimation Misspecification of Pr(y < 0) In the tobit model, a variable that increases the probability of an observation being a non-limit observation also increases the mean of the variable - Example: loss due to fire in buildings A more general model has been devised involving a decision equation and a regression equation for nonlimit observations Non-normality MLE is inconsistent Research is ongoing both on alternative estimators and on methods for testing this type of misspecification Laura Magazzini (@univr.it) Truncation and Censoring 20 / 35
21 Sample selection Truncation and censoring Sample selection What if observation is driven by a different process? (1) Data observability Saving function (in the population): saving = β 0 + β 1 income + β 2 age + β 3 married + β 4 kids + u Survey data only includes families whose household head was 45 years of age or older (2) Individual behaviour (Boyes, Hoffman, Low, 1989; Greene, 1992) y 1 = 1 if individual i defaults on a loan/credit card, 0 otherwise y 2 = 1 if individual i is granted a loan/credit card, 0 otherwise For a given individual, y 1 is not observed unless y 2 equals 1 Laura Magazzini (@univr.it) Truncation and Censoring 21 / 35
22 Sample selection Sample selection / incidental truncation Let y and z have a bivariate distribution with correlation ρ We are interested in the distribution of y given that another variable z exceeds a particular value Intuition: if y and z are positively correlated then the truncation of z should push the distribution of y to the right The truncated joint distribution is f (y, z z > a) = f (y, z) Pr(z > a) To obtain the incidentally truncated marginal density of y, we should integrate z out of this expression Laura Magazzini (@univr.it) Truncation and Censoring 22 / 35
23 Sample selection Moment of the incidentally truncated bivariate normal distribution Let y and z have a bivariate normal distribution with means µ y and µ z, standard deviations σ y and σ z, and correlation ρ E[y z > a] = µ y + ρσ y λ(α z ) V [y z > a] = σ 2 y [1 ρ 2 δ(α z )] α z = (a µ z )/σ z λ(α z ) = φ(α z )/[1 Φ(α z )] δ(α z ) = λ(α z )[λ(α z ) α z ] If the truncation is z < a, then λ(α z ) = φ(α z )/Φ(α z ) Laura Magazzini (@univr.it) Truncation and Censoring 23 / 35
24 Sample selection Example: A model of labor supply Consider a population of women where only a subsample is engaged in market employment We are interested in identifying the determinants of the labor supply for all women A simple model of female labor supply consists of 2 equations (1) Wage equation: the difference between a person s market wage and her reservation wage, as a function of characteristics such as age, education, number of children,... plus unobservables (2) Hours equation: The desired number of labor hours supplied depends on the wage, home characteristics (e.g. presence of small children), marital status,... plus unobservable Truncation: Equation 2 describes the desired hours, but an actual figure is observed only if the individual is working, i.e. when the market wage exceeds the reservation wage The hours variable is incidentally truncated Laura Magazzini (@univr.it) Truncation and Censoring 24 / 35
25 Sample selection Example: A model of labor supply When OLS on the working sample? Assume working women are chosen randomly If the working subsample has similar endowments of characteristics (both obs. & unobs.) as the nonworking sample, OLS is an option BUT the decision to work is not random: the working and nonworking sample potentially have different characteristics When the relationship is purely trough observables, appropriate conditioning variables can be included in the relevant equation If unobservable characteristics affecting the work decision are correlated with the unobservable characteristics affecting wage, then a relationship is determined that cannot be tackle by including appropriate controls A bias is induced due to sample selection Laura Magazzini (@univr.it) Truncation and Censoring 25 / 35
26 Sample selection Regression in a model of selection (1) Equation that determines sample selection The equation of primary interest is z i = w i γ + u i y i = x i β + ɛ i where y i is observed only when zi data are not available) is greater than zero (otherwise This model is closely related to the Tobit model, although it is less restrictive: the parameters explaining the censoring are not constrained to equal those explaining the variation in the observed dependent variable. For this reason the model is also known as Tobit type two. Laura Magazzini (@univr.it) Truncation and Censoring 26 / 35
27 Sample selection Regression in a model of selection (2) If u i and ɛ i have a bivariate normal distribution with zero mean and correlation ρ, E[y i y i is observed] = E[y i zi > 0] = E[y i u i > w i γ] = x i β + E[ɛ i u i > w i γ] = x i β + ρσ ɛ λ i (α u ) where α z = w i γ/σ u and λ(α u ) = φ(α u )/Φ(α u ) So, the regression model can be written as y i z i > 0 = E[y i z i > 0] + υ i = x i β + ρσ ɛ λ i (α u ) + υ i Laura Magazzini (@univr.it) Truncation and Censoring 27 / 35
28 Sample selection Regression in a model of selection (3) E[y i z i > 0] = x i β + ρσ ɛ λ i (α u ) OLS regression using the observed data will lead to inconsistent estimates (omitted variable bias) The marginal effect of the regressors on y i in the observed sample consists of two components: Direct effect on the mean of y i (β) In addition, if the variable appears in the probability that zi is positive, then it will influence y i through its presence in λ i E[y i zi ( > 0] ρσɛ = β k + γ k x ik σ u ) δ i (α u ) Most often zi is not observed, rather we can infer its sign but not its magnitude Since there is no information on the scale of z, the disturbance variance in the selection equation cannot be estimated (we let σu 2 = 1) Laura Magazzini (@univr.it) Truncation and Censoring 28 / 35
29 Sample selection Regression in a model of selection (4) Selection mechanisms z i = w i γ + u i, where we observe z i = 1 if zi > 0 and 0 otherwise. Pr(z i = 1 w i ) = Φ(w i γ) Pr(z i = 0 w i ) = 1 Φ(w i γ) Regression model y i = x i β + ɛ i, where y i is observed only when z i is equal to one (otherwise data are not available) (u i, ɛ i ) bivariate normal[0, 0, 1, σ ɛ, ρ] Laura Magazzini (@univr.it) Truncation and Censoring 29 / 35
30 Sample selection Estimation Least squares using the observed data produces incosistent estimates of β (omitted variable) Least squares regression of y on x and λ would be a consistent estimator However, even if λ i were observed, OLS would be inefficient: υ i are heteroskedastic Maximum likelihood estimation can be applied Heckman (1979) proposed a two-step procedure Laura Magazzini (@univr.it) Truncation and Censoring 30 / 35
31 Maximum likelihood estimation Sample selection The log likelihood for observation i, log L i = l i, can be written as: If y i is not observed l i = log Φ( w i γ) If y i is observed ( ) w i l i = log Φ γ + (y i x i β)ρ/σ ɛ 1 ( yi x 1 ρ 2 2 σ ɛ i β ) log( 2πσ ɛ ) σ ɛ and ρ are not directly estimated (they have to be greater than 0) Directly estimated are log σ ɛ and atanhρ: atanhρ = 1 ( ) 1 + ρ 2 log 1 ρ Estimation would be simplified if ρ = 0 Laura Magazzini (@univr.it) Truncation and Censoring 31 / 35
32 Sample selection Two-step procedure Heckman (1979) y i z i > 0 = E[y i z i > 0] + υ i = x i β + ρσ ɛ λ i (α u ) + υ i 1 Estimate the probit equation by MLE to obtain estimates of γ. For each observation in the selected sample, compute ˆλ i (inverse Mills ratio) 2 Estimate β and β λ = ρσ ɛ by least squares regression of y on x and ˆλ Laura Magazzini (@univr.it) Truncation and Censoring 32 / 35
33 Sample selection Estimators of the variance and standard errors Second step standard errors need to be adjusted to account for the first step estimation The estimation of σ ɛ needs to be adjusted: At each observation, the true conditional variance of the disturbance would be σ 2 i = σ 2 ɛ(1 ρ 2 δ i ) A consistent estimator of σ 2 ɛ is given by: ˆσ 2 ɛ = 1 n e e + ˆ δb 2 λ To test hypothesis, an estimate of the asymptotic covariance matrix of the coefficients (including β λ ) is needed Two problems arise: (1) the disturbance terms υ i is heteroskedastic; (2) there are unknown parameters in λ i Formulas are rather cumbersome, but can be calculated using the matrix of independent variables, the sample estimates of σ 2 ɛ and ρ, and the assumed known values of λ i and δ i Laura Magazzini (@univr.it) Truncation and Censoring 33 / 35
34 Two-step procedure Discussion Truncation and censoring Sample selection Identification: exclusion restriction Although the inverse Mills ration is non linear in the single index w i γ, the function mapping this index into the inverse Mills ratio is linear for certain ranges of the index Accordingly, the inclusion of additional variables in w i in the first step can be important for identification of the second step estimates In real world, there are few cadidates for simultaneous inclusion in w i and exclusion from x i Inclusion of the inverse Mills ratio into the equation of interest is driven by the normality assumption Recent research includes specific attempts to move away from the normality assumption: y i z i > 0 = x i β + µ(w i γ) + υ i where µ(w i γ) is called selectivity correction Laura Magazzini (@univr.it) Truncation and Censoring 34 / 35
35 Sample selection Selection in qualitative response models The problem of sample selection has been modeled in other settings besides the linear regression model Binary choice model have been considered, but also count data models For example in the case of the Poisson model: y i ɛ i Poisson(λ i ) log λ i = x i β + ɛ i (y i, x i ) are only observed when z i = 1, where zi = w i γ + u i and z i = 1 if zi > 0, 0 otherwise Assume that (ɛ i, u i ) have a bivariate normal distribution with non-zero correlation Selection affects the mean (and the variance) of y i and, in the observed data, y i no longer has a Poisson distribution Laura Magazzini (@univr.it) Truncation and Censoring 35 / 35
Econometrics Master in Business and Quantitative Methods
Econometrics Master in Business and Quantitative Methods Helena Veiga Universidad Carlos III de Madrid This chapter deals with truncation and censoring. Truncation occurs when the sample data are drawn
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationTobit and Selection Models
Tobit and Selection Models Class Notes Manuel Arellano November 24, 2008 Censored Regression Illustration : Top-coding in wages Suppose Y log wages) are subject to top coding as is often the case with
More informationLecture 11/12. Roy Model, MTE, Structural Estimation
Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Censoring and truncation b)
More informationMore on Roy Model of Self-Selection
V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income
More informationMid-term exam Practice problems
Mid-term exam Practice problems Most problems are short answer problems. You receive points for the answer and the explanation. Full points require both, unless otherwise specified. Explaining your answer
More informationSample Selection. In a UW Econ Ph.D. Model. April 25, Kevin Kasberg and Alex Stevens ECON 5360 Dr. Aadland
Sample Selection In a UW Econ Ph.D. Model Kevin Kasberg and Alex Stevens ECON 5360 Dr. Aadland April 25, 2013 Sample Selection Bias Sample Selection Bias (Incidental Truncation): When estimates from a
More informationTreatment Effects with Normal Disturbances in sampleselection Package
Treatment Effects with Normal Disturbances in sampleselection Package Ott Toomet University of Washington December 7, 017 1 The Problem Recent decades have seen a surge in interest for evidence-based policy-making.
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation Additional Topics Raphael Cunha Program in Statistics and Methodology PRISM Department of Political Science The Ohio State University cunha.6@osu.edu March 28, 2014 Overview
More informationGibbs Sampling in Endogenous Variables Models
Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More information1 Static (one period) model
1 Static (one period) model The problem: max U(C; L; X); s.t. C = Y + w(t L) and L T: The Lagrangian: L = U(C; L; X) (C + wl M) (L T ); where M = Y + wt The FOCs: U C (C; L; X) = and U L (C; L; X) w +
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationIntroduction to GSEM in Stata
Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationEconomics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)
More information1. Basic Model of Labor Supply
Static Labor Supply. Basic Model of Labor Supply.. Basic Model In this model, the economic unit is a family. Each faimily maximizes U (L, L 2,.., L m, C, C 2,.., C n ) s.t. V + w i ( L i ) p j C j, C j
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationAppendix A: The time series behavior of employment growth
Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time
More informationLecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions
Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the
More informationDealing With Endogeneity
Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics
More informationCENSORED DATA AND CENSORED NORMAL REGRESSION
CENSORED DATA AND CENSORED NORMAL REGRESSION Data censoring comes in many forms: binary censoring, interval censoring, and top coding are the most common. They all start with an underlying linear model
More informationEconomics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama
Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the
More informationApplied Econometrics Lecture 1
Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity
More informationEconometric Analysis of Games 1
Econometric Analysis of Games 1 HT 2017 Recap Aim: provide an introduction to incomplete models and partial identification in the context of discrete games 1. Coherence & Completeness 2. Basic Framework
More informationECON 4160, Autumn term Lecture 1
ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least
More informationModels of Qualitative Binary Response
Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course
More informationA Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators
ANNALS OF ECONOMICS AND FINANCE 4, 125 136 (2003) A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators Insik Min Department of Economics, Texas A&M University E-mail: i0m5376@neo.tamu.edu
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationA Guide to Modern Econometric:
A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction
More informationLimited Dependent Variables and Panel Data
Limited Dependent Variables and Panel Data Logit, Probit and Friends Benjamin Bittschi Sebastian Koch Outline Binary dependent variables Logit Fixed Effects Models Probit Random Effects Models Censored
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationRe-estimating Euler Equations
Re-estimating Euler Equations Olga Gorbachev September 1, 2016 Abstract I estimate an extended version of the incomplete markets consumption model allowing for heterogeneity in discount factors, nonseparable
More information16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006
16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in 1999. They cover material similar to Sections 16.2-16.3 and 16.5 of our subsequent book Microeconometrics:
More informationProgram Evaluation in the Presence of Strategic Interactions
Program Evaluation in the Presence of Strategic Interactions Daron Acemoglu, Camilo García-Jimeno, Rossa O Keeffe-O Donovan October 13, 2017 Abstract Recent improvements in program evaluation techniques
More informationWrite your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).
Formatmall skapad: 2011-12-01 Uppdaterad: 2015-03-06 / LP Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2404 Semester: Spring 2015 Type of exam: MAIN Examiner: Peter
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section
More informationEconometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationLecture 4: Linear panel models
Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample
More informationEstimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing
Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università
More information2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0
Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationDiscrete Choice Modeling
[Part 4] 1/43 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent
More informationInference and Regression
Name Inference and Regression Final Examination, 2015 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page.
More informationCh 7: Dummy (binary, indicator) variables
Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationMax. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes
Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter
More informationDEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005
DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.
More informationEconometrics I Lecture 7: Dummy Variables
Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable
More informationReview of Econometrics
Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,
More informationSimultaneous Equation Models
Simultaneous Equation Models Suppose we are given the model y 1 Y 1 1 X 1 1 u 1 where E X 1 u 1 0 but E Y 1 u 1 0 We can often think of Y 1 (and more, say Y 1 )asbeing determined as part of a system of
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationRegression with a Binary Dependent Variable (SW Ch. 9)
Regression with a Binary Dependent Variable (SW Ch. 9) So far the dependent variable (Y) has been continuous: district-wide average test score traffic fatality rate But we might want to understand the
More informationLink to Paper. The latest iteration can be found at:
Link to Paper Introduction The latest iteration can be found at: http://learneconometrics.com/pdf/gc2017/collin_gretl_170523.pdf BKW dignostics in GRETL: Interpretation and Performance Oklahoma State University
More informationEconometrics Problem Set 3
Econometrics Problem Set 3 Conceptual Questions 1. This question refers to the estimated regressions in table 1 computed using data for 1988 from the U.S. Current Population Survey. The data set consists
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationBinary Dependent Variable. Regression with a
Beykent University Faculty of Business and Economics Department of Economics Econometrics II Yrd.Doç.Dr. Özgür Ömer Ersin Regression with a Binary Dependent Variable (SW Chapter 11) SW Ch. 11 1/59 Regression
More informationModeling Binary Outcomes: Logit and Probit Models
Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed
More informationProblem set - Selection and Diff-in-Diff
Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into
More informationDepartment of Agricultural Economics. PhD Qualifier Examination. May 2009
Department of Agricultural Economics PhD Qualifier Examination May 009 Instructions: The exam consists of six questions. You must answer all questions. If you need an assumption to complete a question,
More informationExercise sheet 6 Models with endogenous explanatory variables
Exercise sheet 6 Models with endogenous explanatory variables Note: Some of the exercises include estimations and references to the data files. Use these to compare them to the results you obtained with
More informationSyllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools
Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The
More informationIntroduction to Linear Regression Analysis
Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing
More informationLinear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons
Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for
More informationUltra High Dimensional Variable Selection with Endogenous Variables
1 / 39 Ultra High Dimensional Variable Selection with Endogenous Variables Yuan Liao Princeton University Joint work with Jianqing Fan Job Market Talk January, 2012 2 / 39 Outline 1 Examples of Ultra High
More informationLabor Supply and the Two-Step Estimator
Labor Supply and the Two-Step Estimator James J. Heckman University of Chicago Econ 312 This draft, April 7, 2006 In this lecture, we look at a labor supply model and discuss various approaches to identify
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours
More informationLinear Models and Estimation by Least Squares
Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:
More informationWomen. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered
Simulation Estimation of Two-Tiered Dynamic Panel Tobit Models with an Application to the Labor Supply of Married Women Sheng-Kai Chang Abstract In this paper a computationally practical simulation estimator
More informationA Course on Advanced Econometrics
A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys
More information1. Regressions and Regression Models. 2. Model Example. EEP/IAS Introductory Applied Econometrics Fall Erin Kelley Section Handout 1
1. Regressions and Regression Models Simply put, economists use regression models to study the relationship between two variables. If Y and X are two variables, representing some population, we are interested
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationECON Introductory Econometrics. Lecture 11: Binary dependent variables
ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability
More informationIdentification and Estimation of Nonlinear Dynamic Panel Data. Models with Unobserved Covariates
Identification and Estimation of Nonlinear Dynamic Panel Data Models with Unobserved Covariates Ji-Liang Shiu and Yingyao Hu April 3, 2010 Abstract This paper considers nonparametric identification of
More informationChapter 8 Heteroskedasticity
Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized
More informationChapter 6 Stochastic Regressors
Chapter 6 Stochastic Regressors 6. Stochastic regressors in non-longitudinal settings 6.2 Stochastic regressors in longitudinal settings 6.3 Longitudinal data models with heterogeneity terms and sequentially
More informationUsing EViews Vox Principles of Econometrics, Third Edition
Using EViews Vox Principles of Econometrics, Third Edition WILLIAM E. GRIFFITHS University of Melbourne R. CARTER HILL Louisiana State University GUAY С LIM University of Melbourne JOHN WILEY & SONS, INC
More informationPhD/MA Econometrics Examination January 2012 PART A
PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator
More informationWrite your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).
STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2402 Examiner: Peter Skogman Thoursie Number of credits: 7,5 credits (hp) Date of exam: Saturday,
More informationStatistical methods for Education Economics
Statistical methods for Education Economics Massimiliano Bratti http://www.economia.unimi.it/bratti Course of Education Economics Faculty of Political Sciences, University of Milan Academic Year 2007-08
More informationEconomics 241B Estimation with Instruments
Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)
More informationWooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems
Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including
More informationECON5115. Solution Proposal for Problem Set 4. Vibeke Øi and Stefan Flügel
ECON5115 Solution Proposal for Problem Set 4 Vibeke Øi (vo@nupi.no) and Stefan Flügel (stefanflu@gmx.de) Problem 1 (i) The standardized normal density of u has the nice property This helps to show that
More information