PERSON OR PLACE? Parametric and Semiparametric Estimates of Intrametropolitan Earnings Variation. John B. Engberg and Taeil Kim

Size: px
Start display at page:

Download "PERSON OR PLACE? Parametric and Semiparametric Estimates of Intrametropolitan Earnings Variation. John B. Engberg and Taeil Kim"

Transcription

1 Institute for Research on Poverty Discussion Paper no PERSON OR PLACE? Parametric and Semiparametric Estimates of Intrametropolitan Earnings Variation John B. Engberg and Taeil Kim The Heinz School of Public Policy and Management Carnegie Mellon University May 1996 We have benefitted from comments by Glen Cain, Guido Imbens, and from participants at the Carnegie Mellon/University of Pittsburgh Applied Microeconomics Workshop and the Institute for Research on Poverty Seminar. This research was supported by the U.S. Department of Health and Human Services through the Institute for Research on Poverty (Engberg) and a HUD Doctoral Research Grant (Kim). Points of view or opinions expressed in this paper are those of the authors and do not necessarily represent either the U.S. Department of Housing and Urban Development, the U.S. Department of Health and Human Services, or the Institute for Research on Poverty. IRP publications (discussion papers, special reports, and the newsletter Focus) are now available electronically. The IRP Web Site can be accessed at the following address:

2 Abstract Some scholars have attributed earnings differences among locations to labor market conditions ( place effects ) whereas others have focused on the skill level of residents ( person effects ). We estimate a variety of selection models in an effort to detect differences in labor market conditions while controlling for differences in skill levels. We maintain the assumption that there are no barriers to mobility within a metropolitan area for highly educated white men, which implies that intra-urban differences for this group reflect sorting by skill and earnings rather than real wage differences for equally productive workers. This prediction allows us to reject several conventional parametric selection models. We estimate a semiparametric selection model that yields strong evidence that, for less educated white men, the apparent suburban earnings premium is due to sorting rather than labor market differences.

3 PERSON OR PLACE? Parametric and Semiparametric Estimates of Intrametropolitan Earnings Variation INTRODUCTION Is the relative demand for low-skill employment higher in the suburbs than in the city? If so, and if barriers to residential mobility and commuting restrict the suburban supply of low-skill workers, then low-skill suburban residents will earn more than comparable center-city residents. Identifying comparably skilled workers is the greatest challenge in empirically verifying this hypothesis. Many dimensions of skill are not observable, making possible the alternative conclusion that observed earnings differences represent differences in person rather than in place. Urban economists have long recognized that residential location is endogenous. Housing filtering, zoning restrictions, and the positive income elasticity of lot size imply that people with high earnings are more likely to choose a suburban residence. Therefore, a suburban earnings premium for workers of apparently equal skill may in fact reflect compensation for unobserved skills and the resulting preference for a suburban residence. Consistent estimation of the impact of location on earnings requires accounting for the endogeneity of residential location. The spatial labor market disequilibrium that some scholars posit for low-skill workers is not likely to be present for high-skill workers. The costs of commuting or relocating may create an effective barrier for many low-skill workers but are not likely to be influential in the labor market for high-skilled workers. We assume that any substantial earnings difference between high skilled suburban and city residents is due to residential choice rather than spatially segmented labor markets. We frame the estimation of spatial earnings differences in the presence of residential endogeneity as a sample selection problem and examine a variety of specifications. We estimate the suburban earnings premium using a sample of employed prime-age white men from Allegheny County (Pittsburgh). We find that conventional parametric corrections for sample selection bias are not

4 2 adequate to account for the suburban earnings premium of high-skilled workers. Therefore, we turn to a semiparametric sample selection estimation method. When combined with the assumption that residential choice accounts for the entire high-skill suburban premium, we reject the hypothesis of a suburban premium for low-skilled residents. Social science research provides many theoretical and empirical examinations of intrametropolitan spatial earnings differences. The monocentric urban model implies that wages will decrease as distance between the place of employment and the central business district (CBD) increases (e.g., Mills and Hamilton, 1994). However, decentralization of production over the past several decades may weaken or reverse this prediction. Ihlanfeldt (1992) does not find negative wage gradients for several low-skill occupations. This finding corresponds to anecdotal and ethnographic evidence of suburban low-skill labor shortages. Although the wage gradient is expressed in terms of place of employment, the spatial mismatch 1 literature makes the important connection to place of residence. Recent empirical work suggests that wage rates and employment rates are lower for youth who live far from areas of employment concentration or employment growth (Ihlanfeldt and Sjoquist, 1990; Raphael, 1995; O Regan and Quigley, 1995), thus providing additional support for the existence of spatial disequilibrium. 2 Although these empirical explorations are executed with great care, they may not fully account for endogenous location. Wage-gradient estimates based on place of work will be biased if there is a systematic relationship between distance from the CBD and skill requirements. For example, firms 1 The spatial mismatch hypothesis (Kain, 1968) attributes racial differences in labor market outcomes to housing market barriers. Black workers are constrained to live in the city, preventing access to a spatially decentralized pool of low-skill jobs. See Holzer, 1991; Kain, 1992; Ihlanfeldt, 1994; and Jencks and Mayer, 1990, for recent reviews of the empirical literature. 2 Alternatively, some have hypothesized that the demographic composition of the residential neighborhood has an important impact on labor market outcomes, especially for youth. See O Regan and Quigley, 1995; Case and Katz, 1991; and Borjas, 1992, for empirical examinations and Manski, 1993, for a critique of this hypothesis.

5 3 farther from the CBD may be newer and employ technology that requires expertise that is not measured by standard surveys. The spatial-mismatch hypothesis estimates have focused on youth to minimize the problem of endogenous location, but the possibility of unmeasured family characteristics that are correlated to location and labor market outcome is a recurring concern. 3 Although the preponderance of the empirical work in spatial earnings differences has focused on minority youth, we prefer to work with a sample of prime-age white men. We choose to focus on earnings, and implicitly on wages, leaving the analysis of spatial patterns of labor force participation and employment for future study. Excluding women and youth from the sample reduces the sample selection bias from the labor force participation and employment decisions. Studying spatial earnings patterns of white workers eliminates the possibility that spatial patterns of racial discrimination in the labor market are affecting the results. Although the existence of spatial earnings differences among prime-age white men is of interest in its own right, the results provide a baseline with which to compare spatial patterns for other demographic groups. Furthermore, if workers from all demographic groups compete for the same jobs, then spatial wage differences cannot persist for some groups but not others. Even if (spatially uniform) labor market discrimination leads to racial wage premia, labor market competition will equate spatial wage premia for all demographic groups. 4 We examine the Pittsburgh area because its economic and physical characteristics are conducive to spatial earnings differences. Like many metropolitan areas, economic growth in the 3 For example, Ihlanfeldt (1988) rejects endogenous location using a parametric selection model, but McMillen (1993) finds strong evidence of sample selection in a model which assumes residential choice is entirely driven by labor market differences. 4 Strazheim, in his frequently cited theoretical work (1980) examining racial differences in wage gradients, provides a model in which white workers have a negative wage gradient, but black workers have a positive wage gradient. However, this model rests on the premise of a very low elasticity of substitution between black and white workers. Such a premise does not have any theoretical or empirical support. See Engberg (1996) for a theoretical examination that permits greater substitution.

6 4 suburbs has greatly exceeded growth in the city, especially in industries and occupations with relatively low skill requirements. Furthermore, residential migration from the city to the suburbs has been hampered by a considerable migration out of the region, which has depressed housing prices in the central city. Commuting is notoriously difficult due to three major rivers and the ubiquitous hills. These features provide ideal conditions for a spatially segmented labor market. The observed 21 percent earnings suburban premium in the expanding suburbs relative to the city is consistent with this conjecture (Table 1). Finally, unlike many metropolitan areas, prime-age white men represent a very large portion of the labor supply at all skill levels in both the central city and the suburbs (Table 1). The analysis is complicated by differences in economic growth among the suburbs. We separate the older suburbs and former steel mill towns in the southeastern portion of the county from the expanding suburbs. This permits the comparison of earnings between the center city and expanding suburbs. However, when modeling the residential location choice, we include all three locations. This introduction is followed by a brief review of our methodological strategy. We then present a sequence of models of spatial earnings differences and the corresponding estimates. We conclude with some comments on the identifying assumptions of the final estimates and on the implications of our findings. IDENTIFYING PLACE EFFECTS Place effects in an earnings equation can be identified by controlling for differences in observed and unobserved average skill among locations. Consider an experiment in which individuals are randomly assigned to a residential location. Any resulting differences in average earnings would

7 5 TABLE 1 Average Weekly Earnings of Employed White Prime-Age Men (Allegheny County, Pennsylvania) Average Percentage of All Employed Location Weekly Earnings Prime-Age Men Who Are White City of Pittsburgh Expanding Suburbs Older Suburbs and Steel Towns Source: Authors tabulation of 1980 Public-Use Microsample of the U.S. Census. Notes: Average Weekly Earnings is a geometric mean. Population: Male workers between the ages of 25 and 55 who worked at least 35 hours per week, at least 40 weeks in 1979, and earned at least $100 per week.

8 6 be due to the treatment associated with residential location. In the absence of a controlled experiment, place (treatment) effects can be identified only by modeling the process that assigns individuals to each location (Manski, 1993). Consistent estimates of wage differences due to place effects can then be obtained by accounting for the endogeneity of location in the wage formation process. We formulate a general model in which residential location is determined by differences in potential wages and amenities in each location. We examine several parametric and semiparametric estimators based on decreasingly restrictive sets of assumptions. The goal is to find a model that confirms the maintained assumption that there are no place effects for highly educated workers. Consider the panels of Figure 1. Panel A provides a stylized representation of the apparent earnings levels and returns to skill for the expanding suburbs and the central city. Separate ordinary least squares (OLS) regressions of earnings on education and other human capital measures demonstrate that earnings are higher in the suburbs and that the gap increases with education. (Regression coefficient estimates are in the appendix.) These estimates would be consistent if residential location is exogenous. However, the large difference between the city and expanding suburbs for highly educated workers leads us to reject the hypothesis of exogenous location. Endogenous location causes these equations to be biased, both in level and slope. If an estimation method that accounts for endogenous location is consistent, it should reveal lines that converge for highly educated workers. Therefore, any method that does not remove the spatial gap for highly educated workers is not properly specified and should not be used to estimate spatial differences for unskilled workers. Unfortunately, none of the parametric estimators control for all the person effects among the highly educated workers. In order to obtain consistent estimates of the place effects for less educated workers, we use a method that provides consistent estimates of the location-specific earnings

9

10 8 regression slopes (but not the intercepts) under very unrestrictive assumptions (Ahn and Powell, 1993). We proceed by imposing the assumption of no place effects for highly educated workers, which allows us to calculate estimates of the place effects for less educated workers. If the resulting estimates look like Panel B in Figure 1, this will provide evidence of spatially segmented low-skill labor markets. But, if consistent estimates resemble Panel C, this will provide evidence that there are no place effects for low-skill workers. This work draws on recent research in econometrics and in many applied fields which examines the impact of modeling assumptions on the estimation of treatment effects in self-selected populations. Mroz (1987), Newey, Powell, and Walker (1990), and Ahn and Powell (1993) examine progressively unrestrictive versions of Heckman s (1974) model of women s labor supply. Robinson (1989) examines the union wage effect. Heckman and Hotz (1989) and Heckman, Ichimura, Smith, and Todd (1995) analyze the impact of job training programs. Angrist (1995) examines the returns to schooling. Manski, Sandefur, McLanahan, and Powers (1992) analyze the impact of family structure on educational attainment. All of these applications require an understanding of a selection process in order to estimate the treatment effect of interest. Wage Differences by Education Level Row 1 of Table 2 presents the average earnings difference between the city of Pittsburgh and its expanding suburbs by education level. Clearly, the overall earnings difference of 21 percent implied by Table 1 is not accounted for by differing levels of education in the two locations. The differences 5 increase with education level, with the exception of the most highly educated group. Workers with 16 years of education (bachelor s degree) earn almost a third more if they live in the suburbs than if they live in the city. The greater differences for the more highly educated workers are 5 Given the large sample size, these differences are estimated very precisely. For each category, the t-statistic is greater than 5.

11 9 TABLE 2 Location Earnings Differences: Suburban Premium by Education Category Years of Schooling a (1) Raw Earnings Difference b (2) Control for Exper, Edu, Marr c (3) Exogenous Location d (4) Para Error, Linear Choice e (5) Para Error, Nonlinear Choice f (6) Nonparametric Selection *** (bootstrap standard errors) (8.3) (4.9) (3.2) (1.8) Sample: City Expanding Suburbs Source: Authors tabulation of 1980 Public-Use Microsample of the U.S. Census. Notes: Regression coefficients for rows 3 6 are presented in the appendix. Rows 3 6: Table entries = X( s c ) where X is the mean by education category of the exogenous variables for the entire county; c and s are estimated coefficients using city and expanding suburbs workers, respectively. a Percentage difference between residents of the expanding suburbs and the city of Pittsburgh. b Table entries are coefficients on interactions of dummies for each education group with a dummy for the expanding suburbs in a regression that controls for experience, experience squared, years of education, and marital status. c Estimated from OLS regression (equation (1)). d Estimated from OLS regression that includes sample selection term defined in equations (4) and (5). e Estimated from OLS regression that includes sample selection term defined in equations (6) and (7). f Estimated from OLS regression on differences between individuals with similar predicted location choice probabilities. See equations (11) and (12). Sample: Male workers between the ages of 25 and 55 with 8 years of schooling or more who worked at least 35 hours per week, at least 40 weeks in 1979, and earned at least $100 per week.

12 10 consistent with the hypothesized greater mobility of these workers and the positive relationship between income and preference for suburban location. It is possible, however, that differences in observed (exogenous) characteristics other than education account for the earnings differences. Table 3 indicates that the average education, average age, and proportion married are higher in the expanding suburbs than in the city. These characteristics are usually associated with higher wages, and so might explain the observed earnings differences. These patterns are consistent with the theory of person effects: individuals with observed characteristics indicative of higher skills are more likely to live in the suburbs. Row 2 of Table 2 reports the earnings differences after controlling for age, education, and marital status. As expected from the patterns of age and marital differences across locations, the estimated earnings differences are lower after controlling for these factors. However, there remains an 6 unexplained earnings difference of over 15 percent for the highest education categories. This suggests that sorting on observables is not sufficient to explain earnings differences between locations. Modeling of the location choice process is necessary to capture the dependence of location on unobservable characteristics that are correlated with earnings. LOCATION CHOICE MODEL The location choice model is a variation on the Roy model (Roy, 1951; Heckman and Honore, 1990), which has been used extensively to model earnings for self-selected populations. The Roy model is based on the idea that individuals choose the alternative that maximizes their earnings or utility. Therefore, individuals who are observed in a particular alternative are not a random sample of the underlying population, but will have unobserved characteristics that systematically compensate for 6 Given the large sample size, these differences are estimated very precisely. For each category, the t-statistic is greater than 5.

13 11 TABLE 3 Average Characteristics by Location of Full-Time Prime-Age White Male Workers (Allegheny County, Pennsylvania) Average Percentage Location Education Average Age Married Sample Size City of Pittsburgh Expanding Suburbs Older Suburbs and Steel Towns Source: Authors tabulation of 1980 Public-Use Microsample of the U.S. Census. Notes: Population: Male workers between the ages of 25 and 55 who worked at least 35 hours per week, at least 40 weeks in 1979, and earned at least $100 per week.

14 12 levels of observed characteristics that predispose them against the chosen alternative. In the simple case with one observed characteristic, OLS regression using a selected sample will give a downward-biased estimate of the actual impact of the observed characteristic on the outcome. For example, the returns to education would be underestimated using either a sample composed only of high-income individuals or only of low-income individuals. In the current application, a set of equations describes potential earnings in each of three 7 locations. An additional set of equations governs the choice of residential location. The two parts of the model are linked in that potential earnings is one of the factors that determines location choice. * Potential (or latent) earnings of person i in location j, W ij, is given by the following equation: W ij X i j ij (1) Earnings depend on observable characteristics, X i, according to the coefficients j, which may vary by location. For example, if less educated workers get paid more in the suburbs than in the city, this implies greater returns to education in the city than in the expanding suburbs. In addition to education, X includes experience, experience squared, and an indicator of marital status. The error term i ij represents unobserved characteristics that affect earnings for person i in location j. It is assumed to be independent of X in the population. As discussed above, if comparisons of potential earnings affect i location choice, ij will not be uncorrelated with X within a location. i The utility, U, received from choosing a particular residential location is determined by the * ij value of earnings that can be attained and by the value of amenities in the location: U ij W ij j X i j v ij (2) 7 See McMillen, 1993, for an earlier application of the Roy model to spatial earnings differences.

15 13 The coefficient j is included to capture the idea of the Tiebout hypothesis (1956). The Tiebout hypothesis suggests that locations differ in the bundles of public goods provided to the residents. A location with a relatively large value of is more highly valued by high-income individuals. (The coefficient is assumed to be greater than or equal to zero in all locations.) Similarly, locations differ in their attractiveness to individuals of differing education, age, and marital status. These differences 8 are captured by variation among locations in the parameter. The error term v ij captures unobserved characteristics that have an impact on location-specific utility. As with the earnings error terms, v is ij assumed to be independent of X in the population. i The individual is assumed to choose the location that provides the highest utility. This choice is indicated by a categorical variable, d, which equals the index of the chosen location. Earnings are observed for only the chosen location, W. id The endogeneity of location arises because, in general, there is a correlation between the error term of the earnings equation and the explanatory variables in the earnings equation for individuals who choose a location; that is, E( X,d=j) varies with X. For example, if a location is valued by ij i i individuals with high earnings, then less educated individuals will chose that location only if they have unusually high unobserved characteristics that contribute to earnings in that location. The concepts of place and person effects can be expressed as variation across location in particular parameters of the model. As mentioned above, place effects are captured by differences among locations in the earnings equation coefficients. Person effects can arise from several sources. j Spatial variation in the utility value of earnings ( ) indicates that some locations will attract j high-earnings individuals and other locations will attract low-earnings individuals. Spatial variation in the wage error term ( ) for each person implies that individuals choosing each location have particular ij 8 We use the standard specification of experience (i.e., age-education-6) and its square in the wage equation, but use age and its square in the location-choice equation. The presentation ignores this slight difference in the explanatory variables.

16 skills that are demanded in that location. Correlation between the earnings and location error terms (E( 14 ijv ij)=0) will also lead to correlation between the error of the wage equation and the explanatory variables in the chosen location, although in this case earnings do not cause location. Exogenous Location Choice Substituting the location-specific earnings equation (1) into the location-specific utility equation (2) provides a reduced-form expression of the location specific utility: U ij X i ( j j j ) ij j v ij (3) This representation suggests two special cases in which there are no person effects, i.e., cases in which location is exogenous with respect to earnings. If earnings do not have an impact on location choice ( =0 for all j) then the composite error term ( +v ) from equation (3) is independent of the error j ij j ij term ij in the wage equation. Similarly, if neither the utility of earnings nor the earnings equation error term vary by location (i.e., = and j ij i = for all j), then the earnings equation error will not play a role in determining location. Note that in both cases the errors in the location choice equation (v ) must also ij be assumed independent of the earnings equation error terms ( ). ij Although these are very restrictive models, they are necessary to justify OLS estimation of location-specific earnings equations. Row 3 of Table 2 presents the place effects estimated under the assumption of exogenous location choice. (The notes to Table 2 provide the formula for place effects. The appendix contains the estimated regression coefficients.) The estimated place effects are increasing with education, ranging from 6.7 percent for the least educated to almost 18 percent for the most educated. The large estimated place effects for highly educated white men provide evidence against the hypothesis of exogenous location. 9 9 Of course, it is possible that the model is correct and the estimated place effect for the high-education groups is a consistent, but very imprecise, estimate of zero. This would not alter the

17 Linear Location Choice with Parametric Error Distributions A conventional approach to controlling for selection effects in a model with multiple 15 alternatives is to estimate the choice equation with multinomial logit (Lee, 1983). The MNL estimator is based on the assumption that the composite error term of the choice equation ( +v ) is ij j ij independently and identically distributed across individuals and locations, with a type I extreme-value distribution. This permits the conditional expectation of the location-specific earnings equation error term to be written as a function of the predicted probability of living in the chosen location: E( ij X i,d j) j 2 j (X i ) j 2 ( 1 [P j (X i )]) P j (X i ) (4) where: P j (X i ) e X i [ j j j ] J e X i [ k k k ] k 1 (5) The standard normal probability distribution function and cumulative distribution function are represented by and, respectively. The earnings equation error term ( ) is assumed to be normally ij 2 distributed with variance and to be independent of v ij. There are two drawbacks to this specification. First, it assumes a functional form for the explanatory variables in the location-choice equation. Although equation (2) assumes that location choice depends on the explanatory variables through a linear index (X i j ), there is little theory to justify this specification. However, we temporarily assume that the choice equation is linear in education, age, age squared, and marital status. Second, even if linearity in the explanatory variables is correct, the conclusion that this model is not useful for estimating place effects. Given the difficulty of calculating either analytic or bootstrap standard errors for these estimates, it does not seem worthwhile to investigate further whether it is a question of bias or precision.

18 16 transformations in equations (4) and (5) from the linear index (X i j ) to obtain the conditional expectation of the location-specific earnings error term rely on specific distributional forms for the wage and location-choice error terms ij and v. Again, there is no theoretical justification for these j forms. The estimated place effects of more than 100 percent reported in Row 4 of Table 2 suggest that these restrictions, although conventional, are not appropriate. Nonlinear Location Choice with Parametric Error Distributions The assumption that the explanatory variables enter the location choice index with a particular functional form (i.e., linearity) can be relaxed by estimating the location choice equation nonparametrically. By changing X to an unknown function g (X ), we allow for arbitrary i j j i nonlinearities, including interactions, of the explanatory variables. We estimate P j(x i), j=1,2, using 10 local linear regression. Figure 2 presents the predicted probabilities from MNL (left side) and local linear (right side) regression. For example, the local linear estimates capture the nonlinearity that arises from the tendency of single middle-aged men to live in the city. Not only do the nonlinearities captured by the local linear regression portray the choice process more accurately, they also help distinguish the impact of person from that of place. In the i.i.d. parametric model above, the person effects were captured by selection correction term j(x i). Variation in this term (independent of X ) depends entirely on the assumed distribution of the error i 10 We split the sample into ten cells defined by marital status and education category. For each observation, we estimate an OLS regression of location choice on age and years of education using all observations from the same cell whose age differs by no more than six years. (For observations whose age is less than six years from the minimum or maximum age, we use a regression centered seven years from the bound.) The predicted probability for each observation is calculated from their own regression. The bandwidth of +/- six years was chosen by leave-one-out cross validation (Hardle, 1990). Alternative nonparametric methods involve adding polynomials in X to the choice equation (Mroz, 1987) or kernal estimation (Ahn and Powell, 1993). As pointed out by Heckman et al. (1995), local linear regression has the advantage of retaining prediction precision in sparsely populated portions of the distribution of explanatory variables. (Also see Fan, 1993.) See Heckman et al. (1995) for an application of local linear regression in the second step of a two-step selection model.

19

20 18 terms. The nonlinear functions of X implicit in the local linear regression act as instrumental variables that provide additional independent variation in j(x i). Another possible source of independent variation in the selection correction term would be to include variables in equation (2) which affect location choice but are not correlated to the error term in the earnings equation. Unfortunately, variables that are likely to affect location choice, such as number of children, spouse s characteristics, and nonlabor income, are likely to be related to earnings. Therefore, we prefer to use nonlinearities for identification. Replacing the linear MNL probability estimate in equation (4) with the local linear choice probability estimates retains the functional form of the error distributions while relaxing the linearity assumption on the explanatory variables: E( ij X i, d j) j 2 j (X i ) j 2 ( 1 [P j (X i )]) P j (X i ) (6) where: P j (X i ) e X i j j g j (X i ) J e X i k k g k (X i ) k 1 (7) Row 5 of Table 2 presents the estimated place effects from this specification. Although they are considerably more reasonable than the previous row, the estimate of a 26 percent wage difference for the most educated group casts doubt on the assumed error distributions. Nonparametric Selection Unfortunately, removing the parametric distributional assumptions of the error terms does not yield tractable estimators of the form described above. The assumptions about the shape of the error distributions allow the expected value of the earnings error term to be expressed as a single known

21 19 function of the estimated location choice probability. However, we can remove these restrictive assumptions at the cost of using an estimator that expresses the expectation of the earnings error term as an unknown function of the location choice probability: E( ij X i,d j) K[P j (X i )] (8) A sufficient condition for substituting a function of the scaler P j(x i) for a function of the high * dimensional vector X is that the reduced-form choice equation (i.e., choosing the maximum U, as i defined by equation (3), over all 3 locations) can be expressed as a dichotomous choice between the chosen location and all other locations. If the dichotomous choice equation can be represented by a function of X i and an additive error term that is independent of X i, then no further restrictions are necessary regarding the functional form for the explanatory variables or the distribution of the error term. 11 We do not attempt to estimate the unknown function K(P ), but instead treat it as a nuisance j parameter. Following a method developed by Ahn and Powell (1993), the slope coefficients of the earnings equation can be estimated by OLS after differencing to eliminate K(P ). The differences are j taken pair-wise between individual a and individual b in location j who share similar values of the location-choice probability but differ in their values of X: 12 ij W aj W bj (X a X b ) j K[P j (X a )] K[P j (X b )] aj bj (9) 11 This is known as index sufficiency and, in the statistical literature, is closely related to the use of propensity scores. Heckman et al., 1995, contains an illuminating presentation of index sufficiency. Use of propensity scores is due to Rosenbaum and Rubin, Angrist, 1995, ties these literatures together and presents alternative conditions that permit this substitution. 12 Careful examination of the right side of Figure 1 indicates which pairs of values of X have the same predicted probability. For estimation purposes, all possible N*(N-1) pairs were weighted by a normal kernal evaluated at their probability difference. See Ahn and Powell (1993). Bandwidth for the kernal weight is selected by leave-one-out cross validation. Only pairs that are close in probability are left out and the chosen bandwidth minimizes the unweighted mean squared error of this subsample.

22 20 such that X a X b and P j (X a ) P j (X b ) (10) The new error term ( - ) is uncorrelated with X -X by construction. This provides consistent aj bj aj bj estimates of the returns to education, experience, and marital status in each of the locations, without placing any restrictions on the correlations or shapes of the wage and location-choice error terms. The cost of such weak assumptions is that we do not estimate an intercept for each equation it disappears in the differencing process. Without further assumptions, this method does not provide estimates of place effects. Therefore, we impose the assumption that we have been using as a measuring stick for all the previous estimates. We assume that there are no place effects for individuals in the highest education category. We assume that the observed earnings difference of 24.7 percent (Row 1, Table 2) is entirely due to person effects. Setting predicted wages equal between the city and expanding suburbs for this group allows us to estimate place effects for the remaining individuals in the sample. Row 6 of Table 2 provides these estimates. The estimated place effects for men with only a high school degree and with less than a high school degree are -5.4 percent and -5.8 percent, respectively. For men with some college, the place effect is -5.6 percent and for men with a bachelor s degree, the estimated effect is -2.9 percent. A 95 percent confidence interval of place effects for men with a high school degree ranges from percent to 4.0 percent. We conclude that white men with little education cannot substantially increase their earnings by moving from the central city to the expanding suburbs. Most, if not all, of the spatial variation in earnings for white men is due to sorting by earnings and characteristics that determine earnings. Our point estimates suggest that wages are actually lower in the suburbs than in the city, as

23 implied by a wage gradient in a monocentric city. However, the precision of our estimates does not allow us to reject the hypothesis that wages are the same in the city and the suburbs. 21 CONCLUSION With few exceptions, previous work on intrametropolitan earnings variation has attributed observed differences in earnings to place effects rather than to person effects. Our work is premised on the assumption that for highly educated white men, geographic labor market barriers are sufficiently weak that all place effects will be quickly arbitraged away. Based on this assumption, we reject several variations of conventional parametric selection models that attempt to control for sorting when estimating location-specific earnings equations. Our conclusion that place effects are minimal for white men of all education levels is derived from combining our fundamental assumption regarding labor markets for highly educated white men with estimates from a selection model that places no restrictions on the functional form of the location choice equation or the distributions of the earnings and locationchoice error terms. Although we have removed some functional form assumptions that we do not believe are imposed by theory, we wish to be explicit about the remaining assumptions that are identifying the model. An earnings equation functional form assumption is necessary for identification. The arbitrary nonlinearities of the nonparametric location choice estimator act as instrumental variables that are only useful when they are excluded from the wage equation. (See Ahn and Powell, 1993, for a further discussion.) Fortunately, the familiar log linear form for the wage equation has decades of theoretical and empirical support. Indeed, since Mincer s (1974) justification of a linear relationship between log wages and education, experience, and experience squared, this relationship has been the mainstay of labor econometrics. Our assumption that marriage is exogenous, allowing its inclusion in the wage

24 equation, is based on the work by Korenman and Neumark (1991). We exclude other frequently included variables such as industry, occupation, and number of children, because they are likely to affected by earnings and/or residential location, making them endogenous. 22 Although we have removed the functional form assumptions from the index and error distribution of the location choice equation, several assumptions remain. First, we assume that earnings enter the location utility (equation (2)) linearly with a fixed (i.e., nonrandom) coefficient. This j implies that an increase in earnings will have the same impact on location choice for any individual. We also assume that the distribution of the location error term (v ) does not depend on the explanatory ij variables. These assumptions permit the conditional expectation of the earnings error term to be expressed as an arbitrary function of the choice probability only, rather than as an arbitrary function of all the explanatory variables. The maintained assumption that there are no place effects for high-skilled workers is necessary for identification after assumptions regarding the distributional forms of the error terms are eliminated. Without parametric distributions, we cannot calculate the conditional expectation of the earnings error term and must treat it as a nuisance parameter. After differencing to eliminate the conditional expectation of the earnings error term, we cannot estimate the level of the earnings equations without additional assumptions. Our assumption that earnings are the same for high-skilled suburban and city residents identifies the relative levels of the two earnings equations. The determination of whether place effects exist for low-skilled workers has important public policy ramifications. Current microeconomic policy is based on the assumption of both person and place effects. Education and job training subsidies are aimed at remediating skill deficiencies, while economic development tax credits and empowerment zones are targeted at restoring job growth to inner cities. The proper mix of labor market policies depends on the relative strength of the person and place effects. Our conclusion implies that subsidizing job-creating investments in the central city will not

25 23 substantially raise earnings among inner-city residents. It should be noted, however, that there may be other benefits from such policies which our analysis of earnings does not address. Although we have focused on white men, our conclusions have implications for central-city black workers who are the traditional subject of spatial mismatch research. If low-educated black and white workers compete for the same jobs, our findings that wages are arbitraged between the expanding suburbs and central city imply that spatial mismatch is not an explanation for the observed black/white earnings gap. In order to explain the black/white earnings gap, we must turn to other factors, such as racial discrimination in the labor market and in the processes that determine observed and unobserved skill levels.

26 24

27 25 APPENDIX Regressions for Table 2 Regression for Row 3: Exogenous Location Dependent Variable: Log Weekly Earnings City Expanding Suburbs Coefficient Standard Error Coefficient Standard Error Experience Exp / Education Married Constant S R N 1,767 4,811 Multinomial Logit Used to Create Selection Correction Term for Row 4 City is the Omitted Category Expanding Suburbs Steel Towns and Old Suburbs Coefficient Standard Error z Coefficient Standard Error z Age Age / Education Married Constant Log Likelihood = Number of Observations: 8,732

28 Regression for Row 4: Linear Location Choice and Parametric Error Distributions Dependent Variable: Log Weekly Earnings 26 City Expanding Suburbs Coefficient Standard Error Coefficient Standard Error Experience Exp / Education Married Constant S R N 1,767 4,811 Regression for Row 5: Nonlinear Location Choice and Parametric Error Distributions Dependent Variable: Log Weekly Earnings City Expanding Suburbs Coefficient Standard Error Coefficient Standard Error Experience Exp / Education Married Constant S R N 1,767 4,811

29 27 Regression for Row 6: Nonparametric Selection Dependent Variable: Log Weekly Earnings City Expanding Suburbs Coefficient Standard Error Coefficient Standard Error Experience Exp / Education Married N 1,767 4,811 Note: Standard errors for regression with nonparametric selection correction are calculated by bootstrapping.

30 28

31 29 References Ahn, H., and J. Powell Semiparametric Estimation of Censored Selection Models with a Nonparametric Selection Mechanism. Journal of Econometrics 58: Angrist, J Conditioning on the Probability of Selection to Control Selection Bias. Mimeo, MIT. Borjas, G Ethnicity, Neighborhoods, and Human Capital Externalities. National Bureau of Economic Research Working Paper no Case, A., and L. Katz The Company You Keep: The Effects of Family and Neighborhood on Disadvantaged Youths. National Bureau of Economic Research Working Paper no Engberg, J A Spatial Labor Market Equilibrium with Housing Market Discrimination. Mimeo, Carnegie Mellon University. Fan, J Local Linear Regression Smoothers and Their Minimax Efficiencies. Annals of Statistics 21: Hardle, W Applied Nonparametric Regression. Cambridge: Cambridge University Press. Heckman, J Shadow Prices, Market Wages, and Labor Supply. Econometrica 42: Heckman, J., and B. Honore The Empirical Content of the Roy Model. Econometrica 58 (September): Heckman, J., and V. J. Hotz Choosing among Alternative Nonexperimental Methods for Estimating the Impact of Social Programs: The Case of Manpower Training. Journal of the American Statistical Association 84: Heckman, J., H. Ichimura, J. Smith, and P. Todd Non-Parametric Characterization of Selection Bias Using Experimental Data: A Study of Adult Males in JTPA. Mimeo, University of Chicago.

32 30 Holzer, H The Spatial Mismatch Hypothesis: What Has the Evidence Shown? Urban Studies 28: Ihlanfeldt, K Intra-Metropolitan Variation in Earnings and Labor Market Discrimination: An Econometric Analysis of the Atlanta Labor Market. Southern Economic Journal 50: Ihlanfeldt, K Intraurban Wage Gradients: Evidence by Race, Gender, Occupational Class, and Sector. Journal of Urban Economics 32: Ihlanfeldt, K The Spatial Mismatch between Jobs and Residential Locations within Urban Areas. Cityscape 1: Ihlanfeldt, K., and D. Sjoquist Job Accessibility and Racial Differences in Youth Employment Rates. American Economic Review 80: Jencks, C., and S. Mayer Residential Segregation, Job Proximity, and Black Job Opportunities. In Inner-City Poverty in the United States, edited by L. Lynn, Jr., and M. McGeary. Washington, D.C.: National Academy Press. Kain, J Housing Segregation, Negro Employment, and Metropolitan Decentralization. Quarterly Journal of Economics 2: Kain, J The Spatial Mismatch Hypothesis: Three Decades Later. Housing Policy Debate 3. Korenman, S., and D. Neumark Does Marriage Really Make Men More Productive? Journal of Human Resources 26: Lee, L-F Generalized Econometric Models with Selectivity. Econometrica 51: Manski, C Identification Problems in the Social Sciences. Sociological Methodology 23: Manski, C., G. Sandefur, S. McLanahan, and D. Powers Alternative Estimates of the Effect of Family Structure during Adolescence on High School Graduation. Journal of the American Statistical Association 87:

33 31 McMillen, D Can Blacks Earn More in the Suburbs? Racial Differences in Intra-metropolitan Earnings Variation. Journal of Urban Economics 33: Mills, E., and B. Hamilton Urban Economics, 5th ed. New York: Harper Collins. Mincer, J Schooling, Experience, and Earnings. New York: National Bureau of Economic Research. Mroz, T The Sensitivity of an Empirical Model of Married Women s Hours of Work to Economic and Statistical Assumptions. Econometrica 55: Newey, W., J. Powell, and J. Walker Semiparametric Estimation of Selection Models: Some Empirical Results. American Economic Review Papers and Proceedings 80: O Regan, K., and J. Quigley Spatial Contributions to Employment Outcomes: The Case of New Jersey Teenagers. Mimeo, Yale University. Raphael, S The Spatial Mismatch Hypothesis of Black Youth Unemployment: Evidence from the San Francisco Bay Area. Mimeo, University of California at Berkeley. Robinson, C The Joint Determination of Union Status and Union Wage Effects: Some Tests of Alternative Models. Journal of Political Economy 97: Rosenbaum, P. R., and D. B. Rubin The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika 70: Roy, A. D Some Thoughts on the Distribution of Earnings. Oxford Economic Papers 3: Strazheim, M Discrimination and the Spatial Characteristics of the Urban Labor Market for Black Workers. Journal of Urban Economics 7: Tiebout, C A Pure Theory of Local Expenditures. Journal of Political Economy 64:

Abstract Teenage Employment and the Spatial Isolation of Minority and Poverty Households Using micro data from the US Census, this paper tests the imp

Abstract Teenage Employment and the Spatial Isolation of Minority and Poverty Households Using micro data from the US Census, this paper tests the imp Teenage Employment and the Spatial Isolation of Minority and Poverty Households by Katherine M. O'Regan Yale School of Management and John M. Quigley University of California Berkeley I II III IV V Introduction

More information

A Note on Commutes and the Spatial Mismatch Hypothesis

A Note on Commutes and the Spatial Mismatch Hypothesis Upjohn Institute Working Papers Upjohn Research home page 2000 A Note on Commutes and the Spatial Mismatch Hypothesis Kelly DeRango W.E. Upjohn Institute Upjohn Institute Working Paper No. 00-59 Citation

More information

Urban Studies June 1998 v35 n7 p1187(19) Page 1

Urban Studies June 1998 v35 n7 p1187(19) Page 1 Urban Studies June 1998 v35 n7 p1187(19) Page 1 by Katherine M. O Regan and John M. Quigley This paper synthesises a series of empirical analyses investigating the role of urban space in affecting minority

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

Does city structure cause unemployment?

Does city structure cause unemployment? The World Bank Urban Research Symposium, December 15-17, 2003 Does city structure cause unemployment? The case study of Cape Town Presented by Harris Selod (INRA and CREST, France) Co-authored with Sandrine

More information

Explaining Racial/Ethnic Gaps in Spatial Mismatch: The Primacy of Racial Segregation

Explaining Racial/Ethnic Gaps in Spatial Mismatch: The Primacy of Racial Segregation Institute for Research on Poverty Discussion Paper no. 1379-10 Explaining Racial/Ethnic Gaps in Spatial Mismatch: The Primacy of Racial Segregation Michael A. Stoll School of Public Affairs University

More information

Urban Revival in America

Urban Revival in America Urban Revival in America Victor Couture 1 Jessie Handbury 2 1 University of California, Berkeley 2 University of Pennsylvania and NBER May 2016 1 / 23 Objectives 1. Document the recent revival of America

More information

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples DISCUSSION PAPER SERIES IZA DP No. 4304 A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples James J. Heckman Petra E. Todd July 2009 Forschungsinstitut zur Zukunft

More information

Long Commutes or Neighborhood Perceptions: Why Do Employers Avoid Applicants from High-Poverty Neighborhoods?

Long Commutes or Neighborhood Perceptions: Why Do Employers Avoid Applicants from High-Poverty Neighborhoods? Long Commutes or Neighborhood Perceptions: Why Do Employers Avoid Applicants from High-Poverty Neighborhoods? David C. Phillips Assistant Professor of Economics Hope College Abstract Why do employers discriminate

More information

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points]

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points] Econometrics (60 points) Question 7: Short Answers (30 points) Answer parts 1-6 with a brief explanation. 1. Suppose the model of interest is Y i = 0 + 1 X 1i + 2 X 2i + u i, where E(u X)=0 and E(u 2 X)=

More information

Explaining Racial/Ethnic Gaps In Spatial Mismatch: The Primacy of Racial Segregation

Explaining Racial/Ethnic Gaps In Spatial Mismatch: The Primacy of Racial Segregation National Poverty Center Working Paper Series #10-02 February 2010 Explaining Racial/Ethnic Gaps In Spatial Mismatch: The Primacy of Racial Segregation Michael A. Stoll, UCLA and Kenya L. Covington, California

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models An obvious reason for the endogeneity of explanatory variables in a regression model is simultaneity: that is, one

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E. NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES James J. Heckman Petra E. Todd Working Paper 15179 http://www.nber.org/papers/w15179

More information

A Meta-Analysis of the Urban Wage Premium

A Meta-Analysis of the Urban Wage Premium A Meta-Analysis of the Urban Wage Premium Ayoung Kim Dept. of Agricultural Economics, Purdue University kim1426@purdue.edu November 21, 2014 SHaPE seminar 2014 November 21, 2014 1 / 16 Urban Wage Premium

More information

PROGRAM ON HOUSING AND URBAN POLICY

PROGRAM ON HOUSING AND URBAN POLICY Institute of Business and Economic Research Fisher Center for Real Estate and Urban Economics PROGRAM ON HOUSING AND URBAN POLICY WORKING PAPER SERIES WORKING PAPER NO. W96-002 SPATIAL EFFECTS UPON UEMPLOYMENT

More information

Working Paper UCTC No. 290

Working Paper UCTC No. 290 Teenage Employment and the Spatial Isolation of Minority and Poverty Households Katherine M. O Regan John M. Quigley Working Paper UCTC No. 290 The Umversity of California Transportation Center University

More information

PhD/MA Econometrics Examination January 2012 PART A

PhD/MA Econometrics Examination January 2012 PART A PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator

More information

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The

More information

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX

GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX GROWING APART: THE CHANGING FIRM-SIZE WAGE PREMIUM AND ITS INEQUALITY CONSEQUENCES ONLINE APPENDIX The following document is the online appendix for the paper, Growing Apart: The Changing Firm-Size Wage

More information

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects The Problem Analysts are frequently interested in measuring the impact of a treatment on individual behavior; e.g., the impact of job

More information

217 2013 8 1002-2031 2013 08-0095 - 06 F293. 3 B 6500-7500 / 39% 26% 1. 5 2 3 1 10% 60% 2007 1977 70973065 /71273154 2013-01 - 11 2013-03 - 14 95 2013 8 5 Oates 70% 54 4 residential sorting 6 7-8 9-11

More information

Course Description. Course Requirements

Course Description. Course Requirements University of Pennsylvania Spring 2007 Econ 721: Advanced Microeconometrics Petra Todd Course Description Lecture: 9:00-10:20 Tuesdays and Thursdays Office Hours: 10am-12 Fridays or by appointment. To

More information

ECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information

ECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information 1. Qualitative Information Qualitative Information Up to now, we assume that all the variables has quantitative meaning. But often in empirical work, we must incorporate qualitative factor into regression

More information

Johns Hopkins University Fall APPLIED ECONOMICS Regional Economics

Johns Hopkins University Fall APPLIED ECONOMICS Regional Economics Johns Hopkins University Fall 2017 Applied Economics Sally Kwak APPLIED ECONOMICS 440.666 Regional Economics In this course, we will develop a coherent framework of theories and models in the field of

More information

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton. 1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity

CRP 608 Winter 10 Class presentation February 04, Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity CRP 608 Winter 10 Class presentation February 04, 2010 SAMIR GAMBHIR SAMIR GAMBHIR Senior Research Associate Kirwan Institute for the Study of Race and Ethnicity Background Kirwan Institute Our work Using

More information

ECON Interactions and Dummies

ECON Interactions and Dummies ECON 351 - Interactions and Dummies Maggie Jones 1 / 25 Readings Chapter 6: Section on Models with Interaction Terms Chapter 7: Full Chapter 2 / 25 Interaction Terms with Continuous Variables In some regressions

More information

MOBILITY AND THE RETURN TO EDUCATION: TESTING A ROY MODEL WITH MULTIPLE MARKETS. By Gordon B. Dahl 1

MOBILITY AND THE RETURN TO EDUCATION: TESTING A ROY MODEL WITH MULTIPLE MARKETS. By Gordon B. Dahl 1 Econometrica, Vol. 70, No. 6 (November, 2002), 2367 2420 MOBILITY AND THE RETURN TO EDUCATION: TESTING A ROY MODEL WITH MULTIPLE MARKETS By Gordon B. Dahl 1 Self-selected migration presents one potential

More information

1. Basic Model of Labor Supply

1. Basic Model of Labor Supply Static Labor Supply. Basic Model of Labor Supply.. Basic Model In this model, the economic unit is a family. Each faimily maximizes U (L, L 2,.., L m, C, C 2,.., C n ) s.t. V + w i ( L i ) p j C j, C j

More information

Ch 7: Dummy (binary, indicator) variables

Ch 7: Dummy (binary, indicator) variables Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male

More information

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)

More information

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate

More information

Job Sprawl, Spatial Mismatch, and Black Employment Disadvantage

Job Sprawl, Spatial Mismatch, and Black Employment Disadvantage Institute for Research on Poverty Discussion Paper no. 1304-05 Job Sprawl, Spatial Mismatch, and Black Employment Disadvantage Michael A. Stoll Department of Public Policy UCLA School of Public Affairs

More information

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes Chapter 1 Introduction What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes 1.1 What are longitudinal and panel data? With regression

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Sources of Inequality: Additive Decomposition of the Gini Coefficient.

Sources of Inequality: Additive Decomposition of the Gini Coefficient. Sources of Inequality: Additive Decomposition of the Gini Coefficient. Carlos Hurtado Econometrics Seminar Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Feb 24th,

More information

A nonparametric test for path dependence in discrete panel data

A nonparametric test for path dependence in discrete panel data A nonparametric test for path dependence in discrete panel data Maximilian Kasy Department of Economics, University of California - Los Angeles, 8283 Bunche Hall, Mail Stop: 147703, Los Angeles, CA 90095,

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Comparative Advantage and Schooling

Comparative Advantage and Schooling Comparative Advantage and Schooling Pedro Carneiro University College London, Institute for Fiscal Studies and IZA Sokbae Lee University College London and Institute for Fiscal Studies June 7, 2004 Abstract

More information

Chapter 9. Dummy (Binary) Variables. 9.1 Introduction The multiple regression model (9.1.1) Assumption MR1 is

Chapter 9. Dummy (Binary) Variables. 9.1 Introduction The multiple regression model (9.1.1) Assumption MR1 is Chapter 9 Dummy (Binary) Variables 9.1 Introduction The multiple regression model y = β+β x +β x + +β x + e (9.1.1) t 1 2 t2 3 t3 K tk t Assumption MR1 is 1. yt =β 1+β 2xt2 + L+β KxtK + et, t = 1, K, T

More information

Wasteful Commuting: A Re-examination

Wasteful Commuting: A Re-examination JOURNAL OF URBAN ECONOMICS 29,2-13 (1991) Wasteful Commuting: A Re-examination MAUREEN L. CROPPER Department of Economics, University of Maryland, College Park, Maryland 20742, and Resources for the Future,

More information

Econometrics I Lecture 7: Dummy Variables

Econometrics I Lecture 7: Dummy Variables Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable

More information

Population Profiles

Population Profiles U N D E R S T A N D I N G A N D E X P L O R I N G D E M O G R A P H I C C H A N G E MAPPING AMERICA S FUTURES, BRIEF 6 2000 2010 Population Profiles Atlanta, Las Vegas, Washington, DC, and Youngstown Allison

More information

Do the Causes of Poverty Vary by Neighborhood Type?

Do the Causes of Poverty Vary by Neighborhood Type? Do the Causes of Poverty Vary by Neighborhood Type? Suburbs and the 2010 Census Conference Uday Kandula 1 and Brian Mikelbank 2 1 Ph.D. Candidate, Maxine Levin College of Urban Affairs Cleveland State

More information

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press. Cheng Hsiao Microeconometrics Required Text: C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press. A.C. Cameron and P.K. Trivedi (2005), Microeconometrics, Cambridge University

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Lecture 9. Matthew Osborne

Lecture 9. Matthew Osborne Lecture 9 Matthew Osborne 22 September 2006 Potential Outcome Model Try to replicate experimental data. Social Experiment: controlled experiment. Caveat: usually very expensive. Natural Experiment: observe

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Rising Wage Inequality and the Effectiveness of Tuition Subsidy Policies:

Rising Wage Inequality and the Effectiveness of Tuition Subsidy Policies: Rising Wage Inequality and the Effectiveness of Tuition Subsidy Policies: Explorations with a Dynamic General Equilibrium Model of Labor Earnings based on Heckman, Lochner and Taber, Review of Economic

More information

The impact of residential density on vehicle usage and fuel consumption*

The impact of residential density on vehicle usage and fuel consumption* The impact of residential density on vehicle usage and fuel consumption* Jinwon Kim and David Brownstone Dept. of Economics 3151 SSPA University of California Irvine, CA 92697-5100 Email: dbrownst@uci.edu

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

Can Public Transport Infrastructure Relieve Spatial Mismatch?

Can Public Transport Infrastructure Relieve Spatial Mismatch? Can Public Transport Infrastructure Relieve Spatial Mismatch? Evidence from Recent Light Rail Extensions Kilian Heilmann University of California San Diego April 20, 2015 Motivation Paradox: Even though

More information

During the latter half of the twentieth. Modest Progress:

During the latter half of the twentieth. Modest Progress: Center on Urban & Metropolitan Policy Modest Progress: The Narrowing Spatial Mismatch Between Blacks and Jobs in the 1990s Steven Raphael, University of California, Berkeley Michael A. Stoll, University

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

R E SEARCH HIGHLIGHTS

R E SEARCH HIGHLIGHTS Canada Research Chair in Urban Change and Adaptation R E SEARCH HIGHLIGHTS Research Highlight No.8 November 2006 THE IMPACT OF ECONOMIC RESTRUCTURING ON INNER CITY WINNIPEG Introduction This research highlight

More information

The relationship between treatment parameters within a latent variable framework

The relationship between treatment parameters within a latent variable framework Economics Letters 66 (2000) 33 39 www.elsevier.com/ locate/ econbase The relationship between treatment parameters within a latent variable framework James J. Heckman *,1, Edward J. Vytlacil 2 Department

More information

Links between socio-economic and ethnic segregation at different spatial scales: a comparison between The Netherlands and Belgium

Links between socio-economic and ethnic segregation at different spatial scales: a comparison between The Netherlands and Belgium Links between socio-economic and ethnic segregation at different spatial scales: a comparison between The Netherlands and Belgium Bart Sleutjes₁ & Rafael Costa₂ ₁ Netherlands Interdisciplinary Demographic

More information

Introduction to Linear Regression Analysis

Introduction to Linear Regression Analysis Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing

More information

Environmental Analysis, Chapter 4 Consequences, and Mitigation

Environmental Analysis, Chapter 4 Consequences, and Mitigation Environmental Analysis, Chapter 4 4.17 Environmental Justice This section summarizes the potential impacts described in Chapter 3, Transportation Impacts and Mitigation, and other sections of Chapter 4,

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours

More information

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX Well, it depends on where you're born: A practical application of geographically weighted regression to the study of infant mortality in the U.S. P. Johnelle Sparks and Corey S. Sparks 1 Introduction Infant

More information

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Prepared for consideration for PAA 2013 Short Abstract Empirical research

More information

Chapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies)

Chapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies) Chapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies) Statistics and Introduction to Econometrics M. Angeles Carnero Departamento de Fundamentos del Análisis Económico

More information

Has the Family Planning Policy Improved the Quality of the Chinese New. Generation? Yingyao Hu University of Texas at Austin

Has the Family Planning Policy Improved the Quality of the Chinese New. Generation? Yingyao Hu University of Texas at Austin Very preliminary and incomplete Has the Family Planning Policy Improved the Quality of the Chinese New Generation? Yingyao Hu University of Texas at Austin Zhong Zhao Institute for the Study of Labor (IZA)

More information

The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples

The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples The Impact of Residential Density on Vehicle Usage and Fuel Consumption: Evidence from National Samples Jinwon Kim Department of Transport, Technical University of Denmark and David Brownstone 1 Department

More information

Applied Econometrics Lecture 1

Applied Econometrics Lecture 1 Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity

More information

Spatial mismatch. Laurent Gobillon INED, PSE-INRA and CREST. Harris SELOD The World Bank, PSE-INRA and CREST 1

Spatial mismatch. Laurent Gobillon INED, PSE-INRA and CREST. Harris SELOD The World Bank, PSE-INRA and CREST 1 Spatial mismatch Laurent Gobillon INED, PSE-INRA and CREST Harris SELOD The World Bank, PSE-INRA and CREST 1 Keywords Spatial mismatch, urban unemployment, job decentralization, residential segregation,

More information

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix) Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix Flavio Cunha The University of Pennsylvania James Heckman The University

More information

More on Specification and Data Issues

More on Specification and Data Issues More on Specification and Data Issues Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Specification and Data Issues 1 / 35 Functional Form Misspecification Functional

More information

Subject: Note on spatial issues in Urban South Africa From: Alain Bertaud Date: Oct 7, A. Spatial issues

Subject: Note on spatial issues in Urban South Africa From: Alain Bertaud Date: Oct 7, A. Spatial issues Page 1 of 6 Subject: Note on spatial issues in Urban South Africa From: Alain Bertaud Date: Oct 7, 2009 A. Spatial issues 1. Spatial issues and the South African economy Spatial concentration of economic

More information

Cultural Data in Planning and Economic Development. Chris Dwyer, RMC Research Sponsor: Rockefeller Foundation

Cultural Data in Planning and Economic Development. Chris Dwyer, RMC Research Sponsor: Rockefeller Foundation Cultural Data in Planning and Economic Development Chris Dwyer, RMC Research Sponsor: Rockefeller Foundation A Decade of Attempts to Quantify Arts and Culture Economic impact studies Community indicators

More information

Does a Rising Tide Lift All Metropolitan Boats? Assessing Poverty Dynamics by Metropolitan Size and County Type *

Does a Rising Tide Lift All Metropolitan Boats? Assessing Poverty Dynamics by Metropolitan Size and County Type * Does a Rising Tide Lift All Metropolitan Boats? Assessing Poverty Dynamics by Metropolitan Size and County Type * by Mark D. Partridge Canada Research Chair in the New Rural Economy Professor of Agricultural

More information

A Note on Methods. ARC Project DP The Demand for Higher Density Housing in Sydney and Melbourne Working Paper 3. City Futures Research Centre

A Note on Methods. ARC Project DP The Demand for Higher Density Housing in Sydney and Melbourne Working Paper 3. City Futures Research Centre A Note on Methods ARC Project DP0773388 The Demand for Higher Density Housing in Sydney and Melbourne Working Paper 3 City Futures Research Centre February 2009 A NOTE ON METHODS Dr Raymond Bunker, Mr

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

The Spatial Mismatch Hypothesis: A Review of Recent Studies and Their Implications for Welfare Reform

The Spatial Mismatch Hypothesis: A Review of Recent Studies and Their Implications for Welfare Reform Housing Policy Debate Volume 9, Issue 4 849 Fannie Mae Foundation 1998. All Rights Reserved. The Spatial Mismatch Hypothesis: A Review of Recent Studies and Their Implications for Welfare Reform Keith

More information

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information

Income Distribution Dynamics with Endogenous Fertility. By Michael Kremer and Daniel Chen

Income Distribution Dynamics with Endogenous Fertility. By Michael Kremer and Daniel Chen Income Distribution Dynamics with Endogenous Fertility By Michael Kremer and Daniel Chen I. Introduction II. III. IV. Theory Empirical Evidence A More General Utility Function V. Conclusions Introduction

More information

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary Part VII Accounting for the Endogeneity of Schooling 327 / 785 Much of the CPS-Census literature on the returns to schooling ignores the choice of schooling and its consequences for estimating the rate

More information

Modern Urban and Regional Economics

Modern Urban and Regional Economics Modern Urban and Regional Economics SECOND EDITION Philip McCann OXFORD UNIVERSITY PRESS Contents List of figures List of tables Introduction xii xiv xvii Part I Urban and Regional Economic Models and

More information

14.461: Technological Change, Lecture 4 Technology and the Labor Market

14.461: Technological Change, Lecture 4 Technology and the Labor Market 14.461: Technological Change, Lecture 4 Technology and the Labor Market Daron Acemoglu MIT September 20, 2016. Daron Acemoglu (MIT) Technology and the Labor Market September 20, 2016. 1 / 51 Technology

More information

Volume Author/Editor: Gregory K. Ingram, John F. Kain, and J. Royce Ginn. Volume URL:

Volume Author/Editor: Gregory K. Ingram, John F. Kain, and J. Royce Ginn. Volume URL: This PDF is a selection from an out-of-print volume from the National Bureau of Economic Research Volume Title: The Detroit Prototype of the NBER Urban Simulation Model Volume Author/Editor: Gregory K.

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

One Economist s Perspective on Some Important Estimation Issues

One Economist s Perspective on Some Important Estimation Issues One Economist s Perspective on Some Important Estimation Issues Jere R. Behrman W.R. Kenan Jr. Professor of Economics & Sociology University of Pennsylvania SRCD Seattle Preconference on Interventions

More information

Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda

Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda Secondary Towns and Poverty Reduction: Refocusing the Urbanization Agenda Luc Christiaensen and Ravi Kanbur World Bank Cornell Conference Washington, DC 18 19May, 2016 losure Authorized Public Disclosure

More information

Analysis of travel-to-work patterns and the identification and classification of REDZs

Analysis of travel-to-work patterns and the identification and classification of REDZs Analysis of travel-to-work patterns and the identification and classification of REDZs Dr David Meredith, Teagasc, Spatial Analysis Unit, Rural Economy Development Programme, Ashtown, Dublin 15. david.meredith@teagasc.ie

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Estimating Marginal and Average Returns to Education

Estimating Marginal and Average Returns to Education Estimating Marginal and Average Returns to Education Pedro Carneiro, James Heckman and Edward Vytlacil Econ 345 This draft, February 11, 2007 1 / 167 Abstract This paper estimates marginal and average

More information

Econometrics -- Final Exam (Sample)

Econometrics -- Final Exam (Sample) Econometrics -- Final Exam (Sample) 1) The sample regression line estimated by OLS A) has an intercept that is equal to zero. B) is the same as the population regression line. C) cannot have negative and

More information

A User s Guide to the Federal Statistical Research Data Centers

A User s Guide to the Federal Statistical Research Data Centers A User s Guide to the Federal Statistical Research Data Centers Mark Roberts Professor of Economics and Director PSU FSRDC September 2016 M. Roberts () RDC User s Guide September 2016 1 / 14 Outline Introduction

More information

Normalized Equation and Decomposition Analysis: Computation and Inference

Normalized Equation and Decomposition Analysis: Computation and Inference DISCUSSION PAPER SERIES IZA DP No. 1822 Normalized Equation and Decomposition Analysis: Computation and Inference Myeong-Su Yun October 2005 Forschungsinstitut zur Zukunft der Arbeit Institute for the

More information

A Measure of Robustness to Misspecification

A Measure of Robustness to Misspecification A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator

More information

Course Description. Course Requirements

Course Description. Course Requirements University of Pennsylvania Fall, 2015 Econ 721: Econometrics III Advanced Petra Todd Course Description Lecture: 10:30-11:50 Mondays and Wednesdays Office Hours: 10-11am Tuesdays or by appointment. To

More information