Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.

UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question carries 0%; Question 3 carries 0%; Question 4 carries 35%. Marks awarded for individual parts are shown in square brackets. A formula sheet, t-tables, F-tables, and chi-squared tables are attached to the examination paper. Notes are not permitted in this examination. Do not turn over until you are told to do so by the Invigilator. ECO-7000A Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version

Page THIS PAGE IS DELIBERATELY LEFT BLANK

Page 3 QUESTION 1 [5 Marks] ALL WORKING MUST BE SHOWN IN YOUR ANSWER TO THIS QUESTION The following table contains data on weekly income (X) and weekly expenditure on restaurant meals (Y) for a sample of six households. Both variables are measured in pounds. Household X Y A 400 30 B 500 0 C 600 60 D 800 40 E 900 90 F 1000 140 (a) Obtain ordinary least squares estimates of β1 and β in the model: Yi = β1+ βxi + ui i = 1, L, 6 [10 marks] (b) (c) (d) Place a precise economic interpretation on each of the two parameter estimates, ˆ β 1 and ˆ β. In particular, does it make any economic sense that your estimate of the intercept is negative? [6 marks] Find the residuals. Which of the six households has the highest positive residual associated with it? What conclusion can you draw about this household? [3 marks] Test the null hypothesis that β = 0 against the alternative that β > 0. What is the economic interpretation of your result (i.e. what term would an economist use to describe restaurant meals)? [6 marks] TURN OVER

Page 4 QUESTION [0 MARKS] Data on 900 properties sold in four postcode areas of Norwich during 017 are collected. The variables are: price: sqft: beds: nr1: nr: nr3: nr4: Price of property in thousands of pounds Internal area of property in square feet Number of bedrooms in property 1 if property is in NR1 (Central Norwich); 0 otherwise 1 if property is in NR (West Central Norwich); 0 otherwise 1 if property is in NR3 (North Central Norwich); 0 otherwise 1 if property is in NR4 (South-West Norwich); 0 otherwise Two models are estimated, with the following results:. * MODEL 1. regress price sqft beds Source SS df MS Number of obs = 900 -------------+---------------------------------- F(, 897) = 147.04 Model 846367.865 43183.933 Prob > F = 0.0000 Residual 581654.38 897 878.0985 R-squared = 0.469 -------------+---------------------------------- Adj R-squared = 0.45 Total 3480.4 899 3813.15044 Root MSE = 53.648 ------------------------------------------------------------------------------ price Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- sqft.096701.0189933 4.88 0.000.0553935.199467 beds -7.034638 5.876785-1.0 0.3-18.56849 4.4991 _cons 91.577 10.31896 8.87 0.000 71.3508 111.893 ------------------------------------------------------------------------------. * MODEL. regress price sqft beds nr nr3 nr4 Source SS df MS Number of obs = 900 -------------+---------------------------------- F(5, 894) = 97.34 Model 108393.53 5 41678.707 Prob > F = 0.0000 Residual 1968.71 894 48.80616 R-squared = 0.355 -------------+---------------------------------- Adj R-squared = 0.3489 Total 3480.4 899 3813.15044 Root MSE = 49.88 ------------------------------------------------------------------------------ price Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- sqft.1081395.0177348 6.10 0.000.073338.149463 beds -11.55759 5.481609 -.11 0.035 -.31591 -.79965 nr 6.951 4.733318 1.46 0.144 -.364489 16.1493 nr3-10.97375 4.770331 -.30 0.0-0.3361-1.611394 nr4 4.0009 4.633934 9.06 0.000 3.90564 51.09495 _cons 73.14067 10.01765 7.30 0.000 53.47983 9.80151 ------------------------------------------------------------------------------ (a) Interpret the estimate of the intercept parameter in Model 1. Do the same for Model. [5 marks]

Page 5 (b) Explain why only three of the four postcode dummies have been included in Model. What would happen if you attempted to include all four? [5marks] (c) (d) Using an F-test, test Model 1 as a restricted version of Model. Interpret the result of the test. [5 marks] Interpret the coefficients on the three included postcode dummies. Which area has the highest prices, and which lowest? Why is the ceteris paribus concept important here? [5 marks] TURN OVER

Page 6 QUESTION 3 [0 marks] We have data on 53 countries in 017. Let p_locali be the price of a Big Mac (the McDonald s hamburger) in country i in local currency in 017. Let ei be the exchange rate for country i against the US dollar in 017 (that is, ei is the number of units of local currency that can be exchanged for one US dollar in 017). (a) Data on three of the 53 countries is shown in the following table. Country Currency p_local e Indonesia Rupiah 3469.6 1359 Norway Kroner 48.1 8.13 Japan Yen 38. 113.7 Compute the price of a Big Mac in each of the three countries in US dollars. On this basis, which of the three currencies appears under-valued in 017, and which appears over-valued? [7 marks] The following regression model is estimated using data from all 53 countries in 017 (p_usa is the price of a Big Mac in the USA in 017): _ log p locali = β + 1 βlog e + i ui ; i = 1, L,53 (1) p _ usa Following the regression, two tests are performed. The results are as follows:. regress log_p_ratio log_e Source SS df MS Number of obs = 53 -------------+---------------------------------- F(1, 51) = 4817.75 Model 37.61177 1 37.61177 Prob > F = 0.0000 Residual 3.468153 51.068003004 R-squared = 0.9895 -------------+---------------------------------- Adj R-squared = 0.9893 Total 331.08933 5 6.367105 Root MSE =.6077 ------------------------------------------------------------------------------ log_p_ratio Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- log_e.949016.0136753 69.41 0.000.917473.9766559 _cons -.401569.0475768-5.05 0.000 -.3356713 -.144645 ------------------------------------------------------------------------------. test (_b[_cons]=0) (_b[log_e]=1) ( 1) _cons = 0 ( ) log_e = 1 F(, 51) = 56.4 Prob > F = 0.0000. test (_b[log_e]=1) ( 1) log_e = 1 F( 1, 51) = 13.80 Prob > F = 0.0005

Page 7 (b) (c) Consider the two tests performed following the regression above. The first test is a test of the Law of One Price (LOP). Explain the concept of the LOP. Is it rejected by the 017 Big Mac data? Which theory is being tested by the second test? Is it rejected? [7 marks] Explain the Balassa-Samuelson effect, and explain why you might expect it to hold. How would you extend the regression model (1) in order to carry out a test of the Balassa-Samuelson effect? [6 marks] TURN OVER

QUESTION 4 [35 Marks] Page 8 A sample of 1,000 high school graduates in the US is drawn and the following data is collected on each: Y: = 1 if the student goes to college, = 0 otherwise Grades: Average grade in Maths, English and Social Studies on a 13 point grading scale, with 13 for the highest grade and 1 for the lowest grade Income: Gross family income in thousands of dollars Famsize: Number of family members Parcoll: = 1 if the most educated parent graduated from college, = 0 otherwise Gender: = 1 if male, = 0 if female White: = 1 if the student is white, = 0 if non-white Two logit models are estimated, with Y as the dependent variable. The results are shown in the following table. The numbers in brackets are the asymptotic standard errors. Model 1 Model Constant -.60 (0.41) -.40 (0.4) Grades 0.60 (0.07) 0.65 (0.06) Income 0.01 (0.004) 0.01 (0.005) Famsize -0.97 (0.73) -0.85 (0.74) Parcoll 0.53 (0.5) 0.40 (0.18) Gender -0.15 (0.19) White 0.0 (0.15) LogL -84. -836.1 (a) (b) (c) (d) A colleague been asked to estimate the probability of a high school graduate going to college. They would like to estimate a linear probability model but are unsure whether this model is suitable. What would you advise? [4 marks] Using Model 1, interpret the results and test for the individual significance of the four explanatory variables. [10 marks] Using Model, predict the probability of a white female from a family with four members, whose average grade is 9, whose family income is $60,000 and whose most educated family member is a college graduate, going to college. [6 marks] Conduct a likelihood ratio (LR) test of the joint significance of the Gender and White dummy variables. [3 marks]

(e) (f) Page 9 Explain how you would extend Model to investigate whether the effect of grades on the probability of going to college differs between males and females. [4 marks] Suppose now that the high school graduate has three options; their choices are: 1. Do not go to college. A two-year college course 3. A four-year college course Which model could be used to estimate the probability of the high school graduate choosing each of these options? [4 marks] (g) Do you think that the assumption of Independence of Irrelevant Alternatives (IIA) might be violated in this situation? Explain your answer. [4 marks] END OF PAPER

The simple regression model Consider the model: Y = β + β X + u i,...,n. i 1 i i = 1 Page 10 Econometric Methods Formula Sheet The ordinary least squares estimators of β and β1 are: ( X i X )Y i ˆ i β = ( X i X ) i ˆ β = Y ˆ 1 β The fitted values of Y are given by: ˆ Ŷi = β 1 + β X i The residuals are: û i i ˆ = Y Ŷ i X The standard error of the regression is given by: = ûi ˆ σ n The estimated standard errors of $ β and $ β 1 are given by: se( ˆ β ) = ˆ σ se( ˆ β ) = ˆ σ 1 ( X i 1 X ) 1 X + n ( X i X ) Testing joint restrictions in the multiple regression model Let n be the sample size, let r be the number of restrictions under test, let k be the number of parameters in the unrestricted model, let R U be the R in the unrestricted model and let R R be the R in the restricted model. Under the null hypothesis that the r restrictions are true, the F-statistic (R U R R ) / r F = ( 1 R U ) /(n k ) has an Fr,n-k distribution, that is, an F distribution with r, n-k degrees of freedom. The Logit Model exp( xi ' β ) P(Yi = 1) = 1+ exp( x ' β ) i

Page 11 Table 1: Critical values of the t-distribution df α = 0.10 α = 0.05 α = 0.05 α = 0.01 α = 0.005 1 3.08 6.31 1.71 31.8 63.66 1.89.9 4.30 6.97 9.93 3 1.64.35 3.18 4.54 5.84 4 1.53.13.78 3.75 4.60 5 1.48.0.57 3.37 4.03 6 1.44 1.94.45 3.14 3.71 7 1.4 1.90.37 3.00 3.50 8 1.40 1.86.31.90 3.36 9 1.38 1.83.6.8 3.5 10 1.37 1.81.3.76 3.17 11 1.36 1.80.0.7 3.11 1 1.36 1.78.18.68 3.06 13 1.35 1.77.16.65 3.01 14 1.35 1.76.15.6.98 15 1.34 1.75.13.60.95 16 1.34 1.75.1.58.9 17 1.33 1.74.11.57.90 18 1.33 1.73.10.55.88 19 1.33 1.73.09.54.86 0 1.33 1.73.09.53.85 1 1.3 1.7.08.5.83 1.3 1.7.07.51.8 3 1.3 1.71.07.50.81 4 1.3 1.71.06.49.80 5 1.3 1.71.06.49.79 6 1.3 1.70.06.48.78 7 1.31 1.70.05.47.77 8 1.31 1.70.05.47.76 9 1.31 1.70.04.46.76 30 1.31 1.70.04.46.75 40 1.30 1.68.0.4.70 50 1.30 1.68.01.40.68 60 1.30 1.67.00.39.66 70 1.9 1.67 1.99.38.65 80 1.9 1.66 1.99.37.64 90 1.9 1.66 1.99.37.63 100 1.9 1.66 1.98.36.63 15 1.9 1.66 1.98.36.6 150 1.9 1.65 1.98.35.61 00 1.9 1.65 1.97.35.60 1.8 1.64 1.96.33.58

Page 1 Table : Critical values of the F- distribution (α=0.05) df1=1 3 4 5 6 7 8 10 15 df=1 161.4 199.5 15.7 4.6 30. 34.0 37.0 38.9 41.9 45.9 18.51 19.00 19.16 19.5 19.30 19.33 19.4 19.37 19.40 19.43 3 10.13 9.55 9.8 9.1 9.01 8.94 8.89 8.85 8.79 8.70 4 7.71 6.94 6.59 6.39 6.6 6.16 6.09 6.04 5.96 5.86 5 6.61 5.79 5.41 5.19 5.05 4.95 4.88 4.8 4.74 4.6 6 5.99 5.14 4.76 4.53 4.39 4.8 4.1 4.15 4.06 3.94 7 5.59 4.74 4.35 4.1 3.97 3.87 3.79 3.73 3.64 3.51 8 5.3 4.46 4.07 3.84 3.69 3.58 3.50 3.44 3.35 3. 9 5.1 4.6 3.86 3.63 3.48 3.37 3.9 3.3 3.14 3.01 10 4.96 4.10 3.71 3.48 3.33 3. 3.14 3.07.98.85 11 4.84 3.98 3.59 3.36 3.0 3.09 3.01.95.85.7 1 4.75 3.89 3.49 3.6 3.11 3.00.91.85.75.6 13 4.67 3.81 3.41 3.18 3.03.9.83.77.67.53 14 4.60 3.74 3.34 3.11.96.85.76.70.60.46 15 4.54 3.68 3.9 3.06.90.79.71.64.54.40 16 4.49 3.63 3.4 3.01.85.74.66.59.49.35 17 4.45 3.59 3.0.96.81.70.61.55.45.31 18 4.41 3.55 3.16.93.77.66.58.51.41.7 19 4.38 3.5 3.13.90.74.63.54.48.38.3 0 4.35 3.49 3.10.87.71.60.51.45.35.0 1 4.3 3.47 3.07.84.68.57.49.4.3.18 4.30 3.44 3.05.8.66.55.46.40.30.15 3 4.8 3.4 3.03.80.64.53.44.37.7.13 4 4.6 3.40 3.01.78.6.51.4.36.5.11 5 4.4 3.39.99.76.60.49.40.34.4.09 6 4.3 3.37.98.74.59.47.39.3..07 7 4.1 3.35.96.73.57.46.37.31.0.06 8 4.0 3.34.95.71.56.45.36.9.19.04 9 4.18 3.33.93.70.55.43.35.8.18.03 30 4.17 3.3.9.69.53.4.33.7.16.01 40 4.08 3.3.84.61.45.34.5.18.08 1.9 50 4.03 3.18.79.56.40.9.0.13.03 1.87 60 4.00 3.15.76.53.37.5.17.10 1.99 1.84 70 3.98 3.13.74.50.35.3.14.07 1.97 1.81 80 3.96 3.11.7.49.33.1.13.06 1.95 1.79 90 3.95 3.10.71.47.3.0.11.04 1.94 1.78 100 3.94 3.09.70.46.31.19.10.03 1.93 1.77 15 3.9 3.07.68.44.9.17.09.01 1.91 1.75 150 3.90 3.06.66.43.7.16.08.00 1.89 1.73 00 3.89 3.04.65.4.6.14.06 1.98 1.88 1.7 3.84 3.00.60.37.1.10.01 1.94 1.83 1.67

Page 13 Table 4: Critical values of the χ -distribution df α = 0.10 α = 0.05 α = 0.05 α = 0.01 α = 0.005 1.71 3.84 5.0 6.64 7.88 4.61 5.99 7.38 9.1 10.60 3 6.5 7.8 9.35 11.34 1.84 4 7.78 9.49 11.14 13.8 14.86 5 9.4 11.07 1.83 15.09 16.75 6 10.64 1.59 14.45 16.81 18.55 7 1.0 14.07 16.01 18.48 0.8 8 13.36 15.51 17.53 0.09 1.95 9 14.68 16.9 19.0 1.67 3.59 10 15.99 18.31 0.48 3.1 5.19 11 17.8 19.68 1.9 4.7 6.76 1 18.55 1.03 3.34 6. 8.30 13 19.81.36 4.74 7.69 9.8 14 1.06 3.68 6.1 9.14 31.3 15.31 5.00 7.49 30.58 3.80 END OF MATERIALS