Properties of estimator Functional Form. Econometrics. Lecture 8. Nathaniel Higgins JHU. Nathaniel Higgins Lecture 8
|
|
- Gary Hood
- 5 years ago
- Views:
Transcription
1 Econometrics Lecture 8 Nathaniel Higgins JHU
2 Homework Next class: GDP, population, temperature, energy, and mortality data together by Nov. 9 (next class) If you have questions / need help, let Rob or I know! Rob will use part of his time this week to specifically help iron out any difficulties with collecting and curating this data Put time asap into getting all of this data on hand If you have time to collect all the data, do it!
3 Hypotheses Wrap-up We have completed material on basic hypothesis testing You should be able to: 1 Test single hypotheses involving a single coefficient 2 Test single hypotheses involving a combination of coefficients 3 Test multiple hypotheses jointly Single hypotheses can be evaluated against: 1 One-sided alternative 2 Two-sided alternative Can evaluate all our hypotheses using test statistics or p-values Can evaluate our t-tests using a confidence interval as well
4 Next topic Properties of the OLS estimator Material maps over to Wooldridge Ch 5 This finishes up our work with basic OLS Next set of topics will be working on functional form how to write models that deal with different shapes in the data (more fun)
5 Properties of the OLS estimator Everything we ve done up to this point has used the OLS estimator Why? Because the OLS estimator has nice properties under a reasonable set of assumptions Recall the assumptions we have made: MRL.1-MLR.6
6 Gauss-Markov assumptions MLR.1 y = β 0 + β 1 x β k x k + u 2 MLR.2 Our data (x,y) represent a random sample from the relevant population 3 MLR.3 x i z x j 4 MLR.4 E(u x) = 0
7 Gauss-Markov assumptions MLR.5 Var(u x) = σ 2 6 MLR.6 u is independent of x s and is distributed normally with mean of zero and variance of σ 2
8 Properties of the OLS estimator If we make assumptions MLR.1 - MLR.4, we get: OLS is unbiased (see section 3.3) know this! If we add assumption MLR.5, we get: OLS is BLUE (see section 3.5) know this! If we add assumption MLR.6, not only do we know that OLS is best in the sense that it has lower variance than all the other linear estimators, we can actually compute the variance
9 Properties of the OLS estimator If we make assumptions MLR.1 - MLR.4, we get: OLS is unbiased (see section 3.3) know this! If we add assumption MLR.5, we get: OLS is BLUE (see section 3.5) know this! If we add assumption MLR.6, not only do we know that OLS is best in the sense that it has lower variance than all the other linear estimators, we can actually compute the variance If we assume only MLR.1 - MLR.4, we get that OLS is consistent (new jargon)
10 Properties of the OLS estimator Consistency Consistency is a large-sample property What does this mean, practically? It means you appeal to large sample properties whenever you can. Typically, large sample properties can be appealed to with dsets that most of you would consider small Consistency means: as the sample size gets bigger and bigger (formally: approaches infinity ), the distribution of the OLS estimator β j collapses around the true value β j That s really all I expect you to know: as you get more and more data, the OLS estimator gets better and better (in the sense that our estimates become tighter around the true value)
11 Properties of the OLS estimator Normality So if we assume MLR.1 - MLR.4, we get that OLS is asymptotically awesome But recall: we needed MLR.5 and MLR.6 to perform inference Why? Because we needed to know the exact distribution of β j to form our t- and F-statistics Problem: in small samples, we may find that u doesn t look very normal To the rescue: the central limit theorem (appendix C) If we assume MLR.5 (homoskedasticity) but NOT MLR.6 (normality), we get: As n approaches infinity, ( β j β j )/se( β j ) is distributed normally
12 Properties of the OLS estimator Normality Hold the phone! I thought that ( β j β j )/se( β j ) was distributed at a t!
13 Properties of the OLS estimator Normality Hold the phone! I thought that ( β j β j )/se( β j ) was distributed at a t! Fear not. It is distributed as a t. But as sample sizes get really big, the t distribution starts to look exactly like N(0,1) (the standard normal distribution).
14 Properties of the OLS estimator Normality Hold the phone! I thought that ( β j β j )/se( β j ) was distributed at a t! Fear not. It is distributed as a t. But as sample sizes get really big, the t distribution starts to look exactly like N(0,1) (the standard normal distribution). So... the central limit theorem helps rescue us, because we can essentially use the same testing procedures that we have always used even if u is not normally distributed, as long as the sample is big enough
15 Properties of the OLS estimator Normality Hold the phone! I thought that ( β j β j )/se( β j ) was distributed at a t! Fear not. It is distributed as a t. But as sample sizes get really big, the t distribution starts to look exactly like N(0,1) (the standard normal distribution). So... the central limit theorem helps rescue us, because we can essentially use the same testing procedures that we have always used even if u is not normally distributed, as long as the sample is big enough What s big enough?
16 Properties of the OLS estimator Normality What s big enough? Depends. If u is not exactly normal, but it s pretty close, n = 30 is plenty. If u is really really not normal, then we need more n.
17 Properties of the OLS estimator Normality What s big enough? Depends. If u is not exactly normal, but it s pretty close, n = 30 is plenty. If u is really really not normal, then we need more n. How to tell? Graph the distribution of û! If it looks kinda symmetrical and nice, you re in good shape with a relatively small sample. If it looks way ugly, buckle up and get a bigger sample.
18 Properties of the OLS estimator Efficiency With small samples, we knew that OLS was BLUE (using MLR.1 - MLR.5) Saying that OLS is BLUE means: OLS is the Best Linear Unbiased Estimator Bestness refers to the fact that the OLS estimator has minimum variance among all the other linear unbiased estimators A synonym for best in this context is efficient. When we say OLS is efficient, we mean that it has minimum variance (ask me why) With large samples, we can say that OLS is asymptotically efficient
19 Plan Discuss functional form more directly than we have so far logarithmic transformations polynomial forms units
20 Functional form Transformation of variables Everything we have learned so far has been about linear models y = β 0 + β 1 x + u We say that the model above is linear There is a small set of nonlinear functional forms that you can work with... because you can transform them into linear functional forms
21 Functional form Transformation of variables dep var ind var coeff level-level y x y due to x log-log log(y) log(x) % in y due to 1% in x
22 Functional form Transformation of variables dep var ind var coeff level-level y x y due to x log-log log(y) log(x) % in y due to 1% in x level-log y log(x) 100 y due to 1% x log-level log(y) x (1/100) (% y) due to x
23 Functional form Transformation of variables dep var ind var coeff level-level y x y due to x log-log log(y) log(x) % in y due to 1% in x level-log y log(x) 100 y due to 1% x log-level log(y) x (1/100) (% y) due to x dep var ind var coeff level-level y x y = β x log-log log(y) log(x) % y = β% x
24 Functional form Transformation of variables dep var ind var coeff level-level y x y due to x log-log log(y) log(x) % in y due to 1% in x level-log y log(x) 100 y due to 1% x log-level log(y) x (1/100) (% y) due to x dep var ind var coeff level-level y x y = β x log-log log(y) log(x) % y = β% x level-log y log(x) y = (1/100) β% x log-level log(y) x % y = 100 β x
25 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: ŷ = x When x increases by one unit, y increases by 5 units
26 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: ŷ = x When x increases by one unit, y increases by 5 units ŷ = log(x) When x increases by one pct., y increases by 0.05
27 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: ŷ = x When x increases by one unit, y increases by 5 units ŷ = log(x) When x increases by one pct., y increases by 0.05 log(y) = x When x increases by one unit, y increases by 500 pct.
28 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: log(y) = log(x) When x increases by one pct., y increases by 5 pct.
29 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: log(y) = log(x) When x increases by one pct., y increases by 5 pct. A logarithmic transformation should be done for the right reasons
30 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: log(y) = log(x) When x increases by one pct., y increases by 5 pct. A logarithmic transformation should be done for the right reasons Even though the interpretation of the coefficients in a log-transformed model are sometimes convenient, you should never transform data because of the interpretation of coefficients OK, who s going to ask it?
31 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: log(y) = log(x) When x increases by one pct., y increases by 5 pct. A logarithmic transformation should be done for the right reasons Even though the interpretation of the coefficients in a log-transformed model are sometimes convenient, you should never transform data because of the interpretation of coefficients OK, who s going to ask it? Q: so why do you do the transformation?
32 Functional form Transformation of variables Interpretation of the coefficient values when using a logarithmic transformation: log(y) = log(x) When x increases by one pct., y increases by 5 pct. A logarithmic transformation should be done for the right reasons Even though the interpretation of the coefficients in a log-transformed model are sometimes convenient, you should never transform data because of the interpretation of coefficients OK, who s going to ask it? Q: so why do you do the transformation? Because it fits the data better
33 Functional form Transformation of variables Because it fits the data better? Because theory or practice (previous experience) suggest that the relationship is best described by logarithmic transforms Example: economic growth is often thought to be best described by a proportional relationship (Horowitz transforms GDP using logarithms)
34 Functional form Polynomials The use of polynomials gives us a lot of flexibility You can approximate any crazy functional form with polynomials ( Typically, we use only quadratic (squared-terms) or cubic (cubed-terms) terms to approximate higher-order terms are rare We do this because we believe the relationship between y and x is not constant Example: environmental kuznets curve (we talked about last time) Using logarithms to describe a nonlinear relationship implies a very specific nonlinear function using polynomials is more flexible
35 Functional form Polynomials Polynomial transforms and their interpretation We usually interpret coefficients β as the marginal impact of changes in x on y In this relationship y = β 0 + β 1 x + u β is the increase in y due to a one unit increase in x
36 Functional form Polynomials Polynomial transforms and their interpretation We usually interpret coefficients β as the marginal impact of changes in x on y In this relationship y = β 0 + β 1 x + u β is the increase in y due to a one unit increase in x Suppose we specify a quadratic relationship y = β 0 + β 1 x + β 2 x 2 + u If x increases by one unit, by how much does y increase?
37 Functional form Polynomials y = β 0 + β 1 x + β 2 x 2 + u If x increases by a little bit, by how much does y increase?
38 Functional form Polynomials y = β 0 + β 1 x + β 2 x 2 + u If x increases by a little bit, by how much does y increase? Calculus! dy dx = β 1 + 2β 2 x
39 Functional form Polynomials y = β 0 + β 1 x + β 2 x 2 + u If x increases by a little bit, by how much does y increase? Calculus! dy dx = β 1 + 2β 2 x When x increases by a little bit, y increases by β 1 + 2β 2 x The coefficients no longer have much interpretation by themselves rather, we interpret a function of the coefficients (above) to describe the relationship between y and x (see Horowitz for an example of this)
40 Functional form Polynomials y = β 0 + β 1 x + β 2 x 2 + u If x increases by one unit, by how much does y increase?
41 Functional form Polynomials y = β 0 + β 1 x + β 2 x 2 + u If x increases by one unit, by how much does y increase? No calculus! y = β 0 + β 1 (x + 1) + β 2 (x + 1) 2 + u Subtract the above from and what do we get? = β 0 + β 1 x + β 1 + β 2 x 2 + 2β 2 x + β 2 + u y = β 0 + β 1 x + β 2 x 2 + u β 1 + 2β 2 x + β 2
42 Functional form Units That s enough about specific functional form for now Using logarithms and polynomials, you will be able to understand 99% of the functional forms used in applied econometrics (Other transformations like sin, cos, etc. are used, but very rarely and for specific reasons, such as describing cyclic behavior, e.g.) For exam-type purposes, make sure you understand how to work with coefficients to get to an interpretation of the relationship between y and x (see these slides, and page 46 of Wooldridge) There is one more thing we should touch on briefly: units
43 Functional form Units We are interested in the relationship between GDP and energy reserves (oil, e.g.) We specify the following model gdp = β 0 + β 1 oil + u
44 Functional form Units We are interested in the relationship between GDP and energy reserves (oil, e.g.) We specify the following model gdp = β 0 + β 1 oil + u Nobody balks at this. Seems reasonable.
45 Functional form Units We are interested in the relationship between GDP and energy reserves (oil, e.g.) We specify the following model gdp = β 0 + β 1 oil + u Nobody balks at this. Seems reasonable. But how is oil measured? In what units?
46 Functional form Units We are interested in the relationship between GDP and energy reserves (oil, e.g.) We specify the following model gdp = β 0 + β 1 oil + u Nobody balks at this. Seems reasonable. But how is oil measured? In what units? We could legitimately measure oil in barrels, in thousands of barrels, or in BTUs Any way we do it, we are still representing the fundamental relationship between gdp and oil
47 Functional form Units Suppose we measure oil in BTUs and run our regression. We get: ĝdp = β 0 + β 1 BTU
48 Functional form Units Suppose we measure oil in BTUs and run our regression. We get: ĝdp = β 0 + β 1 BTU What increase in GDP do we expect (based on our model) due to an increase in known oil reserves, measured in barrels?
49 Functional form Units Suppose we measure oil in BTUs and run our regression. We get: ĝdp = β 0 + β 1 BTU What increase in GDP do we expect (based on our model) due to an increase in known oil reserves, measured in barrels? That is, we want to know what α 1 is from the regression: ĝdp = α 0 + α 1 barrels
50 Functional form Units Suppose we measure oil in BTUs and run our regression. We get: ĝdp = β 0 + β 1 BTU What increase in GDP do we expect (based on our model) due to an increase in known oil reserves, measured in barrels? That is, we want to know what α 1 is from the regression: ĝdp = α 0 + α 1 barrels We need to know the conversion between BTUs and barrels: 5.6 million BTU/barrel
51 Functional form Units Then we can say α 1 = 5.6million β 1
52 Functional form Dummies Use of dummy variables on the RHS Wait. What are dummy variables? Simple:
53 Functional form Dummies Use of dummy variables on the RHS Wait. What are dummy variables? Simple: dummy variables are binary (0/1) variables that indicate something (That s why they are also known as indicator variables)
54 Functional form Dummies Use of dummy variables on the RHS Wait. What are dummy variables? Simple: dummy variables are binary (0/1) variables that indicate something (That s why they are also known as indicator variables) You probably already know what they are
55 Dummy variables A dummy variable is an indicator variable the variable indicates that some particular thing is true about an observation Easiest way to start thinking about it is with a toy example (next slide)
56 Dummy variables A dummy variable is an indicator variable the variable indicates that some particular thing is true about an observation Easiest way to start thinking about it is with a toy example (next slide) Keep in mind as we go through the examples: Big idea: means Remember when I told you that econometrics was nothing but fancy averaging?
57 Toy example We have a sample of data that includes wage data for men and women We suspect that men and women might earn different wages We can run a model to test this presumption wage = β 0 + β 1 male + u
58 Toy example We have a sample of data that includes wage data for men and women We suspect that men and women might earn different wages We can run a model to test this presumption wage = β 0 + β 1 male + u male is a dummy variable What is the interpretation of β 0 in the model above?
59 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with?
60 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women.
61 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women. Another way to say the same thing:
62 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women. Another way to say the same thing: What does the model look like for women?
63 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women. Another way to say the same thing: What does the model look like for women? wage = β 0 + u
64 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women. Another way to say the same thing: What does the model look like for women? wage = β 0 + u This is the wage equation conditional on being a woman (male = 0)
65 Toy example Approach to interpretation: zero-out male wage = β 0 + β u What does this leave us with? Women. Another way to say the same thing: What does the model look like for women? wage = β 0 + u This is the wage equation conditional on being a woman (male = 0) The expected wage of women is given by β 0 (the model without the u term)
66 Toy example What does the model look like for men?
67 Toy example What does the model look like for men? wage = β 0 + β 1 + u It s the wage equation, conditional on being a man (male = 1)
68 Toy example What does the model look like for men? wage = β 0 + β 1 + u It s the wage equation, conditional on being a man (male = 1) What is the expected wage of men?
69 Toy example What does the model look like for men? wage = β 0 + β 1 + u It s the wage equation, conditional on being a man (male = 1) What is the expected wage of men? wage = β 0 + β 1
70 Dummy variables Dummy variables are just like other variables, except they only take on two values: 0 and 1 Using dummy variables effectively creates a bunch of category-specific intercepts (in our first example the two categories were men and women)
71 Dummy variables Dummy variables are just like other variables, except they only take on two values: 0 and 1 Using dummy variables effectively creates a bunch of category-specific intercepts (in our first example the two categories were men and women) Exercise (for studying): Make a model of wages Individuals are categorized two ways: by sex, and by whether or not they have a college degree Build a model and determine the predicted wage (in terms of the coefficients of the model) for every type of person in the sample
72 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u
73 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u What is the expected wage of: 1 A woman who did not go to college? 2 A woman who did? 3 A man who did not go to college? 4 A man who did?
74 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u What is the expected wage of: 1 A woman who did not go to college? β 0 2 A woman who did? 3 A man who did not go to college? 4 A man who did?
75 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u What is the expected wage of: 1 A woman who did not go to college? β 0 2 A woman who did? β 0 + β 2 3 A man who did not go to college? 4 A man who did?
76 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u What is the expected wage of: 1 A woman who did not go to college? β 0 2 A woman who did? β 0 + β 2 3 A man who did not go to college? β 0 + β 1 4 A man who did?
77 Dummy variables You might build a model like this wage = β 0 + β 1 male + β 2 college + u What is the expected wage of: 1 A woman who did not go to college? β 0 2 A woman who did? β 0 + β 2 3 A man who did not go to college? β 0 + β 1 4 A man who did? β 0 + β 1 + β 2
78 Dummy variables This is pretty easy to see in a table wage = β 0 + β 1 male + β 2 college + u women men no college college
79 Dummy variables This is pretty easy to see in a table wage = β 0 + β 1 male + β 2 college + u no college women β 0 β 0 men β 0 β 0 college
80 Dummy variables This is pretty easy to see in a table wage = β 0 + β 1 male + β 2 college + u no college college women β 0 β 0 men β 0 + β 1 β 0 + β 1
81 Dummy variables This is pretty easy to see in a table wage = β 0 + β 1 male + β 2 college + u no college college women β 0 β 0 + β 2 men β 0 + β 1 β 0 + β 1 + β 2
82 Dummy variables This is pretty easy to see in a table wage = β 0 + β 1 male + β 2 college + u no college college women β 0 β 0 + β 2 men β 0 + β 1 β 0 + β 1 + β 2 There is one other way to build a model like this (entirely out of dummy variables)
83 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own
84 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own wage = β 1 female + β 2 male + β 3 college + u
85 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own wage = β 1 female + β 2 male + β 3 college + u Notice what is not in the model
86 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own wage = β 1 female + β 2 male + β 3 college + u Notice what is not in the model wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u
87 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own wage = β 1 female + β 2 male + β 3 college + u Notice what is not in the model wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u The second model cannot work!
88 Dummy variables We can build a model like this without a regular intercept So instead of everybody sharing an intercept, each group can have their own wage = β 1 female + β 2 male + β 3 college + u Notice what is not in the model wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u The second model cannot work! Why?
89 Dummy variable facts To understand why it won t work, it s easiest to reveal the same fact in a simpler setting
90 Dummy variable facts To understand why it won t work, it s easiest to reveal the same fact in a simpler setting We can t run the following model for the same reason
91 Dummy variable facts To understand why it won t work, it s easiest to reveal the same fact in a simpler setting We can t run the following model for the same reason wage = β 0 + β 1 female + β 2 male + u
92 Dummy variable facts To understand why it won t work, it s easiest to reveal the same fact in a simpler setting We can t run the following model for the same reason wage = β 0 + β 1 female + β 2 male + u We can t run this model because then the sum of the constant that everybody shares (the data that multiplies β 0 ) is exactly equal to the sum of the data female and male
93 Dummy variable facts We can t run this model because then the sum of the constant that everybody shares (the data that multiplies β 0 ) is exactly equal to the sum of the data female and male
94 Dummy variable facts We can t run this model because then the sum of the constant that everybody shares (the data that multiplies β 0 ) is exactly equal to the sum of the data female and male Let s re-write the equation with constant = 1 explicitly in the model so you can see what I mean
95 Dummy variable facts We can t run this model because then the sum of the constant that everybody shares (the data that multiplies β 0 ) is exactly equal to the sum of the data female and male Let s re-write the equation with constant = 1 explicitly in the model so you can see what I mean wage = β 0 + β 1 female + β 2 male + u = β 0 constant + β 1 female + β 2 male + u
96 Dummy variable facts We can t run this model because then the sum of the constant that everybody shares (the data that multiplies β 0 ) is exactly equal to the sum of the data female and male Let s re-write the equation with constant = 1 explicitly in the model so you can see what I mean wage = β 0 + β 1 female + β 2 male + u = β 0 constant + β 1 female + β 2 male + u We can t run this regression because i.e. female + male = 1 constant = female + male
97 Dummy variable facts The data looks like this female male
98 Dummy variable facts The data looks like this female male female + male
99 Dummy variable facts The data looks like this female male female + male
100 Dummy variable facts The data looks like this female male female + male constant
101 Dummy variable facts The data looks like this female male female + male constant
102 Dummy variable facts The data looks like this female male female + male constant We can t include data on income in thousands of dollars (income10k) and income in dollars (income) in the same regression because income10k = income 10, 000 No linear combinations in the data!
103 Dummy variable facts The data looks like this female male female + male constant We can t include data on income in thousands of dollars (income10k) and income in dollars (income) in the same regression because income10k = income 10, 000 No linear combinations in the data! male + female = constant
104 Dummy variable facts The data looks like this female male female + male constant We can t include data on income in thousands of dollars (income10k) and income in dollars (income) in the same regression because income10k = income 10, 000 No linear combinations in the data! male + female = constant = bad
105 Dummy variables So back to the original problem wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u Why can t this work? Let s look at the data to see if we can detect the same type of problem
106 Dummy variables So back to the original problem wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u Why can t this work? Let s look at the data to see if we can detect the same type of problem female male nocollege college Does anybody see it? Exercise: find why we can t do this (find the linear combination in the data)
107 Dummy variables When we look at the data female male nocollege college we can see that female + male = nocollege + college This violates the rule that we cannot have linear combinations in the data Intuitively, you don t add a column of data (a variable) when that column doesn t add any new information If we already have female, male, and nocollege, then we know everything about each observation without ever looking at the college variable
108 Dummy variables We can t estimate wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u because female + male = college + nocollege. Can we run this model? wage = β 0 + β 1 female + β 2 male + β 3 college + u
109 Dummy variables We can t estimate wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u because female + male = college + nocollege. Can we run this model? wage = β 0 + β 1 female + β 2 male + β 3 college + u Nope.
110 Dummy variables We can t estimate wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u because female + male = college + nocollege. Can we run this model? wage = β 0 + β 1 female + β 2 male + β 3 college + u Nope. female + male = constant (this hasn t changed)
111 Dummy variables We can t estimate wage = β 1 female + β 2 male + β 3 college + β 4 nocollege + u because female + male = college + nocollege. Can we run this model? wage = β 0 + β 1 female + β 2 male + β 3 college + u Nope. female + male = constant (this hasn t changed) Takeaway: No linear combinations
112 Dummy variables If we want to run a model that differentiates women and men, and college graduates and non-college-graduates, what can we do?
113 Dummy variables If we want to run a model that differentiates women and men, and college graduates and non-college-graduates, what can we do? Two ways to go:
114 Dummy variables If we want to run a model that differentiates women and men, and college graduates and non-college-graduates, what can we do? Two ways to go: 1 Can include indicators of all but one type of individual in each category and include a constant (the usual thing that multiplies β 0 ) wage = β 0 + β 1 male + β 2 college + u
115 Dummy variables If we want to run a model that differentiates women and men, and college graduates and non-college-graduates, what can we do? Two ways to go: 1 Can include indicators of all but one type of individual in each category and include a constant (the usual thing that multiplies β 0 ) wage = β 0 + β 1 male + β 2 college + u 2 Can include indicators of all but one type of individual in each category, plus for one category we can include all types of individuals (the gender category in the example below) wage = α 1 female + α 2 male + α 3 college + u
116 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college
117 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college sum
118 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college sum
119 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college sum
120 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college sum
121 Dummy variables First way wage = β 0 + β 1 male + β 2 college + u constant male college sum
122 Dummy variables First way Test: passed wage = β 0 + β 1 male + β 2 college + u constant male college sum
123 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college
124 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum
125 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum
126 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum
127 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum
128 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum
129 Dummy variables Second way wage = α 1 female + α 2 male + α 3 college + u female male college sum Since the data does not sum to a constant, we can estimate the model
130 Dummy variables Takeaway: we can build a model out of dummy variables We can build the model any way we want, as long as the dummy variables don t induce a linear combination in the data
131 Dummy variables OK. We can estimate the model a couple of ways. Interpretation of the variables is slightly different depending on which formulation you use Choose the way that meets your needs They are totally equivalent: choose which formulation gives you the interpretation that you like best Interpretation can be confusing it takes practice
132 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v
133 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education
134 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1
135 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1 So what s the difference?
136 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1 So what s the difference? β 1 is the difference between the expected wage of males and females with no college degree
137 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1 So what s the difference? β 1 is the difference between the expected wage of males and females with no college degree : (β 0 + β 1 ) β 0
138 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1 So what s the difference? β 1 is the difference between the expected wage of males and females with no college degree : (β 0 + β 1 ) β 0 (it s also the difference between the expected wage of males and females with a college degree)
139 Interpretation Interpret α 1 and β 1 wage = β 0 + β 1 male + β 2 college + u wage = α 1 male + α 2 female + α 3 college + v β 1 is the effect of being male on wage, regardless of education in a manner of speaking, so is α 1 So what s the difference? β 1 is the difference between the expected wage of males and females with no college degree : (β 0 + β 1 ) β 0 (it s also the difference between the expected wage of males and females with a college degree) α 1 by itself is the expected wage of a male with no college degree (so α 1 = β 0 + β 1 )
140 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0
141 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1
142 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1 The difference bet. a graduate male and a graduate female in (1): (β 0 + β 1 + β 2 ) (β 0 + β 2 )
143 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1 The difference bet. a graduate male and a graduate female in (1): (β 0 + β 1 + β 2 ) (β 0 + β 2 ) = β 1
144 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1 The difference bet. a graduate male and a graduate female in (1): (β 0 + β 1 + β 2 ) (β 0 + β 2 ) = β 1 The difference bet. a non-graduate male and a non-graduate female in (2): α 1 α 2
145 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1 The difference bet. a graduate male and a graduate female in (1): (β 0 + β 1 + β 2 ) (β 0 + β 2 ) = β 1 The difference bet. a non-graduate male and a non-graduate female in (2): α 1 α 2 The difference bet. a graduate male and a graduate female in (2): (α 1 + α 3 ) (α 2 + α 3 )
146 Interpretation Compare the two models: wage = β 0 + β 1 male + β 2 college + u (1) wage = α 1 male + α 2 female + α 3 college + v (2) The difference bet. a non-graduate male and a non-graduate female in (1): (β 0 + β 1 ) β 0 = β 1 The difference bet. a graduate male and a graduate female in (1): (β 0 + β 1 + β 2 ) (β 0 + β 2 ) = β 1 The difference bet. a non-graduate male and a non-graduate female in (2): α 1 α 2 The difference bet. a graduate male and a graduate female in (2): (α 1 + α 3 ) (α 2 + α 3 ) = α 1 α 2
147 Interpretation Takeaway: when we do not include an intercept in the model (what you are used to thinking of as β 0 ) the interpretation of the coefficients changes When working with a standard model (i.e. a model with the regular old β 0 constant), the omitted category becomes a benchmark (baseline) to which others are compared Omitting the constant enables us to use a single constant to estimate the expected wage (or whatever y happens to be) of a particular group of individuals. Sometimes this is desirable.
148 Dummy variables Consider a new model
149 Dummy variables Consider a new model We are still looking at wages, but now we have a sample of male and female professionals. Specifically doctors, lawyers, and professors
150 Dummy variables Consider a new model We are still looking at wages, but now we have a sample of male and female professionals. Specifically doctors, lawyers, and professors We want to build a wage model using one of the formulations we have covered
151 Professional wages The two formulations differ only by whether or not they include a constant
152 Professional wages The two formulations differ only by whether or not they include a constant wage = β 0 + β 1 male + β 2 doctor + β 3 lawyer + u wage = α 1 male + α 2 female + α 3 doctor + α 4 lawyer + v
153 Professional wages The two formulations differ only by whether or not they include a constant wage = β 0 + β 1 male + β 2 doctor + β 3 lawyer + u wage = α 1 male + α 2 female + α 3 doctor + α 4 lawyer + v If we want to know whether or not being a doctor influences wages we can conduct a t-test of the null hypothesis that β 2 = 0 or α 3 = 0 H 0 : β 2 = 0 H A : β 2 0
154 Professional wages If we want to know whether or not profession matters (in general) then we could conduct an f-test
155 Professional wages If we want to know whether or not profession matters (in general) then we could conduct an f-test H 0 : β 2 = 0, β 3 = 0 H A : at least one of the above is not true
156 Professional wages If either β 2 0 or β 3 0 then profession matters Note that in both formulations of the model, the omitted category is professors. So when we test hypotheses about β 2 or α 3, we are really testing hypotheses about how different doctors and professors are.
157 Professional wages If either β 2 0 or β 3 0 then profession matters Note that in both formulations of the model, the omitted category is professors. So when we test hypotheses about β 2 or α 3, we are really testing hypotheses about how different doctors and professors are. Takeaway 1: We can use categorical variables (dummy variables) to test individual or compound hypotheses Takeaway 2: The omitted category matters. It influences the coefficients themselves, and we must take this into account when interpreting the coefficients. Takeaway 3: We can use categorical variables and continuous variables in the same regression (even though we didn t look at that specifically in the notes above)
158 Dummy variables Interactions Now THIS is different from the formulations we have already looked at
159 Dummy variables Interactions Now THIS is different from the formulations we have already looked at This is where we start using dummy variables to manipulate functional form
160 Dummy variables Interactions Now THIS is different from the formulations we have already looked at This is where we start using dummy variables to manipulate functional form So far we have looked at categorical variables separately This makes sense in some circumstances but not in others
161 Example Suppose the data look like this
162 Example Our current model works OK the effect of profession appears to be the same for men and women
163 Example Even easier to see this way the gap between men and women is constant over professions
164 Example What if the data looks like this instead?
165 Example Let s put some lines in to make it easier to see
166 Example If the effect of being a male is different depending on an individual s profession, then our current model is inappropriate. Why? wage = β 0 + β 1 male + β 2 doctor + β 3 lawyer + u
167 Example If the effect of being a male is different depending on an individual s profession, then our current model is inappropriate. wage = β 0 + β 1 male + β 2 doctor + β 3 lawyer + u Why? Our current model includes the effect of being male and the effect of being a lawyer But the effect of being a lawyer is the effect of being a lawyer, holding all else constant and the same is true for the effect of gender β 3 is the effect of being a lawyer, holding gender constant What s the problem with this?
168 Example If the effect of being a male is different depending on an individual s profession, then our current model is inappropriate. wage = β 0 + β 1 male + β 2 doctor + β 3 lawyer + u Why? Our current model includes the effect of being male and the effect of being a lawyer But the effect of being a lawyer is the effect of being a lawyer, holding all else constant and the same is true for the effect of gender β 3 is the effect of being a lawyer, holding gender constant What s the problem with this? The effect of being a lawyer IS NOT constant over gender (and the effect of being male is not constant over profession)
169 Interaction terms What s the solution? Interaction terms
170 Interaction terms What s the solution? Interaction terms Whenever an effect differs over a category, interact the categories wage = β 0 +β 1 male+β 2 doctor+β 3 lawyer+β 4 male lawyer+u
171 Interaction terms We can interact any number of categorical variables we want We want to interact any time we think there is reason to believe that the interaction is meaningful meaningful could be motivated by theory or by differences in the data
172 Interaction terms We can interact any number of categorical variables we want We want to interact any time we think there is reason to believe that the interaction is meaningful meaningful could be motivated by theory or by differences in the data A fully interacted model would look like this wage = β 0 +β 1 D M +β 2 D D +β 3 D L +β 4 D M D D +β 5 D M D L +u Or like this (where subscripts indicate male doctor, etc.) wage = α 0 +α 1 D MD +α 2 D FD +α 3 D ML +α 4 D FL +α 5 D MP +v
173 Brain exercise Are the intercepts the same in the following two models? w = β 0 + β 1 D M + β 2 D D + β 3 D L + β 4 D M D D + β 5 D M D L + u w = α 0 + α 1 D MD + α 2 D FD + α 3 D ML + α 4 D FL + α 5 D MP + v Hint: make a table
174 Brain exercise Are the intercepts the same in the following two models? w = β 0 + β 1 D M + β 2 D D + β 3 D L + β 4 D M D D + β 5 D M D L + u w = α 0 + α 1 D MD + α 2 D FD + α 3 D ML + α 4 D FL + α 5 D MP + v Hint: make a table female Doc Law Prof male
175 Answer Here is mine for the first model (the β model): Doc Law Prof female (β 0 + β 2 ) (β 0 + β 3 ) (β 0 ) male (β 0 + β 1 + β 2 + β 4 ) (β 0 + β 1 + β 3 + β 5 ) (β 0 + β 1 )
176 Answer Here is mine for the first model (the β model): Doc Law Prof female (β 0 + β 2 ) (β 0 + β 3 ) (β 0 ) male (β 0 + β 1 + β 2 + β 4 ) (β 0 + β 1 + β 3 + β 5 ) (β 0 + β 1 ) and the α model: Doc Law Prof female (α 0 + α 2 ) (α 0 + β 4 ) (α 0 ) male (α 0 + α 1 ) (α 0 + α 3 ) (α 0 + α 5 )
177 Which formulation to use? Sometimes we want to control for categories by introducing category-specific intercepts Sometimes we want to control for categories by introducing interaction terms Sometimes we actually want to interpret the coefficient values on the categorical variables Choose whichever formulation fits your needs Choose categories to omit for the same reason
178 Real example Card, David, and Aalan B. Krueger (1994). Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania, The American Economic Review, 84(4): Economic theory: if the minimum wage goes up unemployment will go up (Some of the people who are making less than the new min wage will lose their jobs)
179 Real example Card, David, and Aalan B. Krueger (1994). Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania, The American Economic Review, 84(4): Economic theory: if the minimum wage goes up unemployment will go up (Some of the people who are making less than the new min wage will lose their jobs) What happened:
180 Real example Card, David, and Aalan B. Krueger (1994). Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania, The American Economic Review, 84(4): Economic theory: if the minimum wage goes up unemployment will go up (Some of the people who are making less than the new min wage will lose their jobs) What happened: April 1992 New Jersey raised its min wage from $4.25 to $5.05
181 Real example Card, David, and Aalan B. Krueger (1994). Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania, The American Economic Review, 84(4): Economic theory: if the minimum wage goes up unemployment will go up (Some of the people who are making less than the new min wage will lose their jobs) What happened: April 1992 New Jersey raised its min wage from $4.25 to $5.05 Card and Krueger decided to take advantage of this to find out whether employment decreased when the minimum wage increased How to do this?
182 Real example What did they do?
183 Real example What did they do? CK knew the minimum wage was coming, so they sent out a survey to each major fast-food restaurant they could find in the phone book before the wage hike: KFC, Roy Rogers, Wendy s, Burger King (410 total)
184 Real example What did they do? CK knew the minimum wage was coming, so they sent out a survey to each major fast-food restaurant they could find in the phone book before the wage hike: KFC, Roy Rogers, Wendy s, Burger King (410 total) CK then waited until after the wage hike and sent out another survey to the same restaurants
185 Real example Problem:
186 Real example Problem: What if other things are changing around the same time? Maybe other things are causing employment, wages, prices, etc. to change over the period straddling April 1992 Solution?
187 Real example Problem: What if other things are changing around the same time? Maybe other things are causing employment, wages, prices, etc. to change over the period straddling April 1992 Solution? A second dimension of comparison:
188 Real example Problem: What if other things are changing around the same time? Maybe other things are causing employment, wages, prices, etc. to change over the period straddling April 1992 Solution? A second dimension of comparison: Spatial
189 Real example
190 Real example Dimension 1: before and after treatment Dimension 2: in and out of treatment group
191 Real example Dimension 1: before and after treatment Dimension 2: in and out of treatment group Translation of dimension 1: before and after April 1992 Translation of dimension 2: in New Jersey vs. in Pennsylvania
192 Real example Dimension 1: before and after treatment Dimension 2: in and out of treatment group Translation of dimension 1: before and after April 1992 Translation of dimension 2: in New Jersey vs. in Pennsylvania What does the regression look like that does this comparison for us? avgemployment = β 0 + β 1 NJ + β 2 D + β 3 D NJ + u
193 Real example Dimension 1: before and after treatment Dimension 2: in and out of treatment group Translation of dimension 1: before and after April 1992 Translation of dimension 2: in New Jersey vs. in Pennsylvania What does the regression look like that does this comparison for us? avgemployment = β 0 + β 1 NJ + β 2 D + β 3 D NJ + u What is the effect of the minimum wage increase?
194 Real example Dimension 1: before and after treatment Dimension 2: in and out of treatment group Translation of dimension 1: before and after April 1992 Translation of dimension 2: in New Jersey vs. in Pennsylvania What does the regression look like that does this comparison for us? avgemployment = β 0 + β 1 NJ + β 2 D + β 3 D NJ + u What is the effect of the minimum wage increase? β 3 = 2.75 average employment of full-time equivalent employees went up as a result
195 Review Dummy variables: indicator variables (syn) Use to represent categorical information Can use to discretize continuous variables (to better represent actual relationships Can interact Interact multiple dummy variables to create cross-category indicators Interact dummy variables with continuous variables to create category-specific slopes Use t-test to test for effects of categories Use F-test to test for effects of a whole categorization
Ch 7: Dummy (binary, indicator) variables
Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male
More informationMultiple Regression Analysis
Multiple Regression Analysis y = 0 + 1 x 1 + x +... k x k + u 6. Heteroskedasticity What is Heteroskedasticity?! Recall the assumption of homoskedasticity implied that conditional on the explanatory variables,
More informationLectures 5 & 6: Hypothesis Testing
Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across
More informationModule 03 Lecture 14 Inferential Statistics ANOVA and TOI
Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module
More informationInference in Regression Model
Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationStatistical Inference with Regression Analysis
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing
More informationMath101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2:
Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2: 03 17 08 3 All about lines 3.1 The Rectangular Coordinate System Know how to plot points in the rectangular coordinate system. Know the
More informationRegression with a Single Regressor: Hypothesis Tests and Confidence Intervals
Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression
More informationSimple Regression Model. January 24, 2011
Simple Regression Model January 24, 2011 Outline Descriptive Analysis Causal Estimation Forecasting Regression Model We are actually going to derive the linear regression model in 3 very different ways
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationAlgebra & Trig Review
Algebra & Trig Review 1 Algebra & Trig Review This review was originally written for my Calculus I class, but it should be accessible to anyone needing a review in some basic algebra and trig topics. The
More informationPOLI 8501 Introduction to Maximum Likelihood Estimation
POLI 8501 Introduction to Maximum Likelihood Estimation Maximum Likelihood Intuition Consider a model that looks like this: Y i N(µ, σ 2 ) So: E(Y ) = µ V ar(y ) = σ 2 Suppose you have some data on Y,
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More informationLinear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons
Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for
More information#29: Logarithm review May 16, 2009
#29: Logarithm review May 16, 2009 This week we re going to spend some time reviewing. I say re- view since you ve probably seen them before in theory, but if my experience is any guide, it s quite likely
More informationNotes 11: OLS Theorems ECO 231W - Undergraduate Econometrics
Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics Prof. Carolina Caetano For a while we talked about the regression method. Then we talked about the linear model. There were many details, but
More informationChapter 26: Comparing Counts (Chi Square)
Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces
More informationwhere Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.
Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter
More informationBusiness Statistics. Lecture 9: Simple Regression
Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals
More informationIntroduction to Algebra: The First Week
Introduction to Algebra: The First Week Background: According to the thermostat on the wall, the temperature in the classroom right now is 72 degrees Fahrenheit. I want to write to my friend in Europe,
More informationAnnouncements. J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 February 8, / 45
Announcements Solutions to Problem Set 3 are posted Problem Set 4 is posted, It will be graded and is due a week from Friday You already know everything you need to work on Problem Set 4 Professor Miller
More informationECNS 561 Multiple Regression Analysis
ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking
More informationLecture 4: Constructing the Integers, Rationals and Reals
Math/CS 20: Intro. to Math Professor: Padraic Bartlett Lecture 4: Constructing the Integers, Rationals and Reals Week 5 UCSB 204 The Integers Normally, using the natural numbers, you can easily define
More informationEconomics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects
Economics 113 Simple Regression Models Simple Regression Assumptions Simple Regression Derivation Changing Units of Measurement Nonlinear effects OLS and unbiased estimates Variance of the OLS estimates
More informationDescriptive Statistics (And a little bit on rounding and significant digits)
Descriptive Statistics (And a little bit on rounding and significant digits) Now that we know what our data look like, we d like to be able to describe it numerically. In other words, how can we represent
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More information2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0
Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationMath 5a Reading Assignments for Sections
Math 5a Reading Assignments for Sections 4.1 4.5 Due Dates for Reading Assignments Note: There will be a very short online reading quiz (WebWork) on each reading assignment due one hour before class on
More informationBiostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras
Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics
More informationOrdinary Differential Equations Prof. A. K. Nandakumaran Department of Mathematics Indian Institute of Science Bangalore
Ordinary Differential Equations Prof. A. K. Nandakumaran Department of Mathematics Indian Institute of Science Bangalore Module - 3 Lecture - 10 First Order Linear Equations (Refer Slide Time: 00:33) Welcome
More informationCHAPTER 7: TECHNIQUES OF INTEGRATION
CHAPTER 7: TECHNIQUES OF INTEGRATION DAVID GLICKENSTEIN. Introduction This semester we will be looking deep into the recesses of calculus. Some of the main topics will be: Integration: we will learn how
More informationMITOCW watch?v=pqkyqu11eta
MITOCW watch?v=pqkyqu11eta The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To
More informationEconometrics Problem Set 3
Econometrics Problem Set 3 Conceptual Questions 1. This question refers to the estimated regressions in table 1 computed using data for 1988 from the U.S. Current Population Survey. The data set consists
More informationFNCE 926 Empirical Methods in CF
FNCE 926 Empirical Methods in CF Lecture 2 Linear Regression II Professor Todd Gormley Today's Agenda n Quick review n Finish discussion of linear regression q Hypothesis testing n n Standard errors Robustness,
More informationQuadratic Equations Part I
Quadratic Equations Part I Before proceeding with this section we should note that the topic of solving quadratic equations will be covered in two sections. This is done for the benefit of those viewing
More informationRegression #8: Loose Ends
Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch
More informationLECTURE 15: SIMPLE LINEAR REGRESSION I
David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).
More informationHypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.
Hypothesis Testing Today, we are going to begin talking about the idea of hypothesis testing how we can use statistics to show that our causal models are valid or invalid. We normally talk about two types
More informationInteractions among Continuous Predictors
Interactions among Continuous Predictors Today s Class: Simple main effects within two-way interactions Conquering TEST/ESTIMATE/LINCOM statements Regions of significance Three-way interactions (and beyond
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationSociology 593 Exam 2 Answer Key March 28, 2002
Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationMid-term exam Practice problems
Mid-term exam Practice problems Most problems are short answer problems. You receive points for the answer and the explanation. Full points require both, unless otherwise specified. Explaining your answer
More informationCh. 3 Equations and Inequalities
Ch. 3 Equations and Inequalities 3.1 Solving Linear Equations Graphically There are 2 methods presented in this section for solving linear equations graphically. Normally I would not cover solving linear
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More information[Disclaimer: This is not a complete list of everything you need to know, just some of the topics that gave people difficulty.]
Math 43 Review Notes [Disclaimer: This is not a complete list of everything you need to know, just some of the topics that gave people difficulty Dot Product If v (v, v, v 3 and w (w, w, w 3, then the
More informationWe're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).
Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More information4. Nonlinear regression functions
4. Nonlinear regression functions Up to now: Population regression function was assumed to be linear The slope(s) of the population regression function is (are) constant The effect on Y of a unit-change
More informationChapter 1 Review of Equations and Inequalities
Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More information( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of
Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator
More informationIntroduction. So, why did I even bother to write this?
Introduction This review was originally written for my Calculus I class, but it should be accessible to anyone needing a review in some basic algebra and trig topics. The review contains the occasional
More informationSolution to Proof Questions from September 1st
Solution to Proof Questions from September 1st Olena Bormashenko September 4, 2011 What is a proof? A proof is an airtight logical argument that proves a certain statement in general. In a sense, it s
More informationPOL 681 Lecture Notes: Statistical Interactions
POL 681 Lecture Notes: Statistical Interactions 1 Preliminaries To this point, the linear models we have considered have all been interpreted in terms of additive relationships. That is, the relationship
More information1 The Multiple Regression Model: Freeing Up the Classical Assumptions
1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationMITOCW MITRES18_006F10_26_0602_300k-mp4
MITOCW MITRES18_006F10_26_0602_300k-mp4 FEMALE VOICE: The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational
More informationApplied Regression Analysis. Section 4: Diagnostics and Transformations
Applied Regression Analysis Section 4: Diagnostics and Transformations 1 Regression Model Assumptions Y i = β 0 + β 1 X i + ɛ Recall the key assumptions of our linear regression model: (i) The mean of
More informationSometimes the domains X and Z will be the same, so this might be written:
II. MULTIVARIATE CALCULUS The first lecture covered functions where a single input goes in, and a single output comes out. Most economic applications aren t so simple. In most cases, a number of variables
More informationFinding Limits Graphically and Numerically
Finding Limits Graphically and Numerically 1. Welcome to finding limits graphically and numerically. My name is Tuesday Johnson and I m a lecturer at the University of Texas El Paso. 2. With each lecture
More informationPattern Recognition Prof. P. S. Sastry Department of Electronics and Communication Engineering Indian Institute of Science, Bangalore
Pattern Recognition Prof. P. S. Sastry Department of Electronics and Communication Engineering Indian Institute of Science, Bangalore Lecture - 27 Multilayer Feedforward Neural networks with Sigmoidal
More informationGov 2000: 9. Regression with Two Independent Variables
Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Harvard University mblackwell@gov.harvard.edu Where are we? Where are we going? Last week: we learned about how to calculate a simple
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationLECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity
LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationMATH 22 FUNCTIONS: ORDER OF GROWTH. Lecture O: 10/21/2003. The old order changeth, yielding place to new. Tennyson, Idylls of the King
MATH 22 Lecture O: 10/21/2003 FUNCTIONS: ORDER OF GROWTH The old order changeth, yielding place to new. Tennyson, Idylls of the King Men are but children of a larger growth. Dryden, All for Love, Act 4,
More informationEconometrics Multiple Regression Analysis with Qualitative Information: Binary (or Dummy) Variables
Econometrics Multiple Regression Analysis with Qualitative Information: Binary (or Dummy) Variables João Valle e Azevedo Faculdade de Economia Universidade Nova de Lisboa Spring Semester João Valle e Azevedo
More informationPROFESSOR: WELCOME BACK TO THE LAST LECTURE OF THE SEMESTER. PLANNING TO DO TODAY WAS FINISH THE BOOK. FINISH SECTION 6.5
1 MATH 16A LECTURE. DECEMBER 9, 2008. PROFESSOR: WELCOME BACK TO THE LAST LECTURE OF THE SEMESTER. I HOPE YOU ALL WILL MISS IT AS MUCH AS I DO. SO WHAT I WAS PLANNING TO DO TODAY WAS FINISH THE BOOK. FINISH
More informationInteractions and Factorial ANOVA
Interactions and Factorial ANOVA STA442/2101 F 2017 See last slide for copyright information 1 Interactions Interaction between explanatory variables means It depends. Relationship between one explanatory
More informationProblem Set #6: OLS. Economics 835: Econometrics. Fall 2012
Problem Set #6: OLS Economics 835: Econometrics Fall 202 A preliminary result Suppose we have a random sample of size n on the scalar random variables (x, y) with finite means, variances, and covariance.
More informationMITOCW watch?v=vjzv6wjttnc
MITOCW watch?v=vjzv6wjttnc PROFESSOR: We just saw some random variables come up in the bigger number game. And we're going to be talking now about random variables, just formally what they are and their
More informationLECTURE 2: SIMPLE REGRESSION I
LECTURE 2: SIMPLE REGRESSION I 2 Introducing Simple Regression Introducing Simple Regression 3 simple regression = regression with 2 variables y dependent variable explained variable response variable
More informationEconometrics Review questions for exam
Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =
More informationEconometrics I Lecture 7: Dummy Variables
Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable
More informationHypothesis testing. Data to decisions
Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the
More informationECON 497 Midterm Spring
ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain
More informationInteractions and Factorial ANOVA
Interactions and Factorial ANOVA STA442/2101 F 2018 See last slide for copyright information 1 Interactions Interaction between explanatory variables means It depends. Relationship between one explanatory
More informationAn overview of applied econometrics
An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical
More informationP (E) = P (A 1 )P (A 2 )... P (A n ).
Lecture 9: Conditional probability II: breaking complex events into smaller events, methods to solve probability problems, Bayes rule, law of total probability, Bayes theorem Discrete Structures II (Summer
More informationThe regression model with one fixed regressor cont d
The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationGraphical Diagnosis. Paul E. Johnson 1 2. (Mostly QQ and Leverage Plots) 1 / Department of Political Science
(Mostly QQ and Leverage Plots) 1 / 63 Graphical Diagnosis Paul E. Johnson 1 2 1 Department of Political Science 2 Center for Research Methods and Data Analysis, University of Kansas. (Mostly QQ and Leverage
More informationChapter 2. Mathematical Reasoning. 2.1 Mathematical Models
Contents Mathematical Reasoning 3.1 Mathematical Models........................... 3. Mathematical Proof............................ 4..1 Structure of Proofs........................ 4.. Direct Method..........................
More informationRegression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.
Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationAstronomy 102 Math Review
Astronomy 102 Math Review 2003-August-06 Prof. Robert Knop r.knop@vanderbilt.edu) For Astronomy 102, you will not need to do any math beyond the high-school alegbra that is part of the admissions requirements
More informationUniversidad Carlos III de Madrid Econometría Nonlinear Regression Functions Problem Set 8
Universidad Carlos III de Madrid Econometría Nonlinear Regression Functions Problem Set 8 1. The sales of a company amount to 196 millions of dollars in 2009 and increased up to 198 millions in 2010. (a)
More informationChapter 2: simple regression model
Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More informationECON Interactions and Dummies
ECON 351 - Interactions and Dummies Maggie Jones 1 / 25 Readings Chapter 6: Section on Models with Interaction Terms Chapter 7: Full Chapter 2 / 25 Interaction Terms with Continuous Variables In some regressions
More informationCommunication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi
Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 41 Pulse Code Modulation (PCM) So, if you remember we have been talking
More informationIntroduction to Econometrics
Introduction to Econometrics Lecture 3 : Regression: CEF and Simple OLS Zhaopeng Qu Business School,Nanjing University Oct 9th, 2017 Zhaopeng Qu (Nanjing University) Introduction to Econometrics Oct 9th,
More informationEconometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage
More informationManipulating Equations
Manipulating Equations Now that you know how to set up an equation, the next thing you need to do is solve for the value that the question asks for. Above all, the most important thing to remember when
More informationEconometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur
Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Module No. # 01 Lecture No. # 28 LOGIT and PROBIT Model Good afternoon, this is doctor Pradhan
More information