Inference and Regression

Size: px
Start display at page:

Download "Inference and Regression"

Transcription

1 Name Inference and Regression Final Examination, 2015 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page. Please answer all questions on this question book. Do not turn in a blue book. Please do not separate the pages of this exam booklet. Where a computation is required to answer a question, please show your work. (I cannot give partial credit for an incorrect numerical answer unless the work provided shows a partially correct computation.) Grading: There are 10 questions in this exam.there are 185 points in total. The point values for the questions are Total 185 1

2 [40] Part I. Labor Market The regressions on page 4 are based on data that are part of the National Longitudinal Survey that has been carried out on a yearly basis by the Bureau of Labor Statistics of the Department of Labor. The dependent variable in this regression model is the log of the monthly wage of the sample of individuals. The variables in the equations are EXP = Labor market experience in years, EXPSQ = EXP 2 WKS = Number of weeks worked this year SOUTH = 1 if the individual lives in the Southern part of the U.S. FEM = 1 if the person is female, 0 if they are male. Three regressions on page 4 below are computed (1) using the full sample of 4165 observations, (2) Using the 3392 observations for MARRIED = 1 and (3) Using the 773 observations for which the head of the household is not married. 1. Using an F test, test the hypothesis that the five slope coefficients (not the constant) in the first regression are equal to zero. Model test F[ 5, 4159] = Prob F > F* Using an F test, test the hypothesis that the same model applies to both married people and not married people. F=[( )/6]/[( )/( )] 3. Show in detail how the R-bar squared = (in the first regression) is computed. R-bar 2 = 1 (n-1)/(n-k)*(1-rsquared) plug in values from first regression. K=6,n=4165,Rsq= The first regression uses the whole sample this is called the pooled regression. Using the pooled regression results, test the hypothesis that the coefficient on WKS equals 0.0. t=2.65. hypothesis is rejected. 5. Test the hypothesis that the coefficients on FEM in the MARRIED and NotMARRIED equations are the same. (Hint: The two subsamples are independent.) t = ( ( ))/sqr( ) 2

3 ALL Ordinary least squares regression... LHS=LWAGE Mean = No. of observations = 4165 DegFreedom Mean square Regression Sum of Squares = Residual Sum of Squares = Total Sum of Squares = Standard error of e = Root MSE Fit R-squared = R-bar squared Model test F[ 5, 4159] = Prob F > F* LWAGE Coefficient Error z z >Z* Interval Constant *** EXP.03952*** EXPSQ ***.5590D WKS.00333*** SOUTH *** FEM *** ***, **, * ==> Significance at 1%, 5%, 10% level. MARRIED Ordinary least squares regression... LHS=LWAGE Mean = Standard deviation = No. of observations = 3392 DegFreedom Mean square Regression Sum of Squares = Residual Sum of Squares = Total Sum of Squares = Standard error of e = Root MSE Fit R-squared = R-bar squared Model test F[ 5, 3386] = Prob F > F* LWAGE Coefficient Error z z >Z* Interval Constant *** EXP.03531*** EXPSQ ***.6361D WKS.00249* SOUTH *** FEM ** Not Married Ordinary least squares regression... LHS=LWAGE Mean = Standard deviation = No. of observations = 773 DegFreedom Mean square Regression Sum of Squares = Residual Sum of Squares = Total Sum of Squares = Standard error of e = Root MSE Fit R-squared = R-bar squared Model test F[ 5, 767] = Prob F > F* LWAGE Coefficient Error z z >Z* Interval Constant *** EXP.04981*** EXPSQ *** WKS.00534** SOUTH *** FEM ***

4 [40] Part II. Moral Hazard This part of our study deals with a phenomenon called moral hazard. The theory of moral hazard holds that people act differently when they have insurance. In the health care world, what this means is that if people have health insurance, they use the health care system more. The model below is called a Poisson regression. We studied this in class. The dependent variable in the model is DOCVIS = the number of visits to the doctor taken by the person in the survey year. (This variable ranges from 0 to about 15, with a handful of outliers that range from 15 to 80. These are individuals who are chronically sick, or perhaps require a weekly treatment.) The insurance variable is PUBLIC. For people who have the insurance, PUBLIC = 1; for those who do not have the insurance, PUBLIC = 0. The Poisson regression model states that Y exp( λ ) i i λi Prob(DocVis i=y i) =, Yi = 0,1,... i = 1,...,N. Y! i In this model, λ i is the mean of the random variable (λ i is the regression function). To make the model into a regression, we form Expected value = E[y i x i ] = λ i = exp(β x i ) Maximum likelihood estimates of the three models based on the survey data are as follows: Model 1 contains my full theory Theory A about doctor visits. Model 2 contains only a constant term Theory Z. Model 3 contains only the constant term and AGE Theory B. 4

5 Poisson Regression Dependent variable DOCVIS Log likelihood function Restricted log likelihood Estimation based on N = 3377, K = 6 DOCVIS Coefficient Error z z >Z* Interval Constant.49959*** AGE.02059*** EDUC *** PUBLIC.34568*** INCOME *** Interaction FEMALE*INCOME Intrct *** ***, **, * ==> Significance at 1%, 5%, 10% level. Poisson Regression Dependent variable DOCVIS Log likelihood function DOCVIS Coefficient Error z z >Z* Interval Constant *** Poisson Regression Dependent variable DOCVIS Log likelihood function DOCVIS Coefficient Error z z >Z* Interval Constant.32994*** AGE.02266*** Do the regression results provide significant evidence of moral hazard? Explain. coefficient on public is large, positive and significant. yes 2. Form the log likelihood function (logl) for estimation of the parameters β. Sum of logs of Probabilities = Sum {-λ i + y i logλ i logy i!} 3. Obtain the first order (necessary) conditions for maximizing logl with respect to β. Sum{ -1 + y i /λ i }*λ i x i. = 0. 5

6 4. We can use the first two sets of results to test the hypothesis that the variables in the model are collectively significant. (Like, but not the same as the F test for the linear model.) Use the likelihood ratio to test the hypothesis that all 5 slope coefficients are zero. Now, use a likelihood ratio test to test theory B (which removes all the variables except for AGE) from the model against Theory A. LR test is 2*( ) chi squared (5) LR test is 2*( ) = chi squared 5 5. The way the model is constructed, β x = α + β 1 AGE + β 2 EDUC + β 3 PUBLIC + β 4 INCOME + β 5 FEMALE*INCOME Notice, then, that for women, that is when FEMALE=1, β x = α + β 1 AGE + β 2 EDUC + β 3 PUBLIC + (β 4 + β 5 )INCOME while for men, that is when FEMALE=0 and β x = α + β 1 AGE + β 2 EDUC + β 3 PUBLIC + β 4 INCOME We are interested in how the expected value differs for men and women. Consider someone who is 35 years old, has 12 years of education, has public insurance (PUBLIC=1) and INCOME = 0.5. Compute the expected values for men and for women, and comment on the difference that you find. (This is called the partial effect of gender.) E[] = exp( * * * * *1*.5) - exp( * * * * *0*.5) compute to get partial effect. 6. How does the expected number of doctor visits respond to years of education? (a) Obtain the (mathematical) derivative of E[DocVis i ] = λ i with respect to EDUC. Tip: d(e t )/dt = e t. Use the chain rule as well. partial of λ i wrt educ = λ i *( ). (b) Compute the value for the person in part 4; AGE = 35 years old, EDUC = 12 years of education, INCOME =.5 and has public insurance PUBLIC = 1. plug in values and compute λ i with educ =13 then with educ = 12 and compute the difference. (b) Compute λ i using these values but now with EDUC = 13 instead of 12, What do you find? you should get essentially the same answer as in part a. 6

7 [15] Part III. Regression Basics. Forbes Magazine reported a survey in which citizens of 150+ countries reported how happy they were with their lives, using some kind of survey scale. In a linear regression of this variable (obtained from Forbes website) on the (disability adjusted) life expectancy in the country Minitab reported the following regression results. Answer T (true) or F (false) to each of the following. Explain your answer in one short sentence. 1. The reported statistics provide evidence of a significant regression (i.e., people who live longer are happier). T F is huge % of the residuals are between -7 and +7. False. 90% are within +/- one standard deviation, -14 to Increasing life expectancy by one year causes a significant increase of about 1 happiness unit. False. No causation. 4. The correlation between the variables HAPPY and DALE is True. square root of.455. Positive as slope is positive. 5. The regression slope estimator would be regarded as statistically significant True. t ratio is very large. 7

8 [15] Part IV. Analyzing Descriptive Statistics It is often found that on average women tend to give lower answers to the health satisfaction question in a survey such as the one we are analyzing in this exam. 1. The histogram below shows the relative frequencies (proportions) of the answers for men and women. The results in the histogram do agree with the suggested comparison of men and women. Explain. It looks like men have taller bars for the high values and lower bars for the low values. 2. The following statistics were gathered for the sample of men and women in a sample of 2039 observations. Test the hypothesis that the means for men and women are equal. Descriptive Statistics for 1 variables Variable Mean Std.Dev. Minimum Maximum Cases Subsample is FEMALE = 0 (Men) HLTHSAT Subsample is FEMALE = 1 (Women) HLTHSAT Use standard test. ( )/sqr( / /955). 8

9 [15] Part V. Multiple Regression My model for the auction prices of Monet paintings was Ln$ = Constant + β 1 lnsurface Area + β 2 lnaspect Ratio + β 3 ln Height + β 4 lnwidth + β 5 Signed. (Surface area = Height Width, Aspect Ratio = Height/Width.) It looks like Minitab didn t like my model as much as I did. Explain in detail why Minitab insisted on dropping the two variables from my equation. This is the multicollinearity. lnaspect lnwidth lnheight, which is a linear combination of variables in the equation. Same with surface area. 9

10 [15] Part VI. Statistical Theory Suppose the density of x is f(x) = What is the density of z = x 2 x 2 1 (ln x) exp, 0 < x < +. 2π 2 x = z 1/2 so dx = (1/2)z -1/2 dz 1/2 2 ( ln z ) ( ln z) exp = xp z 2π 2 2 z z 2 2π 4(2) This is also a lognormal [15] Part VII. Very Basic Statistics The histogram above describes the 2039 observations on the variable income used in part IV. 1. Provide a guess of the sample mean, and explain how you obtained it. about.5 middle of distribution 2. Provide a guess of the sample median and explain how you obtained it. about.45. Less than the median 3. Provide a guess of the sample standard deviation and explain precisely how you obtained it. about.16. Range 0 to 1 should be about 6 standard deviations 4. Are these data skewed to the left or to the right, or not at all? To the right 10

11 [15] Part VIII. Bivariate Outcomes. An important variable in the analysis of German health outcomes is whether the individual takes up the public health insurance. The table below shows the takeup rates for men and women. Cross Tabulation PUBLIC FEMALE NO_INS INS Total MALE FEMALE Total What proportion of women take the insurance? What proportion of men take the insurance? 881/ The chi squared value for the test of independence of gender and insurance takeup is (See class notes 10, pages ) Should I conclude that insurance takeup and gender are independent based on these results? Explain in detail. They are not independent. Critical chi squared would be 3.84 < The following are the results of a logistic regression of PUBLIC in which the only variable that explains the whether the individual takes public insurance is whether the applicant is female or not. Are these results consistent with the chi squared test in part 1? Explain. Binary Logit Model for Binary Choice Dependent variable PUBLIC Log likelihood function PUBLIC Coefficient Error z z >Z* Interval Constant *** FEMALE.60891*** ***, **, * ==> Significance at 1%, 5%, 10% level. It is consistent, but this says women take up public insurance more than men. Previous result only suggests dependence. 11

12 [15] Part IX. Function of a Random Estimator 1. For the income data used earlier, I defined HHINCOME = household income = 100 income. The sample variance of HHINCOME is This estimates the variance, σ 2. The variance of an estimator of a variance is approximately 2σ 4 /N. N is 2039, so the variance estimator for s 2, the estimator, is The precision of a random variable is defined as φ = 1/σ. Estimate the precision of the income variable. Precision = 1/σ = 1/sqr( ) 2. The standard error for the estimator of σ 2 in part 1 above is sqr(72.89) = How would you compute the standard error for the precision, φ = (σ 2 ) -1/2. What is the value? Show in detail how you obtain the result Use delta method. Variance of s 2 is Derivative is -1/2 (σ 2 ) -3/2 = -1/2σ 3. Squared derivative is (1/4)/σ 6. Value is (1/4) / ( ) So, variance of φ would be [(1/4)/ ] * About Standard error = /s is

Inference and Regression

Inference and Regression Name Inference and Regression Final Examination, 2016 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page.

More information

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 4] 1/43 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

Statistics and Data Analysis

Statistics and Data Analysis Statistics and Data Analysis Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page: http://people.stern.nyu.edu/wgreene Email: wgreene@stern.nyu.edu Course web page: http://people.stern.nyu.edu/wgreene/statistics/outline.htm

More information

Econometric Analysis of Panel Data. Final Examination: Spring 2018

Econometric Analysis of Panel Data. Final Examination: Spring 2018 Department of Economics Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Email: wgreene@stern.nyu.edu URL for course

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Econometric Analysis of Panel Data. Final Examination: Spring 2013 Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu URL for course web page: people.stern.nyu.edu/wgreene/econometrics/paneldataeconometrics.htm

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The

More information

Final Exam. Name: Solution:

Final Exam. Name: Solution: Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.

More information

ECON Interactions and Dummies

ECON Interactions and Dummies ECON 351 - Interactions and Dummies Maggie Jones 1 / 25 Readings Chapter 6: Section on Models with Interaction Terms Chapter 7: Full Chapter 2 / 25 Interaction Terms with Continuous Variables In some regressions

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Inference and Regression

Inference and Regression Inference and Regression Midterm Examination, 016 Department of IOMS Instructions Please write your name at the top of this page. Please answer all questions on this question book. Do not turn in a blue

More information

Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses

Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses ISQS 5349 Final Spring 2011 Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses 1. (10) What is the definition of a regression model that we have used throughout

More information

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section: Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the

More information

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti Lecture 2: Categorical Variable A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti 1 Categorical Variable Categorical variable is qualitative

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

Truncation and Censoring

Truncation and Censoring Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of

More information

ISQS 5349 Final Exam, Spring 2017.

ISQS 5349 Final Exam, Spring 2017. ISQS 5349 Final Exam, Spring 7. Instructions: Put all answers on paper other than this exam. If you do not have paper, some will be provided to you. The exam is OPEN BOOKS, OPEN NOTES, but NO ELECTRONIC

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

WHAT IS HETEROSKEDASTICITY AND WHY SHOULD WE CARE?

WHAT IS HETEROSKEDASTICITY AND WHY SHOULD WE CARE? 1 WHAT IS HETEROSKEDASTICITY AND WHY SHOULD WE CARE? For concreteness, consider the following linear regression model for a quantitative outcome (y i ) determined by an intercept (β 1 ), a set of predictors

More information

Econometric Analysis of Panel Data. Assignment 1

Econometric Analysis of Panel Data. Assignment 1 Department of Economics Econometric Analysis of Panel Data Professor William Greene Phone: 22.998.0876 Office: KMC 7-78 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu URL for course

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5.

1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5. Economics 3 Introduction to Econometrics Winter 2004 Professor Dobkin Name Final Exam (Sample) You must answer all the questions. The exam is closed book and closed notes you may use calculators. You must

More information

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot

More information

EEE-05, Series 05. Time. 3 hours Maximum marks General Instructions: Please read the following instructions carefully

EEE-05, Series 05. Time. 3 hours Maximum marks General Instructions: Please read the following instructions carefully EEE-05, 2015 Series 05 Time. 3 hours Maximum marks. 100 General Instructions: Please read the following instructions carefully Check that you have a bubble-sheet and an answer book accompanying this examination

More information

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%. UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations

More information

Pre-Calculus Multiple Choice Questions - Chapter S8

Pre-Calculus Multiple Choice Questions - Chapter S8 1 If every man married a women who was exactly 3 years younger than he, what would be the correlation between the ages of married men and women? a Somewhat negative b 0 c Somewhat positive d Nearly 1 e

More information

Lecture 12: Interactions and Splines

Lecture 12: Interactions and Splines Lecture 12: Interactions and Splines Sandy Eckel seckel@jhsph.edu 12 May 2007 1 Definition Effect Modification The phenomenon in which the relationship between the primary predictor and outcome varies

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression Section IX Introduction to Logistic Regression for binary outcomes Poisson regression 0 Sec 9 - Logistic regression In linear regression, we studied models where Y is a continuous variable. What about

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Name: Person on right SID: Person on left There will be one, double sided, handwritten, 8.5in x 11in page of notes allowed during the exam. The exam is closed

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

ISQS 5349 Spring 2013 Final Exam

ISQS 5349 Spring 2013 Final Exam ISQS 5349 Spring 2013 Final Exam Name: General Instructions: Closed books, notes, no electronic devices. Points (out of 200) are in parentheses. Put written answers on separate paper; multiple choices

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Econometrics I Lecture 7: Dummy Variables

Econometrics I Lecture 7: Dummy Variables Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable

More information

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018 Page 1 of 4 QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018 ECONOMICS 250 Introduction to Statistics Instructor: Gregor Smith Instructions: The exam

More information

Chapter 6. Exploring Data: Relationships. Solutions. Exercises:

Chapter 6. Exploring Data: Relationships. Solutions. Exercises: Chapter 6 Exploring Data: Relationships Solutions Exercises: 1. (a) It is more reasonable to explore study time as an explanatory variable and the exam grade as the response variable. (b) It is more reasonable

More information

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. 1.(10) What is usually true about a parameter of a model? A. It is a known number B. It is determined by the data C. It is an

More information

Mid-term exam Practice problems

Mid-term exam Practice problems Mid-term exam Practice problems Most problems are short answer problems. You receive points for the answer and the explanation. Full points require both, unless otherwise specified. Explaining your answer

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

EXAM # 2. Total 100. Please show all work! Problem Points Grade. STAT 301, Spring 2013 Name

EXAM # 2. Total 100. Please show all work! Problem Points Grade. STAT 301, Spring 2013 Name STAT 301, Spring 2013 Name Lec 1, MWF 9:55 - Ismor Fischer Discussion Section: Please circle one! TA: Shixue Li...... 311 (M 4:35) / 312 (M 12:05) / 315 (T 4:00) Xinyu Song... 313 (M 2:25) / 316 (T 12:05)

More information

Final Exam Bus 320 Spring 2000 Russell

Final Exam Bus 320 Spring 2000 Russell Name Final Exam Bus 320 Spring 2000 Russell Do not turn over this page until you are told to do so. You will have 3 hours minutes to complete this exam. The exam has a total of 100 points and is divided

More information

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 5.1. (a) In a log-log model the dependent and all explanatory variables are in the logarithmic form. (b) In the log-lin model the dependent variable

More information

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.

More information

ORF 245 Fundamentals of Engineering Statistics. Final Exam

ORF 245 Fundamentals of Engineering Statistics. Final Exam Princeton University Department of Operations Research and Financial Engineering ORF 45 Fundamentals of Engineering Statistics Final Exam May 15, 009 1:30pm-4:30pm PLEASE DO NOT TURN THIS PAGE AND START

More information

Spatial Discrete Choice Models

Spatial Discrete Choice Models Spatial Discrete Choice Models Professor William Greene Stern School of Business, New York University SPATIAL ECONOMETRICS ADVANCED INSTITUTE University of Rome May 23, 2011 Spatial Correlation Spatially

More information

ECON 497: Lecture Notes 10 Page 1 of 1

ECON 497: Lecture Notes 10 Page 1 of 1 ECON 497: Lecture Notes 10 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 10 Heteroskedasticity Studenmund Chapter 10 We'll start with a quote from Studenmund:

More information

Introduction to Linear Regression Analysis

Introduction to Linear Regression Analysis Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing

More information

Answer Key: Problem Set 5

Answer Key: Problem Set 5 : Problem Set 5. Let nopc be a dummy variable equal to one if the student does not own a PC, and zero otherwise. i. If nopc is used instead of PC in the model of: colgpa = β + δ PC + β hsgpa + β ACT +

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

MGEC11H3Y L01 Introduction to Regression Analysis Term Test Friday July 5, PM Instructor: Victor Yu

MGEC11H3Y L01 Introduction to Regression Analysis Term Test Friday July 5, PM Instructor: Victor Yu Last Name (Print): Solution First Name (Print): Student Number: MGECHY L Introduction to Regression Analysis Term Test Friday July, PM Instructor: Victor Yu Aids allowed: Time allowed: Calculator and one

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Econometrics Problem Set 4

Econometrics Problem Set 4 Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

DSST Principles of Statistics

DSST Principles of Statistics DSST Principles of Statistics Time 10 Minutes 98 Questions Each incomplete statement is followed by four suggested completions. Select the one that is best in each case. 1. Which of the following variables

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX Term Test 3 December 5, 2003 Name Math 52 Student Number Direction: This test is worth 250 points and each problem worth 4 points DO ANY SIX PROBLEMS You are required to complete this test within 50 minutes

More information

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam.

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam. LAST NAME (Please Print): KEY FIRST NAME (Please Print): HONOR PLEDGE (Please Sign): Statistics 111 Midterm 1 This is a closed book exam. You may use your calculator and a single page of notes. The room

More information

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability and Statistics FS 2017 Session Exam 22.08.2017 Time Limit: 180 Minutes Name: Student ID: This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided

More information

Descriptive Statistics Class Practice [133 marks]

Descriptive Statistics Class Practice [133 marks] Descriptive Statistics Class Practice [133 marks] The weekly wages (in dollars) of 80 employees are displayed in the cumulative frequency curve below. 1a. (i) (ii) Write down the median weekly wage. Find

More information

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book. NAME (Please Print): HONOR PLEDGE (Please Sign): statistics 101 Practice Final Key This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables

More information

IUT of Saint-Etienne Sales and Marketing department Mr. Ferraris Prom /04/2017

IUT of Saint-Etienne Sales and Marketing department Mr. Ferraris Prom /04/2017 IUT of Saint-Etienne Sales and Marketing department Mr. Ferraris Prom 2016-2018 14/04/2017 MATHEMATICS 2 nd semester, Test 1 length : 2 hours coefficient 1/2 Graphic calculator is allowed. Any personal

More information

Statistics 100 Exam 2 March 8, 2017

Statistics 100 Exam 2 March 8, 2017 STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF

More information

Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines

Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines Pierre Dubois and Ethan Ligon presented by Rachel Heath November 3, 2006 Introduction Outline Introduction Modification

More information

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. 1. What is the difference between a deterministic model and a probabilistic model? (Two or three sentences only). 2. What is the

More information

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157. CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 STAT 391 - Spring Quarter 2017 - Midterm 1 - April 27, 2017 Name: Student ID Number: Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 Directions. Read directions carefully and show all your

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving

More information

The 2010 Medici Summer School in Management Studies. William Greene Department of Economics Stern School of Business

The 2010 Medici Summer School in Management Studies. William Greene Department of Economics Stern School of Business The 2010 Medici Summer School in Management Studies William Greene Department of Economics Stern School of Business Econometric Models When There Are Unusual Events Part 5: Binary Outcomes Agenda General

More information

Correlation and regression

Correlation and regression 1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

More information

Econometric Analysis of Panel Data Assignment 4 Parameter Heterogeneity in Linear Models: RPM and HLM

Econometric Analysis of Panel Data Assignment 4 Parameter Heterogeneity in Linear Models: RPM and HLM Department of Economics Econometric Analysis of Panel Data Assignment 4 Parameter Heterogeneity in Linear Models: RPM and HLM The estimation parts of this assignment will be based on the Baltagi and Griffin

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Ramesh Yapalparvi It is Time for Homework! ( ω `) First homework + data will be posted on the website, under the homework tab. And also sent out via email. 30% weekly homework.

More information