In Class Review Exercises Vartanian: SW 540

Similar documents
Review of Multiple Regression

y response variable x 1, x 2,, x k -- a set of explanatory variables

General Linear Model (Chapter 4)

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients

Can you tell the relationship between students SAT scores and their college grades?

EDF 7405 Advanced Quantitative Methods in Educational Research. Data are available on IQ of the child and seven potential predictors.

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Two-Way ANOVA. Chapter 15

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total

1 Independent Practice: Hypothesis tests for one parameter:

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

Ordinary Least Squares Regression Explained: Vartanian

Lecture 1 Linear Regression with One Predictor Variable.p2

Lecture (chapter 13): Association between variables measured at the interval-ratio level

Difference in two or more average scores in different groups

Sociology 593 Exam 2 Answer Key March 28, 2002

N J SS W /df W N - 1

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Simple Linear Regression: One Qualitative IV

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

Lecture 11: Simple Linear Regression

Fixed and Random Effects Models: Vartanian, SW 683

REVIEW 8/2/2017 陈芳华东师大英语系

Simple Linear Regression

Chapter 1 Linear Regression with One Predictor

Failure Time of System due to the Hot Electron Effect

Simple Linear Regression: One Qualitative IV

Sociology 593 Exam 2 March 28, 2002

3 Variables: Cyberloafing Conscientiousness Age

9 Correlation and Regression

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

STAT 3A03 Applied Regression With SAS Fall 2017

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Simple Linear Regression: One Quantitative IV

Table 1: Fish Biomass data set on 26 streams

At this point, if you ve done everything correctly, you should have data that looks something like:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

1 A Review of Correlation and Regression

Lecture 11 Multiple Linear Regression

Correlation and Simple Linear Regression

Introducing Generalized Linear Models: Logistic Regression

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

using the beginning of all regression models

Chapter 9 - Correlation and Regression

Midterm 2 - Solutions

4/22/2010. Test 3 Review ANOVA

Ordinary Least Squares Regression Explained: Vartanian

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Multiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

Lecture 4: Multivariate Regression, Part 2

Binary Logistic Regression

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Unit 6 - Introduction to linear regression

9. Linear Regression and Correlation

SPSS LAB FILE 1

Lecture 24: Partial correlation, multiple regression, and correlation

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

Multiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Correlation and the Analysis of Variance Approach to Simple Linear Regression

MATH ASSIGNMENT 2: SOLUTIONS

Answer Key: Problem Set 6

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Lecture 13 Extra Sums of Squares

Booklet of Code and Output for STAC32 Final Exam

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Unit 6 - Simple linear regression

Practical Biostatistics

Sociology Research Statistics I Final Exam Answer Key December 15, 1993

Lecture 12 Inference in MLR

Statistics 5100 Spring 2018 Exam 1

Final Exam - Solutions

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Simple Linear Regression Using Ordinary Least Squares

Multiple Regression: Inference

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Problem Set 10: Panel Data

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Models for Clustered Data

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

1 Correlation and Inference from Regression

Lecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II

Lab 07 Introduction to Econometrics

Models for Clustered Data

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

STA 101 Final Review

Overview Scatter Plot Example

Transcription:

In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE intercept 5 8 Black 9 3 Age 7 2.8 2. We want to use residuals to determine the relationship of mental health problems and income, controlling for age. What is the partial r coefficient using residuals? 3. How do we determine significance in an ANOVA model? What factors do we compare? 4. Use the two types of analyses we ve learned to examine the following rankings. Math Reading 1 3 2 4 4 2 3 1 5. Given the following nominal scale variables, what is the direction of the relationship between the variables, what is the chi-square value, and do we have a statistically significant relationship between the variables? Treatment Control Total Depressed 30 50 80 Not Depressed 40 80 120 Total 70 130 200 6. Given the following results, what is your prediction for income for the mean individual? The DV is family income. The IVs are years of education of the head (I/R), parents expect the child to get a college degree (dummy excluded are those parents with lower expectations), and family members hit each other (dummy excluded are those that do not hit each other). C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 1

Regression Model 1 Model Summary Adjusted Std. Error of R R Square R Square the Estimate.415 a.172.171 52455.76070 a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr ANOVA b Model Squares df Square F Sig. 1 Regression 1.21E+12 3 4.024E+11 146.231.000 a Residual 5.81E+12 2112 2751606831 Total 7.02E+12 2115 a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr b. Dependent Variable: total family income Model 1 (Constant) education of head PCGs educ expectat of child: coll degr family members hit each other Coefficients a Unstandardized Coefficients a. Dependent Variable: total family income Standardized Coefficients B Std. Error Beta t Sig. -56529.0 5753.702-9.825.000 7444.029 456.878.339 16.293.000 17003.996 2569.757.138 6.617.000-9701.196 3199.498 -.061-3.032.002 Descriptive Statistics total family income education of head family members hit each other PCGs educ expectat of child: coll degr Valid N (listwise) N Minimum Maximum Std. Deviation 3557.00 1602720 46694.32 54528.34097 3536 1.00 17.00 12.6092 2.59473 2172.00 1.00.1533.36037 3484.00 1.00.6705.47010 2116 #7. A. Indicate the meaning of the standardized coefficient estimates in problem #6. B. If education of the head increased by 5.18946 grades, what is your prediction for the change in income? Give this in standard deviation units and in change in actual income. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 2

#8. You are examining a number of childhood predictor variables on the log of income as an adult. You use a number of race variables (with white exclude), gender (with female excluded), whether either of the person s parents were born outside of the U.S. (dummy), 8 th grade grades, standardized test score (I/R), and socioeconomic status in 8 th grade (made into an interval level variable). A. Why do we use log dependent variables? B. Which of the variables has the greatest effect on the log of income? C. Interpret the coefficient estimates for each of the variables. Coefficients a Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: LOGINC Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig. 10.141.044 229.135.000 -.142.041 -.044-3.443.001 -.308.032 -.104-9.549.000-2.81E-02.046 -.007 -.616.538 -.123.031 -.049-3.932.000 4.267E-02.017.027 2.540.011 5.520E-02.030.026 1.863.063 8.692E-02.014.080 6.261.000 5.553E-02.013.054 4.422.000 3.728E-03.006.008.599.549 D. You run a second model with regular income as the DV. Which of these two models is the better model? The results are given below. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 3

Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: INCOME99 Coefficients a Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig. 32166.320 1815.877 17.714.000-4124.963 1678.301 -.031-2.458.014-10128.4 1306.117 -.083-7.755.000-3043.701 1859.385 -.017-1.637.102-4547.463 1274.456 -.044-3.568.000 403.747 690.629.006.585.559 3290.146 1211.321.037 2.716.007 2916.091 570.041.064 5.116.000 3513.504 515.016.082 6.822.000 28.516 255.561.002.112.911 9. A. Are either of the following models statistically significant? B. What is the meaning of a model being statistically significant? C. What is the adjusted or corrected R 2 value in each of the models? D. Is there much difference between the R 2 value and the corrected R 2 value? Why? E. In Model 1, what is the predicted level of income (not log income) for someone who has 0 sibling, is female, has a 3.5 GPA (grades) and 0 SES. Model 1: Using Log income as the DV. IVs are # of siblings, gender, SES (I/R), 8 th grade grades. Model: MODEL1 Dependent Variable: loginc Source DF Squares Model 4 96.19388 Error 8802 5399.45745 Corrected Total 8806 5495.65133 Variable Label DF Estimate Error t Value Pr > t Intercept Intercept 1 10.13611 0.04054 250.05 <.0001 sibs 1-0.00299 0.00559-0.54 0.5926 female 1-0.04476 0.01684-2.66 0.0079 BYSES SOCIO-ECONOMIC STATUS 1 0.07569 0.01182 6.41 <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE 1 0.09279 0.01224 7.58 <.0001 Model 2, Using Income as the DV Dependent Variable: INCOME99 Source DF Squares Square F Value Pr > F C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 4

Model 4 1.614915E11 40372874276 38.39 <.0001 Error 8802 9.256029E12 1051582510 Corrected Total 8806 9.417521E12 Root MSE 32428 R-Square 0.0171 Dependent 41281 Adj R-Sq 0.0167 Coeff Var 78.55480 Variable Label DF Estimate Error t Value Pr > t Intercept Intercept 1 32138 1678.37157 19.15 <.0001 sibs 1 155.83464 231.61966 0.67 0.5011 female 1 45.09213 697.09642 0.06 0.9484 BYSES SOCIO-ECONOMIC STATUS 1 4113.85645 489.18432 8.41 <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE 1 2958.13174 506.90452 5.84 <.0001 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 5

10. You want to determine if area of residence helps in predicting grade point average. You are looking at 3 different areas: Urban, suburban, and rural. You get the results below, including a post hoc Scheffe test. Oneway ANOVA GRADES IN 8TH GRADE Squares df Square F Sig. Between Groups 7.996 2 3.998 7.570.001 Within Groups 5012.466 9491.528 Total 5020.462 9493 Post Hoc Tests Dependent Variable: GRADES IN 8TH GRADE Scheffe Multiple Comparisons (I) Area of residence Urban Suburban Rural (J) Area of residence Suburban Rural Urban Rural Urban Suburban *. The mean difference is significant at the.05 level. A. Is there a significant difference among groups? B. Which groups are different? C. How different are the individual groups? Difference 95% Confidence Interval (I-J) Std. Error Sig. Lower Bound Upper Bound.0048.01904.969 -.0419.0514.0648*.02011.006.0156.1140 -.0048.01904.969 -.0514.0419.0600*.01725.002.0178.1023 -.0648*.02011.006 -.1140 -.0156 -.0600*.01725.002 -.1023 -.0178 11. You have the following 3 groups and want to determine if there is a statistically significant difference in the groups. Is there? Married Never Married Divorce/Separate 6 7 9 5 8 10 6 9 8 7 8 9 Sum 24 32 36 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 6

12. Given the following data, how would we use residuals to determine partial r s and partial b s? data a; input kids nbpov income; cards; 7 20.00 30000.00 7 15.00 30000.00 8 15.00 29999.00 1 14.00 25000.00 2 14.00 25000.00 3 13.00 22000.00 4 11.00 35200.00 5 10.00 34000.00 3 11.00 35000.00 3 10.00 33800.00 2 11.00 35000.00 1 10.00 34000.00 0 8.00 40000.00 0 9.00 41200.00 0 8.00 41000.00 4 9.00 42000.00 4 8.00 41000.00 5 9.00 42000.00 ; proc reg; model income=kids; output out=b r=resid1 p=pred1; run; proc reg;model nbpov=kids; output out=c r=resid2 p=pred2; run; proc reg;model kids=nbpov; output out=e r=resid3 p=pred3; run; proc reg;model income=nbpov; output out=f r=resid4 p=pred4; run; proc reg;model kids=income; output out=e r=resid5 p=pred5; run; proc reg;model nbpov=income; output out=f r=resid6 p=pred6; run; C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 7

The SAS System 06:54 Wednesday, December 10, 2003 56 The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model 1 29805484 29805484 0.74 0.4010 Error 16 640542984 40033937 Corrected Total 17 670348468 Root MSE 6327.23767 R-Square 0.0445 Dependent 34233 Adj R-Sq -0.0153 Coeff Var 18.48271 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1 35991 2524.95100 14.25 <.0001 kids 1-536.34584 621.59997-0.86 0.4010 The SAS System 06:54 Wednesday, December 10, 2003 57 The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model 1 55.82845 55.82845 7.54 0.0143 Error 16 118.44933 7.40308 Corrected Total 17 174.27778 Root MSE 2.72086 R-Square 0.3203 Dependent 11.38889 Adj R-Sq 0.2779 Coeff Var 23.89048 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1 8.98284 1.08579 8.27 <.0001 kids 1 0.73405 0.26730 2.75 0.0143 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 8

The SAS System 06:54 Wednesday, December 10, 2003 58 The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model 1 33.19096 33.19096 7.54 0.0143 Error 16 70.42015 4.40126 Corrected Total 17 103.61111 Root MSE 2.09792 R-Square 0.3203 Dependent 3.27778 Adj R-Sq 0.2779 Coeff Var 64.00427 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1-1.69238 1.87621-0.90 0.3804 nbpov 1 0.43640 0.15892 2.75 0.0143 The SAS System 06:54 Wednesday, December 10, 2003 59 The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model 1 369715492 369715492 19.68 0.0004 Error 16 300632976 18789561 Corrected Total 17 670348468 Root MSE 4334.69272 R-Square 0.5515 Dependent 34233 Adj R-Sq 0.5235 Coeff Var 12.66222 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1 50821 3876.60354 13.11 <.0001 nbpov 1-1456.50781 328.35022-4.44 0.0004 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 9

The SAS System 06:54 Wednesday, December 10, 2003 60 The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model 1 4.60683 4.60683 0.74 0.4010 Error 16 99.00428 6.18777 Corrected Total 17 103.61111 Root MSE 2.48752 R-Square 0.0445 Dependent 3.27778 Adj R-Sq -0.0153 Coeff Var 75.89051 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1 6.11569 3.34086 1.83 0.0859 income 1-0.00008290 0.00009608-0.86 0.4010 The SAS System 06:54 Wednesday, December 10, 2003 61 The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model 1 96.11896 96.11896 19.68 0.0004 Error 16 78.15882 4.88493 Corrected Total 17 174.27778 Root MSE 2.21019 R-Square 0.5515 Dependent 11.38889 Adj R-Sq 0.5235 Coeff Var 19.40652 Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept 1 24.35180 2.96839 8.20 <.0001 income 1-0.00037866 0.00008536-4.44 0.0004 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 10

13. You have standardized all of your continuous variables by dividing by their respective standard deviations. Interpret the following output. DV: Income IV: Kids IV: nb poverty rate IV: White b SE Kids 1.2.5 NBPOV -.5.25 White 1.6.8 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 11

#1 Answer. Yp=5 + 9 Black + 7 Age Black: Yp=5+9+7Age Yp=14+7Age White: Yp=5 + 7Age Answer for #2. MHP=X1 Income=Y Age=X2 Y=a+bx2+e1 X1=a+bx2+e2 Correlate e1 and e2. Answer for #3. We compare the within to the between. If we find a relatively large between and a relatively small within, group membership helps us in predicting the outcome. #4 Answer. Correlations Kendall's tau_b Spearman's rho MATH READING MATH READING Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N MATH READING 1.000 -.333..497 4 4 -.333 1.000.497. 4 4 1.000 -.600..400 4 4 -.600 1.000.400. 4 4 Neither Kendall or Spearman show a significant relationship. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 12

#5 Answer. Treatment Control Total Depressed 30 (28) 50 (52) 80 Not Depressed 40 (42) 80 (78) 120 Total 70 130 200 Positive relationship between the control group and not depressed. Chi-square value is: Obs Expect Difference Difference Squared chi-squared value 30 28 2 4 4/28=.14 50 52 2 4 4/52=.08 40 42 2 4 4/42=.10 80 78 2 4 4/78=.05 chi-square value =.14+.08+.10+.05 =.37 at 1 DF. CV at.05 level is 3.84. Therefore, you will fail to reject H 0. #6 Answer. To come up with the predicted value for the mean individual, you would use the Yp equation with the b values from the coefficient estimates, and use the mean values for the variables for the X values. Yp = -56529+7444*12.6+17004*.67+(-9701)*.36 = 45165.7 Answer for #7. A. For a one standard deviation increase in education level of the head, we predict that income will increase by.339 standard deviation units. B. If education level increased by roughly 2 standard deviation units, we would predict that income would increase by.678 standard deviation units. Because the standard deviation for income is 54528, a 2 standard deviation increase in education level would lead to a.678*54528 = 36969.98 increase in income. Answers for #8. A. We use log dependent variables to decrease the effects of outliers and to help interpret the effects of independent variables on scaled dependent variables. B. It appears that 8 th grade grades has the larger effect on log income relative to the other I/R variables. Standardized test scores appear to have no effect on income as an adult. C. You would need to make the transformation of these coefficient estimates using the exponential function in order to get the percentage change for a one unit increase in the IV or the percentage difference between the include and the excluded group. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 13

D. You would need to know the R2 values of the two models to know which is a better fitting model. Answers for #9. First Equation: R 2 = 96.19/5496.65 = 0.0175 Adjusted R 2 = 0.0175 - [4/(8807-4-1) * (1-.0175)] =.017054 There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0175)/4] / [(1-.0175)/8802] = 39.20 Or use the mean square model/mean square error = (96.19/4) / (5399.46/8802) = 39.20 The critical value for a.05 F test at 4 and 8802 DFs is 2.37. For a.01 test, the critical value is 3.32. Our F value exceeds these critical values and therefore we will reject the null hypothesis. Second Equation R 2 = 1.614915/94.175231 = 0.0171 (roughly) (we could have added lots of zeros onto the end of these values but as long as we know that denominator is one decimal place larger than the numerator, we can determine their relative values.) Adjusted R 2 = 0.0171 - [4/(8807-4-1) * (1-.0171)] =.017054 =.01665 There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0171)/4] / [(1-.0171)/8802] = 38.39 Or use the mean square model/mean square error = (1.614915/4) / (92.56029/8802) = 38.39 (Again, we know that the denominator has an extra decimal place or is one decimal place farther to the right -- relative to the numerator.) The critical value for a.05 F test at 4 and 8802 DFs is 2.37. For a.01 test, the critical value is 3.32. Our F value exceeds these critical values and therefore we will reject the null hypothesis. The meaning of a model being statistically significant: In all likelihood, the set of independent variables helps us in explaining the variance of the dependent variable. The R2 and corrected R2 are very similar because we are not using many independent variables in the model and because our sample size is very large. E. Yp= 10.13611 + (-.00299)*0+(-.04476)*1+(.07569)*0+(.09279)*3.5 = 10.416115 Take the exponential of this: e 10.416115 = $33393.44827 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 14

Answers for #10. A. The F test indicates that there is an overall difference among the groups, statistically significant at the.001 level. B. There is no difference between urban and suburban group. There is a difference between the rural and the urban groups, and the suburban and rural groups. C. The GPA of urban groups is.0648 points higher than the rural group. The GPA for the suburban group is.0600 high relative to the rural group. Answers for #11. Determine the within sums of squares: Married group: mean =6. (6-6) 2 +(5-6) 2 +(6-6) 2 + (7-6) 2 = 2 NM Group: =8. (7-8) 2 +(8-8) 2 + (9-8) 2 + (8-8) 2 = 2 DS Group: =9. (9-9) 2 + (10-9) 2 +(8-9) 2 + (9-9) 2 = 2 The within sums of square = 6. Divide this value by n-k, or 12-3 =9. The mean of the within sums of squares = 6/9 =.667. To determine the between sums of squares we first need to determine the overall mean value: (24+32+36)/12 = 92/12 = 7.67. 4*(6-7.67) 2 + 4*(8-7.67) 2 + 4*(9-7.67) 2 = 4*(-1.67) 2 + 4*(.33) 2 + 4*(1.33) 2 = 4*2.78 + 4*.109 + 4* 1.77 = 18.64 To get the mean between SS, divide by k-1, or 2: 18.64/2 = 9.32. F 2,9 = 9.32/.667 =13.97 The critical F values are 4.26 at the.05 level and 8.02 at the.01 level. We will reject the null hypothesis at both levels. Sort of an answer for #12. We would use the intercept values and b coefficients from the various bivariate regression models to determine residuals for particular partial correlations and b coefficient estimates that were of interest to us. For example, if we wanted to partial out the effect of kids, we would use the first two regression models, come up with residuals for each of those two models, then either run bivariate correlations between the two residuals or run a regression with the two residuals. You will choose the residual that results from the equation using the dependent variable as the dependent variable in a regression model. You will use the residual that results from the use of the independent variable as independent variable in the regression. 13. For a 1 SD increase in kids, income is predicted to increase by 1.2 SD units. For a 1 SD unit increase in NB poverty, income is predicted to decrease by.5 SD units. Whites have incomes that are 1.6 SD units higher than non-whites. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 15