In Class Review Exercises Vartanian: SW 540

Size: px
Start display at page:

Download "In Class Review Exercises Vartanian: SW 540"

Transcription

1 In Class Review Exercises Vartanian: SW Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE intercept 5 8 Black 9 3 Age We want to use residuals to determine the relationship of mental health problems and income, controlling for age. What is the partial r coefficient using residuals? 3. How do we determine significance in an ANOVA model? What factors do we compare? 4. Use the two types of analyses we ve learned to examine the following rankings. Math Reading Given the following nominal scale variables, what is the direction of the relationship between the variables, what is the chi-square value, and do we have a statistically significant relationship between the variables? Treatment Control Total Depressed Not Depressed Total Given the following results, what is your prediction for income for the mean individual? The DV is family income. The IVs are years of education of the head (I/R), parents expect the child to get a college degree (dummy excluded are those parents with lower expectations), and family members hit each other (dummy excluded are those that do not hit each other). C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 1

2 Regression Model 1 Model Summary Adjusted Std. Error of R R Square R Square the Estimate.415 a a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr ANOVA b Model Squares df Square F Sig. 1 Regression 1.21E E a Residual 5.81E Total 7.02E a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr b. Dependent Variable: total family income Model 1 (Constant) education of head PCGs educ expectat of child: coll degr family members hit each other Coefficients a Unstandardized Coefficients a. Dependent Variable: total family income Standardized Coefficients B Std. Error Beta t Sig Descriptive Statistics total family income education of head family members hit each other PCGs educ expectat of child: coll degr Valid N (listwise) N Minimum Maximum Std. Deviation #7. A. Indicate the meaning of the standardized coefficient estimates in problem #6. B. If education of the head increased by grades, what is your prediction for the change in income? Give this in standard deviation units and in change in actual income. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 2

3 #8. You are examining a number of childhood predictor variables on the log of income as an adult. You use a number of race variables (with white exclude), gender (with female excluded), whether either of the person s parents were born outside of the U.S. (dummy), 8 th grade grades, standardized test score (I/R), and socioeconomic status in 8 th grade (made into an interval level variable). A. Why do we use log dependent variables? B. Which of the variables has the greatest effect on the log of income? C. Interpret the coefficient estimates for each of the variables. Coefficients a Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: LOGINC Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig E E E E E E D. You run a second model with regular income as the DV. Which of these two models is the better model? The results are given below. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 3

4 Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: INCOME99 Coefficients a Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig A. Are either of the following models statistically significant? B. What is the meaning of a model being statistically significant? C. What is the adjusted or corrected R 2 value in each of the models? D. Is there much difference between the R 2 value and the corrected R 2 value? Why? E. In Model 1, what is the predicted level of income (not log income) for someone who has 0 sibling, is female, has a 3.5 GPA (grades) and 0 SES. Model 1: Using Log income as the DV. IVs are # of siblings, gender, SES (I/R), 8 th grade grades. Model: MODEL1 Dependent Variable: loginc Source DF Squares Model Error Corrected Total Variable Label DF Estimate Error t Value Pr > t Intercept Intercept <.0001 sibs female BYSES SOCIO-ECONOMIC STATUS <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE <.0001 Model 2, Using Income as the DV Dependent Variable: INCOME99 Source DF Squares Square F Value Pr > F C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 4

5 Model E <.0001 Error E Corrected Total E12 Root MSE R-Square Dependent Adj R-Sq Coeff Var Variable Label DF Estimate Error t Value Pr > t Intercept Intercept <.0001 sibs female BYSES SOCIO-ECONOMIC STATUS <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE <.0001 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 5

6 10. You want to determine if area of residence helps in predicting grade point average. You are looking at 3 different areas: Urban, suburban, and rural. You get the results below, including a post hoc Scheffe test. Oneway ANOVA GRADES IN 8TH GRADE Squares df Square F Sig. Between Groups Within Groups Total Post Hoc Tests Dependent Variable: GRADES IN 8TH GRADE Scheffe Multiple Comparisons (I) Area of residence Urban Suburban Rural (J) Area of residence Suburban Rural Urban Rural Urban Suburban *. The mean difference is significant at the.05 level. A. Is there a significant difference among groups? B. Which groups are different? C. How different are the individual groups? Difference 95% Confidence Interval (I-J) Std. Error Sig. Lower Bound Upper Bound * * * * You have the following 3 groups and want to determine if there is a statistically significant difference in the groups. Is there? Married Never Married Divorce/Separate Sum C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 6

7 12. Given the following data, how would we use residuals to determine partial r s and partial b s? data a; input kids nbpov income; cards; ; proc reg; model income=kids; output out=b r=resid1 p=pred1; run; proc reg;model nbpov=kids; output out=c r=resid2 p=pred2; run; proc reg;model kids=nbpov; output out=e r=resid3 p=pred3; run; proc reg;model income=nbpov; output out=f r=resid4 p=pred4; run; proc reg;model kids=income; output out=e r=resid5 p=pred5; run; proc reg;model nbpov=income; output out=f r=resid6 p=pred6; run; C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 7

8 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 kids The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 kids C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 8

9 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept nbpov The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 nbpov C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 9

10 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept income The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 income C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 10

11 13. You have standardized all of your continuous variables by dividing by their respective standard deviations. Interpret the following output. DV: Income IV: Kids IV: nb poverty rate IV: White b SE Kids NBPOV White C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 11

12 #1 Answer. Yp=5 + 9 Black + 7 Age Black: Yp=5+9+7Age Yp=14+7Age White: Yp=5 + 7Age Answer for #2. MHP=X1 Income=Y Age=X2 Y=a+bx2+e1 X1=a+bx2+e2 Correlate e1 and e2. Answer for #3. We compare the within to the between. If we find a relatively large between and a relatively small within, group membership helps us in predicting the outcome. #4 Answer. Correlations Kendall's tau_b Spearman's rho MATH READING MATH READING Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N MATH READING Neither Kendall or Spearman show a significant relationship. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 12

13 #5 Answer. Treatment Control Total Depressed 30 (28) 50 (52) 80 Not Depressed 40 (42) 80 (78) 120 Total Positive relationship between the control group and not depressed. Chi-square value is: Obs Expect Difference Difference Squared chi-squared value /28= /52= /42= /78=.05 chi-square value = =.37 at 1 DF. CV at.05 level is Therefore, you will fail to reject H 0. #6 Answer. To come up with the predicted value for the mean individual, you would use the Yp equation with the b values from the coefficient estimates, and use the mean values for the variables for the X values. Yp = * *.67+(-9701)*.36 = Answer for #7. A. For a one standard deviation increase in education level of the head, we predict that income will increase by.339 standard deviation units. B. If education level increased by roughly 2 standard deviation units, we would predict that income would increase by.678 standard deviation units. Because the standard deviation for income is 54528, a 2 standard deviation increase in education level would lead to a.678*54528 = increase in income. Answers for #8. A. We use log dependent variables to decrease the effects of outliers and to help interpret the effects of independent variables on scaled dependent variables. B. It appears that 8 th grade grades has the larger effect on log income relative to the other I/R variables. Standardized test scores appear to have no effect on income as an adult. C. You would need to make the transformation of these coefficient estimates using the exponential function in order to get the percentage change for a one unit increase in the IV or the percentage difference between the include and the excluded group. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 13

14 D. You would need to know the R2 values of the two models to know which is a better fitting model. Answers for #9. First Equation: R 2 = 96.19/ = Adjusted R 2 = [4/( ) * ( )] = There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0175)/4] / [( )/8802] = Or use the mean square model/mean square error = (96.19/4) / ( /8802) = The critical value for a.05 F test at 4 and 8802 DFs is For a.01 test, the critical value is Our F value exceeds these critical values and therefore we will reject the null hypothesis. Second Equation R 2 = / = (roughly) (we could have added lots of zeros onto the end of these values but as long as we know that denominator is one decimal place larger than the numerator, we can determine their relative values.) Adjusted R 2 = [4/( ) * ( )] = = There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0171)/4] / [( )/8802] = Or use the mean square model/mean square error = ( /4) / ( /8802) = (Again, we know that the denominator has an extra decimal place or is one decimal place farther to the right -- relative to the numerator.) The critical value for a.05 F test at 4 and 8802 DFs is For a.01 test, the critical value is Our F value exceeds these critical values and therefore we will reject the null hypothesis. The meaning of a model being statistically significant: In all likelihood, the set of independent variables helps us in explaining the variance of the dependent variable. The R2 and corrected R2 are very similar because we are not using many independent variables in the model and because our sample size is very large. E. Yp= ( )*0+( )*1+(.07569)*0+(.09279)*3.5 = Take the exponential of this: e = $ C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 14

15 Answers for #10. A. The F test indicates that there is an overall difference among the groups, statistically significant at the.001 level. B. There is no difference between urban and suburban group. There is a difference between the rural and the urban groups, and the suburban and rural groups. C. The GPA of urban groups is.0648 points higher than the rural group. The GPA for the suburban group is.0600 high relative to the rural group. Answers for #11. Determine the within sums of squares: Married group: mean =6. (6-6) 2 +(5-6) 2 +(6-6) 2 + (7-6) 2 = 2 NM Group: =8. (7-8) 2 +(8-8) 2 + (9-8) 2 + (8-8) 2 = 2 DS Group: =9. (9-9) 2 + (10-9) 2 +(8-9) 2 + (9-9) 2 = 2 The within sums of square = 6. Divide this value by n-k, or 12-3 =9. The mean of the within sums of squares = 6/9 =.667. To determine the between sums of squares we first need to determine the overall mean value: ( )/12 = 92/12 = *(6-7.67) 2 + 4*(8-7.67) 2 + 4*(9-7.67) 2 = 4*(-1.67) 2 + 4*(.33) 2 + 4*(1.33) 2 = 4* * * 1.77 = To get the mean between SS, divide by k-1, or 2: 18.64/2 = F 2,9 = 9.32/.667 =13.97 The critical F values are 4.26 at the.05 level and 8.02 at the.01 level. We will reject the null hypothesis at both levels. Sort of an answer for #12. We would use the intercept values and b coefficients from the various bivariate regression models to determine residuals for particular partial correlations and b coefficient estimates that were of interest to us. For example, if we wanted to partial out the effect of kids, we would use the first two regression models, come up with residuals for each of those two models, then either run bivariate correlations between the two residuals or run a regression with the two residuals. You will choose the residual that results from the equation using the dependent variable as the dependent variable in a regression model. You will use the residual that results from the use of the independent variable as independent variable in the regression. 13. For a 1 SD increase in kids, income is predicted to increase by 1.2 SD units. For a 1 SD unit increase in NB poverty, income is predicted to decrease by.5 SD units. Whites have incomes that are 1.6 SD units higher than non-whites. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 15

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed) Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships

More information

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients

SPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients SPSS Output Homework 1-1e ANOVA a Sum of Squares df Mean Square F Sig. 1 Regression 351.056 1 351.056 11.295.002 b Residual 932.412 30 31.080 Total 1283.469 31 a. Dependent Variable: Sexual Harassment

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

EDF 7405 Advanced Quantitative Methods in Educational Research. Data are available on IQ of the child and seven potential predictors.

EDF 7405 Advanced Quantitative Methods in Educational Research. Data are available on IQ of the child and seven potential predictors. EDF 7405 Advanced Quantitative Methods in Educational Research Data are available on IQ of the child and seven potential predictors. Four are medical variables available at the birth of the child: Birthweight

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

Two-Way ANOVA. Chapter 15

Two-Way ANOVA. Chapter 15 Two-Way ANOVA Chapter 15 Interaction Defined An interaction is present when the effects of one IV depend upon a second IV Interaction effect : The effect of each IV across the levels of the other IV When

More information

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total Math 221: Linear Regression and Prediction Intervals S. K. Hyde Chapter 23 (Moore, 5th Ed.) (Neter, Kutner, Nachsheim, and Wasserman) The Toluca Company manufactures refrigeration equipment as well as

More information

1 Independent Practice: Hypothesis tests for one parameter:

1 Independent Practice: Hypothesis tests for one parameter: 1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Lecture 1 Linear Regression with One Predictor Variable.p2

Lecture 1 Linear Regression with One Predictor Variable.p2 Lecture Linear Regression with One Predictor Variablep - Basics - Meaning of regression parameters p - β - the slope of the regression line -it indicates the change in mean of the probability distn of

More information

Lecture (chapter 13): Association between variables measured at the interval-ratio level

Lecture (chapter 13): Association between variables measured at the interval-ratio level Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.

More information

Difference in two or more average scores in different groups

Difference in two or more average scores in different groups ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as

More information

Sociology 593 Exam 2 Answer Key March 28, 2002

Sociology 593 Exam 2 Answer Key March 28, 2002 Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably

More information

N J SS W /df W N - 1

N J SS W /df W N - 1 One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F J Between Groups nj( j * ) J - SS B /(J ) MS B /MS W = ( N

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression 1 Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable Y (criterion) is predicted by variable X (predictor)

More information

Chapter 1 Linear Regression with One Predictor

Chapter 1 Linear Regression with One Predictor STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the

More information

Failure Time of System due to the Hot Electron Effect

Failure Time of System due to the Hot Electron Effect of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV Simple linear regression with one qualitative IV variable is essentially identical to linear regression with quantitative variables. The primary difference

More information

Sociology 593 Exam 2 March 28, 2002

Sociology 593 Exam 2 March 28, 2002 Sociology 59 Exam March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably means that

More information

3 Variables: Cyberloafing Conscientiousness Age

3 Variables: Cyberloafing Conscientiousness Age title 'Cyberloafing, Mike Sage'; run; PROC CORR data=sage; var Cyberloafing Conscientiousness Age; run; quit; The CORR Procedure 3 Variables: Cyberloafing Conscientiousness Age Simple Statistics Variable

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

STAT 3A03 Applied Regression With SAS Fall 2017

STAT 3A03 Applied Regression With SAS Fall 2017 STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.

More information

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

Table 1: Fish Biomass data set on 26 streams

Table 1: Fish Biomass data set on 26 streams Math 221: Multiple Regression S. K. Hyde Chapter 27 (Moore, 5th Ed.) The following data set contains observations on the fish biomass of 26 streams. The potential regressors from which we wish to explain

More information

At this point, if you ve done everything correctly, you should have data that looks something like:

At this point, if you ve done everything correctly, you should have data that looks something like: This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Lecture 11 Multiple Linear Regression

Lecture 11 Multiple Linear Regression Lecture 11 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 11-1 Topic Overview Review: Multiple Linear Regression (MLR) Computer Science Case Study 11-2 Multiple Regression

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

using the beginning of all regression models

using the beginning of all regression models Estimating using the beginning of all regression models 3 examples Note about shorthand Cavendish's 29 measurements of the earth's density Heights (inches) of 14 11 year-old males from Alberta study Half-life

More information

Chapter 9 - Correlation and Regression

Chapter 9 - Correlation and Regression Chapter 9 - Correlation and Regression 9. Scatter diagram of percentage of LBW infants (Y) and high-risk fertility rate (X ) in Vermont Health Planning Districts. 9.3 Correlation between percentage of

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

4/22/2010. Test 3 Review ANOVA

4/22/2010. Test 3 Review ANOVA Test 3 Review ANOVA 1 School recruiter wants to examine if there are difference between students at different class ranks in their reported intensity of school spirit. What is the factor? How many levels

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Multiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar

Multiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar Multiple Regression and Model Building (cont d) + GIS 11.220 Lecture 21 3 May 2006 R. Ryznar Model Summary b 1-[(SSE/n-k+1)/(SST/n-1)] Model 1 Adjusted Std. Error of R R Square R Square the Estimate.991

More information

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

EXST Regression Techniques Page 1. We can also test the hypothesis H : œ 0 versus H : EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

SPSS LAB FILE 1

SPSS LAB FILE  1 SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:

More information

Lecture 24: Partial correlation, multiple regression, and correlation

Lecture 24: Partial correlation, multiple regression, and correlation Lecture 24: Partial correlation, multiple regression, and correlation Ernesto F. L. Amaral November 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A

More information

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.

More information

Multiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar

Multiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar Multiple Regression and Model Building 11.220 Lecture 20 1 May 2006 R. Ryznar Building Models: Making Sure the Assumptions Hold 1. There is a linear relationship between the explanatory (independent) variable(s)

More information

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons 1. Suppose we wish to assess the impact of five treatments while blocking for study participant race (Black,

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

MATH ASSIGNMENT 2: SOLUTIONS

MATH ASSIGNMENT 2: SOLUTIONS MATH 204 - ASSIGNMENT 2: SOLUTIONS (a) Fitting the simple linear regression model to each of the variables in turn yields the following results: we look at t-tests for the individual coefficients, and

More information

Answer Key: Problem Set 6

Answer Key: Problem Set 6 : Problem Set 6 1. Consider a linear model to explain monthly beer consumption: beer = + inc + price + educ + female + u 0 1 3 4 E ( u inc, price, educ, female ) = 0 ( u inc price educ female) σ inc var,,,

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

Lecture 13 Extra Sums of Squares

Lecture 13 Extra Sums of Squares Lecture 13 Extra Sums of Squares STAT 512 Spring 2011 Background Reading KNNL: 7.1-7.4 13-1 Topic Overview Extra Sums of Squares (Defined) Using and Interpreting R 2 and Partial-R 2 Getting ESS and Partial-R

More information

Booklet of Code and Output for STAC32 Final Exam

Booklet of Code and Output for STAC32 Final Exam Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure

More information

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ 1 = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F Between Groups n j ( j - * ) J - 1 SS B / J - 1 MS B /MS

More information

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section: Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Practical Biostatistics

Practical Biostatistics Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:

More information

Sociology Research Statistics I Final Exam Answer Key December 15, 1993

Sociology Research Statistics I Final Exam Answer Key December 15, 1993 Sociology 592 - Research Statistics I Final Exam Answer Key December 15, 1993 Where appropriate, show your work - partial credit may be given. (On the other hand, don't waste a lot of time on excess verbiage.)

More information

Lecture 12 Inference in MLR

Lecture 12 Inference in MLR Lecture 12 Inference in MLR STAT 512 Spring 2011 Background Reading KNNL: 6.6-6.7 12-1 Topic Overview Review MLR Model Inference about Regression Parameters Estimation of Mean Response Prediction 12-2

More information

Statistics 5100 Spring 2018 Exam 1

Statistics 5100 Spring 2018 Exam 1 Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs Logistic Regression, Part I: Problems with the Linear Probability Model (LPM) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 LAST NAME: SOLUTIONS FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 302 STA 1001 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator.

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Multiple Regression: Inference

Multiple Regression: Inference Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled

More information

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know: Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA

More information

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 The MEANS Procedure DRINKING STATUS=1 Analysis Variable : TRIGL N Mean Std Dev Minimum Maximum 164 151.6219512 95.3801744

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Lecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II

Lecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II Lecture 3: Multiple Regression Prof. Sharyn O Halloran Sustainable Development Econometrics II Outline Basics of Multiple Regression Dummy Variables Interactive terms Curvilinear models Review Strategies

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

STA 101 Final Review

STA 101 Final Review STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem

More information

Overview Scatter Plot Example

Overview Scatter Plot Example Overview Topic 22 - Linear Regression and Correlation STAT 5 Professor Bruce Craig Consider one population but two variables For each sampling unit observe X and Y Assume linear relationship between variables

More information