In Class Review Exercises Vartanian: SW 540
|
|
- Homer Pierce
- 6 years ago
- Views:
Transcription
1 In Class Review Exercises Vartanian: SW Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE intercept 5 8 Black 9 3 Age We want to use residuals to determine the relationship of mental health problems and income, controlling for age. What is the partial r coefficient using residuals? 3. How do we determine significance in an ANOVA model? What factors do we compare? 4. Use the two types of analyses we ve learned to examine the following rankings. Math Reading Given the following nominal scale variables, what is the direction of the relationship between the variables, what is the chi-square value, and do we have a statistically significant relationship between the variables? Treatment Control Total Depressed Not Depressed Total Given the following results, what is your prediction for income for the mean individual? The DV is family income. The IVs are years of education of the head (I/R), parents expect the child to get a college degree (dummy excluded are those parents with lower expectations), and family members hit each other (dummy excluded are those that do not hit each other). C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 1
2 Regression Model 1 Model Summary Adjusted Std. Error of R R Square R Square the Estimate.415 a a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr ANOVA b Model Squares df Square F Sig. 1 Regression 1.21E E a Residual 5.81E Total 7.02E a. Predictors: (Constant), family members hit each other, education of head, PCGs educ expectat of child: coll degr b. Dependent Variable: total family income Model 1 (Constant) education of head PCGs educ expectat of child: coll degr family members hit each other Coefficients a Unstandardized Coefficients a. Dependent Variable: total family income Standardized Coefficients B Std. Error Beta t Sig Descriptive Statistics total family income education of head family members hit each other PCGs educ expectat of child: coll degr Valid N (listwise) N Minimum Maximum Std. Deviation #7. A. Indicate the meaning of the standardized coefficient estimates in problem #6. B. If education of the head increased by grades, what is your prediction for the change in income? Give this in standard deviation units and in change in actual income. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 2
3 #8. You are examining a number of childhood predictor variables on the log of income as an adult. You use a number of race variables (with white exclude), gender (with female excluded), whether either of the person s parents were born outside of the U.S. (dummy), 8 th grade grades, standardized test score (I/R), and socioeconomic status in 8 th grade (made into an interval level variable). A. Why do we use log dependent variables? B. Which of the variables has the greatest effect on the log of income? C. Interpret the coefficient estimates for each of the variables. Coefficients a Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: LOGINC Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig E E E E E E D. You run a second model with regular income as the DV. Which of these two models is the better model? The results are given below. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 3
4 Model 1 (Constant) ASIAN BLACK AMIND HISPANIC MALE either parent born outside US GRADES IN 8TH GRADE SOCIO-ECONOMIC STATUS COMPOSITE STANDSCO a. Dependent Variable: INCOME99 Coefficients a Unstandardized Coefficients Standardized Coefficients B Std. Error Beta t Sig A. Are either of the following models statistically significant? B. What is the meaning of a model being statistically significant? C. What is the adjusted or corrected R 2 value in each of the models? D. Is there much difference between the R 2 value and the corrected R 2 value? Why? E. In Model 1, what is the predicted level of income (not log income) for someone who has 0 sibling, is female, has a 3.5 GPA (grades) and 0 SES. Model 1: Using Log income as the DV. IVs are # of siblings, gender, SES (I/R), 8 th grade grades. Model: MODEL1 Dependent Variable: loginc Source DF Squares Model Error Corrected Total Variable Label DF Estimate Error t Value Pr > t Intercept Intercept <.0001 sibs female BYSES SOCIO-ECONOMIC STATUS <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE <.0001 Model 2, Using Income as the DV Dependent Variable: INCOME99 Source DF Squares Square F Value Pr > F C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 4
5 Model E <.0001 Error E Corrected Total E12 Root MSE R-Square Dependent Adj R-Sq Coeff Var Variable Label DF Estimate Error t Value Pr > t Intercept Intercept <.0001 sibs female BYSES SOCIO-ECONOMIC STATUS <.0001 COMPOSITE BYGRADS GRADES IN 8TH GRADE <.0001 C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 5
6 10. You want to determine if area of residence helps in predicting grade point average. You are looking at 3 different areas: Urban, suburban, and rural. You get the results below, including a post hoc Scheffe test. Oneway ANOVA GRADES IN 8TH GRADE Squares df Square F Sig. Between Groups Within Groups Total Post Hoc Tests Dependent Variable: GRADES IN 8TH GRADE Scheffe Multiple Comparisons (I) Area of residence Urban Suburban Rural (J) Area of residence Suburban Rural Urban Rural Urban Suburban *. The mean difference is significant at the.05 level. A. Is there a significant difference among groups? B. Which groups are different? C. How different are the individual groups? Difference 95% Confidence Interval (I-J) Std. Error Sig. Lower Bound Upper Bound * * * * You have the following 3 groups and want to determine if there is a statistically significant difference in the groups. Is there? Married Never Married Divorce/Separate Sum C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 6
7 12. Given the following data, how would we use residuals to determine partial r s and partial b s? data a; input kids nbpov income; cards; ; proc reg; model income=kids; output out=b r=resid1 p=pred1; run; proc reg;model nbpov=kids; output out=c r=resid2 p=pred2; run; proc reg;model kids=nbpov; output out=e r=resid3 p=pred3; run; proc reg;model income=nbpov; output out=f r=resid4 p=pred4; run; proc reg;model kids=income; output out=e r=resid5 p=pred5; run; proc reg;model nbpov=income; output out=f r=resid6 p=pred6; run; C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 7
8 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 kids The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 kids C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 8
9 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept nbpov The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: income Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 nbpov C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 9
10 The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: kids Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept income The SAS System 06:54 Wednesday, December 10, The REG Procedure Model: MODEL1 Dependent Variable: nbpov Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 income C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 10
11 13. You have standardized all of your continuous variables by dividing by their respective standard deviations. Interpret the following output. DV: Income IV: Kids IV: nb poverty rate IV: White b SE Kids NBPOV White C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 11
12 #1 Answer. Yp=5 + 9 Black + 7 Age Black: Yp=5+9+7Age Yp=14+7Age White: Yp=5 + 7Age Answer for #2. MHP=X1 Income=Y Age=X2 Y=a+bx2+e1 X1=a+bx2+e2 Correlate e1 and e2. Answer for #3. We compare the within to the between. If we find a relatively large between and a relatively small within, group membership helps us in predicting the outcome. #4 Answer. Correlations Kendall's tau_b Spearman's rho MATH READING MATH READING Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N Correlation Coefficient Sig. (2-tailed) N MATH READING Neither Kendall or Spearman show a significant relationship. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 12
13 #5 Answer. Treatment Control Total Depressed 30 (28) 50 (52) 80 Not Depressed 40 (42) 80 (78) 120 Total Positive relationship between the control group and not depressed. Chi-square value is: Obs Expect Difference Difference Squared chi-squared value /28= /52= /42= /78=.05 chi-square value = =.37 at 1 DF. CV at.05 level is Therefore, you will fail to reject H 0. #6 Answer. To come up with the predicted value for the mean individual, you would use the Yp equation with the b values from the coefficient estimates, and use the mean values for the variables for the X values. Yp = * *.67+(-9701)*.36 = Answer for #7. A. For a one standard deviation increase in education level of the head, we predict that income will increase by.339 standard deviation units. B. If education level increased by roughly 2 standard deviation units, we would predict that income would increase by.678 standard deviation units. Because the standard deviation for income is 54528, a 2 standard deviation increase in education level would lead to a.678*54528 = increase in income. Answers for #8. A. We use log dependent variables to decrease the effects of outliers and to help interpret the effects of independent variables on scaled dependent variables. B. It appears that 8 th grade grades has the larger effect on log income relative to the other I/R variables. Standardized test scores appear to have no effect on income as an adult. C. You would need to make the transformation of these coefficient estimates using the exponential function in order to get the percentage change for a one unit increase in the IV or the percentage difference between the include and the excluded group. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 13
14 D. You would need to know the R2 values of the two models to know which is a better fitting model. Answers for #9. First Equation: R 2 = 96.19/ = Adjusted R 2 = [4/( ) * ( )] = There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0175)/4] / [( )/8802] = Or use the mean square model/mean square error = (96.19/4) / ( /8802) = The critical value for a.05 F test at 4 and 8802 DFs is For a.01 test, the critical value is Our F value exceeds these critical values and therefore we will reject the null hypothesis. Second Equation R 2 = / = (roughly) (we could have added lots of zeros onto the end of these values but as long as we know that denominator is one decimal place larger than the numerator, we can determine their relative values.) Adjusted R 2 = [4/( ) * ( )] = = There are at least 2 ways to determine the F value. One way is by using the R 2 value. F 4,8802 =[(.0171)/4] / [( )/8802] = Or use the mean square model/mean square error = ( /4) / ( /8802) = (Again, we know that the denominator has an extra decimal place or is one decimal place farther to the right -- relative to the numerator.) The critical value for a.05 F test at 4 and 8802 DFs is For a.01 test, the critical value is Our F value exceeds these critical values and therefore we will reject the null hypothesis. The meaning of a model being statistically significant: In all likelihood, the set of independent variables helps us in explaining the variance of the dependent variable. The R2 and corrected R2 are very similar because we are not using many independent variables in the model and because our sample size is very large. E. Yp= ( )*0+( )*1+(.07569)*0+(.09279)*3.5 = Take the exponential of this: e = $ C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 14
15 Answers for #10. A. The F test indicates that there is an overall difference among the groups, statistically significant at the.001 level. B. There is no difference between urban and suburban group. There is a difference between the rural and the urban groups, and the suburban and rural groups. C. The GPA of urban groups is.0648 points higher than the rural group. The GPA for the suburban group is.0600 high relative to the rural group. Answers for #11. Determine the within sums of squares: Married group: mean =6. (6-6) 2 +(5-6) 2 +(6-6) 2 + (7-6) 2 = 2 NM Group: =8. (7-8) 2 +(8-8) 2 + (9-8) 2 + (8-8) 2 = 2 DS Group: =9. (9-9) 2 + (10-9) 2 +(8-9) 2 + (9-9) 2 = 2 The within sums of square = 6. Divide this value by n-k, or 12-3 =9. The mean of the within sums of squares = 6/9 =.667. To determine the between sums of squares we first need to determine the overall mean value: ( )/12 = 92/12 = *(6-7.67) 2 + 4*(8-7.67) 2 + 4*(9-7.67) 2 = 4*(-1.67) 2 + 4*(.33) 2 + 4*(1.33) 2 = 4* * * 1.77 = To get the mean between SS, divide by k-1, or 2: 18.64/2 = F 2,9 = 9.32/.667 =13.97 The critical F values are 4.26 at the.05 level and 8.02 at the.01 level. We will reject the null hypothesis at both levels. Sort of an answer for #12. We would use the intercept values and b coefficients from the various bivariate regression models to determine residuals for particular partial correlations and b coefficient estimates that were of interest to us. For example, if we wanted to partial out the effect of kids, we would use the first two regression models, come up with residuals for each of those two models, then either run bivariate correlations between the two residuals or run a regression with the two residuals. You will choose the residual that results from the equation using the dependent variable as the dependent variable in a regression model. You will use the residual that results from the use of the independent variable as independent variable in the regression. 13. For a 1 SD increase in kids, income is predicted to increase by 1.2 SD units. For a 1 SD unit increase in NB poverty, income is predicted to decrease by.5 SD units. Whites have incomes that are 1.6 SD units higher than non-whites. C:\WP60_1\LECT1.PHD\Final\Review Exercises in Class Final.doc 15
Review of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationArea1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)
Institutional Assessment Report Texas Southern University College of Pharmacy and Health Sciences "An Analysis of 2013 NAPLEX, P4-Comp. Exams and P3 courses The following analysis illustrates relationships
More informationSPSS Output. ANOVA a b Residual Coefficients a Standardized Coefficients
SPSS Output Homework 1-1e ANOVA a Sum of Squares df Mean Square F Sig. 1 Regression 351.056 1 351.056 11.295.002 b Residual 932.412 30 31.080 Total 1283.469 31 a. Dependent Variable: Sexual Harassment
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationEDF 7405 Advanced Quantitative Methods in Educational Research. Data are available on IQ of the child and seven potential predictors.
EDF 7405 Advanced Quantitative Methods in Educational Research Data are available on IQ of the child and seven potential predictors. Four are medical variables available at the birth of the child: Birthweight
More informationSTAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis
STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO
More informationRegression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.
TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted
More informationTwo-Way ANOVA. Chapter 15
Two-Way ANOVA Chapter 15 Interaction Defined An interaction is present when the effects of one IV depend upon a second IV Interaction effect : The effect of each IV across the levels of the other IV When
More informationAnalysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total
Math 221: Linear Regression and Prediction Intervals S. K. Hyde Chapter 23 (Moore, 5th Ed.) (Neter, Kutner, Nachsheim, and Wasserman) The Toluca Company manufactures refrigeration equipment as well as
More information1 Independent Practice: Hypothesis tests for one parameter:
1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationLecture 1 Linear Regression with One Predictor Variable.p2
Lecture Linear Regression with One Predictor Variablep - Basics - Meaning of regression parameters p - β - the slope of the regression line -it indicates the change in mean of the probability distn of
More informationLecture (chapter 13): Association between variables measured at the interval-ratio level
Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.
More informationDifference in two or more average scores in different groups
ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as
More informationSociology 593 Exam 2 Answer Key March 28, 2002
Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably
More informationN J SS W /df W N - 1
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F J Between Groups nj( j * ) J - SS B /(J ) MS B /MS W = ( N
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationLecture 11: Simple Linear Regression
Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink
More informationFixed and Random Effects Models: Vartanian, SW 683
: Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:
More informationREVIEW 8/2/2017 陈芳华东师大英语系
REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p
More informationSimple Linear Regression
Simple Linear Regression 1 Correlation indicates the magnitude and direction of the linear relationship between two variables. Linear Regression: variable Y (criterion) is predicted by variable X (predictor)
More informationChapter 1 Linear Regression with One Predictor
STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the
More informationFailure Time of System due to the Hot Electron Effect
of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV Simple linear regression with one qualitative IV variable is essentially identical to linear regression with quantitative variables. The primary difference
More informationSociology 593 Exam 2 March 28, 2002
Sociology 59 Exam March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably means that
More information3 Variables: Cyberloafing Conscientiousness Age
title 'Cyberloafing, Mike Sage'; run; PROC CORR data=sage; var Cyberloafing Conscientiousness Age; run; quit; The CORR Procedure 3 Variables: Cyberloafing Conscientiousness Age Simple Statistics Variable
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationFinal Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)
Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the
More informationSTAT 3A03 Applied Regression With SAS Fall 2017
STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.
More informationESP 178 Applied Research Methods. 2/23: Quantitative Analysis
ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationChapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression
BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between
More informationSimple Linear Regression: One Quantitative IV
Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,
More informationTable 1: Fish Biomass data set on 26 streams
Math 221: Multiple Regression S. K. Hyde Chapter 27 (Moore, 5th Ed.) The following data set contains observations on the fish biomass of 26 streams. The potential regressors from which we wish to explain
More informationAt this point, if you ve done everything correctly, you should have data that looks something like:
This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationLecture 11 Multiple Linear Regression
Lecture 11 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 11-1 Topic Overview Review: Multiple Linear Regression (MLR) Computer Science Case Study 11-2 Multiple Regression
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationusing the beginning of all regression models
Estimating using the beginning of all regression models 3 examples Note about shorthand Cavendish's 29 measurements of the earth's density Heights (inches) of 14 11 year-old males from Alberta study Half-life
More informationChapter 9 - Correlation and Regression
Chapter 9 - Correlation and Regression 9. Scatter diagram of percentage of LBW infants (Y) and high-risk fertility rate (X ) in Vermont Health Planning Districts. 9.3 Correlation between percentage of
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More information4/22/2010. Test 3 Review ANOVA
Test 3 Review ANOVA 1 School recruiter wants to examine if there are difference between students at different class ranks in their reported intensity of school spirit. What is the factor? How many levels
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationAn Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01
An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there
More informationMultiple Regression and Model Building (cont d) + GIS Lecture 21 3 May 2006 R. Ryznar
Multiple Regression and Model Building (cont d) + GIS 11.220 Lecture 21 3 May 2006 R. Ryznar Model Summary b 1-[(SSE/n-k+1)/(SST/n-1)] Model 1 Adjusted Std. Error of R R Square R Square the Estimate.991
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationProject Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang
Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations
More informationUnit 6 - Introduction to linear regression
Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More informationSPSS LAB FILE 1
SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:
More informationLecture 24: Partial correlation, multiple regression, and correlation
Lecture 24: Partial correlation, multiple regression, and correlation Ernesto F. L. Amaral November 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A
More informationx3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators
Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.
More informationMultiple Regression and Model Building Lecture 20 1 May 2006 R. Ryznar
Multiple Regression and Model Building 11.220 Lecture 20 1 May 2006 R. Ryznar Building Models: Making Sure the Assumptions Hold 1. There is a linear relationship between the explanatory (independent) variable(s)
More informationSelf-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons
Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons 1. Suppose we wish to assess the impact of five treatments while blocking for study participant race (Black,
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationCorrelation and the Analysis of Variance Approach to Simple Linear Regression
Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation
More informationMATH ASSIGNMENT 2: SOLUTIONS
MATH 204 - ASSIGNMENT 2: SOLUTIONS (a) Fitting the simple linear regression model to each of the variables in turn yields the following results: we look at t-tests for the individual coefficients, and
More informationAnswer Key: Problem Set 6
: Problem Set 6 1. Consider a linear model to explain monthly beer consumption: beer = + inc + price + educ + female + u 0 1 3 4 E ( u inc, price, educ, female ) = 0 ( u inc price educ female) σ inc var,,,
More informationSTA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3
STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae
More informationLecture 13 Extra Sums of Squares
Lecture 13 Extra Sums of Squares STAT 512 Spring 2011 Background Reading KNNL: 7.1-7.4 13-1 Topic Overview Extra Sums of Squares (Defined) Using and Interpreting R 2 and Partial-R 2 Getting ESS and Partial-R
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure
More informationOne-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ 1 = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F Between Groups n j ( j - * ) J - 1 SS B / J - 1 MS B /MS
More informationEcn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,
More informationUnit 6 - Simple linear regression
Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable
More informationPractical Biostatistics
Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:
More informationSociology Research Statistics I Final Exam Answer Key December 15, 1993
Sociology 592 - Research Statistics I Final Exam Answer Key December 15, 1993 Where appropriate, show your work - partial credit may be given. (On the other hand, don't waste a lot of time on excess verbiage.)
More informationLecture 12 Inference in MLR
Lecture 12 Inference in MLR STAT 512 Spring 2011 Background Reading KNNL: 6.6-6.7 12-1 Topic Overview Review MLR Model Inference about Regression Parameters Estimation of Mean Response Prediction 12-2
More informationStatistics 5100 Spring 2018 Exam 1
Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationImmigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs
Logistic Regression, Part I: Problems with the Linear Probability Model (LPM) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationSTA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007
STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 LAST NAME: SOLUTIONS FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 302 STA 1001 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator.
More informationDraft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM
1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact
More informationSimple Linear Regression Using Ordinary Least Squares
Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression
More informationMultiple Regression: Inference
Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled
More informationMultiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:
Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationExample. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationSection 9c. Propensity scores. Controlling for bias & confounding in observational studies
Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA
More informationANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003
ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 The MEANS Procedure DRINKING STATUS=1 Analysis Variable : TRIGL N Mean Std Dev Minimum Maximum 164 151.6219512 95.3801744
More information1 Correlation and Inference from Regression
1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is
More informationLecture 3: Multiple Regression. Prof. Sharyn O Halloran Sustainable Development U9611 Econometrics II
Lecture 3: Multiple Regression Prof. Sharyn O Halloran Sustainable Development Econometrics II Outline Basics of Multiple Regression Dummy Variables Interactive terms Curvilinear models Review Strategies
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More information(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)
12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression
More informationSTA 101 Final Review
STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem
More informationOverview Scatter Plot Example
Overview Topic 22 - Linear Regression and Correlation STAT 5 Professor Bruce Craig Consider one population but two variables For each sampling unit observe X and Y Assume linear relationship between variables
More information