Sociology 301. Bivariate Regression II: Testing Slope and Coefficient of Determination. Bivariate Regression. Calculating Expected Values

Similar documents
Sociology 301. Bivariate Regression. Clarification. Regression. Liying Luo Last exam (Exam #4) is on May 17, in class.

Sociology 470. Bivariate Regression. Extra Points. Regression. Liying Luo Job talk on Thursday 11/3 at Pond 302

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Lecture 6: Introduction to Linear Regression

Chapter 11: Simple Linear Regression and Correlation

Statistics for Business and Economics

Statistics for Economics & Business

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

18. SIMPLE LINEAR REGRESSION III

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

28. SIMPLE LINEAR REGRESSION III

Statistics MINITAB - Lab 2

Basic Business Statistics, 10/e

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

STATISTICS QUESTIONS. Step by Step Solutions.

Linear Regression Analysis: Terminology and Notation

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Midterm Examination. Regression and Forecasting Models

Chapter 15 - Multiple Regression

Chapter 13: Multiple Regression

/ n ) are compared. The logic is: if the two

Statistics II Final Exam 26/6/18

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables

STAT 3008 Applied Regression Analysis

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

Chapter 9: Statistical Inference and the Relationship between Two Variables

Cathy Walker March 5, 2010

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

SIMPLE LINEAR REGRESSION

Lecture 3 Stat102, Spring 2007

Introduction to Regression

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Properties of Least Squares

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

Chapter 14 Simple Linear Regression

a. (All your answers should be in the letter!

Comparison of Regression Lines

A dummy variable equal to 1 if the nearby school is in regular session and 0 otherwise;

Chapter 8 Indicator Variables

17 - LINEAR REGRESSION II

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Dummy variables in multiple variable regression model

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Biostatistics 360 F&t Tests and Intervals in Regression 1

Learning Objectives for Chapter 11

Scatter Plot x

x = , so that calculated

Activity #13: Simple Linear Regression. actgpa.sav; beer.sav;

x i1 =1 for all i (the constant ).

Statistics Chapter 4

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

Chapter 8 Multivariate Regression Analysis

Introduction to Analysis of Variance (ANOVA) Part 1

Economics 130. Lecture 4 Simple Linear Regression Continued

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

January Examinations 2015

STAT 511 FINAL EXAM NAME Spring 2001

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Chapter 3 Describing Data Using Numerical Measures

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

First Year Examination Department of Statistics, University of Florida

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

e i is a random error

REGRESSION ANALYSIS II- MULTICOLLINEARITY

Correlation and Regression

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

III. Econometric Methodology Regression Analysis

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Chapter 5 Multilevel Models

This column is a continuation of our previous column

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Kernel Methods and SVMs Extension

Statistical Evaluation of WATFLOOD

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

ANOVA. The Observations y ij

Topic 7: Analysis of Variance

A Robust Method for Calculating the Correlation Coefficient

Regulation No. 117 (Tyres rolling noise and wet grip adhesion) Proposal for amendments to ECE/TRANS/WP.29/GRB/2010/3

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Professor Chris Murray. Midterm Exam

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Transcription:

Socology 30 Bvarate Regresson II: Testng Slope and Coeffcent of Determnaton Lyng Luo 05.03 Bvarate Regresson F.ed regresson model for sample ntercept slope Learnng objec;ves Understand the basc dea of es;ma;ng regresson coeffcent Be able to nterpret and test regresson coeffcent Be able to compute and nterpret coeffcent of determna;on Calculatng Expected Values Ŷ 4.76 + 0.X Games Won Runs Scored Usng the regresson equaton we can estmate the average value of Y for a gven value of X predct an ndvdual s value of Y for a gven value of X

Usng the regresson equaton we can estmate the average value of Y for a gven value of X predct an ndvdual s value of Y for a gven value of X estmate the average value of Y for a gven value of X predct an ndvdual s value of Y for a gven value of X Calculatng Expected Values Interpretng the Regresson Equaton Interpretng the Regresson Equaton The equaton Ŷ 4.76 + 0.X means (n Englsh) that The equaton Ŷ 4.76 + 0.X 4.76 0.X means (n Englsh) that Expected Number of Wns + 0.Runs Expected Number of Wns + 0.Runs Expected Number of Wns 0.Runs How many wns would we predct a team to wn f they How many wns would we predct a team to wn f they scored scored 80 80 runs? runs? scored Expected 80 runs? Number of Wns + 0.(80) 83.35 Expected Number of Wns + 0.(80) 83.35 Expected Number of Wns + 0.(80) 83.35 Expected Number of Wns 0.(80) 83.35 What s the average number of wns among teams that What s the average number of wns among teams that What s the average number of wns among teams that score 750 runs? score 750 runs? Expected Number of Wns + 0.(750) 77.74 Expected Expected Number Number of of Wns Wns + 0.(750) 0.(750) 77.74 77.74 Socology 38 ~ 3/3/205 Socology 38 ~ 3/3/205 7 7 Worksheet 04.28 A fnancal analyst would lke to know the rela;onshp between the mutual fund fees (X, n %) and ts annual yeld (Y, n %). She regressed annual yeld on fees and found n her sample a4 and b-0... Wrte down the f.ed regresson model. 2. Interpret the es;mated ntercept and slope. 3. What s the expected return for a mutual fund chargng % fee? 5% fee? Testng Bvarate Regresson Coeffcent F.ed regresson model for sample ntercept slope Tes;ng bvarate regresson coeffcent: a ntercept: We don t usually care b slope: Is the predctor (X) related to the response (Y)?

Testng Bvarate Regresson Coeffcent Typcal null hypothess about popula;on slope β: the predctor varable X has no lnear rela;onshp wth the response varable Y. Gven the es;mated slope b wth sample data, how lkely the popula;on slope β equals to 0? Most common: two-taled test: H 0: β0 vs H : β 0 (Far) less common: one-taled test: H 0: β 0 vs H : β<0 or H 0: β 0 vs H : β>0 Testng Bvarate Regresson Coeffcent Example : An economst s nterested n the rela;onshp between annual salary (Y) and heght n nches (X). He regressed annual salary on heght and found the es;mated ntercept a30,000 and slope b350. State the null and research hypothess about the slope. Testng Bvarate Regresson Coeffcent Example 2: A crmnologst s nterested n the rela;onshp between number of homcde (Y) and medan household ncome (X) n neghborhoods. She regressed the number of homcde on medan household ncome. She found the es;mated ntercept a0. and slope b-0.5. State the null and research hypothess about the slope.

Testng Bvarate Regresson Coeffcent The Central Lmt Theorem b a sample sta;s;cs s normally dstrbuted wth meanβ and standard error (under some crcumstances of course). It means that we can use Z-sta;s;cs to test whether β s dfferent from the hypotheszed value (usually 0). However, we usually don t know so we use t-sta;s;cs nstead. and have to rely on sample nforma;on, For ths class, (standard error of b) wll be provded. Testng Bvarate Regresson Coeffcent Example A cvl engneer s nterested n whether anxety level (X, a numerc measure rangng anywhere from 0 to 20 wth 20 beng extremely anxous) s assocated wth drvng speed on the hghway (Y, n mles per hour). He regressed speed on anxety level on speed and found the es;mated ntercept a75, the es;mated slope b2., and the es;mated standard error for the slope S b.2.. Wrte down the f.ed regresson model. 2. Interpret the es;mated ntercept and slope. 3. State the null and research hypothess about the slope. 4. Decde the alpha level and cr;cal value(s). 5. Compute the test sta;s;c. 6. Compare the test sta;s;c to the cr;cal value to make a decson. 7. State a techncal decson and a substan;ve concluson. Testng Bvarate Regresson Coeffcent Example 2 A socologst s nterested n nter-genera;onal moblty defned as the rela;onshp between father educa;on and chld s educa;on. He sampled n,485 ndvduals, and below s the SP output.. Wrte down the f.ed regresson model. 2. Interpret the es;mated ntercept and slope. 3. State the null and research hypothess about the slope. 4. Decde the alpha level and cr;cal value(s). 5. Compute the test sta;s;c. 6. Compare the test sta;s;c to the cr;cal value to make a decson. 7. State a techncal decson and a substan;ve concluson.

Worksheet 05.03 A cvl engneer s nterested n whether anxety level (X, a numerc measure rangng anywhere from 0 to 20 wth 20 beng extremely anxous) s assocated wth drvng speed on the hghway (Y, n mles per hour). He regressed speed on anxety level and found the es;mated ntercept a75, and the ntercept b2., and the es;mated standard error for the slope Sb.2.. Wrte down the f.ed regresson model. 2. Interpret the es;mated ntercept and slope. 3. State the null and research hypothess about the slope. 4. Decde the alpha level and cr;cal value(s). 5. Compute the test sta;s;c. 6. Compare the test sta;s;c to the cr;cal value to make a decson. 7. State a techncal decson and a substan;ve concluson. Coeffcent of Determnaton How well does X do n es;ma;ng/predc;ng Y? A strong assoca;on/rela;onshp allows good es;ma;on/predc;on. A weak assoca;on/rela;onshp means bad es;ma;on/predc;on. To measure ths, we examne how much of the vara;on n Y can be a.rbuted to X and how much s random error. That s, we can par;;on the varance n Y nto the part a.rbutable to X and the part a.rbutable to error. Coeffcent of Determnaton Par;;on the varance/deva;on n Y nto the part a.rbutable to X and the part a.rbutable to error. Y Y ( Ŷ Y) + ( Y Ŷ ) Porton of the devaton from the mean that s attrbutable to X Porton of the devaton from the mean that s attrbutable to random error Devatons from the mean can be expressed as the sum of () devatons of the predcted value from the mean and (2) ndvdual devatons from the predcted value

Coeffcent of Determnaton If we square each sde and then sum across cases we get: n Total Sum of Squares 2 2 n n ( Y Y) ( Ŷ Y) + ( Y Ŷ ) 2 Regresson Sum of Squares REGREION + ERROR Error Sum of Squares How Well Does X Predct Y? If we square each n sde and n then 2sum n across cases we get: 2 ( ) 2 n Y Y Ŷ Y 2+ Y Ŷ When X s not related to Y, 2 n n then ( ) ( ) ( ) 2 Y Y Ŷ Y + Y Ŷ knowng X does not help es;mate or predct Y Total Regresson Error our best Sum Total guess of Squares about Sum s Regresson of, so Squares REGREION Sum 0 and of Error Squares ERROR Sum of Squares Sum of Squares Sum of Squares REGREION + ERROR If there s no assocaton between Y and X, then knowng X does not help predct Y In ths case, our best guess about Y-hat s Y-bar; thus REGREION equals zero and ERROR Coeffcent of Determnaton How Well Does X Predct Y? The coeffcent of determnaton (R 2 ) ndcates the proporton of the total varaton n Y that s determned by ts The lnear coeffcent How relatonshp Well of determnaton wth Does X (RX 2 ) Predct ndcates the Y? proporton 2 of the total varaton ERROR n Y that s determned by REGREION ts R lnear relatonshp wth X The coeffcent of determnaton (R 2 ERROR 2 ) ndcates the REGREION proporton R of the total varaton n Y that s determned by ts lnear relatonshp wth X If R 2 0, then REGREION 0, whch suggests that there s no 2 ERROR REGREION assocaton R between Y and X If If R 2 R 2, then REGREION, whch suggests that there s no 0, then error varaton REGREION 0, whch suggests that there s no and that we can perfectly predct Y assocaton between Y and X based on X If R 2, then REGREION, whch suggests that there s no error varaton and that we can perfectly predct Y based on X How Well Does X Predct Y? Coeffcent of Determnaton R 2.0 R 2.0 How Well Does X Predct Y? R 2.0 R 2.0 R 2 0.25 R 2 0.25 R 2 0.25 R 2 0.25 Socology 38 ~ 3/3/205 9

Coeffcent of Determnaton 2 For the baseball example: R 28.63 2948.967 REGREION 0.383 Games Won Runs Scored Coeffcent of Determnaton An R 2 of 0.383 means that 38.3% of the varaton n Y (Wns) s explaned by X (Runs Scored) Games Won Runs Scored Coeffcent of Determnaton 2 ERROR R REGREION Example : Calculate and nterpret REGREION 67,23 ERROR 2,86,928? R 2 for regressng church a.endance on age: Example : Calculate and nterpret REGREION,5,622 ERROR? 0,502,532 R 2 for regressng sexfreq on age: