1) Answer the following questions as true (T) or false (F) by circling the appropriate letter.
|
|
- Christina Mosley
- 5 years ago
- Views:
Transcription
1 1) Answer the following questions as true (T) or false (F) by circling the appropriate letter. T F T F T F a) Variance estimates should always be positive, but covariance estimates can be either positive or negative. b) The influence diagnostics called DFFITS and DFBETAS are calculated on deleted observations, while the calculation of Cook s D does not involve a deleted observation. c) The value of R 2 will always increase and the value of the MSE will always decrease when additional variables are added to the model. T F d) The value of R 2 for a linear regression ranges from 0.0 to 1.0. T F e) The confidence interval of a simple linear regression line will be narrowest (i.e. have the smallest variance) at the intercept. T F f) Logarithmic transformation of a Yi variable in regression does not affect homogeneity of variance. T F g) As a result of the assumptions for regression, we expect the regression coefficients, bi, to be independent of each other. T F h) For regression the TYPE I and TYPE II sums of squares are the same. T F i) The sequential sums of squares for the SAS model statement MODEL Y = X1 X2 X3; would be represented in extra sum of squares notation as: SSX1 X2 X3; SSX2 X1 X3; SSX3 X1 X2 T F j) The general linear hypothesis test is used to test for a difference between two models where one model has a subset of variables contained in the other model. 2) Examine the list of diagnostic statistics below. For the questions that follow, choose a diagnostic from this list (SAS names used). a) STUDENT b) RSTUDENT c) COOK's D d) DFFITS e) DFBETAS f) h ii or HATDIAG g) R 2 h) VIF i) Standardized b i j) Sequential b i k) Partial R 2 Type III Description of the statistic : Which of the statistics named in the list above have the property listed below? Choose one statistic and enter the corresponding letter in the blank space to the right. a) Which could be obtained by doing regression on standardized variables (e.g. where Yi and Xij have a mean zero and variance of 1)? Letter from the list above b) Which could be used to examine the observations for unusual values of Xi? 1
2 c) Which could be used to determine which observations strongly influence the fit of the data (e.g. judging by the effect on the predicted Yi values)? d) Which would be the best statistic to use as a diagnostic for multicolinearity? e) Which would be the best statistic to use to diagnose outliers? f) Which would be used to examine changes in the regression coefficients as additional variables are added one by one? 3) Which ONE of the following IS an intrinsically linear regression model? a) a model linearized by taking the logarithm of the dependent variable Yi b) a model linearized by taking the logarithm of the independent variable Xi c) a model linearized by taking the inverse of the independent variable Xi d) a polynomial regression with successively higher powers of Xi 4) Some statistics are diagnostics for observations and some for variables. Which of the statistics below is a diagnostic for a variable? a) STUDENT b) RSTUDENT c) HATDIAG d) VIF 5) Which of the statistics below could be produced by "standardizing" the Yi variable and the Xi variables before doing the regression? a) RSTUDENT b) COOK's D c) HATDIAG d) Standardized b i 6) Which of the diagnostic tools below would NOT be used to examine the observations for potential outliers of Yi values? a) STUDENT b) RSTUDENT c) HATDIAG d) a residual plot 7) Given the model, Yi b0 bx 1 1i bx 2 2i bx 3 3i bx 4 4 i ei, fitted in SAS as MODEL Y = X1 X2 X3 X4;, which statement below best describes the Extra SS for X4? a) The TYPE I SS for X4 will be LARGER than the TYPE II or TYPE III for X4. b) The TYPE I SS for X4 will be SMALLER than the TYPE II or TYPE III for X4. 2
3 c) The TYPE I SS for X4 will be the SAME AS the TYPE II or TYPE III for X4. d) Sometimes the TYPE I SS for X4 will be larger and sometimes the TYPE II or TYPE III will be larger. 8) Use of the word partial in association to a statistic in multiple regression implies which of the following (for example partial standardized regression coefficients )? a) The statistics are given for only part of the reduced model b) The statistic is fully adjusted for all other variables c) The statistics is only partly adjusted for some other variables d) The result is an intermediate result 9) Which of the following is not fully adjusted for all variables in the multiple regression? a) TYPE I sum of squares b) TYPE II sum of squares c) regression coefficients (bi) d) squared partial correlations Type II 10) Which of the following will alter homogeneity of variance in a simple linear regression? a) taking a logarithm of the dependent variable (Yi) b) taking a logarithm of the dependent variable (Xi) c) fitting a polynomial with power terms of X d) adding additional independent variables 11) Given two models where the full model is Yi 0 1X1 i 2X 2i 3X3i 4X 4i e and the reduced i model is Yi 0 1X1 i 2X 2i e the test of difference between the models corresponds to which of the i hypotheses tests below? a) H0: b1 = b2 = 0. b) H0: b3 = b4 = 0. c) H0: β1 = β2 = 0. d) H0: β3 = β4 = 0. 3
4 12) Which of the following is a characteristic or likely consequence of strong multicollinearity? Circle all that apply. a) poor estimates of the regression coefficients (βi). b) increased variance estimates for the regression coefficients (bi). c) correlations among combinations of the independent variables (Xi). 13) Which of the following statistics is not used to evaluate multicollinearity. a) correlations among independent variables. b) the hat value (hii). c) the variance inflation factor. d) the sequential parameter estimates. 14) If a 3 factor regression is first fitted in SAS as Y = X1 X2 X3 and then refitted as Y = X1 X2 which of the following is true of the TYPE I sum of squares for X1? a) It will be larger. b) It will be smaller c) It will not change d) It can be larger, smaller or the same 15) Given 4 independent variables available for a multiple regression; if the residuals of the model Y = X3 X1 X4 are plotted on the residuals of the model X2 = X3 X1 X4 what is the graphic called? a) The ordinary residual plot of a 3 factor model (e.g. X1, X3, X4) b) The partial residual plot of the variable X2 c) The scatter plot of Y on the predicted value of Y (i.e. Ŷ ) d) This is a plot of the hat values for X4 16) Given the results below, what is SS(x3, x1 x2)? 4
5 ID Model SSReg SSE SSTotal Model 1 Y = X Model 2 Y = X Model 3 Y = X Model 4 Y = X1 X Model 5 Y = X1 X Model 6 Y = X2 X Model 7 Y = X1 X2 X The remaining questions refer to the SAS output pages given separately. These pages contain output for a multiple regression fitted with PROC REG. Questions requesting a P value may not always have a meaningful P value associated. If no P value is associated with the answer, state what statistic or graphic was used to answer the question. The SAS program is an analysis of attendance at Saturday Australian Football League (AFL) games at the Melbourne Cricket Ground (MCG). The variables used are described in the program comments. The computer output provided contains the SAS program and a full model analysis with selected diagnostics. The program is provided to help you to determine what was done. 17a) Do you believe there is multicolinearity in this multiple regression? Circle one: Yes / No P value or statistic used =. 17b) Disregarding of whether it was statistically significant or not (for this question only), do higher forecast temperatures tend to increase or decrease game attendance? Circle one: Increase / Decrease Statistic used =. 17c) Is there evidence that the size of the membership of the club (Members) has a statistically significant impact on the attendance? Circle one: Yes / No P value or statistic used =. 17d) What is the raw sum of squares of the variable Other (i.e. )? 2 X other 5
6 Requested Value =. 17e) What is the covariance between coefficients of the variables Temp and Top50? Requested Value =. 17f) Does the assumption of normality appear to have been met for this analysis? Circle one: Yes / No P value or statistic used =. 17g) One hypotheses of the investigators was that about 80% of the club members would attend a game and that half of them would bring a guest. The expected coefficient for the slope (Attendence / Member) would then be = 1.2? Does the observed value differ significantly from this hypothesized value (e.g. result in rejecting H : Members ). Circle one: Yes / No P value or statistic used =. 17h) Was the assumption of homogeneity of variance met for this analysis? Circle one: Yes / No P value or statistic used =. 17i) Using statistics provided to evaluate the importance or impact of variables in the regression other than the parameter estimates and the t test of those estimates, which of the variables appears to be the SECOND best variable in the model? Variable name Statistic used =. 17j) What is the standard error of the intercept?. 17k) It was thought by the investigators that the conditions that existed for observation n on 24/7/93 were nearly ideal for game attendance? What is the confidence interval for all games that might be played under the same conditions, that is, have the same values for the four independent variables. P( true mean ) = l) What fraction of the (corrected) total variation is accounted for by the model? 6
7 Requested Value =. The investigators were concerned about the individual observation #6 (ObsID f ). They calculated a Bonferroni adjusted t value of 3.52 for =0.05 and 41 observations and a mean value of h ii (p/n = 0.122). Answer the next 4 questions below about this observation. 17m) First place a confidence interval on this single observation. P( true mean ) = n) Does this observation appear to be an outlier given the Bonferroni calculation above? Circle one: Yes / No Statistic(s) used =. 17o) Does this observation appear to be influential? Circle one: Yes / No Statistic(s) used =. 17p) Does this observation appear to have an unusual combination of independent variables? Circle one: Yes / No Statistic(s) used =. 17q) Which observation appears most likely to be an outlier judging from the residual plot? Give an observation number of ObsID value? Indicate if you believe it is an outlier (yes of no). Circle one: Yes / No Observation number =. 7
8 dm'log;clear;output;clear'; options ps=512 ls=111 nocenter nodate nonumber FORMCHAR=" =-/\<>*"; ODS HTML style=minimal body='afl Crowd Attendance.HTML'; ODS listing; ods graphics off; TITLE1 'Exam 1 Example - AFL Crowd Attendance at the MCG'; *****************************************************************; *** AFL Crowd Attendance at the MCG ***; *** Data set was assembled by Rowan Todd and Mark McNaughton, *** *** two stat students in Dr Margaret Mackisack's QUT class. ***; *********************************************************************; *** They collected data involving Saturday Australian Football ***; *** League (AFL) matches at the Melbourne Cricket Ground (MCG). ***; *** They looked only at matches during the normal home and away ***; *** season (i.e. not including finals). They used statistics from ***; *** all such games in 1993 and 1994 (nineteen relevant matches in ***; *** 1993 and twenty-two in 1994). The response variable measured ***; *** was attendance at the MCG, and after consideration, they came ***; *** up with the following covariates: ***; 8
9 *********************************************************************; *** Variable Description ***; *** MCG Attendance at the MCG in 1000's. ***; *** Temp Temperature. The forecast max temp on the day of the***; *** match, in degrees C, from The Weekend Australian. ***; *** Other Attendance at other matches in 1000's. The sum of ***; *** the attendances at other AFL matches in Melbourne ***; *** and Geelong on the same day as the match. ***; *** Members Membership. The sum of the memberships of the two ***; *** clubs whose teams were playing the match in 1000's. ***; *** Top50 Number of players from the top fifty. The number of ***; *** top 50 players from the AFL playing in the match. ***; *** Date Date of the match in the format dd/mm/yy. ***; *** Home Abbreviation for home team. ***; *** Away Abbreviation for away team. ***; **********************************************************************; DATA Attendance; INPUT MCG 1-7 Other 9-16 Temp Members Top Date $ Home $ Away $ 53-58; obs + 1; 9
10 ObsID = byte(obs+96); * lower case *; if obs gt 26 then ObsID = byte(obs+64-26); * upper case *; * ; CARDS; RUN; Data was here ; *proc print data=attendance; run; PROC REG DATA=Attendance lineprinter; id ObsID; TITLE2 'PROC REG'; MODEL MCG = Other Temp Members Top50 / CLB ss1 ss2 XPX I influence vif collin scorr1 pcorr2 covb R stb; OUTPUT out=next1 p=predicted r=residual lclm=lclmean Uclm=Uclmean lcl=lclindiv ucl=uclindiv; Test1:Test Members = 1.2; RUN; proc print data=next1 noobs; TITLE3 'OUTPUT STATEMENT results'; var obs ObsID MCG Other Temp Members Top50 predicted lclmean Uclmean lclindiv uclindiv; run; 10
11 proc univariate data=next1 normal plot; var residual; ods exclude basicmeasures extremeobs quantiles testsforlocation; TITLE3 'Univariate analysis of residuals'; run; proc plot data=next1; plot residual*predicted=obsid / vref=0; TITLE3 'Residual plot'; options ps=56 ls=111; run; options ps=512 ls=111; 11
12 Exam 1 Example - AFL Crowd Attendance at the MCG PROC REG The REG Procedure Model: MODEL1 Dependent Variable: MCG Number of Observations Read 41 Number of Observations Used 41 Model Crossproducts X'X X'Y Y'Y Variable Intercept Other Temp Members Top50 MCG Intercept Other Temp Members Top MCG X'X Inverse, Parameter Estimates, and SSE Variable Intercept Other Temp Members Top50 MCG Intercept Other E Temp Members E Top MCG Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var
13 Parameter Estimates Parameter Standard Standardized Variable DF Estimate Error t Value Pr > t Type I SS Type II SS Estimate Intercept Other Temp Members Top Parameter Estimates Squared Squared Semi-partial Partial Variance Variable DF Corr Type I Corr Type II Inflation 95% Confidence Limits Intercept Other Temp Members Top Covariance of Estimates Variable Intercept Other Temp Members Top50 Intercept Other Temp Members Top Collinearity Diagnostics Condition Proportion of Variation- Number Eigenvalue Index Intercept Other Temp Members Top Test Test1 Results for Dependent Variable MCG Source DF Mean Square F Value Pr > F Numerator Denominator
14 Exam 1 Example - AFL Crowd Attendance at the MCG PROC REG The REG Procedure Model: MODEL1 Dependent Variable: MCG OutputStatistics Obs Dependent Predicted Std Error Std Error Student Cook's Obs ID Variable Value Mean Predict Residual Residual Residual D RStudent 1 a * b * c d e * f ***** g h ** i j ** k * l m n ** o p * q r **** s t ** u * v *** w ** x ** y ** z A * B C *** D E
15 32 F G * H ** I *** J * K L M N * O * OutputStatistics Obs Hat Diag Cov DFBETAS-- Obs ID H Ratio DFFITS Intercept Other Temp Members Top50 1 a b c d e f g h i j k l m n o p q r s t u v w x y z A
16 28 B C D E F G H I J K L M N O Sum of Residuals 0 Sum of Squared Residuals Predicted Residual SS (PRESS) Exam 1 Example - AFL Crowd Attendance at the MCG PROC REG OUTPUT STATEMENT results Obs obs ID MCG Other Temp Members Top50 predicted lclmean Uclmean lclindiv uclindiv 1 a b c d e f g h i j k l m n o p
17 17 q r s t u v w x y z A B C D E F G H I J K L M N O Exam 1 Example - AFL Crowd Attendance at the MCG PROC REG Univariate analysis of residuals The UNIVARIATE Procedure Variable: residual (Residual) Moments N 41 Sum Weights 41 Mean 0 Sum Observations 0 Std Deviation Variance Skewness Kurtosis Uncorrected SS Corrected SS Coeff Variation. Std Error Mean
18 Tests for Normality Test --Statistic p Value Shapiro-Wilk W Pr < W Kolmogorov-Smirnov D Pr > D > Cramer-von Mises W-Sq Pr > W-Sq > Anderson-Darling A-Sq Pr > A-Sq > Stem Leaf Boxplot Normal Probability Plot * * +* * * ** * ***** *** *-----* **** *** ****** * +** * Multiply Stem.Leaf by 10**
19 Exam 1 Example - AFL Crowd Attendance at the MCG PROC REG Residual plot Plot of residual*predicted. Symbol is value of ObsID r C I n w 20 + H x 10 + O b N G D i d R F z q e L K M s c E s i B l d u g o a u l k p J e A a h t y j v f Predicted Value of MCG NOTE: 1 obs hidden. 19
Chapter 8 (More on Assumptions for the Simple Linear Regression)
EXST3201 Chapter 8b Geaghan Fall 2005: Page 1 Chapter 8 (More on Assumptions for the Simple Linear Regression) Your textbook considers the following assumptions: Linearity This is not something I usually
More informationEXST7015: Estimating tree weights from other morphometric variables Raw data print
Simple Linear Regression SAS example Page 1 1 ********************************************; 2 *** Data from Freund & Wilson (1993) ***; 3 *** TABLE 8.24 : ESTIMATING TREE WEIGHTS ***; 4 ********************************************;
More informationCOMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION
COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION Answer all parts. Closed book, calculators allowed. It is important to show all working,
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationunadjusted model for baseline cholesterol 22:31 Monday, April 19,
unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol
More informationOdor attraction CRD Page 1
Odor attraction CRD Page 1 dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************;
More informationdm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;
dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************; *** Moore, David
More informationLecture notes on Regression & SAS example demonstration
Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also
More information5.3 Three-Stage Nested Design Example
5.3 Three-Stage Nested Design Example A researcher designs an experiment to study the of a metal alloy. A three-stage nested design was conducted that included Two alloy chemistry compositions. Three ovens
More informationThe program for the following sections follows.
Homework 6 nswer sheet Page 31 The program for the following sections follows. dm'log;clear;output;clear'; *************************************************************; *** EXST734 Homework Example 1
More informationChapter 11 : State SAT scores for 1982 Data Listing
EXST3201 Chapter 12a Geaghan Fall 2005: Page 1 Chapter 12 : Variable selection An example: State SAT scores In 1982 there was concern for scores of the Scholastic Aptitude Test (SAT) scores that varied
More informationa. YOU MAY USE ONE 8.5 X11 TWO-SIDED CHEAT SHEET AND YOUR TEXTBOOK (OR COPY THEREOF).
STAT3503 Test 2 NOTE: a. YOU MAY USE ONE 8.5 X11 TWO-SIDED CHEAT SHEET AND YOUR TEXTBOOK (OR COPY THEREOF). b. YOU MAY USE ANY ELECTRONIC CALCULATOR. c. FOR FULL MARKS YOU MUST SHOW THE FORMULA YOU USE
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationHandout 1: Predicting GPA from SAT
Handout 1: Predicting GPA from SAT appsrv01.srv.cquest.utoronto.ca> appsrv01.srv.cquest.utoronto.ca> ls Desktop grades.data grades.sas oldstuff sasuser.800 appsrv01.srv.cquest.utoronto.ca> cat grades.data
More informationSAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c
Inference About the Slope ffl As with all estimates, ^fi1 subject to sampling var ffl Because Y jx _ Normal, the estimate ^fi1 _ Normal A linear combination of indep Normals is Normal Simple Linear Regression
More informationSTATISTICS 479 Exam II (100 points)
Name STATISTICS 79 Exam II (1 points) 1. A SAS data set was created using the following input statement: Answer parts(a) to (e) below. input State $ City $ Pop199 Income Housing Electric; (a) () Give the
More informationTopic 18: Model Selection and Diagnostics
Topic 18: Model Selection and Diagnostics Variable Selection We want to choose a best model that is a subset of the available explanatory variables Two separate problems 1. How many explanatory variables
More informationBE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club
BE640 Intermediate Biostatistics 2. Regression and Correlation Simple Linear Regression Software: SAS Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and Simonoff JS A Casebook
More informationSTA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007
STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 LAST NAME: SOLUTIONS FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 302 STA 1001 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator.
More informationMulticollinearity Exercise
Multicollinearity Exercise Use the attached SAS output to answer the questions. [OPTIONAL: Copy the SAS program below into the SAS editor window and run it.] You do not need to submit any output, so there
More information3 Variables: Cyberloafing Conscientiousness Age
title 'Cyberloafing, Mike Sage'; run; PROC CORR data=sage; var Cyberloafing Conscientiousness Age; run; quit; The CORR Procedure 3 Variables: Cyberloafing Conscientiousness Age Simple Statistics Variable
More informationDescriptions of post-hoc tests
Experimental Statistics II Page 81 Descriptions of post-hoc tests Post-hoc or Post-ANOVA tests! Once you have found out some treatment(s) are different, how do you determine which one(s) are different?
More informationLINEAR REGRESSION. Copyright 2013, SAS Institute Inc. All rights reserved.
LINEAR REGRESSION LINEAR REGRESSION REGRESSION AND OTHER MODELS Type of Response Type of Predictors Categorical Continuous Continuous and Categorical Continuous Analysis of Variance (ANOVA) Ordinary Least
More informationLecture 11: Simple Linear Regression
Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink
More informationAnswer Keys to Homework#10
Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean
More informationLecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the weight percent
More informationOutline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping
Topic 19: Remedies Outline Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Regression Diagnostics Summary Check normality of the residuals
More information7.3 Ridge Analysis of the Response Surface
7.3 Ridge Analysis of the Response Surface When analyzing a fitted response surface, the researcher may find that the stationary point is outside of the experimental design region, but the researcher wants
More informationAssignment 9 Answer Keys
Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67
More informationDetecting and Assessing Data Outliers and Leverage Points
Chapter 9 Detecting and Assessing Data Outliers and Leverage Points Section 9.1 Background Background Because OLS estimators arise due to the minimization of the sum of squared errors, large residuals
More informationBiological Applications of ANOVA - Examples and Readings
BIO 575 Biological Applications of ANOVA - Winter Quarter 2010 Page 1 ANOVA Pac Biological Applications of ANOVA - Examples and Readings One-factor Model I (Fixed Effects) This is the same example for
More informationMultiple Linear Regression
Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from
More informationStatistics for exp. medical researchers Regression and Correlation
Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence
More informationLecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3
Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Fall, 2013 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationStatistical Modelling in Stata 5: Linear Models
Statistical Modelling in Stata 5: Linear Models Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 07/11/2017 Structure This Week What is a linear model? How good is my model? Does
More informationIES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc
IES 612/STA 4-573/STA 4-576 Winter 2008 Week 1--IES 612-STA 4-573-STA 4-576.doc Review Notes: [OL] = Ott & Longnecker Statistical Methods and Data Analysis, 5 th edition. [Handouts based on notes prepared
More informationSTAT 4385 Topic 06: Model Diagnostics
STAT 4385 Topic 06: Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 1/ 40 Outline Several Types of Residuals Raw, Standardized, Studentized
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 8, 2014 List of Figures in this document by page: List of Figures 1 Popcorn data............................. 2 2 MDs by city, with normal quantile
More informationSTOR 455 STATISTICAL METHODS I
STOR 455 STATISTICAL METHODS I Jan Hannig Mul9variate Regression Y=X β + ε X is a regression matrix, β is a vector of parameters and ε are independent N(0,σ) Es9mated parameters b=(x X) - 1 X Y Predicted
More informationRegression Model Building
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation in Y with a small set of predictors Automated
More informationOverview Scatter Plot Example
Overview Topic 22 - Linear Regression and Correlation STAT 5 Professor Bruce Craig Consider one population but two variables For each sampling unit observe X and Y Assume linear relationship between variables
More informationStat 500 Midterm 2 12 November 2009 page 0 of 11
Stat 500 Midterm 2 12 November 2009 page 0 of 11 Please put your name on the back of your answer book. Do NOT put it on the front. Thanks. Do not start until I tell you to. The exam is closed book, closed
More informationChapter 2 Inferences in Simple Linear Regression
STAT 525 SPRING 2018 Chapter 2 Inferences in Simple Linear Regression Professor Min Zhang Testing for Linear Relationship Term β 1 X i defines linear relationship Will then test H 0 : β 1 = 0 Test requires
More informationPubH 7405: REGRESSION ANALYSIS SLR: DIAGNOSTICS & REMEDIES
PubH 7405: REGRESSION ANALYSIS SLR: DIAGNOSTICS & REMEDIES Normal Error RegressionModel : Y = β 0 + β ε N(0,σ 2 1 x ) + ε The Model has several parts: Normal Distribution, Linear Mean, Constant Variance,
More informationssh tap sas913, sas
B. Kedem, STAT 430 SAS Examples SAS8 ===================== ssh xyz@glue.umd.edu, tap sas913, sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm Multiple Regression ====================== 0. Show
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationAMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression
AMS 315/576 Lecture Notes Chapter 11. Simple Linear Regression 11.1 Motivation A restaurant opening on a reservations-only basis would like to use the number of advance reservations x to predict the number
More informationStatistics 5100 Spring 2018 Exam 1
Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all
More informationSTATISTICS 110/201 PRACTICE FINAL EXAM
STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable
More informationLINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises
LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on
More informationHomework 2: Simple Linear Regression
STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA
More informationContents. 1 Review of Residuals. 2 Detecting Outliers. 3 Influential Observations. 4 Multicollinearity and its Effects
Contents 1 Review of Residuals 2 Detecting Outliers 3 Influential Observations 4 Multicollinearity and its Effects W. Zhou (Colorado State University) STAT 540 July 6th, 2015 1 / 32 Model Diagnostics:
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationCorrelation and Regression
Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class
More informationEXST 7015 Fall 2014 Lab 08: Polynomial Regression
EXST 7015 Fall 2014 Lab 08: Polynomial Regression OBJECTIVES Polynomial regression is a statistical modeling technique to fit the curvilinear data that either shows a maximum or a minimum in the curve,
More informationChapter 8 Quantitative and Qualitative Predictors
STAT 525 FALL 2017 Chapter 8 Quantitative and Qualitative Predictors Professor Dabao Zhang Polynomial Regression Multiple regression using X 2 i, X3 i, etc as additional predictors Generates quadratic,
More informationReview: Second Half of Course Stat 704: Data Analysis I, Fall 2014
Review: Second Half of Course Stat 704: Data Analysis I, Fall 2014 Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 704: Data Analysis I, Fall 2014 1 / 13 Chapter 8: Polynomials & Interactions
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationChapter 1 Linear Regression with One Predictor
STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the
More informationComparison of a Population Means
Analysis of Variance Interested in comparing Several treatments Several levels of one treatment Comparison of a Population Means Could do numerous two-sample t-tests but... ANOVA provides method of joint
More informationSPECIAL TOPICS IN REGRESSION ANALYSIS
1 SPECIAL TOPICS IN REGRESSION ANALYSIS Representing Nominal Scales in Regression Analysis There are several ways in which a set of G qualitative distinctions on some variable of interest can be represented
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationFailure Time of System due to the Hot Electron Effect
of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift
More informationSTATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002
Time allowed: 3 HOURS. STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002 This is an open book exam: all course notes and the text are allowed, and you are expected to use your own calculator.
More informationSingle Factor Experiments
Single Factor Experiments Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 4 1 Analysis of Variance Suppose you are interested in comparing either a different treatments a levels
More informationSTAT 3A03 Applied Regression With SAS Fall 2017
STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.
More informationT-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum
T-test: means of Spock's judge versus all other judges 1 The TTEST Procedure Variable: pcwomen judge1 N Mean Std Dev Std Err Minimum Maximum OTHER 37 29.4919 7.4308 1.2216 16.5000 48.9000 SPOCKS 9 14.6222
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationStatistical Techniques II EXST7015 Simple Linear Regression
Statistical Techniques II EXST7015 Simple Linear Regression 03a_SLR 1 Y - the dependent variable 35 30 25 The objective Given points plotted on two coordinates, Y and X, find the best line to fit the data.
More informationChapter 6 Multiple Regression
STAT 525 FALL 2018 Chapter 6 Multiple Regression Professor Min Zhang The Data and Model Still have single response variable Y Now have multiple explanatory variables Examples: Blood Pressure vs Age, Weight,
More informationholding all other predictors constant
Multiple Regression Numeric Response variable (y) p Numeric predictor variables (p < n) Model: Y = b 0 + b 1 x 1 + + b p x p + e Partial Regression Coefficients: b i effect (on the mean response) of increasing
More informationSTAT 3A03 Applied Regression Analysis With SAS Fall 2017
STAT 3A03 Applied Regression Analysis With SAS Fall 2017 Assignment 5 Solution Set Q. 1 a The code that I used and the output is as follows PROC GLM DataS3A3.Wool plotsnone; Class Amp Len Load; Model CyclesAmp
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationMath 3330: Solution to midterm Exam
Math 3330: Solution to midterm Exam Question 1: (14 marks) Suppose the regression model is y i = β 0 + β 1 x i + ε i, i = 1,, n, where ε i are iid Normal distribution N(0, σ 2 ). a. (2 marks) Compute the
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationLecture 1: Linear Models and Applications
Lecture 1: Linear Models and Applications Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction to linear models Exploratory data analysis (EDA) Estimation
More informationTopic 2. Chapter 3: Diagnostics and Remedial Measures
Topic Overview This topic will cover Regression Diagnostics Remedial Measures Statistics 512: Applied Linear Models Some other Miscellaneous Topics Topic 2 Chapter 3: Diagnostics and Remedial Measures
More informationAny of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.
STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed
More information3rd Quartile. 1st Quartile) Minimum
EXST7034 - Regression Techniques Page 1 Regression diagnostics dependent variable Y3 There are a number of graphic representations which will help with problem detection and which can be used to obtain
More informationLecture 1 Linear Regression with One Predictor Variable.p2
Lecture Linear Regression with One Predictor Variablep - Basics - Meaning of regression parameters p - β - the slope of the regression line -it indicates the change in mean of the probability distn of
More informationAnalysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking
Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,
More informationMultiple Linear Regression
Chapter 3 Multiple Linear Regression 3.1 Introduction Multiple linear regression is in some ways a relatively straightforward extension of simple linear regression that allows for more than one independent
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationChapter 16. Simple Linear Regression and dcorrelation
Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationSTAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)
STAT40 Midterm Exam University of Illinois Urbana-Champaign October 19 (Friday), 018 3:00 4:15p SOLUTIONS (Yellow) Question 1 (15 points) (10 points) 3 (50 points) extra ( points) Total (77 points) Points
More informationAnalysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total
Math 221: Linear Regression and Prediction Intervals S. K. Hyde Chapter 23 (Moore, 5th Ed.) (Neter, Kutner, Nachsheim, and Wasserman) The Toluca Company manufactures refrigeration equipment as well as
More informationWeek 7.1--IES 612-STA STA doc
Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More informationSTAT 212 Business Statistics II 1
STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationLecture 12 Inference in MLR
Lecture 12 Inference in MLR STAT 512 Spring 2011 Background Reading KNNL: 6.6-6.7 12-1 Topic Overview Review MLR Model Inference about Regression Parameters Estimation of Mean Response Prediction 12-2
More informationAnalysis of variance. April 16, Contents Comparison of several groups
Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationAnalysis of variance. April 16, 2009
Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More information7 The Analysis of Response Surfaces
7 The Analysis of Response Surfaces Goal: the researcher is seeking the experimental conditions which are most desirable, that is, determine optimum design variable levels. Once the researcher has determined
More informationLectures on Simple Linear Regression Stat 431, Summer 2012
Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population
More informationFirst Year Examination Department of Statistics, University of Florida
First Year Examination Department of Statistics, University of Florida August 19, 010, 8:00 am - 1:00 noon Instructions: 1. You have four hours to answer questions in this examination.. You must show your
More informationSections 7.1, 7.2, 7.4, & 7.6
Sections 7.1, 7.2, 7.4, & 7.6 Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1 / 25 Chapter 7 example: Body fat n = 20 healthy females 25 34
More informationTopic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts
Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction The analysis of covariance (ANCOVA) is a technique that is occasionally useful for improving the precision of an experiment.
More information