Regression With a Categorical Independent Variable

Size: px
Start display at page:

Download "Regression With a Categorical Independent Variable"

Transcription

1 Regression With a Independent Variable Lecture 10 November 5, 2008 ERSH 8320 Lecture #10-11/5/2008 Slide 1 of 54

2 Today s Lecture Today s Lecture Chapter 11: Regression with a single categorical independent variable. Coding procedures for analysis. Dummy coding. Relationship between categorical independent variable regression and other statistical terms. Lecture #10-11/5/2008 Slide 2 of 54

3 Regression with Continuous Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis Linear regression regresses a continuous-valued dependent variable, Y, onto a set of continuous-valued independent variables X. The regression line gives the estimate of the mean of Y conditional on the values of X, or E(Y X). But what happens when some or all independent variables are categorical in nature? Is the point of the regression to determine E(Y X), across the levels of Y? Can t we just put the categorical variables into SPSS and push the Continue" button? Lecture #10-11/5/2008 Slide 3 of 54

4 Example Data Set Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis Neter (1996, p. 676). The Kenton Food Company wished to test four different package designs for a new breakfast cereal. Twenty stores, with approximately equal sales volumes, were selected as the experimental units. Each store was randomly assigned one of the package designs, with each package design assigned to five stores. The stores were chosen to be comparable in location and sales volume. Other relevant conditions that could affect sales, such as price, amount and location of shelf space, and special promotional efforts, were kept the same for all of the stores in the experiment. Lecture #10-11/5/2008 Slide 4 of 54

5 A Regular Regression? W Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis Number of Cases Sold Number of Cases Sold = * package R Square = 0.64 W W W W W W W W W W W W W W W W W W W Package Type What is wrong with this picture? Lecture #10-11/5/2008 Slide 5 of 54

6 Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis variables commonly occur in research settings. Another term sometimes used to describe for categorial variables is that of qualitative variables. A strict definition of a qualitative or categorical variable is that of a variable that has a finite number of levels. Continuous (or quantitative) variables, alternatively, have infinitely many levels. Often this is assumed more than practiced. Quantitative variables often have countably many levels. Level of precision of an instrument can limit the number of levels of a quantitative variable. Lecture #10-11/5/2008 Slide 6 of 54

7 Research Design Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis variables can occur in many different research designs: Experimental research. Quasi-experimental research. Nonexperimental/Observational research. Such variables can be used with regression for: Prediction. Explanation. Lecture #10-11/5/2008 Slide 7 of 54

8 Analysis Specifics Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis Because of nature of categorical variables, emphasis of regression is not on linear trends but on differences between means (of Y ) at each level of the category. Not all categorical variables are ordered (like cereal box type, gender,etc...). When considering differences in the mean of the dependent variable, the type of analysis being conducted by a regression is commonly called an ANalysis Of VAriance (ANOVA). Combinations of categorical and continuous variables in the same regression is called ANalysis Of CoVAriance (ANCOVA - Chapters 14 and 15). Lecture #10-11/5/2008 Slide 8 of 54

9 Example Variable: Two Categories Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis From Pedhazur (1997; p. 343): Assume that the data reported [below] were obtained in an experiment in which E represents an experimental group and C represents a control group. E C Y Ȳ (Y Ȳ ) 2 = y Lecture #10-11/5/2008 Slide 9 of 54

10 Old School Statistics: The t-test Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis As you may recall from an earlier course on statistics, an easy way to determine if the means of the two conditions differ significantly is to use a t-test (with n 1 + n 2 2) degrees of freedom. H 0 µ 1 = µ 2 H A µ 1 µ 2 t = Σy 2 1 +Σy2 2 n 1 +n 2 2 Ȳ 1 Ȳ 2 ( ) 1 n n 2 Lecture #10-11/5/2008 Slide 10 of 54

11 Old School Statistics: The t-test t = ( ) = 4 3 = 2.31 Regression Basics Not A Good Idea Research Design Analysis Specs Example Example Analysis From Excel ( =tdist(2.31,8,2) ), p = If we used a Type-I error rate of 0.05, we would reject the null hypothesis, and conclude the means of the two groups were significantly different. But what if we had more than two groups?. This type of problem can be solved equivalently from within the context of the General Linear Model. Lecture #10-11/5/2008 Slide 11 of 54

12 When using categorical variables in regression, levels of the categories must be recoded from their original value to ensure the regression model truly estimates the mean differences at levels of the categories. Several types of coding strategies are common: Dummy coding. Effect coding. Each type will produce the same fit of the model (via R 2 ). The estimated regression parameters are different across coding types, thereby representing the true difference in approaches employed by each type of coding. The choice of method of coding does not differ as a function of the type of research or analysis or purpose (explanation or prediction) of the analysis. Lecture #10-11/5/2008 Slide 12 of 54

13 Definition: a code is a set of symbols to which meanings can be assigned (Pedhazur, 1997; p. 342). The assignment of symbols follows a rule (or set of rules) determined by the categories of the variable used. Typically symbols represent the respective levels of a categorical variable. All entities within the same symbol are considered alike (or homogeneous) within that category level. levels must be predetermined prior to analysis. Some variables are obviously categorical - gender. Some variables are not so obviously categorial - political affiliation. Lecture #10-11/5/2008 Slide 13 of 54

14 The most straight-forward method of coding categorical variables is dummy coding. Example: Dummy Coded Example 1 Example 2 Example 3 In dummy coding, one creates a set of variables that represent the membership of an observation to a given category level. If an observation is a member of a specific category level, they are given a value of 1 in that category level s variable. If an observation is not a member of a specific category, they are given a value of 0 in that category level s variable. Lecture #10-11/5/2008 Slide 14 of 54

15 For each observation, a no more that a single 1 will appear in the set of columns for that variable. Example: Dummy Coded Example 1 Example 2 Example 3 The columns represent the predictor variables in a regression analysis, where the dependent variable is modeled as a function of these columns. Because of linear dependence with an intercept, one category-level column is often excluded from the analysis. Because all observations at a given category level have the same value across the set of predictors, the predicted value of the dependent variable, Y, will be identical for all observations within a category. The set of category columns (and a vector for an intercept) are now used as input into a regression model. Lecture #10-11/5/2008 Slide 15 of 54

16 Dummy Coded Regression Example Example: Dummy Coded Example 1 Example 2 Example 3 Y X 1 X 2 X 3 Group E E E E E C C C C C Mean SS yx2 = 10 yx3 = 10 Lecture #10-11/5/2008 Slide 16 of 54

17 Dummy Coded Regression The General Linear Model states that the estimated regression parameters are given by: b = (X X) 1 X y Example: Dummy Coded Example 1 Example 2 Example 3 From the previous slide, you can see what our entries for X could be, but... Notice that X 1 = X 2 + X 3. This linear dependency means that: (X X) is a singular matrix - no inverse exists. Any combination of two of the columns would rid us of the linear dependency. Lecture #10-11/5/2008 Slide 17 of 54

18 Dummy Coded Regression - X 2 and X 3 For our first example analysis, consider the regression of Y on X 2 and X 3. Y = b 2 X 2 + b 3 X 3 + e Example: Dummy Coded Example 1 Example 2 Example 3 b 2 = 17 b 3 = 13 y2 = 100 SS res = X X = 60 SS reg = = 40 R 2 = = 0.4 Lecture #10-11/5/2008 Slide 18 of 54

19 Dummy Coded Regression - X 2 and X 3 Example: Dummy Coded Example 1 Example 2 Example 3 b 2 = 17 is the mean for the E category. b 3 = 13 is the mean for the C category. Without an intercept, the model is fairly easy to interpret. For more advanced models, an intercept will prove to be helpful in interpretation. Lecture #10-11/5/2008 Slide 19 of 54

20 Dummy Coded Regression - X 1 and X 2 For our second example analysis, consider the regression of Y on X 1 and X 2. Y = a + b 2 X 2 + e Example: Dummy Coded Example 1 Example 2 Example 3 a = 13 b 2 = 4 y2 = 100 SS res = X X = 60 SS reg = = 40 R 2 = = 0.4 Lecture #10-11/5/2008 Slide 20 of 54

21 Dummy Coded Regression - X 2 and X 3 a = 13 is the mean for the C category. Example: Dummy Coded Example 1 Example 2 Example 3 b 2 = 4 is the mean difference between the E category and the C category. The C category is called reference category. For members of the C category: Y = a + b 2 X 2 = (0) = 13 For members of the E category: Y = a + b 2 X 2 = (1) = 17 With the intercept, the model parameters are now different from the first example. The fit of the model, however, is the same. Lecture #10-11/5/2008 Slide 21 of 54

22 Dummy Coded Regression - X 1 and X 3 For our third example analysis, consider the regression of Y on X 1 and X 3. Y = a + b 3 X 3 + e Example: Dummy Coded Example 1 Example 2 Example 3 a = 17 b 3 = 4 y2 = 100 SS res = X X = 60 SS reg = = 40 R 2 = = 0.4 Lecture #10-11/5/2008 Slide 22 of 54

23 Dummy Coded Regression - X 1 and X 3 a = 17 is the mean for the E category. Example: Dummy Coded Example 1 Example 2 Example 3 b 3 = 4 is the mean difference between the C category and the E category. The E category is called reference category. For members of the E category: Y = a + b 3 X 3 = 17 4(0) = 17 For members of the E category: Y = a + b 3 X 3 = 17 4(1) = 13 With the intercept, the model parameters are now different from the first example. The fit of the model, however, is the same. Lecture #10-11/5/2008 Slide 23 of 54

24 Hypothesis Test of the Regression Coefficien Because each model had the same value for R 2 and the same number of degrees of freedom for the regression (1), all hypothesis tests of the model parameters will result in the same value of the test statistic. Example: Dummy Coded Example 1 Example 2 Example 3 F = R 2 /k (1 R 2 )/(N k 1) = 0.4/1 (1 0.4)/(10 1 1) = 5.33 From Excel ( =fdist(5.33,1,8) ), p = If we used a Type-I error rate of 0.05, we would reject the null hypothesis, and conclude the regression coefficient for each analysis would be significantly different from zero. Lecture #10-11/5/2008 Slide 24 of 54

25 Hypothesis Test of the Regression Coefficien Example: Dummy Coded Example 1 Example 2 Example 3 Recall from the t-test of the mean difference, t = For the test of the coefficient, notice that F = t 2. Also notice that the p-values for each hypothesis test were the same, p = The test of the regression coefficient is equivalent to running a t-test when using a single categorical variable with two categories. Lecture #10-11/5/2008 Slide 25 of 54

26 Generalizing the concept of dummy coding, we revisit our first example data set, the cereal experiment data. Recall that there were four different types of cereal boxes. Breakfast Cereal Example A dummy coding scheme would involve creation of four new column vectors, each representing observations from each box type. Just as was the case with two categories, a linear dependency is created if we wanted to use all four variables. Therefore, we must choose which category to remove from the analysis. Lecture #10-11/5/2008 Slide 26 of 54

27 One-Way Analysis of Variance Just as was the case for the example with two categories, a multiple category regression model with a single categorical independent variable has a direct link to a statistical test you may be familiar with. Breakfast Cereal Example The regression model tests for mean differences across all pairings of category levels simultaneously. Testing for a difference between multiple groups equates to a one-way ANOVA model (for a model with a single categorical independent variable). Lecture #10-11/5/2008 Slide 27 of 54

28 Y X 1 X 2 X 3 X 4 X 5 Type

29 Breakfast Cereal Example Breakfast Cereal Example To make things interesting, let s drop X 5 from our analysis. Y = a + b 2 X 2 + b 3 X 3 + b 4 X 4 + e Because X 5 (representing box type four) was omitted from our model, the estimated intercept parameter now represents the mean for group X 5. All other parameters represent the difference between their respective category level and category level four with respect to the dependent variable. a = 27.2 b 2 = 12.6 b 3 = 13.8 b 4 = 7.8 Lecture #10-11/5/2008 Slide 28 of 54

30 Breakfast Cereal Example Therefore: Ȳ A = Y A = a + b 2 (1) + b 3 (0) + b 4 (0) = = 14.6 Breakfast Cereal Example Ȳ B = Y B = a + b 2 (0) + b 3 (1) + b 4 (0) = = 13.4 Ȳ C = Y C = a + b 2 (0) + b 3 (0) + b 4 (1) = = 19.4 Ȳ D = Y D = a + b 2 (0) + b 3 (0) + b 4 (0) = 27.2 Lecture #10-11/5/2008 Slide 29 of 54

31 Hypothesis Test Breakfast Cereal Example To test that all means are equal to each other (H 0 : µ 1 = µ 2 =... = µ k ) against the hypothesis that at least one mean differs (H 1 : At least one µ µ ), called an omnibus test, the same hypothesis test from before can be used: F = R 2 /k (1 R 2 )/(N k 1) = 0.4/1 (1 0.4)/(10 1 1) = 5.33 y 2 = SS res = SS reg = = R 2 = 854.6/ = Lecture #10-11/5/2008 Slide 30 of 54

32 Hypothesis Tests F = R 2 /k (1 R 2 )/(N k 1) = 0.844/3 ( )/(20 3 1) = Breakfast Cereal Example From Excel ( =fdist(28.77,3,16) ), p = If we used a Type-I error rate of 0.05, we would reject the null hypothesis, and conclude that at least one regression coefficient for this analysis would be significantly different from zero. Having a regression coefficient of zero means having zero difference between two means (reference and specific category being compared). Having all regression coefficients of zero means absolutely no difference between any of the means. Lecture #10-11/5/2008 Slide 31 of 54

33 Effect coding is the less straight-forward method of coding categorical variables when compared with dummy coding. In effect coding, one (again) creates a set of columns that represent the membership of an observation to a given category level. Like dummy coding, the total number of columns for a categorical variable are equal to one less than the total number of category levels. Example: Effect Coded Example 1 Fixed Effects Linear Model Lecture #10-11/5/2008 Slide 32 of 54

34 If an observation is a member of a specific category level, they are given a value of 1 in that category level s column. If an observation is not a member of a specific category and is not a member of the omitted category, they are given a value of 0 in that category level s column. If an observation is a member of the omitted category, they are given a value of -1 in every category level s column. Example: Effect Coded Example 1 Fixed Effects Linear Model Lecture #10-11/5/2008 Slide 33 of 54

35 Example: Effect Coded Example 1 Fixed Effects Linear Model For each observation, a no more that a single 1 will appear in the set of columns for that variable. The columns represent the predictor variables in a regression analysis, where the dependent variable is modeled as a function of these columns. Because all observations at a given category level have the same value across the set of predictors, the predicted value of the dependent variable, Y, will be identical for all observations within a category. The set of category columns (and a column for an intercept) are now used as input into a regression model. Lecture #10-11/5/2008 Slide 34 of 54

36 Effect Coded Regression Example Example: Effect Coded Example 1 Fixed Effects Linear Model Y X 1 X 2 Group E E E E E C C C C C Mean Lecture #10-11/5/2008 Slide 35 of 54

37 Effect Coded Regression The General Linear Model states that the estimated regression parameters are given by: b = (X X) 1 X y Example: Effect Coded Example 1 Fixed Effects Linear Model Lecture #10-11/5/2008 Slide 36 of 54

38 Effect Coded Regression - X 1 and X 2 For our second example analysis, consider the regression of Y on X 1 and X 2. a = 15 b 2 = 2 y2 = 100 Y = a + b 2 X 2 + e Example: Effect Coded Example 1 Fixed Effects Linear Model SS res = X X = 60 SS reg = = 40 R 2 = = 0.4 Lecture #10-11/5/2008 Slide 37 of 54

39 Effect Coded Regression - X 1 and X 2 Example: Effect Coded Example 1 Fixed Effects Linear Model a = 15 is the overall mean of the dependent variable across all categories. b 2 = 2 is the called the effect of the experimental group. This effect represents the difference between the experimental group mean and the overall mean. For members of the E category: Y = a + b 2 X 2 = (1) = 17 For members of the C category: Y = a + b 2 X 2 = ( 1) = 13 The fit of the model is the same as was found in the dummy coding from the previous class. Lecture #10-11/5/2008 Slide 38 of 54

40 The Fixed Effects Linear Model Effect coding is built to estimate the fixed linear effects model. Example: Effect Coded Example 1 Fixed Effects Linear Model Y ij = µ + β j + ǫ ij Y ij is the value of the dependent variable of individual i in group/treatment/category j. µ is the population (grand) mean. β j is the effect of group/treatment/category j. ǫ ij is the error associated with the score of individual i in group/treatment/category j. Lecture #10-11/5/2008 Slide 39 of 54

41 The Fixed Effects Linear Model The fixed effects linear model states that a predicted score for an observation is a composite of the grand mean and the treatment effect of the group to which the observation belongs. Example: Effect Coded Example 1 Fixed Effects Linear Model Y ij = µ + β j + ǫ ij For all category levels (total represented by G), the model has the following constraint: G β g = 0 g=1 Lecture #10-11/5/2008 Slide 40 of 54

42 The Fixed Effects Linear Model This constraint means that the effect for the omitted category level (o) is equal to: β o = g o β g = β 1 β 2... From the example, the effect for the control group is equal to: Example: Effect Coded Example 1 Fixed Effects Linear Model Just to verify: β C = β E = 2 β E + β C = 2 + ( 2) = 0 Lecture #10-11/5/2008 Slide 41 of 54

43 Hypothesis Test of the Regression Coefficien Because each model had the same value for R 2 and the same number of degrees of freedom for the regression (1), all hypothesis tests of the model parameters will result in the same value of the test statistic. F = R 2 /k (1 R 2 )/(N k 1) = 0.4/1 (1 0.4)/(10 1 1) = 5.33 Example: Effect Coded Example 1 Fixed Effects Linear Model From Excel ( =fdist(5.33,1,8) ), p = If we used a Type-I error rate of 0.05, we would reject the null hypothesis, and conclude the regression coefficient for each analysis would be significantly different from zero. Lecture #10-11/5/2008 Slide 42 of 54

44 Breakfast Cereal Example Can You Guess? Predicted Values Generalizing the concept of effect coding, we revisit the cereal experiment data. Recall that there were four different types of cereal boxes. A effect coding scheme would involve creation of three new columns, each representing observations from each box type. The choice of omitted category level is arbitrary. Any level can be omitted and you will get the same results...this is due to the equivalence of linear models under effect coding. Lecture #10-11/5/2008 Slide 43 of 54

45 One-Way Analysis of Variance Just as was the case for the example with two categories, a multiple category regression model with a single categorical independent variable has a direct link to a statistical test you may be familiar with. The regression model tests for mean differences across all pairings of category levels simultaneously. Testing for a difference between multiple groups equates to a one-way ANOVA model (for a model with a single categorical independent variable). Breakfast Cereal Example Can You Guess? Predicted Values Lecture #10-11/5/2008 Slide 44 of 54

46 Y X 1 X 2 X 3 X 4 Type Mean

47 Breakfast Cereal Example Group means: Group Mean We will omit the final category from our analysis. Y ij = µ + β j + ǫ ij Breakfast Cereal Example Can You Guess? Predicted Values Lecture #10-11/5/2008 Slide 45 of 54

48 It s the Guess the Parameter Game Breakfast Cereal Example Can You Guess? Predicted Values Group means: Group Mean Grand mean µ = β 1 = β 2 = β 3 = β 4 = Lecture #10-11/5/2008 Slide 46 of 54

49 It s the Guess the Parameter Game Breakfast Cereal Example Can You Guess? Predicted Values Group means: Group Mean Grand mean µ = (the grand mean). β 1 = = β 2 = = β 3 = = 0.75 β 4 = -(-4.05) - (-5.25) = = 8.55 Lecture #10-11/5/2008 Slide 47 of 54

50 Breakfast Cereal Example Therefore: Breakfast Cereal Example Can You Guess? Predicted Values Ȳ A = Y A = µ + β 1 = = 14.6 Ȳ B = Y B = µ + β 1 = = 13.4 Ȳ C = Y C = µ + β 1 = = 19.4 Ȳ D = Y D = µ + β 1 = = 27.2 R 2 = Lecture #10-11/5/2008 Slide 48 of 54

51 Hypothesis Test To test that all means are equal to each other (H 0 : µ 1 = µ 2 =... = µ k ) against the hypothesis that at least one mean differs (H 1 : At least one µ µ ), called an omnibus test, the same hypothesis test from before can be used: y 2 = SS res = SS reg = = R 2 = / = Breakfast Cereal Example Can You Guess? Predicted Values Lecture #10-11/5/2008 Slide 49 of 54

52 Hypothesis Tests F = R 2 /k (1 R 2 )/(N k 1) = 0.788/3 ( )/(20 3 1) = Breakfast Cereal Example Can You Guess? Predicted Values From Excel ( =fdist(19.803,3,16) ), p = If we used a Type-I error rate of 0.05, we would reject the null hypothesis, and conclude that at least one regression coefficient for this analysis would be significantly different from zero. Having a regression coefficient of zero means having zero difference between the mean of one category and the grand mean. Having all regression coefficients of zero means absolutely no difference between any of the means (all means are equal to the grand mean). Lecture #10-11/5/2008 Slide 50 of 54

53 For a categorical independent variable, a statistically significant R 2 means a rejection of the null hypothesis: H 0 : µ 1 = µ 2 =... = µ g Note that rejection simply means that at least one of the above = signs is truly a. To determine which means are not equal, one of the multiple comparison procedures must be applied. Concerns Lecture #10-11/5/2008 Slide 51 of 54

54 Comparison Concerns The topic of multiple comparisons brings up a wealth of concerns, both from philosophical and statistical points of view. Concerns Most concerns are centered around the potential for an exponential number of post-hoc comparisons, for g groups: ( ) g 2 The phrase capitalization on chance is frequently used to describe many concerns. Even with these concerns, most people still use multiple comparisons for information regarding their analysis. Like most other statistical techniques, know the limitations of a technique is often as important as knowing the results of a technique. Lecture #10-11/5/2008 Slide 52 of 54

55 Final Thought Regression with categorical variables can be accomplished by coding schemes. Differing ways of coding (or inclusion of certain coded column vectors) may change the interpretation of the model parameters, but will not change the overall fit of the model. Final Thought Next Class Lecture #10-11/5/2008 Slide 53 of 54

56 Next Time Lab tonight: Regression with a categorical IV. Homework: Due Wednesday, 11/12, at the start of class. Next week: Chapter 12 - more than one categorical IV. Final Thought Next Class Lecture #10-11/5/2008 Slide 54 of 54

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression ith a Independent Variable ERSH 8320 Slide 1 of 34 Today s Lecture Regression with a single categorical independent variable. Today s Lecture Coding procedures for analysis. Dummy coding. Relationship

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression With a Categorical Independent Variable Lecture 15 March 17, 2005 Applied Regression Analysis Lecture #15-3/17/2005 Slide 1 of 29 Today s Lecture» Today s Lecture» Midterm Note» Example Regression

More information

Regression With a Categorical Independent Variable: Mean Comparisons

Regression With a Categorical Independent Variable: Mean Comparisons Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Applied Regression Analysis

Applied Regression Analysis Applied Regression Analysis Lecture 2 January 27, 2005 Lecture #2-1/27/2005 Slide 1 of 46 Today s Lecture Simple linear regression. Partitioning the sum of squares. Tests of significance.. Regression diagnostics

More information

Chapter 8: Regression Models with Qualitative Predictors

Chapter 8: Regression Models with Qualitative Predictors Chapter 8: Regression Models with Qualitative Predictors Some predictors may be binary (e.g., male/female) or otherwise categorical (e.g., small/medium/large). These typically enter the regression model

More information

Variance Partitioning

Variance Partitioning Chapter 9 October 22, 2008 ERSH 8320 Lecture #8-10/22/2008 Slide 1 of 33 Today s Lecture Test review and discussion. Today s Lecture Chapter 9: Muddying the waters of regression. What not to do when considering

More information

Variance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009

Variance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009 Variance Estimates and the F Ratio ERSH 8310 Lecture 3 September 2, 2009 Today s Class Completing the analysis (the ANOVA table) Evaluating the F ratio Errors in hypothesis testing A complete numerical

More information

Variance Partitioning

Variance Partitioning Lecture 12 March 8, 2005 Applied Regression Analysis Lecture #12-3/8/2005 Slide 1 of 33 Today s Lecture Muddying the waters of regression. What not to do when considering the relative importance of variables

More information

Profile Analysis Multivariate Regression

Profile Analysis Multivariate Regression Lecture 8 October 12, 2005 Analysis Lecture #8-10/12/2005 Slide 1 of 68 Today s Lecture Profile analysis Today s Lecture Schedule : regression review multiple regression is due Thursday, October 27th,

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Difference in two or more average scores in different groups

Difference in two or more average scores in different groups ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as

More information

BNAD 276 Lecture 10 Simple Linear Regression Model

BNAD 276 Lecture 10 Simple Linear Regression Model 1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions

More information

ST430 Exam 2 Solutions

ST430 Exam 2 Solutions ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving

More information

ANALYTICAL COMPARISONS AMONG TREATMENT MEANS (CHAPTER 4)

ANALYTICAL COMPARISONS AMONG TREATMENT MEANS (CHAPTER 4) ANALYTICAL COMPARISONS AMONG TREATMENT MEANS (CHAPTER 4) ERSH 8310 Fall 2007 September 11, 2007 Today s Class The need for analytic comparisons. Planned comparisons. Comparisons among treatment means.

More information

General Principles Within-Cases Factors Only Within and Between. Within Cases ANOVA. Part One

General Principles Within-Cases Factors Only Within and Between. Within Cases ANOVA. Part One Within Cases ANOVA Part One 1 / 25 Within Cases A case contributes a DV value for every value of a categorical IV It is natural to expect data from the same case to be correlated - NOT independent For

More information

Analytical Comparisons Among Treatment Means (Chapter 4) Analysis of Trend (Chapter 5) ERSH 8310 Fall 2009

Analytical Comparisons Among Treatment Means (Chapter 4) Analysis of Trend (Chapter 5) ERSH 8310 Fall 2009 Analytical Comparisons Among Treatment Means (Chapter 4) Analysis of Trend (Chapter 5) ERSH 8310 Fall 009 September 9, 009 Today s Class Chapter 4 Analytic comparisons The need for analytic comparisons

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Review. One-way ANOVA, I. What s coming up. Multiple comparisons Review One-way ANOVA, I 9.07 /15/00 Earlier in this class, we talked about twosample z- and t-tests for the difference between two conditions of an independent variable Does a trial drug work better than

More information

Interactions among Continuous Predictors

Interactions among Continuous Predictors Interactions among Continuous Predictors Today s Class: Simple main effects within two-way interactions Conquering TEST/ESTIMATE/LINCOM statements Regions of significance Three-way interactions (and beyond

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions Introduction to Analysis of Variance 1 Experiments with More than 2 Conditions Often the research that psychologists perform has more conditions than just the control and experimental conditions You might

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Topic 28: Unequal Replication in Two-Way ANOVA

Topic 28: Unequal Replication in Two-Way ANOVA Topic 28: Unequal Replication in Two-Way ANOVA Outline Two-way ANOVA with unequal numbers of observations in the cells Data and model Regression approach Parameter estimates Previous analyses with constant

More information

Review of the General Linear Model

Review of the General Linear Model Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

STAT 705 Chapter 16: One-way ANOVA

STAT 705 Chapter 16: One-way ANOVA STAT 705 Chapter 16: One-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 21 What is ANOVA? Analysis of variance (ANOVA) models are regression

More information

WELCOME! Lecture 13 Thommy Perlinger

WELCOME! Lecture 13 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 13 Thommy Perlinger Parametrical tests (tests for the mean) Nature and number of variables One-way vs. two-way ANOVA One-way ANOVA Y X 1 1 One dependent variable

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Regression Analysis with Categorical Variables

Regression Analysis with Categorical Variables International Journal of Statistics and Systems ISSN 973-2675 Volume, Number 2 (26), pp. 35-43 Research India Publications http://www.ripublication.com Regression Analysis with Categorical Variables M.

More information

FAQ: Linear and Multiple Regression Analysis: Coefficients

FAQ: Linear and Multiple Regression Analysis: Coefficients Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable

More information

Introduction To Logistic Regression

Introduction To Logistic Regression Introduction To Lecture 22 April 28, 2005 Applied Regression Analysis Lecture #22-4/28/2005 Slide 1 of 28 Today s Lecture Logistic regression. Today s Lecture Lecture #22-4/28/2005 Slide 2 of 28 Background

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method

More information

Comparing Several Means: ANOVA

Comparing Several Means: ANOVA Comparing Several Means: ANOVA Understand the basic principles of ANOVA Why it is done? What it tells us? Theory of one way independent ANOVA Following up an ANOVA: Planned contrasts/comparisons Choosing

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 .0.0 5 5 1.0 7 5 X2 X2 7 1.5 1.0 0.5 3 1 2 Hierarchical clustering

More information

Introduction to Matrix Algebra and the Multivariate Normal Distribution

Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Structural Equation Modeling Lecture #2 January 18, 2012 ERSH 8750: Lecture 2 Motivation for Learning the Multivariate

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

STATISTICS FOR ECONOMISTS: A BEGINNING. John E. Floyd University of Toronto

STATISTICS FOR ECONOMISTS: A BEGINNING. John E. Floyd University of Toronto STATISTICS FOR ECONOMISTS: A BEGINNING John E. Floyd University of Toronto July 2, 2010 PREFACE The pages that follow contain the material presented in my introductory quantitative methods in economics

More information

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 2: August 24, 2012 PSYC 943: Lecture 2 Today s Class Centering and

More information

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests last lecture: introduction to factorial designs next lecture: factorial between-ps ANOVA II: (effect sizes and follow-up tests) 1 general

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Lecture 6: Linear Regression

Lecture 6: Linear Regression Lecture 6: Linear Regression Reading: Sections 3.1-3 STATS 202: Data mining and analysis Jonathan Taylor, 10/5 Slide credits: Sergio Bacallado 1 / 30 Simple linear regression Model: y i = β 0 + β 1 x i

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-2 STATS 202: Data mining and analysis Sergio Bacallado September 19, 2018 1 / 23 Announcements Starting next week, Julia Fukuyama

More information

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables. One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)

More information

Advanced Regression Topics: Violation of Assumptions

Advanced Regression Topics: Violation of Assumptions Advanced Regression Topics: Violation of Assumptions Lecture 7 February 15, 2005 Applied Regression Analysis Lecture #7-2/15/2005 Slide 1 of 36 Today s Lecture Today s Lecture rapping Up Revisiting residuals.

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression ST 430/514 Recall: a regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates).

More information

Lecture 5: Clustering, Linear Regression

Lecture 5: Clustering, Linear Regression Lecture 5: Clustering, Linear Regression Reading: Chapter 10, Sections 3.1-3.2 STATS 202: Data mining and analysis October 4, 2017 1 / 22 Hierarchical clustering Most algorithms for hierarchical clustering

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education Deciphering Math Notation Billy Skorupski Associate Professor, School of Education Agenda General overview of data, variables Greek and Roman characters in math and statistics Parameters vs. Statistics

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd Why We Use Analysis of Variance to Compare Group Means and How it Works The question of how to compare the population means of more than two groups is an important one to researchers. Let us suppose that

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

Introduction to the Analysis of Variance (ANOVA)

Introduction to the Analysis of Variance (ANOVA) Introduction to the Analysis of Variance (ANOVA) The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique for testing for differences between the means of multiple (more

More information

Topic 1. Definitions

Topic 1. Definitions S Topic. Definitions. Scalar A scalar is a number. 2. Vector A vector is a column of numbers. 3. Linear combination A scalar times a vector plus a scalar times a vector, plus a scalar times a vector...

More information

Multivariate Regression (Chapter 10)

Multivariate Regression (Chapter 10) Multivariate Regression (Chapter 10) This week we ll cover multivariate regression and maybe a bit of canonical correlation. Today we ll mostly review univariate multivariate regression. With multivariate

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to moderator effects Hierarchical Regression analysis with continuous moderator Hierarchical Regression analysis with categorical

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

Simple, Marginal, and Interaction Effects in General Linear Models

Simple, Marginal, and Interaction Effects in General Linear Models Simple, Marginal, and Interaction Effects in General Linear Models PRE 905: Multivariate Analysis Lecture 3 Today s Class Centering and Coding Predictors Interpreting Parameters in the Model for the Means

More information

Introduction to Regression

Introduction to Regression Regression Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression,

More information

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA: 1 Neuendorf MANOVA /MANCOVA Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y1 Y2 Y3 Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices) across cells (groups defined by

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Topic 20: Single Factor Analysis of Variance

Topic 20: Single Factor Analysis of Variance Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Vectors and Matrices Statistics with Vectors and Matrices

Vectors and Matrices Statistics with Vectors and Matrices Vectors and Matrices Statistics with Vectors and Matrices Lecture 3 September 7, 005 Analysis Lecture #3-9/7/005 Slide 1 of 55 Today s Lecture Vectors and Matrices (Supplement A - augmented with SAS proc

More information

14 Multiple Linear Regression

14 Multiple Linear Regression B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 14 Multiple Linear Regression 14.1 The multiple linear regression model In simple linear regression, the response variable y is expressed in

More information

Statistical Techniques II EXST7015 Simple Linear Regression

Statistical Techniques II EXST7015 Simple Linear Regression Statistical Techniques II EXST7015 Simple Linear Regression 03a_SLR 1 Y - the dependent variable 35 30 25 The objective Given points plotted on two coordinates, Y and X, find the best line to fit the data.

More information

One-way ANOVA. Experimental Design. One-way ANOVA

One-way ANOVA. Experimental Design. One-way ANOVA Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class

More information

Module 2. General Linear Model

Module 2. General Linear Model D.G. Bonett (9/018) Module General Linear Model The relation between one response variable (y) and q 1 predictor variables (x 1, x,, x q ) for one randomly selected person can be represented by the following

More information

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple

More information

Statistical Foundations:

Statistical Foundations: Statistical Foundations: t distributions, t-tests tests Psychology 790 Lecture #12 10/03/2006 Today sclass The t-distribution t ib ti in its full glory. Why we use it for nearly everything. Confidence

More information

ANCOVA. Lecture 9 Andrew Ainsworth

ANCOVA. Lecture 9 Andrew Ainsworth ANCOVA Lecture 9 Andrew Ainsworth What is ANCOVA? Analysis of covariance an extension of ANOVA in which main effects and interactions are assessed on DV scores after the DV has been adjusted for by the

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance (ANOVA) Two types of ANOVA tests: Independent measures and Repeated measures Comparing 2 means: X 1 = 20 t - test X 2 = 30 How can we Compare 3 means?: X 1 = 20 X 2 = 30 X 3 = 35 ANOVA

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information