Analysis of variance and regression. November 22, 2007

Size: px
Start display at page:

Download "Analysis of variance and regression. November 22, 2007"

Transcription

1 Analysis of variance and regression November 22, 2007

2 Parametrisations: Choice of parameters Comparison of models Test for linearity Linear splines

3 Lene Theil Skovgaard, Dept. of Biostatistics, Institute of Public Health, University of Copenhagen

4 Parametrisations, November Parameter: unknown quantity that we want to estimate (provide a good guess) the decrease in blood pressure following treatment A or the difference in decrease for treatment A and placebo the increase in insulin growth factor (IGF-1) with age Parametrisation: choice of which parameters are to enter the model description Re-parametrisation: shift to a new set of parameters

5 Parametrisations, November Most well known choice of parametrisation: Change of scale/units Do we measure height in cm or m? Take the relation of lung capacity versus height: fev1= α + β height If we change from measuring height in cm to m, we also change the regression coefficient (the parameter) from β to β = 100β Change of origin/intercept choice of another reference group in ANOVA subtracting e.g. 170cm from all height measurements Re-parametrisations do not change the model as such! same fitted values same confidence- and prediction limits but a possibility for interpretations of specific interest

6 Parametrisations, November What makes us choose a specific parametrisation? Ease - the program has some default parametrisations Estimation of specific quantities: - the potency of a drug, ED 50 or ED 90 Test of specific hypotheses - difference between treatment and placebo - difference in height for boys and girls at the age of 14

7 Parametrisations, November In the more advanced situations (beyond linearity) non-linear regression, logistic regression, correlated observations: Knowledge of distributional assumptions: - Some parameter estimates may be more normally distributed than others (and we like to be able to construct symmetric confidence intervals, using the standard error) In linear models the estimates have exact normal distributions (provided the model assumptions are met, of course...)

8 Parametrisations, November Example: A group consisting of 45 patients with Reumatoid Arthritis are randomised to one out of 6 possible treatments (treat): Placebo Aspirin One of 4 doses (dose) of an active anti-inflammatory drug which we shall denote X. Outcome: An index (Index) summing up the effectiveness of the treatment (decrease in various symptoms)

9 Parametrisations, November Outcome: Index-values: Reference: Woolson, R.F. & Clarke, W.R.: Statistical methods for the analysis of biomedical data. 2ed., Wiley, (Exercise 10.4 page 409)

10 Parametrisations, November How do we represent these data in SAS? Obs group type dose index 1 placebo placebo placebo placebo placebo placebo placebo placebo placebo placebo placebo placebo placebo placebo x20 active x20 active x20 active x25 active x25 active x25 active x25 active x25 active

11 Parametrisations, November Summary statistics The MEANS Procedure Analysis Variable : index N group Obs N Mean Std Dev Minimum Maximum aspirin placebo x x x x

12 Parametrisations, November We start by looking at the 4 X-groups only: Below, the outcome Index is plotted against Dose group.

13 Parametrisations, November Comparison of 4 dose groups: One-way ANOVA Model written as a multiple regression: Y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + ǫ where the x s are socalled dummy variables: x 1 is 1 if subject i belongs to the first group, and 0 otherwise x 2 is 1 if subject i belongs to the second group, and 0 otherwise x 3 is 1 if subject i belongs to the third group, and 0 otherwise With this parametrisation, β 0 will correspond to the level for the last group (the reference group, here group 4); β 1 will be the difference in level between group 1 and group 4 β 2 will be the difference in level between group 2 and group 4 and so on...

14 Parametrisations, November Traditional One-way ANOVA in SAS: proc glm data=drug; where type= active ; class group; model index=group / solution; run; which yields the output: The GLM Procedure Class Level Information Class Levels Values group 4 x10 x15 x20 x25 Number of Observations Used 25

15 Parametrisations, November The GLM Procedure Dependent Variable: index Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total R-Square Coeff Var Root MSE index Mean Source DF Type I SS Mean Square F Value Pr > F group <.0001 Source DF Type III SS Mean Square F Value Pr > F group <.0001

16 Parametrisations, November Standard Parameter Estimate Error t Value Pr > t Intercept B <.0001 group x B <.0001 group x B <.0001 group x B <.0001 group x B... NOTE: The X X matrix has been found to be singular, and a generalized inverse was used to solve the normal equations. Terms whose estimates are followed by the letter B are not uniquely estimable. The B to the right of the estimates is explained in the NOTE It simply means: By renaming the group levels/names, we may get a different parametrisation!

17 Parametrisations, November We here disregard the problem of variance heterogeneity: proc glm data=drug; where type= active ; class group; model index=group / noint solution; means group / hovtest=levene; run; from which we get Levene s Test for Homogeneity of index Variance ANOVA of Squared Deviations from Group Means Sum of Mean Source DF Squares Square F Value Pr > F group Error A clear indication that the variance increases with dose. Logarithms?

18 Parametrisations, November Same model, now parametrised with one level for each group: proc glm data=drug; where type= active ; class group; model index=group / noint solution; run; now yielding instead: Dependent Variable: index Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Uncorrected Total

19 Parametrisations, November R-Square Coeff Var Root MSE index Mean Source DF Type I SS Mean Square F Value Pr > F group <.0001 Source DF Type III SS Mean Square F Value Pr > F group <.0001 Standard Parameter Estimate Error t Value Pr > t group x <.0001 group x <.0001 group x <.0001 group x <.0001 The tests now refer to the hypothesis of a zero level (which is not interesting)

20 Parametrisations, November Parametrisations in One-way ANOVA One level (µ 4 ) for the reference group (the last, numerically or alphabetically), supplemented with differences from this reference group to each of the remaining groups (β 1, β 2, β 3 ) Y gi = µ 4 + β g + ε gi, good for testing of identity and certain pairwise comparisons β i = µ i µ 4 One level for each group Y gi = µ g + ε gi good for estimation, not suited for testing!!

21 Parametrisations, November Estimate statements in GLM If we want to compare dose 10 with dose 15: proc glm data=drug; where type= active ; class group; model index=group / noint solution; estimate dose 15 vs. dose 10 group ; run; from which we get Standard Parameter Estimate Error t Value Pr > t dose 15 vs. dose <.0001

22 Parametrisations, November We return to the scatter plot, now with a linear regression line Can we use a simple model, saying that the dose effect is linear?

23 Parametrisations, November We look at the simple linear regression model: Y gi = α + βx gi + ε gi, ε gi N(0, σ 2 ) where Y gi denotes Index, X gi denotes Dose, α is the intercept (with the vertical axis), β is the slope of the line and ε gi is the distance from observation to line, measured vertically (the residual), for individual i in drug group g Program statements: proc reg data=drug; where type= active ; model index=dose / clb; run;

24 Parametrisations, November The output becomes: The REG Procedure Dependent Variable: index Number of Observations Used 25 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var

25 Parametrisations, November Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > t Intercept <.0001 dose <.0001 Parameter Estimates Variable DF 95% Confidence Limits Intercept dose Is this a reasonable description? Can we test the linearity? Is this model almost as good as a quadratic model the ANOVA model

26 Parametrisations, November Quadratic fit: Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > t Intercept dose <.0001 dose

27 Parametrisations, November Model reduction - F test for comparison of explained variation We have to compare two models: Model 1 The ANOVA model with 4 separate groups Model 2 The linear regression model with actual dose as covariate Note: The models have to be nested, i.e. one is derived from the other, typically be demanding some parameters to be equal to each other or equal to zero (e.g. remove effects ). Indeed, the linear regression model is a special case of the ANOVA model, in which all contrast (µ 2 µ 1, µ 3 µ 2, µ 4 µ 3 ) are equal.

28 Parametrisations, November Sums of squares Model Sum Sq(Model): i (ŷ i ȳ) 2 Explained variation: How much do the predicted values vary? (the bigger the better) Residuals Sum Sq(Residual): i (y i ŷ i ) 2 Unexplained variation (residual variation): How large are the deviations from the model? (the smaller the better)

29 Parametrisations, November Look at changes in the explained variation Sum Sq(Model). How much less is explained by the simpler model? Sum Sq = Sum Sq(Model 1 ) Sum Sq(Model 2 ) > 0, always (since more parameters can always explain more of the variation).

30 Parametrisations, November Mean Sq = Sum Sq/ df F = Mean Sq Mean Sq(Residual) How large can this get, just by chance/coincidence? F F(df 2 df 1, df 1 ) where df 1 and df 2 denote the degrees of freedom for the Mean Sq(Residual) for the two models.

31 Parametrisations, November Direct calculation of F-test statistic: Sum Sq(Model) for ANOVA model: , with 3 df Mean Sq(Residual) for ANOVA model: , with 21 df (df 1 = 21) Sum Sq(Model) for linear model: , with 1 df > (( )/2)/ [1] F(2, 21) corresponds to a P-value of 0.06 If we have many groups, the test is not very powerful

32 Parametrisations, November We may trick SAS into performing our comparison directly by including dose simultaneously as a linear effect and as a class effect: proc glm data=drug; where type= active ; class group; model index=group dose / solution; run; In this way, the term group represents the variation of dose group means around a straight line

33 Parametrisations, November The output becomes: Dependent Variable: index Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Source DF Type I SS Mean Square F Value Pr > F dose <.0001 group

34 Parametrisations, November Source DF Type III SS Mean Square F Value Pr > F dose group Standard Parameter Estimate Error t Value Pr > t Intercept B dose B <.0001 group x B group x B group x B... group x B... Note: The effect group now has only 2 degrees of freedom, since we are not testing equality, but linearity!

35 Parametrisations, November Contrast statement in GLM for test of linearity (a bit tricky) proc glm data=drug; where type= active ; class group; model index=group / solution; contrast dev linearity group , group ; run; gives an extra line of output: Contrast DF Contrast SS Mean Square F Value Pr > F dev linearity

36 Parametrisations, November Explanation for this contrast: The linear regression model is a special case of the ANOVA model, in which all contrast (µ 2 µ 1, µ 3 µ 2, µ 4 µ 3 ) are equal. µ 2 µ 1 = µ 3 µ 2 (µ 3 µ 2 ) (µ 2 µ 1 ) = µ 1 2µ 2 + µ 3 = 0 µ 3 µ 2 = µ 4 µ 3 (µ 4 µ 3 ) (µ 3 µ 2 ) = µ 2 2µ 3 + µ 4 = 0

37 Parametrisations, November Example with comparison of regression lines Analysis of covariance, ANCOVA (just a name used for a model with one Class variable and one quantitative variable.) Relate weight to age and gender, in the well known Juul-data, tanner=1 The GLM Procedure Dependent Variable: weight Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total

38 Parametrisations, November R-Square Coeff Var Root MSE weight Mean Source DF Type I SS Mean Square F Value Pr > F sex <.0001 age <.0001 age*sex Source DF Type III SS Mean Square F Value Pr > F sex age <.0001 age*sex

39 Parametrisations, November Standard Parameter Estimate Error t Value Pr > t Intercept B sex female B sex male B... age B <.0001 age*sex female B age*sex male B... NOTE: The X X matrix has been found to be singular, and a generalized inverse was used to solve the normal equations. Terms whose estimates are followed by the letter B are not uniquely estimable. Interpretation: Boys increase their weight more rapidly than girls: approx. 692g more per year, CI=(216, 1167)

40 Parametrisations, November Weight related to age: Again: Problems with variance homogeneity...

41 Parametrisations, November Estimate the weight difference at the age of 14: proc glm data=juul; where tanner=1; class sex; model weight=sex age age*sex / solution clparm; estimate difference at 14 sex -1 1 age*sex ; run; with the output: Standard Parameter Estimate Error t Value Pr > t difference at Parameter 95% Confidence Limits difference at

42 Parametrisations, November Simultaneous test of both sex-effects proc glm data=juul; where tanner=1; class sex; model weight=sex age age*sex / solution clparm; contrast sex and sex*age sex -1 1, age*sex -1 1; run; which yields: Contrast DF Contrast SS Mean Square F Value Pr > F sex and sex*age

43 Parametrisations, November SAS-computation of both lines simultaneously Subgroup analysis Keep the interaction term sex*age Leave out the marginal effect age this will merge the marginal effect into the interaction term Leave out the intercept (use option noint in textsfmodel) this will merge the intercept into the sex effect proc glm data=juul; where tanner=1; class sex; model weight=sex age*sex / noint solution clparm; run;

44 Parametrisations, November Output: The GLM Procedure Dependent Variable: weight Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Uncorrected Total R-Square Coeff Var Root MSE weight Mean Source DF Type I SS Mean Square F Value Pr > F sex <.0001 age*sex <.0001

45 Parametrisations, November Source DF Type III SS Mean Square F Value Pr > F sex <.0001 age*sex <.0001 Standard Parameter Estimate Error t Value Pr > t sex female <.0001 sex male age*sex female <.0001 age*sex male <.0001 Parameter 95% Confidence Limits sex female sex male age*sex female age*sex male

46 Parametrisations, November Same model, 2 different parametrisations: proc glm data=juul; where tanner=1; class sex; model weight=sex age age*sex / solution; run; An intercept for the reference group (sex= male ) A difference from sex= female to sex= male An effect of age (slope) for the reference group A difference in slopes from sex= female to sex= male proc glm data=juul; where tanner=1; class sex; model weight=sex age*sex / noint solution; run; An intercept for each group (sex) An effect of age (slope) for each group (sex)

47 Parametrisations, November Now we include the Placebo group as Dose=0. Does the Placebo treatment fit in here? No, obviously not. Either we have a placebo effect or a threshold effect. But how could we make a formal test?

48 Parametrisations, November data drug; set drug; active_drug=(dose>0); run; proc glm data=drug; where group ne aspirin ; class active_drug; model index= dose active_drug / solution; run; Class Level Information Class Levels Values active_drug Number of Observations Used 34

49 Parametrisations, November Dependent Variable: index Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total R-Square Coeff Var Root MSE index Mean Source DF Type I SS Mean Square F Value Pr > F dose <.0001 active_drug <.0001 Source DF Type III SS Mean Square F Value Pr > F dose <.0001 active_drug <.0001

50 Parametrisations, November Standard Parameter Estimate Error t Value Pr > t Intercept B <.0001 dose <.0001 active_drug B <.0001 active_drug B... NOTE: The X X matrix has been found to be singular, and a generalized inverse was used to solve the normal equations. Terms whose estimates are followed by the letter B are not uniquely estimable. Interpretation: The placebo group (active_drug=0) lies an estimated above the expected 0-dose level.

51 Parametrisations, November Example: Serum IGF-1 for boys We have seen different age dependencies in separate Tanner stages. Is this simply due to a nonlinear age effect?

52 Parametrisations, November How can we model non-linear effects? Polynomials tend to be very wiggly (use i=rq or i=rc in the symbol-statement) Splines piecewise interpolations, often linear or cubic

53 Parametrisations, November Separate lines for each age group: Subdivide age into groups, using appropriate thresholds Fit linear effect of age in each age group The result is a 5 unconnected lines:

54 Parametrisations, November Linear splines: Subdivide age into groups, using appropriate thresholds Fit linear effect of age in each age group Make the linear pieces meet at the thresholds The result is a bended line:

55 Parametrisations, November data juul; set juul; /* define new intercept value */ age5=age-5; /* define number of years above certain threshold ages */ extra_age1=max(age-10,0); extra_age2=max(age-12,0); extra_age3=max(age-13,0); extra_age4=max(age-15,0); run; proc sort data=juul; by sex; run; proc reg data=juul; where age ge 5 and age le 20; by sex; model ssigf1=age5 extra_age10 extra_age12 extra_age13 extra_age15; run;

56 Parametrisations, November Definition af extra_age: age_ extra_ extra_ extra_ extra_ Obs age group age10 age12 age13 age

57 Parametrisations, November sex=male The REG Procedure Dependent Variable: ssigf1 Number of Observations Read 506 Number of Observations Used 362 Number of Observations with Missing Values 144 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > t Intercept <.0001 age extra_age extra_age extra_age extra_age <.0001

58 Parametrisations, November Can we reduce the model to a simpler one? To simple linearity... not reasonable Quadratic age effect... Why? proc glm data=juul; where age ge 5 and age le 20; by sex; model ssigf1=age5 extra_age10 extra_age12 extra_age13 extra_age15 / solution; contrast all extra_age10 1, extra_age12 1, extra_age13 1, extra_age15 1; run; Contrast DF Contrast SS Mean Square F Value Pr > F all <.0001

59 Parametrisations, November Test of adequacy of linear spline? We have to test it against a more complicated model, e.g. separate regressions for each age group Inclusion of tanner as a Class-variable...

60 Parametrisations, November Dose-response curves Example of a typical dose-response relation, for moderate doses We have almost linearity in this dose range:

61 Parametrisations, November For extreme doses we see a clear deviation from linearity and: smaller variation in the ends

62 Parametrisations, November Y axis: Logit transformed response, i.e. log(response/(100 response)) X axis: Logarithmic transformed dose proc gplot; plot logit*dose / haxis=axis1 vaxis=axis2 frame; axis1 logbase=2 logstyle=expand value=(h=2) minor=none label=(h=3 Dose on log scale ); axis2 value=(h=2) minor=none label=(a=90 R=0 H=3 Logit transformed response ); symbol v=circle i=sm60 h=3 c=black l=1 w=2; run; /* i=sm60 indicates a 60% smoothing */ We get a reasonable linearity on this scale

63 Parametrisations, November Theoretical dose response relation: This looks like a cumulative normal distribution a sigmoid shape a logistic curve

64 Parametrisations, November group=halothane Example from anaesthesia: 47 patients to be operated with two different anesthetics Halothane Neurolept Y: Twitch response at the ulnar nerve (at the thumb), in % X: Dose of muscle relaxantia patient dose depression group=neurolept patient dose depression

65 Parametrisations, November

66 Parametrisations, November Neurolept

67 Parametrisations, November Transformation to linearity in the usual way: ( ) twitch y : logit_twitch = log 100 twitch x : logdose = log(dose) Linear relation: logit_twitch = α + β logdose produces estimates of α and β, with corresponding standard errors (s.e.)

68 Parametrisations, November group=neurolept The REG Procedure Dependent Variable: logity Number of Observations Used 32 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model <.0001 Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var

69 Parametrisations, November Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > t Intercept <.0001 logdose <.0001 Parameter Estimates Variable DF 95% Confidence Limits Intercept logdose

70 Parametrisations, November We get estimates of α and β from the equation logit_twitch = α + β logdose But: What about the parameters of interest, i.e. ED 50 and ED 90? ( ED ˆ 50 = exp ˆαˆβ ) How do we calculate s.e.( ˆ ED 50 )? Reparametrisation: γ 1 = log(ed 50 ) γ 2 = log(ed 90 )

71 Parametrisations, November The model may then be written as: y= logit_twitch = logit(0.9) x γ 1 γ 2 γ 1 = x γ 1 γ 2 γ 1 This function is nonlinear in γ 1 and γ 2! Direct estimation of γ 1 and γ 2 using non-linear regression... more about this next week

Parametrisations, splines

Parametrisations, splines / 7 Parametrisations, splines Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression_2 Marc Andersen, mja@statgroup.dk Analysis of variance and regression for health researchers,

More information

Answer to exercise 'height vs. age' (Juul)

Answer to exercise 'height vs. age' (Juul) Answer to exercise 'height vs. age' (Juul) Question 1 Fitting a straight line to height for males in the age range 5-20 and making the corresponding illustration is performed by writing: proc reg data=juul;

More information

Nonlinear regression. Nonlinear regression analysis. Polynomial regression. How can we model non-linear effects?

Nonlinear regression. Nonlinear regression analysis. Polynomial regression. How can we model non-linear effects? Nonlinear regression Nonlinear regression analysis Peter Dalgaard (orig. Lene Theil Skovgaard) Department of Biostatistics University of Copenhagen Simple kinetic model Compartment models Michaelis Menten

More information

Analysis of variance and regression. November 29, 2007

Analysis of variance and regression. November 29, 2007 Analysis of variance and regression November 29, 2007 Nonlinear regression: Simple kinetic model Compartment models Michaelis Menten reaction Dose-response relationships Lene Theil Skovgaard, Dept. of

More information

6. Multiple regression - PROC GLM

6. Multiple regression - PROC GLM Use of SAS - November 2016 6. Multiple regression - PROC GLM Karl Bang Christensen Department of Biostatistics, University of Copenhagen. http://biostat.ku.dk/~kach/sas2016/ kach@biostat.ku.dk, tel: 35327491

More information

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,

More information

Analysis of variance. April 16, Contents Comparison of several groups

Analysis of variance. April 16, Contents Comparison of several groups Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics

More information

Analysis of variance. April 16, 2009

Analysis of variance. April 16, 2009 Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics

More information

Outline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups

Outline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression10_2/index.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk

More information

Outline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups

Outline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~jufo/varianceregressionf2011.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk

More information

Analysis of Variance

Analysis of Variance 1 / 70 Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression11_2 Marc Andersen, mja@statgroup.dk Analysis of variance and regression for health researchers,

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Linear models Analysis of Covariance

Linear models Analysis of Covariance Esben Budtz-Jørgensen November 20, 2007 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor

More information

Linear models Analysis of Covariance

Linear models Analysis of Covariance Esben Budtz-Jørgensen April 22, 2008 Linear models Analysis of Covariance Confounding Interactions Parameterizations Analysis of Covariance group comparisons can become biased if an important predictor

More information

The General Linear Model. November 20, 2007

The General Linear Model. November 20, 2007 The General Linear Model. November 20, 2007 Multiple regression Data: The Faroese Mercury Study Simple linear regression Confounding The multiple linear regression model Interpretation of parameters Model

More information

The General Linear Model. April 22, 2008

The General Linear Model. April 22, 2008 The General Linear Model. April 22, 2008 Multiple regression Data: The Faroese Mercury Study Simple linear regression Confounding The multiple linear regression model Interpretation of parameters Model

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

Correlated data. Introduction. We expect students to... Aim of the course. Faculty of Health Sciences. NFA, May 19, 2014.

Correlated data. Introduction. We expect students to... Aim of the course. Faculty of Health Sciences. NFA, May 19, 2014. Faculty of Health Sciences Introduction Correlated data NFA, May 19, 2014 Introduction Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of Copenhagen The idea of the course

More information

Chapter 1 Linear Regression with One Predictor

Chapter 1 Linear Regression with One Predictor STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the

More information

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 The MEANS Procedure DRINKING STATUS=1 Analysis Variable : TRIGL N Mean Std Dev Minimum Maximum 164 151.6219512 95.3801744

More information

Overview Scatter Plot Example

Overview Scatter Plot Example Overview Topic 22 - Linear Regression and Correlation STAT 5 Professor Bruce Craig Consider one population but two variables For each sampling unit observe X and Y Assume linear relationship between variables

More information

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

unadjusted model for baseline cholesterol 22:31 Monday, April 19, unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol

More information

Answer to exercise: Blood pressure lowering drugs

Answer to exercise: Blood pressure lowering drugs Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:

More information

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c Inference About the Slope ffl As with all estimates, ^fi1 subject to sampling var ffl Because Y jx _ Normal, the estimate ^fi1 _ Normal A linear combination of indep Normals is Normal Simple Linear Regression

More information

1. (Problem 3.4 in OLRT)

1. (Problem 3.4 in OLRT) STAT:5201 Homework 5 Solutions 1. (Problem 3.4 in OLRT) The relationship of the untransformed data is shown below. There does appear to be a decrease in adenine with increased caffeine intake. This is

More information

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum T-test: means of Spock's judge versus all other judges 1 The TTEST Procedure Variable: pcwomen judge1 N Mean Std Dev Std Err Minimum Maximum OTHER 37 29.4919 7.4308 1.2216 16.5000 48.9000 SPOCKS 9 14.6222

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

Topic 28: Unequal Replication in Two-Way ANOVA

Topic 28: Unequal Replication in Two-Way ANOVA Topic 28: Unequal Replication in Two-Way ANOVA Outline Two-way ANOVA with unequal numbers of observations in the cells Data and model Regression approach Parameter estimates Previous analyses with constant

More information

Analysis of variance and regression. May 13, 2008

Analysis of variance and regression. May 13, 2008 Analysis of variance and regression May 13, 2008 Repeated measurements over time Presentation of data Traditional ways of analysis Variance component model (the dogs revisited) Random regression Baseline

More information

171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th

171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Name 171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Use the selected SAS output to help you answer the questions. The SAS output is all at the back of the exam on pages

More information

a. The least squares estimators of intercept and slope are (from JMP output): b 0 = 6.25 b 1 =

a. The least squares estimators of intercept and slope are (from JMP output): b 0 = 6.25 b 1 = Stat 28 Fall 2004 Key to Homework Exercise.10 a. There is evidence of a linear trend: winning times appear to decrease with year. A straight-line model for predicting winning times based on year is: Winning

More information

Chapter 8 Quantitative and Qualitative Predictors

Chapter 8 Quantitative and Qualitative Predictors STAT 525 FALL 2017 Chapter 8 Quantitative and Qualitative Predictors Professor Dabao Zhang Polynomial Regression Multiple regression using X 2 i, X3 i, etc as additional predictors Generates quadratic,

More information

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data

Correlated data. Repeated measurements over time. Typical set-up for repeated measurements. Traditional presentation of data Faculty of Health Sciences Repeated measurements over time Correlated data NFA, May 22, 2014 Longitudinal measurements Julie Lyng Forman & Lene Theil Skovgaard Department of Biostatistics University of

More information

Chapter 12: Multiple Regression

Chapter 12: Multiple Regression Chapter 12: Multiple Regression 12.1 a. A scatterplot of the data is given here: Plot of Drug Potency versus Dose Level Potency 0 5 10 15 20 25 30 0 5 10 15 20 25 30 35 Dose Level b. ŷ = 8.667 + 0.575x

More information

Swabs, revisited. The families were subdivided into 3 groups according to the factor crowding, which describes the space available for the household.

Swabs, revisited. The families were subdivided into 3 groups according to the factor crowding, which describes the space available for the household. Swabs, revisited 18 families with 3 children each (in well defined age intervals) were followed over a certain period of time, during which repeated swabs were taken. The variable swabs indicates how many

More information

Chapter 2 Inferences in Simple Linear Regression

Chapter 2 Inferences in Simple Linear Regression STAT 525 SPRING 2018 Chapter 2 Inferences in Simple Linear Regression Professor Min Zhang Testing for Linear Relationship Term β 1 X i defines linear relationship Will then test H 0 : β 1 = 0 Test requires

More information

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,

More information

Statistical Modelling in Stata 5: Linear Models

Statistical Modelling in Stata 5: Linear Models Statistical Modelling in Stata 5: Linear Models Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 07/11/2017 Structure This Week What is a linear model? How good is my model? Does

More information

Topic 20: Single Factor Analysis of Variance

Topic 20: Single Factor Analysis of Variance Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory

More information

Data Set 8: Laysan Finch Beak Widths

Data Set 8: Laysan Finch Beak Widths Data Set 8: Finch Beak Widths Statistical Setting This handout describes an analysis of covariance (ANCOVA) involving one categorical independent variable (with only two levels) and one quantitative covariate.

More information

Linear Combinations of Group Means

Linear Combinations of Group Means Linear Combinations of Group Means Look at the handicap example on p. 150 of the text. proc means data=mth567.disability; class handicap; var score; proc sort data=mth567.disability; by handicap; proc

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Analysis of Covariance

Analysis of Covariance Analysis of Covariance (ANCOVA) Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 10 1 When to Use ANCOVA In experiment, there is a nuisance factor x that is 1 Correlated with y 2

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

3 Variables: Cyberloafing Conscientiousness Age

3 Variables: Cyberloafing Conscientiousness Age title 'Cyberloafing, Mike Sage'; run; PROC CORR data=sage; var Cyberloafing Conscientiousness Age; run; quit; The CORR Procedure 3 Variables: Cyberloafing Conscientiousness Age Simple Statistics Variable

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Outline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping

Outline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Topic 19: Remedies Outline Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Regression Diagnostics Summary Check normality of the residuals

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

Booklet of Code and Output for STAC32 Final Exam

Booklet of Code and Output for STAC32 Final Exam Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen 2 / 28 Preparing data for analysis The

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

data proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg

data proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg data one; input id Y group X; I1=0;I2=0;I3=0;if group=1 then I1=1;if group=2 then I2=1;if group=3 then I3=1; IINT1=I1*X;IINT2=I2*X;IINT3=I3*X; *************************************************************************;

More information

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical

More information

In Class Review Exercises Vartanian: SW 540

In Class Review Exercises Vartanian: SW 540 In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Introduction to SAS proc mixed

Introduction to SAS proc mixed Faculty of Health Sciences Introduction to SAS proc mixed Analysis of repeated measurements, 2017 Julie Forman Department of Biostatistics, University of Copenhagen Outline Data in wide and long format

More information

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont. TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted

More information

Use of Dummy (Indicator) Variables in Applied Econometrics

Use of Dummy (Indicator) Variables in Applied Econometrics Chapter 5 Use of Dummy (Indicator) Variables in Applied Econometrics Section 5.1 Introduction Use of Dummy (Indicator) Variables Model specifications in applied econometrics often necessitate the use of

More information

STAT 350. Assignment 4

STAT 350. Assignment 4 STAT 350 Assignment 4 1. For the Mileage data in assignment 3 conduct a residual analysis and report your findings. I used the full model for this since my answers to assignment 3 suggested we needed the

More information

PLS205 Lab 2 January 15, Laboratory Topic 3

PLS205 Lab 2 January 15, Laboratory Topic 3 PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

Stat 500 Midterm 2 12 November 2009 page 0 of 11

Stat 500 Midterm 2 12 November 2009 page 0 of 11 Stat 500 Midterm 2 12 November 2009 page 0 of 11 Please put your name on the back of your answer book. Do NOT put it on the front. Thanks. Do not start until I tell you to. The exam is closed book, closed

More information

STAT 350: Summer Semester Midterm 1: Solutions

STAT 350: Summer Semester Midterm 1: Solutions Name: Student Number: STAT 350: Summer Semester 2008 Midterm 1: Solutions 9 June 2008 Instructor: Richard Lockhart Instructions: This is an open book test. You may use notes, text, other books and a calculator.

More information

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007

STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 STA 302 H1F / 1001 HF Fall 2007 Test 1 October 24, 2007 LAST NAME: SOLUTIONS FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 302 STA 1001 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator.

More information

Simple, Marginal, and Interaction Effects in General Linear Models

Simple, Marginal, and Interaction Effects in General Linear Models Simple, Marginal, and Interaction Effects in General Linear Models PRE 905: Multivariate Analysis Lecture 3 Today s Class Centering and Coding Predictors Interpreting Parameters in the Model for the Means

More information

Stat 302 Statistical Software and Its Applications SAS: Simple Linear Regression

Stat 302 Statistical Software and Its Applications SAS: Simple Linear Regression 1 Stat 302 Statistical Software and Its Applications SAS: Simple Linear Regression Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 16, 2015 2 The Spirit of

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression

36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form

More information

Week 7.1--IES 612-STA STA doc

Week 7.1--IES 612-STA STA doc Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ

More information

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages

More information

Topic 29: Three-Way ANOVA

Topic 29: Three-Way ANOVA Topic 29: Three-Way ANOVA Outline Three-way ANOVA Data Model Inference Data for three-way ANOVA Y, the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to b Factor C with levels

More information

STAT 3A03 Applied Regression With SAS Fall 2017

STAT 3A03 Applied Regression With SAS Fall 2017 STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.

More information

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total

Analysis of Variance. Source DF Squares Square F Value Pr > F. Model <.0001 Error Corrected Total Math 221: Linear Regression and Prediction Intervals S. K. Hyde Chapter 23 (Moore, 5th Ed.) (Neter, Kutner, Nachsheim, and Wasserman) The Toluca Company manufactures refrigeration equipment as well as

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Multiple Regression: Chapter 13. July 24, 2015

Multiple Regression: Chapter 13. July 24, 2015 Multiple Regression: Chapter 13 July 24, 2015 Multiple Regression (MR) Response Variable: Y - only one response variable (quantitative) Several Predictor Variables: X 1, X 2, X 3,..., X p (p = # predictors)

More information

ST Correlation and Regression

ST Correlation and Regression Chapter 5 ST 370 - Correlation and Regression Readings: Chapter 11.1-11.4, 11.7.2-11.8, Chapter 12.1-12.2 Recap: So far we ve learned: Why we want a random sample and how to achieve it (Sampling Scheme)

More information

Lecture 11 Multiple Linear Regression

Lecture 11 Multiple Linear Regression Lecture 11 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 11-1 Topic Overview Review: Multiple Linear Regression (MLR) Computer Science Case Study 11-2 Multiple Regression

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Fall, 2013 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the weight percent

More information

Correlation. Bivariate normal densities with ρ 0. Two-dimensional / bivariate normal density with correlation 0

Correlation. Bivariate normal densities with ρ 0. Two-dimensional / bivariate normal density with correlation 0 Correlation Bivariate normal densities with ρ 0 Example: Obesity index and blood pressure of n people randomly chosen from a population Two-dimensional / bivariate normal density with correlation 0 Correlation?

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Least Squares Analyses of Variance and Covariance

Least Squares Analyses of Variance and Covariance Least Squares Analyses of Variance and Covariance One-Way ANOVA Read Sections 1 and 2 in Chapter 16 of Howell. Run the program ANOVA1- LS.sas, which can be found on my SAS programs page. The data here

More information

Biological Applications of ANOVA - Examples and Readings

Biological Applications of ANOVA - Examples and Readings BIO 575 Biological Applications of ANOVA - Winter Quarter 2010 Page 1 ANOVA Pac Biological Applications of ANOVA - Examples and Readings One-factor Model I (Fixed Effects) This is the same example for

More information

Outline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan

Outline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan Topic 22 - Interaction in Two Factor ANOVA - Fall 2013 Outline Strategies for Analysis when interaction not present when interaction present when n ij = 1 when factor(s) quantitative Topic 22 2 General

More information

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

EXST Regression Techniques Page 1. We can also test the hypothesis H : œ 0 versus H : EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Answer Keys to Homework#10

Answer Keys to Homework#10 Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean

More information

Lecture notes on Regression & SAS example demonstration

Lecture notes on Regression & SAS example demonstration Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also

More information

STOR 455 STATISTICAL METHODS I

STOR 455 STATISTICAL METHODS I STOR 455 STATISTICAL METHODS I Jan Hannig Mul9variate Regression Y=X β + ε X is a regression matrix, β is a vector of parameters and ε are independent N(0,σ) Es9mated parameters b=(x X) - 1 X Y Predicted

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH From Basic to Translational: INDIRECT BIOASSAYS INDIRECT ASSAYS In indirect assays, the doses of the standard and test preparations are are applied

More information

Assignment 9 Answer Keys

Assignment 9 Answer Keys Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67

More information

SplineLinear.doc 1 # 9 Last save: Saturday, 9. December 2006

SplineLinear.doc 1 # 9 Last save: Saturday, 9. December 2006 SplineLinear.doc 1 # 9 Problem:... 2 Objective... 2 Reformulate... 2 Wording... 2 Simulating an example... 3 SPSS 13... 4 Substituting the indicator function... 4 SPSS-Syntax... 4 Remark... 4 Result...

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

Statistics 512: Applied Linear Models. Topic 1

Statistics 512: Applied Linear Models. Topic 1 Topic Overview This topic will cover Course Overview & Policies SAS Statistics 512: Applied Linear Models Topic 1 KNNL Chapter 1 (emphasis on Sections 1.3, 1.6, and 1.7; much should be review) Simple linear

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Statistics 5100 Spring 2018 Exam 1

Statistics 5100 Spring 2018 Exam 1 Statistics 5100 Spring 2018 Exam 1 Directions: You have 60 minutes to complete the exam. Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all

More information

ST505/S697R: Fall Homework 2 Solution.

ST505/S697R: Fall Homework 2 Solution. ST505/S69R: Fall 2012. Homework 2 Solution. 1. 1a; problem 1.22 Below is the summary information (edited) from the regression (using R output); code at end of solution as is code and output for SAS. a)

More information

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis PLS205!! Lab 9!! March 6, 2014 Topic 13: Covariance Analysis Covariable as a tool for increasing precision Carrying out a full ANCOVA Testing ANOVA assumptions Happiness! Covariable as a Tool for Increasing

More information