(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)
|
|
- Preston Watson
- 6 years ago
- Views:
Transcription
1 Introduction to Analysis of Variance Analysis of variance models are similar to regression models, in that we re interested in learning about the relationship between a dependent variable (a response) and one or more independent variables. They differ from regression models in two ways: (1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.) (2) Even if the predictor variables are quantitative, no assumption is made regarding the nature of the statistical relationship between these variables and the response variable. Regression: E{Y} increases (or decreases) linearly as X increases, etc. Y X 1
2 ANOVA: E{Y} is different at different values of X. Y X = Group Number The numbers in the horizontal axis have no meaning here: they could just as well be labels like Group 1, etc. In analysis of variance, the explanatory or predictor variables are called factors or treatments (we ll use factors ). The value of a factor is called a factor level, and could be either quantitative or qualitative. Read pgs. 666 through 671. Seriously. Know the terms & concepts. That s a hint. Some analysis of variance notation: r The number of levels of the factor under study Y ij The value of the jth observation at factor level i µ i Mean response at factor level i n i The number of observations (cases) at factor level i n T The total number of observations: n T = n1 + n nr The Model I or fixed effects ANOVA Model: Yij = µ + ε i ij 2
3 Analysis of Variance Example Fifteen students enrolled in an honors course in mathematics were divided at random into three groups of five. Each group was randomly assigned one of three instructional modes which augmented the traditional course materials: (1) programmed text; (2) video tapes; and (3) interactive computer programs. At the end of the course each student was given the same achievement test. The results are as follows: Factor Level (i) Observation (j) Programmed Text Video Tapes Software Sum Average Boxplots of Response by Factor_L (means are indicated by solid circles) 90 Response Factor_Level ProgTxt Software VideoTape 3
4 One-way Analysis of Variance Analysis of Variance for Response Source DF SS MS F P Factor_L Error Total Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev ProgTxt ( * ) Software ( * ) VideoTap ( * ) Pooled StDev =
5 Analysis of Variance Example II A senior partner in a brokerage firm wishes to determine if there is really any difference between long-run performance of different categories of people hired as customers' representatives. The junior members of the firm are classified into four groups: professionals who have changed careers, recent business school graduates, former salespersons (PC!), and brokers hired from competing firms. A random sample of five individuals is selected from each of these categories and a "detailed performance score" is obtained. The scores are as follows: Professionals Business School Grads Salespersons Brokers To address the partner's question we test: H 0 : µ 1 = µ 2 = µ 3 = µ 4 least one vs. H 1 : At Minitab Data Layout & Results: Score Background 85 Professional 95 Professional 96 Professional 91 Professional 88 Professional 73 Grads 54 Grads 72 Grads 81 Grads 69 Grads 67 Salespersons 74 Salespersons 65 Salespersons 68 Salespersons 77 Salespersons 87 Brokers 90 Brokers 84 Brokers 92 Brokers 94 Brokers 5
6 Boxplots of Score by Backgrou (means are indicated by solid circles) Score Background Brokers Grads Professional Salespersons One-way Analysis of Variance Analysis of Variance Source DF SS MS F P Factor Error Total Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev Professi (-----*-----) Grads (-----*-----) Salesper (-----*-----) Brokers (-----*-----) Pooled StDev =
7 Tukey's pairwise comparisons Family error rate = Individual error rate = Critical value = 4.05 Intervals for (column level mean) - (row level mean) Brokers Grads Professi Grads Professi Salesper We re 95% confident that ALL of these intervals contain the true differences in population means! (This is a family confidence statement.) Homogeneity of Variance Test for Score 95% Confidence Intervals for Sigmas Factor Levels Brokers Bartlett's Test Grads Test Statistic: P-Value : Professional Lev ene's Test Test Statistic: P-Value : Salespersons
8 Analysis of Variance vs. Two-Sample t-test (r = 2) Response = score on a general knowledge test given to high school students Factor = type of high school attended: public or private Boxplots of B_Score by School (means are indicated by solid circles) B_Score School Priv_School Pub_School One-way Analysis of Variance Analysis of Variance for B_Score Source DF SS MS F P School Error Total Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev Priv_Sch ( * ) Pub_Scho ( * ) Pooled StDev = Two Sample T-Test and Confidence Interval (Equal variances assumed!) Two sample T for B_Score School N Mean StDev SE Mean Priv_Schoo Pub_School % CI for mu (Priv_Schoo) - mu (Pub_School): ( 1.11, 4.59) T-Test mu (Priv_Schoo) = mu (Pub_School) (vs not =): T = 3.28 P = DF = 57 Both use Pooled StDev =
9 Regression approach to analysis of variance (Section 16.11) (Not on test!) Consider the brokerage firm example and set up indicator (dummy) variables for each factor level: Score Background Prof_Ind Grad_Ind Sales_Ind Broker_Ind 85 Professional Professional Professional Professional Professional Grads Grads Grads Grads Grads Salespersons Salespersons Salespersons Salespersons Salespersons Brokers Brokers Brokers Brokers Brokers
10 One-way Analysis of Variance Analysis of Variance for Score Source DF SS MS F P Backgrou Error Total Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev Brokers (-----*-----) Grads (-----*-----) Professi (-----*-----) Salesper (-----*-----) Pooled StDev = Regression Analysis (X s are the four indicator variables) * Broker_Ind is highly correlated with other X variables * Broker_Ind has been removed from the equation The regression equation is Score = Prof_Ind Grad_Ind Sales_Ind Predictor Coef StDev T P Constant Prof_Ind Grad_Ind Sales_In S = R-Sq = 76.1% R-Sq(adj) = 71.6% Analysis of Variance Source DF SS MS F P Regression Residual Error Total Source DF Seq SS Prof_Ind Grad_Ind Sales_In Unusual Observations Obs Prof_Ind Score Fit StDev Fit Residual St Resid R R denotes an observation with a large standardized residual 10
11 Selected ANOVA Topics, Concepts, & Comments (NOT ON TEST!) Alternative model formulation (Section 16.10): Yij = µ. + τ + ε What do the parameters represent? What are the appropriate null and alternative hypotheses? Estimation (17.3): Estimating a specific factor level mean (original formulation) µ i : i ij Y ± t(1 α / 2, n r) i. T MSE n i Estimating a difference D between two factor level means: D = µ µ i i' ( Y Y ) ± t(1 α / 2; n i. i'. T 1 r) MSE ni 1 + n i' 1/ 2 You can use Bonnferroni s logic if you need to perform simultaneous estimation. Fixed (Model I) vs. Random (Model II) Effects Models (16.6) Fixed effects: A company produces widgets on five different machines. The question of interest is whether the mean strength of the widgets is the same for all machines. To test, we obtain responses from each machine and perform ANOVA. Random effects: A company produces widgets on 100 different machines, and once again the question of interest is whether the mean strength of the widgets is the same for all machines. To test, we randomly select five machines, obtain responses from these machines, and perform ANOVA. In the first case we have data for all five possible factor levels, and our interest is restricted to these five. In the second case for have data for five factor levels that represent a sample from a larger population of factor levels. Our interest is in the larger population. 11
12 Power and Sample Size (Section 26.4) ni ( µ i µ 1. ) i= 1 Power is a function of the noncentrality parameter: φ =, a σ r measure of how much the µ s vary from their mean (when H 0 is true, this value is zero). Sample sizes can be chosen so that the test will have a desired power value. As an example, suppose that r = 4, σ = 10, and we d like a 90% chance of rejecting H 0 when H 1 is true and, specifically, µ 1 = µ 2 = µ 3 but µ 4 = µ Minitab tells us that we d need a sample size of n 1 = n 2 = n 3 = n 4 = 20 to achieve this power level. Here the output: Power and Sample Size One-way ANOVA Sigma = 10 Alpha = 0.05 Number of Levels = 4 Corrected Sum of Squares of Means = 75 Means = 10, 10, 10, 20 Sample Target Actual Size Power Power Note that the choice of a base mean of 10 is arbitrary and irrelevant. The key factor is the relationship among the means. Now suppose we want the power to be 0.90 when H 1 is true and, specifically, µ 1 = µ 2 = 10 and µ 3 = µ 4 = (again assume that r = 4, σ = 10). Minitab says: Power and Sample Size One-way ANOVA Sigma = 10 Alpha = 0.05 Number of Levels = 4 Corrected Sum of Squares of Means = Means = 10, 10, 18.66, Sample Target Actual Size Power Power This is the same result because the µ-values in H 1 were chosen so that the noncentrality parameter ϕ is the same in both cases. r 2 12
13 Two Factor (Two-Way) Analysis of Variance (Chapter 19) This is, conceptually, the same as going from simple linear regression to multiple regression with two independent variables. There are now two factors of interest in the study. Here s an example (text Problem 19.20). The response is the prediction error obtained when a programmer at a large computer software firm is asked to predict the number of programmer-days required to complete a large project. The two factors are: Sys_Exp: The type of experience that the programmer has. A 1 means that the programmer s experience is limited to working with small systems, a 2 means that the programmer has experience on small and large systems. Yrs_Exp: The number of years of experience for the programmer. A 1 means under 5 years., a 2 means >= 5 but < 10 years, and a 3 means 10 or more years. The experiment was performed with 24 programmers (4 per cell). Here s the Minitab printout: Two-way Analysis of Variance Analysis of Variance for Error Source DF SS MS F P Sys_Expe Yrs_Exp Error Total Individual 95% CI Sys_Expe Mean (-----*------) (-----*------) Individual 95% CI Yrs_Exp Mean (------*------) 2 76 (------*------) 3 50 (------*------) As is the case for multiple regression, we could also add an interaction term to the model! 13
14 A Famous Two-Factor Design: the randomized block design (Chapter 27) Research Question: Is there a difference in night vision effectiveness for four different headlamp designs (these are the treatments or factor levels )? To test this, we obtain 12 test subjects and desire a balanced design. In the completely randomized design approach, we would randomly assign three subjects to each treatment (to each headlamp design). Now consider the fact that our subjects vary in age. Let Y = young driver; M = middle age driver, O = old driver. Here s a possible result with the completely randomized approach: Treatment (Headlamp Design) Y M Y O Y O M Y O O Y M Response will be the distance (in feet) at which a sign can be read. There's a possible problem here. If, for example, test results indicate that Design 1 is the best design, is it because it really is best or because it was tested by younger drivers who had better night vision? The solution: a randomized block design which will equalize the effects of age, if any exist. Arrange subjects into similar groups called blocks (in this case "similar" means in the same age group), then randomly assign treatments within the blocks. Treatment (Headlamp Design) Y Y Y Y M M M M O O O O The goal is to choose blocks which have minimal variation within blocks (they re as homogenous as possible) and maximal variation between blocks. 14
15 The model looks like: Yij =.. µ + ρ + τ + ε i j ij The ρ s are block effects and the τ s are treatment effects. Here s a Minitab result: Two-way Analysis of Variance Analysis of Variance for Distance Source DF SS MS F P Age Grou Treatmen Error Total Individual 95% CI Age Grou Mean M (---*--) O (---*---) Y (---*--) Individual 95% CI Treatmen Mean ( * ) ( * ) ( * ) ( * )
Confidence Interval for the mean response
Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationSMAM 314 Practice Final Examination Winter 2003
SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False
More informationMultiple Regression Examples
Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +
More informationANOVA: Analysis of Variation
ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical
More informationBusiness 320, Fall 1999, Final
Business 320, Fall 1999, Final name You may use a calculator and two cheat sheets. You have 3 hours. I pledge my honor that I have not violated the Honor Code during this examination. Obvioiusly, you may
More informationExamination paper for TMA4255 Applied statistics
Department of Mathematical Sciences Examination paper for TMA4255 Applied statistics Academic contact during examination: Anna Marie Holand Phone: 951 38 038 Examination date: 16 May 2015 Examination time
More informationCh 13 & 14 - Regression Analysis
Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more
More informationANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment
More informationSTA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6
STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More information22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationPART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,
Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More information1 Introduction to Minitab
1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you
More informationBasic Business Statistics, 10/e
Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationOne-Way Analysis of Variance (ANOVA)
1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent
More informationModel Building Chap 5 p251
Model Building Chap 5 p251 Models with one qualitative variable, 5.7 p277 Example 4 Colours : Blue, Green, Lemon Yellow and white Row Blue Green Lemon Insects trapped 1 0 0 1 45 2 0 0 1 59 3 0 0 1 48 4
More information23. Inference for regression
23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence
More informationSMAM 314 Exam 42 Name
SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.
More information2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23
2.4. ASSESSING THE MODEL 23 2.4.3 Estimatingσ 2 Note that the sums of squares are functions of the conditional random variables Y i = (Y X = x i ). Hence, the sums of squares are random variables as well.
More informationSchool of Mathematical Sciences. Question 1. Best Subsets Regression
School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 9 and Assignment 8 Solutions Question 1 Best Subsets Regression Response is Crime I n W c e I P a n A E P U U l e Mallows g E P
More informationModels with qualitative explanatory variables p216
Models with qualitative explanatory variables p216 Example gen = 1 for female Row gpa hsm gen 1 3.32 10 0 2 2.26 6 0 3 2.35 8 0 4 2.08 9 0 5 3.38 8 0 6 3.29 10 0 7 3.21 8 0 8 2.00 3 0 9 3.18 9 0 10 2.34
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationSMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)
SMAM 319 Exam 1 Name 1.Pick the best choice for the multiple choice questions below (10 points 2 each) A b In Metropolis there are some houses for sale. Superman and Lois Lane are interested in the average
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationInstitutionen för matematik och matematisk statistik Umeå universitet November 7, Inlämningsuppgift 3. Mariam Shirdel
Institutionen för matematik och matematisk statistik Umeå universitet November 7, 2011 Inlämningsuppgift 3 Mariam Shirdel (mash0007@student.umu.se) Kvalitetsteknik och försöksplanering, 7.5 hp 1 Uppgift
More informationMultiple Linear Regression
Andrew Lonardelli December 20, 2013 Multiple Linear Regression 1 Table Of Contents Introduction: p.3 Multiple Linear Regression Model: p.3 Least Squares Estimation of the Parameters: p.4-5 The matrix approach
More informationStat 231 Final Exam. Consider first only the measurements made on housing number 1.
December 16, 1997 Stat 231 Final Exam Professor Vardeman 1. The first page of printout attached to this exam summarizes some data (collected by a student group) on the diameters of holes bored in certain
More informationThis document contains 3 sets of practice problems.
P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them
More information28. SIMPLE LINEAR REGRESSION III
28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationSMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3
SMAM 319 Exam1 Name 1. Pick the best choice. (10 points-2 each) _c A. A data set consisting of fifteen observations has the five number summary 4 11 12 13 15.5. For this data set it is definitely true
More information[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by
Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/1 40 Examination Date Time Pages Final June 2004 3 hours 7 Instructors Course Examiner Marks Y.P. Chaubey
More informationDESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationSMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot.
SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot. 2. Fit the linear regression line. Regression Analysis: y versus x y
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationOrthogonal contrasts for a 2x2 factorial design Example p130
Week 9: Orthogonal comparisons for a 2x2 factorial design. The general two-factor factorial arrangement. Interaction and additivity. ANOVA summary table, tests, CIs. Planned/post-hoc comparisons for the
More informationIn ANOVA the response variable is numerical and the explanatory variables are categorical.
1 ANOVA ANOVA means ANalysis Of VAriance. The ANOVA is a tool for studying the influence of one or more qualitative variables on the mean of a numerical variable in a population. In ANOVA the response
More information1. Least squares with more than one predictor
Statistics 1 Lecture ( November ) c David Pollard Page 1 Read M&M Chapter (skip part on logistic regression, pages 730 731). Read M&M pages 1, for ANOVA tables. Multiple regression. 1. Least squares with
More information1. An article on peanut butter in Consumer reports reported the following scores for various brands
SMAM 314 Review Exam 1 1. An article on peanut butter in Consumer reports reported the following scores for various brands Creamy 56 44 62 36 39 53 50 65 45 40 56 68 41 30 40 50 50 56 65 56 45 40 Crunchy
More informationSection 4.6 Simple Linear Regression
Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval
More informationn i n T Note: You can use the fact that t(.975; 10) = 2.228, t(.95; 10) = 1.813, t(.975; 12) = 2.179, t(.95; 12) =
MAT 3378 3X Midterm Examination (Solutions) 1. An experiment with a completely randomized design was run to determine whether four specific firing temperatures affect the density of a certain type of brick.
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationSchool of Mathematical Sciences. Question 1
School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant
More informationInference for Regression Simple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating
More informationMBA Statistics COURSE #4
MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of
More informationFactorial designs. Experiments
Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response
More informationChapter 5 Introduction to Factorial Designs Solutions
Solutions from Montgomery, D. C. (1) Design and Analysis of Experiments, Wiley, NY Chapter 5 Introduction to Factorial Designs Solutions 5.1. The following output was obtained from a computer program that
More informationDepartment of Mathematics & Statistics STAT 2593 Final Examination 17 April, 2000
Department of Mathematics & Statistics STAT 2593 Final Examination 17 April, 2000 TIME: 3 hours. Total marks: 80. (Marks are indicated in margin.) Remember that estimate means to give an interval estimate.
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More information1 Use of indicator random variables. (Chapter 8)
1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting
More informationAnalysis of Variance (ANOVA)
Analysis of Variance ANOVA) Compare several means Radu Trîmbiţaş 1 Analysis of Variance for a One-Way Layout 1.1 One-way ANOVA Analysis of Variance for a One-Way Layout procedure for one-way layout Suppose
More informationSTAT 360-Linear Models
STAT 360-Linear Models Instructor: Yogendra P. Chaubey Sample Test Questions Fall 004 Note: The following questions are from previous tests and exams. The final exam will be for three hours and will contain
More informationW&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant
W&M CSCI 688: Design of Experiments Homework 2 Megan Rose Bryant September 25, 201 3.5 The tensile strength of Portland cement is being studied. Four different mixing techniques can be used economically.
More informationχ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.
I. T or F. (1 points each) 1. The χ -distribution is symmetric. F. The χ may be negative, zero, or positive F 3. The chi-square distribution is skewed to the right. T 4. The observed frequency of a cell
More informationINFERENCE FOR REGRESSION
CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression
More informationMultiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company
Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationFinal Exam Bus 320 Spring 2000 Russell
Name Final Exam Bus 320 Spring 2000 Russell Do not turn over this page until you are told to do so. You will have 3 hours minutes to complete this exam. The exam has a total of 100 points and is divided
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationWhat Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)
What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ
More informationChapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationData Set 8: Laysan Finch Beak Widths
Data Set 8: Finch Beak Widths Statistical Setting This handout describes an analysis of covariance (ANCOVA) involving one categorical independent variable (with only two levels) and one quantitative covariate.
More informationConditions for Regression Inference:
AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a
More informationApart from this page, you are not permitted to read the contents of this question paper until instructed to do so by an invigilator.
B. Sc. Examination by course unit 2014 MTH5120 Statistical Modelling I Duration: 2 hours Date and time: 16 May 2014, 1000h 1200h Apart from this page, you are not permitted to read the contents of this
More informationGroup comparison test for independent samples
Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences between means. Supposing that: samples come from normal populations
More informationResidual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,
Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction, is Y ijk = µ ij + ɛ ijk = µ + α i + β j + γ ij + ɛ ijk with i = 1,..., I, j = 1,..., J, k = 1,..., K. In carrying
More informationStat 6640 Solution to Midterm #2
Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end
More information1 Introduction to One-way ANOVA
Review Source: Chapter 10 - Analysis of Variance (ANOVA). Example Data Source: Example problem 10.1 (dataset: exp10-1.mtw) Link to Data: http://www.auburn.edu/~carpedm/courses/stat3610/textbookdata/minitab/
More informationChapter 15 Multiple Regression
Multiple Regression Learning Objectives 1. Understand how multiple regression analysis can be used to develop relationships involving one dependent variable and several independent variables. 2. Be able
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationSTATISTICS 110/201 PRACTICE FINAL EXAM
STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable
More informationSTAT 212 Business Statistics II 1
STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb
More informationLec 1: An Introduction to ANOVA
Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to
More informationOne-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.
One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)
More informationConcordia University (5+5)Q 1.
(5+5)Q 1. Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/1 40 Examination Date Time Pages Mid Term Test May 26, 2004 Two Hours 3 Instructor Course Examiner
More informationNotes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1
Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population
More informationHomework 2: Simple Linear Regression
STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA
More informationBusiness Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal
Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing
More informationDESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationDepartment of Mathematics & Statistics Stat 2593 Final Examination 19 April 2001
Department of Mathematics & Statistics Stat 2593 Final Examination 19 April 2001 TIME: 3 hours. Total Marks: 60. Indicate your answers clearly. Show all work. Remember to answer as a statistician should.
More informationSTATISTICS FOR ECONOMISTS: A BEGINNING. John E. Floyd University of Toronto
STATISTICS FOR ECONOMISTS: A BEGINNING John E. Floyd University of Toronto July 2, 2010 PREFACE The pages that follow contain the material presented in my introductory quantitative methods in economics
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationInference with Simple Regression
1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems
More informationQ Lecture Introduction to Regression
Q3 2009 1 Before/After Transformation 2 Construction Role of T-ratios Formally, even under Null Hyp: H : 0, ˆ, being computed from k t k SE ˆ ˆ y values themselves containing random error, will sometimes
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationIntroduction to Regression
Introduction to Regression Using Mult Lin Regression Derived variables Many alternative models Which model to choose? Model Criticism Modelling Objective Model Details Data and Residuals Assumptions 1
More informationEXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS
Norges teknisk naturvitenskapelige universitet Institutt for matematiske fag Side 1 av 8 Contact during exam: Bo Lindqvist Tel. 975 89 418 EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationAnalysis of Bivariate Data
Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr® 2 Independent
More information