COMBINED ANALYSIS OF DATA

Similar documents
MULTIVARIATE ANALYSIS OF VARIANCE

SAMPLING IN FIELD EXPERIMENTS

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

One-way ANOVA. Experimental Design. One-way ANOVA

PLS205 Lab 2 January 15, Laboratory Topic 3

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

BLOCK DESIGNS WITH FACTORIAL STRUCTURE

Using SPSS for One Way Analysis of Variance

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

Independent Samples ANOVA

Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Analysis of variance, multivariate (MANOVA)

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

16.3 One-Way ANOVA: The Procedure

1 One-way Analysis of Variance

An inferential procedure to use sample data to understand a population Procedures

COVARIANCE ANALYSIS. Rajender Parsad and V.K. Gupta I.A.S.R.I., Library Avenue, New Delhi

Analysis of variance

Introduction to Business Statistics QM 220 Chapter 12

Three Factor Completely Randomized Design with One Continuous Factor: Using SPSS GLM UNIVARIATE R. C. Gardner Department of Psychology

df=degrees of freedom = n - 1

BIOMETRICS INFORMATION

Lecture 41 Sections Mon, Apr 7, 2008

Retrieve and Open the Data

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each

4.8 Alternate Analysis as a Oneway ANOVA

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

1 Introduction to Minitab

Unit 27 One-Way Analysis of Variance

Using Tables and Graphing Calculators in Math 11

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:

CE3502. Environmental Measurements, Monitoring & Data Analysis. ANOVA: Analysis of. T-tests: Excel options

EXST 7015 Fall 2014 Lab 11: Randomized Block Design and Nested Design

13: Additional ANOVA Topics

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

SRI RAMAKRISHNA INSTITUTE OF TECHNOLOGY DEPARTMENT OF SCIENCE & HUMANITIES STATISTICS & NUMERICAL METHODS TWO MARKS

EXPERIMENTS WITH MIXTURES

Sociology 6Z03 Review II

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

G E INTERACTION USING JMP: AN OVERVIEW

ANOVA Analysis of Variance

Randomized Complete Block Designs

Chapter 4. Latin Square Design

A discussion on multiple regression models

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts

ANCOVA. Lecture 9 Andrew Ainsworth

Factorial Independent Samples ANOVA

Taguchi Method and Robust Design: Tutorial and Guideline

TUTORIAL 8 SOLUTIONS #

The Chi-Square Distributions

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Contrasts (in general)

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

13. The Cochran-Satterthwaite Approximation for Linear Combinations of Mean Squares

Cross-Over Design Experiment (Using SAS)

Advanced Experimental Design

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

PLSC PRACTICE TEST ONE

Topic 21 Goodness of Fit

STAT 115:Experimental Designs

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

22s:152 Applied Linear Regression. 1-way ANOVA visual:

Practical Meta-Analysis -- Lipsey & Wilson

One-way ANOVA (Single-Factor CRD)

PLS205 KEY Winter Homework Topic 3. The following represents one way to program SAS for this question:

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Solutions to Final STAT 421, Fall 2008

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA)

Topic 20: Single Factor Analysis of Variance

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

Data Analysis and Statistical Methods Statistics 651

N J SS W /df W N - 1

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Statistics Lab #6 Factorial ANOVA

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Topic 12. The Split-plot Design and its Relatives (Part II) Repeated Measures [ST&D Ch. 16] 12.9 Repeated measures analysis

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1

A Re-Introduction to General Linear Models (GLM)

Contrasts and Multiple Comparisons Supplement for Pages

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

Chapter 10. Design of Experiments and Analysis of Variance

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

4.1 Hypothesis Testing

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Aquatic Toxicology Lab 10 Pimephales promelas Larval Survival and Growth Test Data Analysis 1. Complete test initiated last week 1.

Difference in two or more average scores in different groups

2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018

Split-Plot Designs. David M. Allen University of Kentucky. January 30, 2014

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Transcription:

COMBINED ANALYSIS OF DATA Krishan Lal I.A.S.R.I., Library Avenue, New Delhi- 110 01 klkalra@iasri.res.in 1. Introduction In large-scale experimental programmes it is necessary to repeat the trial of a t of treatments like varieties or manures at a number of places or in a number of asons. The places where the trial is repeated are usually experimental stations located in the tract. The aim of repetition is to study the susceptibility of treatment effects to place variation. More generally, the aim of repetition is to find out treatments suitable for particular tracts in which ca the trials are carried out simultaneous on a reprentative lection of sites. Further, the purpo of the rearch carried out at experimental stations is to formulate the recommendations for the practitioners which consist of a population quite extensive either in space or time or both. Therefore, it becomes necessary to ensure that the results obtained from rearches are valid for at least veral places in the future and over a reasonably heterogeneous space. A single experiment will precily furnish information about only one place where the experiment is conducted and about the ason in which the experiment is conducted. It has, thus, become a common practice to repeat an experiment at different places or over a number of occasions to obtain valid recommendations taking into account place to place variation or variation over time or both. In such cas of repeated experiments appropriate statistical procedures for a combined analysis of data would have to be followed by the analysis of individual experiments varying with their objectives. In combined analysis of data, the main points of interest are i) to estimate the average respon to given treatments and ii) to test consistency of the respons from place to place or occasion to occasion i. e. interaction of the treatment effects with places or years. The utility and the significance of the estimates of average respon depend on whether the respon is consistence from place to place or changes with it, in other words it depends on the abnce or the prence of interaction. The results of a t of trials may, therefore, be considered as belonging to one of the following four types: i) the experimental errors are homogeneous and the interaction is abnt, ii) the experimental errors are homogeneous and the interaction is prent, iii) the experimental errors are heterogeneous and the interaction is abnt, and iv) the experimental errors are heterogeneous and the interaction is prent. The meaningfulness of average estimates of treatment respons would therefore, depend largely upon the abnce or prence of this interaction analysis. Combined analysis of data and analysis for groups of experiments are synonymous to each other becau in both the

situations we have to perform pooled analysis of the data obtained from the experiments conducted at different places or over times.. Analysis Procedure For combined analysis of data, we follow the steps given below: Step I. Construct an outline of combined analysis of variance over years or for places or environment, bad on the basic design ud. For example, the data of grain yield for four years, four treatments each treatment replicated five times is given in Table-1. Step II. Perform usual analysis of variance (ANOVA) for the given data. Here the experiment conducted is in randomized complete block design. So perform analysis of four years parately. This may be done in SAS, SPSS or EXCEL software. Step III. In general, let we have data of p years and the design ud is randomized block designs (RBD). So we have p error mean squares that corresponding to p RBD conducted. Before pooling the data, we test the homogeneity of error variances. For this we have the following two situations: Situation I. When p = In this situation, we apply F-test for testing the homogeneity of variances. Here null and alternate hypothesis are H 0 : σ 1 = σ and H 1 : σ 1 σ. Let Se 1 and Se are the mean square errors (m) for the two years. Then the value of F statistics will be Se 1 / Se and this value will be tested against the Table F value at n 1 and n degrees of freedom at 5 % level of significance, where n 1 and n are degrees of freedom (df) for error for the two years, respectively. If the calculated value of F is greater than tabulated F value then the null hypothesis of homogeneity of variance is rejected and the data is heterogeneous in different years, otherwi it is homogeneous. Situation II. When p > In this situation, we apply Bartlett's Chi-square test. Here null and alternate hypothesis are H 0 : σ = σ = L = against the alternative hypothesis 1 σp H 1 : at least two of the location. i σ ' s are not equal, where σ i is the error variance for i th year/ Let Se 1, Se,..., Se p are the m of p years respectively and n 1, n,, n p are the df for p years, respectively. Then the test statistics for testing homogeneity of variances is log log where ni ni ni i i χ, = p 1 = 1 1 1 ni 1+ ( ) 3( p 1) ni ni and if n i = n II-1

where χ p 1 χp 1 n[ p log log s = ( p + 1) 1+ 3np follows χ ei ]. distribution with p - 1 df. If the calculated value of χp 1 is greater than tabulated χ p 1 value at p-1df then the null hypothesis of homogeneity of variance is rejected and the data is heterogeneous in different years, otherwi it is homogeneous. Step IV. If error variances are not homogeneous means heterogeneous, then for performing the combined analysis of weighted least square is required, the weights being the reciprocals of the root mean square error. The weighted analysis is carried out by defining a new variable as newres = res/ square root mean square error. This transformation is similar to Aitken s transformation. This new variable is thus homogeneous and thus combined analysis of variance can performed on this new variable. If error variance variances are homogeneous then there is no need to transform the data. Step V. Now one can view the combined analysis as a nested design with veral factors nested within one another. The years/ locations are treated as big blocks, with the experiments nested within the. The combined analysis of data, therefore, can be done as that a nested design. For doing the analysis, the replication wi data of treatments at each year/ location provide uful information. An advantage of this analysis is that there is a further reduction in error sum of squares becau one more source of variability is taken out from the experimental error thus reducing the experimental error. This may also lead to the reduction in the value of coefficient of variation. Step VI. Next step in the analysis is to test for the significance of year treatment interaction. It can be en that the question whether the interaction year treatment is significant, that is whether the difference between treatments tend to vary from year to year can be ttled by comparing the mean square for year treatment with the estimate of error variance by the F-test. If the mean square is found to be non-significant it means interaction is abnt. If this interaction is assumed to be non-existence, sum of squares for treatments years and the error sum of squares can be pooled and a more preci estimate of error can be obtained for testing the significance of treatment differences. If, however, interaction is significant i. e. treatment effects are varying with years, then the appropriate mean square for testing the significance of treatments is the mean square due to year treatment. II-13

3. Illustration Now we shall proceed for the combined analysis of a t of data. Table-1: Data for grain yield (kg/ plot) with four treatments in five replications Year Replication Treatment I II III IV V 1 1 33.6 33.7 30.9 33.3 15.0 34.0 7. 46. 36.7 11.6 3 30.5 33. 15.1 33.3 9.7 4 30.8 14.4 14. 9.5 1.0 1 8.8 8.8 35. 41.6 43. 46.4 43. 38.4 54.4 57.6 3 35. 3.0 3.0 5.6 33.6 4 51. 40.0 49.6 51. 49.6 3 1 30.1 38.1 1.4 17.6 14.3 36.1 18.3 38.0 31.0 6.6 3 7. 40.7 15.5 18.1 1.3 4 37.8 54.5 13. 18.1 7.3 4 1 3.8 48.8 19.5 8.8 34.4 15. 39.0 39.8 5.0 31. 3 40. 5.0 33.0 41. 35.0 4 43. 46.8 34.5 44.5 38.0 A) By using SPSS First we analyze the data for each year parately by using SPSS. Make four files in SPSS Data Editor, the first column of which is replication, cond column as treatment, third column as yield. Now the SPSS commands are o Analyze General Linear model Univariate yield button [Put yield under Dependent variable] rep button [Put rep under Fixed Factor(s)] treat button [Put treat under Fixed Factor(s)] Model Custom rep Build term[s] treat Build term[s] Main effects Continue OK Similarly, we run the data of all the four years. We will get four ANOVA tables. The different ANOVA tables give the significance of treatments for different years parately. Now we have four mean square errors with their respective degrees of freedom. The output screen of one of the t is as below: II-14

Thus the error mean squares of four years along with df are year Degrees of freedom Error mean square 1 1 78.34 1 8.309 3 1 108.466 4 1 67.903 Now we test the homogeneity of error variances using analyze Bartlett's Chi-square test as described in Step III. In our ca value of χ is non-significant. So we can perform the combined analysis without transformation of the data. For combined analysis of data using SPSS we proceed as follows Make a combined file in SPSS Data Editor with some name say gpxpt-comb; the first column of which is year, cond column as replication, third column as replication and fourth column as yield. The SPSS commands are UNIANOVA yield BY year rep treat /METHOD = SSTYPE(3) /INTERCEPT = INCLUDE /CRITERIA = ALPHA(.05) /DESIGN = year rep(year) treat year*treat /test treat vs year*treat. We will get the output screen as II-15

In the above analysis year treatment interaction is significant. Therefore, treatment is tested against the year treatment interaction. B) By using SAS Now we give the steps ud while analyzing the data using SAS software. First we analyze the data for four years parately using proc glm. data klkrbd1; input yr rep trt yld; cards; ; class rep trt; model yld = rep trt/ss3; We test the homogeneity of error variances using analyze Bartlett's Chi-square test as described in Step III. Since value of χ is non-significant. So we can perform the combined analysis. For combined analysis of data using SAS we proceed as follows data klkgps1; input yr rep trt yld; cards; ; II-16

model yld = yr rep(yr) trt yr*trt/ss3; test h=trt e=trt*yr; model yld = yr rep(yr) trt/ss3; Further, the different sites or years are natural environments. The natural environments are generally considered as random. All other effects in the model involve the environment either as nested or as crosd classification are considered as random. The assumption of the random effects helps in identifying the proper error terms for testing the significance of various effects. The combined analysis of data can easily be carried out using PROC GLM of SAS with Random statement with TEST option. The RANDOM statement produces a table of expected mean squares which can be ud to determine appropriate denominators of F- statistics for all terms in the MODEL statement. The tests are produced by the TEST option at the end of the RANDOM statement. The steps to be followed are given below: data klkgps1; input yr rep trt yld; cards; ; model yld = yr rep(yr) trt yr*trt/ss3; random yr yr*trt rep(yr) /test; model yld = yr rep(yr) trt/ss3; random yr rep(yr) /test; References Gomez, K. W. and Gomez A. A. (1984). Statistical procedures fro agricultural rearch. John Wiley & Sons. Pan V. and Sukhatme, P. V. (1976) Statistical methods for agricultural workers. ICAR, New Delhi. Nigam, A. K. and Gupta, V. K. (1979). Handbook on analysis of agricultural Experiments. IASRI, New Delhi II-17