G E INTERACTION USING JMP: AN OVERVIEW

Similar documents
Lecture 08: Standard methods. Bruce Walsh lecture notes Tucson Winter Institute 9-11 Jan 2013

MULTIVARIATE ANALYSIS OF VARIANCE

SOME ASPECTS OF STABILITY OF CROP VARIETIES

PRINCIPAL COMPONENTS ANALYSIS (PCA)

How to Run the Analysis: To run a principal components factor analysis, from the menus choose: Analyze Dimension Reduction Factor...

Package stability. R topics documented: August 23, Type Package

Genotype by Environment Interaction of Soybean (Glycine Max L. Merrill) For Yield and Yield Components in the Savanna

Design & Analysis of Experiments 7E 2009 Montgomery

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

SUPPLEMENTARY INFORMATION

CHAPTER 10. Regression and Correlation

Current advances in GxE analysis models and their interpretations, an implication for genotype selection for multiple environments.

Principal Component Analysis, A Powerful Scoring Technique

COMPLEX PRINCIPAL COMPONENT SPECTRA EXTRACTION

Construction of designs for twophase experiments with Tabu search

Factorial designs. Experiments

COVARIANCE ANALYSIS. Rajender Parsad and V.K. Gupta I.A.S.R.I., Library Avenue, New Delhi

CS 147: Computer Systems Performance Analysis

Experimental Design and Data Analysis for Biologists

progeny. Observe the phenotypes of the F1 progeny flies resulting from this reciprocal cross.

Selection on Correlated Characters (notes only)

INTRODUCTION TO ANIMAL BREEDING. Lecture Nr 3. The genetic evaluation (for a single trait) The Estimated Breeding Values (EBV) The accuracy of EBVs

Methods for Cryptic Structure. Methods for Cryptic Structure

Multivariate T-Squared Control Chart

Multivariate analysis of genetic data: an introduction

Principal Component Analysis. Applied Multivariate Statistics Spring 2012

Lecture 2: Linear and Mixed Models

Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

Application of GGE biplot analysis to evaluate Genotype (G), Environment (E) and GxE interaction on P. radiata: a case study

EXST 7015 Fall 2014 Lab 11: Randomized Block Design and Nested Design

Didacticiel - Études de cas

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

Pharmaceutical Experimental Design and Interpretation

Lecture 32: Infinite-dimensional/Functionvalued. Functions and Random Regressions. Bruce Walsh lecture notes Synbreed course version 11 July 2013

Lecture 15 Topic 11: Unbalanced Designs (missing data)

Principal Components. Summary. Sample StatFolio: pca.sgp

Handbook of Formulas and Software for Plant Geneticists and Breeders

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Lecture 11: Multiple trait models for QTL analysis

Multiphysics Modeling

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

Hypothesis Testing for Var-Cov Components

MIXED MODELS THE GENERAL MIXED MODEL

Variance Components: Phenotypic, Environmental and Genetic

1 Springer. Nan M. Laird Christoph Lange. The Fundamentals of Modern Statistical Genetics

Lecture 2. Fisher s Variance Decomposition

Me n d e l s P e a s Exer c i se 1 - Par t 1

PLS205 Lab 2 January 15, Laboratory Topic 3

Chap The McGraw-Hill Companies, Inc. All rights reserved.

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Chapter 4: Randomized Blocks and Latin Squares

The concept of breeding value. Gene251/351 Lecture 5

Introduction to multivariate analysis Outline

TASK 6.3 Modelling and data analysis support

One-way ANOVA (Single-Factor CRD)

Eigenvalues, Eigenvectors, and an Intro to PCA

Eigenvalues, Eigenvectors, and an Intro to PCA

Example Resistive Heating

Principal Components Analysis. Sargur Srihari University at Buffalo

Ratio of Polynomials Fit One Variable

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts

Unconstrained Ordination

Tutorial. Getting started. Sample to Insight. March 31, 2016

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis

Quantitative Understanding in Biology Principal Components Analysis

Comparing whole genomes

Quantitative characters - exercises

Mixed-Model Estimation of genetic variances. Bruce Walsh lecture notes Uppsala EQG 2012 course version 28 Jan 2012

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

DOE Wizard Screening Designs

Design of Engineering Experiments Part 5 The 2 k Factorial Design

Applying MapCalc Map Analysis Software

... x. Variance NORMAL DISTRIBUTIONS OF PHENOTYPES. Mice. Fruit Flies CHARACTERIZING A NORMAL DISTRIBUTION MEAN VARIANCE

Genetics and Genetic Prediction in Plant Breeding

Latent random variables

M E R C E R W I N WA L K T H R O U G H

Basics of Multivariate Modelling and Data Analysis

VARIANCE COMPONENT ESTIMATION & BEST LINEAR UNBIASED PREDICTION (BLUP)

Lecture 2: Introduction to Quantitative Genetics

Partitioning the Genetic Variance

Statistics Toolbox 6. Apply statistical algorithms and probability models

Passing-Bablok Regression for Method Comparison

Principal component analysis

SPSS LAB FILE 1

1 Introduction to Minitab

Lecture 13 Family Selection. Bruce Walsh lecture notes Synbreed course version 4 July 2013

Multidimensional heritability analysis of neuroanatomical shape. Jingwei Li

Data Preprocessing Tasks

Lecture 2: Diversity, Distances, adonis. Lecture 2: Diversity, Distances, adonis. Alpha- Diversity. Alpha diversity definition(s)

Biplot Analysis of Genotype Environment Interaction: Proceed with Caution

Lecture 3. Introduction on Quantitative Genetics: I. Fisher s Variance Decomposition

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

A mixed model based QTL / AM analysis of interactions (G by G, G by E, G by treatment) for plant breeding

Tutorial 23 Back Analysis of Material Properties

PRINCIPAL COMPONENTS ANALYSIS

Principal Component Analysis Utilizing R and SAS Software s

M M Cross-Over Designs

I Have the Power in QTL linkage: single and multilocus analysis

Transcription:

G E INTERACTION USING JMP: AN OVERVIEW Sukanta Dash I.A.S.R.I., Library Avenue, New Delhi-110012 sukanta@iasri.res.in 1. Introduction Genotype Environment interaction (G E) is a common phenomenon in agricultural research. Differences between genotypic values may increase or decrease from one environment to another which might cause genotypes to even rank differently between environments. The G E studies are somewhat complicated as they require integrated approaches which combine many fields including agriculture, biology, statistics, computer, and genetics. A basic principle indicated by the G E interaction is that even if all animals or plants were created equal (same genotypes), they will not necessarily express their genetic potential in the same way when environmental conditions (drought, temperature, disease pressure, stress, etc.) varies. This important concept may require genetic engineering of plants or animals specifically tailored to their environmental conditions. Statistically, G E interactions occur if the performance of genotypes varies significantly across environments. Assuming 2 genotypes (G1 and G2) tested in 2 environments (E1 and E2). Figure1 indicates the presence of GXE interaction since G1 is phenotypically superior to G2 in Environment 1 (E1) but inferior to G2 in E2. The phenotypic difference between G1 and G2 remains the same in the two environments representing no interaction between the genotype and the environment in Figure 2. G1 G2 G1 G2 E1 Environment E2 E1 Environment E2 (Figure 1) (Figure 2) 2. Model Expressing phenotypic value (P) as a function of the genotype (G) and the environment (E), the equation, P= G + E indicates the situation when environmental factors influence each genotype equally (Fig. 2). However, when environment influences some genotypes more than others (Fig. 1), the phenotypic relationship changes to P=G + E + I GE and the expression includes the G E interaction term I GE. The variance (V) of the effects follows V(P) = V(G) + V(E) + 2 Cov(GE) showing that variance components analyses could be used to partition the phenotypic variance into its genotypic, environmental, and their interaction components. One way to determine the

importance of V(G) or V(E) is to experimentally minimize one of the two effects (minimizing V(G) by using identical genotypes, or minimizing V(E) by using controlled environment chambers and random allocation of genotypes to environmental conditions). Genotypeenvironment covariance (Cov) occurs when better genotypes are provided better environments. For a simple analysis of variance of a randomized complete block design the model: Y G E GE B ijk i j ij jk The above can be applied where µ is the mean, G i is the effect of the i th genotype, E j is the effect of the j th environment, GE ij is the interaction of the i th genotype with the j th environment, B jk is the effect of the k th replication in the j th environment, and ε ijk is the random error. 3. Methods A variety of statistical methods have been proposed to analyze G E interaction data. These methods include Analysis of Variance (e.g., Least Squares, Restricted Maximum Likelihood=REML), Regression (e.g., Joint Regression Analysis, Partial Least-Squares Regression, Factorial Regression), Shifted Multiplicative Model (SHMM), Variance Components, Cluster Analysis, Factor Analysis, and Additive Main Effects and Multiplicative Interaction effects (AMMI model). The G E data can often be arranged in a two-way layout designating genotypes in rows and environments in columns. To apply AMMI model, the conventional analysis of variance for the additive main effects (µ + G i + E j ) is combined with the principal component analysis for the multiplicative interaction (non-additive residual) effects to analyze the matrix of two-way means. Example 3.1: Experimenter is interested in analyzing the performance of different genotypes in a multi-environment trial, based on 10 genotypes of 15 environments each with 4 replications displaying stability measures, genotype and G E least square means from linear-bilinear models, PCA biplots, and heritability. This data is available in sample data sets of JMP. Solution: The G E interaction module used for analyzing the performance of different genotypes in a multi-environment trial. Also it can perform many more analysis like stability measures, genotype and G E least square means from linear-bilinear models, PCA biplots, and heritability. Select Genomics > Genetics > Breeding Analysis > G E interaction to open the dialog illustrated. The dialog is made up of 2 tabs: General, Option as shown in Figure 3. ijk Step 1: Figure 3

Click on genetics example as the data based study. Click Choose to select the input data set. An Open Data window opens. Navigate into the data file peanut.sas7bdat. Click Open for see the dataset for which the path is given. Note: The peanut.sas7bdat file has been selected. If you click Open, you will see the data set. Examine the variables listed in the Available Variables field. Select the yield variable as quantitative trait variable. Select Genotype, Environment and rep as Genotype variable, environment variables and replicate variable. To specify an output folder, complete the following steps: Click Choose. Navigate to the folder where you wish to place the output or create a new folder. Click OK. The completed General tab appears as shown in Figure 3. Step 2: Click on Option tab Click on linear-bilinear and choose the method (AMMI). Write the alpha value for LS means confidence interval ranges from 0 to 1. The completed Option tab appears as shown in Figure 4. Figure 4 Click Run. Results

Running this process using the Peanut Example sample setting generates the tabbed Results window as shown below. Refer to the G E Interaction process description for more information. Output from the process is organized into tabs. Each tab contains one or more plots, data panels, data filters, and so on, that facilitate your analysis. Figure 5 The following tabs are generated by this process as shown in Figure 5. Stability: This tab shows a plot of various stability measures for each of the Genotypes. G LSMeans: This tab displays estimates of LS means for the trait at each of the genotypes with confidence limits. G E LSMeans: This tab displays estimates of LS means for the trait for the Genotype Environment interaction effect, with a line for each environment colored distinctly 2D Biplots: This tab displays 2-D plots of PC2 by PC1, PC3 by PC1, and PC3 by PC2, where PC1-PC3 are the first three principal components from the linear-bilinear model (AMMI, for example). In these plots and the ones on the next two tabs, genotypes are represented by blue triangles and environments are represented by red circles. 3D Biplot: This tab shows a 3-D plot of PC1, PC2, and PC3, the first 3 principal components from the linear-bilinear model. PC1 Mean Trait: This tab displays a biplot of PC1, the first principal component from the linear-bilinear model, by the mean of the Trait Variable.

Scree Plot (G E Interaction): This tab displays the scree plot for the principal components from the linear-bilinear model, plotting the eigenvalue by principal component in red, and the cumulative proportion explained by the PCs in blue. SAS Output: When there is only one Quantitative Trait selected, this tab shows the heritability and the ANOVA results from fitting the G E model as shown in Figure 6. Figure 6 Output Data This process generates the following data set(s) as shown in Figure 7 and 8. ANOVA Data Set: This data set contains the ANOVA statistics from fitting the G E model PCA Data Set: This data set contains the principal component eigenvectors from the linearbilinear model. ANOVA for Linear-Bilinear Model Data Set: This data set contains the ANOVA from fitting the linear-bilinear model that includes principal components to represent the G E

component. Figure 7 Figure 8 Click View Data to reveal the underlying data table associated with the current tab Click Reopen Dialog to reopen the completed process dialog used to generate this output Click Create Report to generate a pdf- or rtf-formatted report containing the plots and charts of selected tabs Click Close all to close all graphics windows and underlying data sets associated with the output.