Community surveys through space and time: testing the space-time interaction in the absence of replication

Size: px
Start display at page:

Download "Community surveys through space and time: testing the space-time interaction in the absence of replication"

Transcription

1 Community surveys through space and time: testing the space-time interaction in the absence of replication Pierre Legendre, Miquel De Cáceres & Daniel Borcard Département de sciences biologiques, Université de Montréal College of the Environment and Ecology, Xiamen University, Xiamen, Fujian, China, May 7, 206

2 . The general problem Ecologists sample portions of the environment repeatedly across time for ecological monitoring, or to test hypotheses about changes in the environment induced by man, including climatic change. Problem: space-time ecological studies are usually done without replication to maximize the study extent and minimize effort. Example: 0 Time coordinates (T) Spatial coordinates along transect (S)

3 . The general problem This talk will describe a method for testing a space time interaction in repeated ecological survey data, when there is no replication at the level of individual sampling units (sites). This methodological development is important for the analysis of long-term monitoring data, including systems under anthropogenic influence. In these systems, an interaction may indicate - that the spatial structure of the community composition has changed in the course of time - or that the temporal evolution is not the same at all sites.

4 . The general problem This talk will describe a method for testing a space time interaction in repeated ecological survey data, when there is no replication at the level of individual sampling units (sites). This methodological development is important for the analysis of long-term monitoring data, including systems under anthropogenic influence. In these systems, an interaction may indicate - that the spatial structure of the community composition has changed in the course of time - or that the temporal evolution is not the same at all sites. => Classical statistics tells us that one cannot test an interaction without replication.

5 2. Two-way anova for space (S) and time (T) crossed factors by canonical analysis (RDA)

6 Canonical redundancy analysis (RDA) The most common application of RDA in ecology is to test the relationship between a response Y and explanatory variables X: Simple RDA Response table Explanatory table Y Community composition data X Example: Environmental variables Partial RDA Response table Explanatory table Covariables Y Community composition data X Example: Environmental variables W Example: Spatial base functions

7 RDA as multivariate Anova RDA can be used to test the relationship between a response data matrix Y containing, for example, community composition data, and one or several experimental factors (crossed balanced designs) in matrix X. It is then a form of multivariate anova. Legendre, P. and M. J. Anderson Distance-based redundancy analysis: testing multispecies responses in multifactorial ecological experiments. Ecological Monographs 69: -24.

8 The anova factors can be coded in two different ways: () Coding for a single factor A with 3 levels: binary dummy variables Matrix X Factor A (3 levels) Factor A (3 levels) y = y y 2 y 3 y 4 y 2 y 22 y 23 y 24 y 3 y 32 y 33 y

9 The anova factors can be coded in two different ways: (2) Helmert contrasts Factor with 2 levels: variable Factor with 3 levels: 2 variables Factor with 4 levels: 3 variables Factor with 5 levels: 4 variables

10 The anova factors can be coded in two different ways: () Coding for a single (2) Coding for two crossed factors factor A with 3 levels: and their interaction: dummy variables Helmert contrasts A y 0 y 2 0 y 3 0 y 4 0 y 2 0 y = y 22 y 23 X = 0 0 y 24 0 y y y y

11 Tests of significance Test of a single factor Response table Y Community composition data Explanatory table X Factor to be tested Two crossed factors. Example: test of the interaction Response table Explanatory table Covariables Y Community composition data X Interaction to be tested W Main factors and other interactions

12 Analyze Y against space and time without replication First method: write matrices coding for space and time using dummy variables. Example:

13 Y = Species presence-absence or abundance (columns) X = Sampling times W = Covariables: dummy variables coding for sites Sites Sites s s Time Time etc etc. Sites s Time etc. Sites s Time etc. Sites s Time etc.

14 Why can t we test the space-time interaction?

15 We would still like to test the space-time interaction because a significant interaction would indicate that the temporal structures differ from site to site, or that the spatial structures differ from time to time. When the interaction is significant, we have to carry out separate analyses of the temporal variance for the different points in space, or separate analyses of the spatial variance for the different times. The absence of a significant interaction would indicate either that the differences among times can be modelled in the same way at all points in space, and conversely; or that there were not enough data to obtain a significant result for the test of the interaction (n too small, lack of power; type II error).

16 3. How can we test the space-time interaction in analyses of Y against space (S) and time (T) without replication?

17 Analyze Y against space and time without replication Using dummy variables to code for space and time, we did not have enough degrees of freedom, in the no-replication case, to test the S-T interaction.

18 Analyze Y against space and time without replication Using dummy variables to code for space and time, we did not have enough degrees of freedom, in the no-replication case, to test the S-T interaction. We can solve that problem by using a more parsimonious way of coding for space and time. We will use Moran s eigenvector maps (MEM). The type of MEM that we will use in this method were formerly called Principal Coordinates of Neighbour Matrices (PCNM),2. Dray, S., P. Legendre and P. R. Peres-Neto Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM). Ecological Modelling 96: Borcard, D. and P. Legendre All-scale spatial analysis of ecological data by means of principal coordinates of neighbour matrices. Ecological Modelling 53: 5-68.

19 MEM eigenfunctions represent a spectral decomposition of the spatial (or temporal) relationships among sites (or times). Example dbmem along a transect: what do they look like? dbmem dbmem dbmem dbmem dbmem dbmem

20 dbmem dbmem 2 Example of a regular grid with 8 2 = 96 points. Maps showing ten of the 48 dbmem eigenfunctions that display positive spatial correlation. Shades of grey: values in each eigenvector, from white (largest negative value) to black (largest positive value). dbmem 3 dbmem 4 dbmem 5 dbmem 6 dbmem 0 dbmem 20 dbmem 30 dbmem 40

21 MEM eigenfunctions represent a spectral decomposition of the spatial (or temporal) relationships among sampling sites (or times). They are orthogonal to one another, and fewer in number than dummy variables coding for the same sites (or times). Borcard, D. and P. Legendre All-scale spatial analysis of ecological data by means of principal coordinates of neighbour matrices. Ecological Modelling 53: Borcard, D., P. Legendre, C. Avois-Jacquet and H. Tuomisto Dissecting the spatial structure of ecological data at multiple scales. Ecology 85:

22 MEM eigenfunctions represent a spectral decomposition of the spatial (or temporal) relationships among sampling sites (or times). They are orthogonal to one another, and fewer in number than dummy variables coding for the same sites (or times). To model the Space and Time variation, we will use s/2 or t/2 MEM functions. These MEM are those that are modelling positive spatial (or temporal) correlation. We leave out the MEM modelling negative spatial (or temporal) correlation. For example, 0 equispaced sampling times are modelled by the following 5 MEM functions: Borcard, D. and P. Legendre All-scale spatial analysis of ecological data by means of principal coordinates of neighbour matrices. Ecological Modelling 53: Borcard, D., P. Legendre, C. Avois-Jacquet and H. Tuomisto Dissecting the spatial structure of ecological data at multiple scales. Ecology 85:

23

24 Sampling each point in space (S) during each sampling campaign (T) creates an orthogonal design. For that reason, the MEM eigenfunctions, which are orthogonal within each set (S, T), are also orthogonal between sets. The S-T interaction can be modelled by creating variables that are the products of each S-MEM by each T-MEM. The S-T interaction variables are orthogonal to the S-MEMs and the T-MEMs.

25 Sampling each point in space (S) during each sampling campaign (T) creates an orthogonal design. For that reason, the MEM eigenfunctions, which are orthogonal within each set (S, T), are also orthogonal between sets. The S-T interaction can be modelled by creating variables that are the products of each S-MEM by each T-MEM. The S-T interaction variables are orthogonal to the S-MEMs and the T-MEMs. Orthogonality of factors = happiness for statisticians Each portion of the model (main factor, interaction) accounts for a linearly independent fraction of variation, which is easier to interpret.

26 Testing the interaction in cross-designs without replication While it takes (s ) dummy variables to represent s sites, fewer MEM variables are necessary to analyze the spatial variation; likewise for time.

27 Simulated univariate data. Data with an S-T interaction Transect of s = 25 points, t = 5 sampling campaigns. Spatially autocorrelated data were generated in a 00 x 00 pixel field using the program SimSSD. A transect of 25 equidistant points (spacing = 4 units) was sampled in the middle of the field. A transect variable was the sum of two simulated vectors, with spatial ranges of 0 and 30 units respectively. 5 independent time realizations of the transect were created. Legendre, Dale, Fortin, Gurevitch, Hohn and Myers Ecography 25: Legendre, Dale, Fortin, Casgrain and Gurevitch Ecology 85: Legendre, Borcard and Peres-Neto Ecological Monographs 75:

28 5 Time Time Time Time Time Coordinates along the transect There is an S-T interaction because the 5 time realizations were created independently of one another.

29 3 S-MEM functions were created to model the spatial variation along the 25 points of the transect. 3 T-MEM functions were created to model the temporal variation across the 5 sampling times. To model the interaction, 39 ST functions were obtained by multiplying each S-MEM by each T-MEM. Canonical RDA was used to test the interaction in the presence of the main factors S and T. Results The S-T interaction was significant: p = (after 9999 perm.)

30 3 S-MEM functions were created to model the spatial variation along the 25 points of the transect. 3 T-MEM functions were created to model the temporal variation across the 5 sampling times. To model the interaction, 39 ST functions were obtained by multiplying each S-MEM by each T-MEM. Canonical RDA was used to test the interaction in the presence of the main factors S and T. Results The S-T interaction was significant: p = (after 9999 perm.) ü Correct answer The spatial structures differed from time to time. One would have to test the spatial structure of each sampling time separately.

31 Simulated univariate data 2. Data without an S-T interaction Transect of s = 25 points, t = 5 sampling campaigns. We took one of the transects (Time 5) from the previous data set and created 5 new time replicates by adding N(0, 0.3) error to all data values.

32 5 Time Time Time Time Time Coordinates along the transect There is no S-T interaction because the 5 time realizations were all constructed from the same, common spatial structure.

33 Results The S-T interaction was not significant: p =.0000 (9999 perm.) The main factor Time was not significant: p = (9999 perm.) The main factor Space was significant: p = (9999 perm.) ü Correct answer

34 A paper was published in 200 describing the method, reporting extensive simulation results (Appendices A and B), as well as two examples involving real ecological survey data: Legendre, P., M. De Cáceres and D. Borcard Community surveys through space and time: testing the space-time interaction in the absence of replication. Ecology 9:

35 ANOVA models Model Space Time Interaction SS(X s ) SS(X t ) SS(X Int ) Residuals SS Res Model 2 SS(X s ) SS(X t ) SS Res Model 3 SS(X u ) SS(X t ) SS(X Int3 ) SS Res Model 4 SS(X u ) SS(X v ) SS(X Int4 ) SS Res Model 5 SS(X s ) SS(X t ) SS(X Int4 ) SS Res Sum of squares partitioning in five Anova models for space-time analysis (Legendre et al. 200, Fig. ).

36 Comparison of Anova models for testing the space-time interaction Type I error of Model 5 was always correct for univariate response data and asymptotically correct for multivariate data, while type I error rates of Models 3 and 4 were too low. Hence the power of Model 5 was always equal to or higher than those of Models 3 and 4. Recommendations. Perform first a test of the S-T interaction using Anova Model Then proceed as follows: If the hypothesis of no interaction is not rejected at a high significance level (subject to the possibility of type II error), i.e. the presence of an interaction is not demonstrated: => Use Model 2 to test for space and time effects without replication. If the hypothesis of no interaction is rejected: => Model the spatial structure of each time period in a separate way; model the temporal structure of each sampling site separately.

37 We can test the significance of the temporal structures that were independently generated at the 30 sites along the transect, as follows: Times Times t t Y Simulated response variable Site Site 2 T-PCNM X Model of independent temporal structures at the various sites T-PCNM Times t Site 3 0 T-PCNM Times t Site s T-PCNM Result: p = (block permutations in Canoco, 9999 perm.) The independent temporal structures, at the 30 sites, are significant.

38 Likewise, we can test the significance of the spatial structures that were independently generated at the 0 sampling times, as follows: Sites Sites Sites s s s Y Simulated response variable Time Time 2 Time 3 S-PCNM X Model of independent spatial structures at the various times S-PCNM S-PCNM Sites s Time t S-PCNM Result: p = (block permutations in Canoco, 9999 perm.) The independent spatial structures, at the 0 times, are significant.

39 Example Trichoptera captured in 22 emergence traps sampled during summer 984 in outflow stream of lac Cromwell, Québec Google Maps

40 22 emergence traps visited daily during 00 days Data: 56 Trichoptera species abundances The insect counts were pooled in 0-day periods (many zeros) Data transformation: y' = log(y + ) Analysis: analysis of space-time interaction by RDA => The interaction was highly significant (R 2 = 0.205, p = after 9999 permutations) Analysis: K-means partioning to visualize the interaction => 5 groups of observations.

41 0-day time periods Current direction Traps

42 Example 2 Barro Colorado Island permanent forest plot in Panama. Fully censused tropical forest plot, 50 ha. We used the stem-based plot data covering four censuses (982-83, 985, 990, and 995) and counted the trees with cm diameter at breast height (dbh) or more, by species, in grid cells of m. 250 grid cells 35 tree species A map of the Center for Tropical Forest Science (CTFS) forest plots, and details about each plot, are available on the Web page

43

44 Example 2 Barro Colorado Island permanent forest plot in Panama. Fully censused tropical forest plot, 50 ha. We used the stem-based plot data covering four censuses (982-83, 985, 990, and 995) and counted the trees with 0 mm diameter at breast height (dbh) or more, by species, in grid cells of m. 250 grid cells 35 tree species Data transformation: y' = log(y + ). Test of S-T interaction on the multivariate data (35 species): significant result (p = 0.00 after 999 permutations). 2. Test of S-T interaction for each species separately: about 43% of the tree species had significantly changed their spatial structures across the four censuses. A map of the Center for Tropical Forest Science (CTFS) forest plots, and details about each plot, are available on the Web page

45 a) Poulsenia armata 0 Meters b) Beilschmiedia pendula Fig. 4a Changes in numbers of individuals for two species associated with slopes that significantly changed their distributions between the and 995 censuses. Light gray squares: loss; dark gray squares: gain of individuals. 500

46 b) Beilschmiedia pendula 0 Meters Fig. 4b Changes in numbers of individuals for two species associated with slopes that significantly changed their distributions between the and 995 censuses. Light gray squares: loss; dark gray squares: gain of individuals.

47 An R package, called STI, is available on Ecological Archives E09-09-S of the Ecological Society of America ( year 200), and on the page

48 Avez-vous des questions? Do you have questions?

Community surveys through space and time: testing the space-time interaction in the absence of replication

Community surveys through space and time: testing the space-time interaction in the absence of replication Community surveys through space and time: testing the space-time interaction in the absence of replication Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/

More information

Community surveys through space and time: testing the space time interaction

Community surveys through space and time: testing the space time interaction Suivi spatio-temporel des écosystèmes : tester l'interaction espace-temps pour identifier les impacts sur les communautés Community surveys through space and time: testing the space time interaction Pierre

More information

Analysis of Multivariate Ecological Data

Analysis of Multivariate Ecological Data Analysis of Multivariate Ecological Data School on Recent Advances in Analysis of Multivariate Ecological Data 24-28 October 2016 Prof. Pierre Legendre Dr. Daniel Borcard Département de sciences biologiques

More information

Temporal eigenfunction methods for multiscale analysis of community composition and other multivariate data

Temporal eigenfunction methods for multiscale analysis of community composition and other multivariate data Temporal eigenfunction methods for multiscale analysis of community composition and other multivariate data Pierre Legendre Département de sciences biologiques Université de Montréal Pierre.Legendre@umontreal.ca

More information

Spatial eigenfunction modelling: recent developments

Spatial eigenfunction modelling: recent developments Spatial eigenfunction modelling: recent developments Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2018 Outline of the presentation

More information

Figure 43 - The three components of spatial variation

Figure 43 - The three components of spatial variation Université Laval Analyse multivariable - mars-avril 2008 1 6.3 Modeling spatial structures 6.3.1 Introduction: the 3 components of spatial structure For a good understanding of the nature of spatial variation,

More information

Partial regression and variation partitioning

Partial regression and variation partitioning Partial regression and variation partitioning Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2017 Outline of the presentation

More information

Supplementary Material

Supplementary Material Supplementary Material The impact of logging and forest conversion to oil palm on soil bacterial communities in Borneo Larisa Lee-Cruz 1, David P. Edwards 2,3, Binu Tripathi 1, Jonathan M. Adams 1* 1 Department

More information

Should the Mantel test be used in spatial analysis?

Should the Mantel test be used in spatial analysis? 1 1 Accepted for publication in Methods in Ecology and Evolution Should the Mantel test be used in spatial analysis? 3 Pierre Legendre 1 *, Marie-Josée Fortin and Daniel Borcard 1 4 5 1 Département de

More information

Appendix A : rational of the spatial Principal Component Analysis

Appendix A : rational of the spatial Principal Component Analysis Appendix A : rational of the spatial Principal Component Analysis In this appendix, the following notations are used : X is the n-by-p table of centred allelic frequencies, where rows are observations

More information

Comparison of two samples

Comparison of two samples Comparison of two samples Pierre Legendre, Université de Montréal August 009 - Introduction This lecture will describe how to compare two groups of observations (samples) to determine if they may possibly

More information

What are the important spatial scales in an ecosystem?

What are the important spatial scales in an ecosystem? What are the important spatial scales in an ecosystem? Pierre Legendre Département de sciences biologiques Université de Montréal Pierre.Legendre@umontreal.ca http://www.bio.umontreal.ca/legendre/ Seminar,

More information

Should the Mantel test be used in spatial analysis?

Should the Mantel test be used in spatial analysis? Methods in Ecology and Evolution 2015, 6, 1239 1247 doi: 10.1111/2041-210X.12425 Should the Mantel test be used in spatial analysis? Pierre Legendre 1 *, Marie-Josee Fortin 2 and Daniel Borcard 1 1 Departement

More information

Estimating and controlling for spatial structure in the study of ecological communitiesgeb_

Estimating and controlling for spatial structure in the study of ecological communitiesgeb_ Global Ecology and Biogeography, (Global Ecol. Biogeogr.) (2010) 19, 174 184 MACROECOLOGICAL METHODS Estimating and controlling for spatial structure in the study of ecological communitiesgeb_506 174..184

More information

Species Associations: The Kendall Coefficient of Concordance Revisited

Species Associations: The Kendall Coefficient of Concordance Revisited Species Associations: The Kendall Coefficient of Concordance Revisited Pierre LEGENDRE The search for species associations is one of the classical problems of community ecology. This article proposes to

More information

The discussion Analyzing beta diversity contains the following papers:

The discussion Analyzing beta diversity contains the following papers: The discussion Analyzing beta diversity contains the following papers: Legendre, P., D. Borcard, and P. Peres-Neto. 2005. Analyzing beta diversity: partitioning the spatial variation of community composition

More information

Modelling directional spatial processes in ecological data

Modelling directional spatial processes in ecological data ecological modelling 215 (2008) 325 336 available at www.sciencedirect.com journal homepage: www.elsevier.com/locate/ecolmodel Modelling directional spatial processes in ecological data F. Guillaume Blanchet

More information

Chapter 11 Canonical analysis

Chapter 11 Canonical analysis Chapter 11 Canonical analysis 11.0 Principles of canonical analysis Canonical analysis is the simultaneous analysis of two, or possibly several data tables. Canonical analyses allow ecologists to perform

More information

Analysis of Multivariate Ecological Data

Analysis of Multivariate Ecological Data Analysis of Multivariate Ecological Data School on Recent Advances in Analysis of Multivariate Ecological Data 24-28 October 2016 Prof. Pierre Legendre Dr. Daniel Borcard Département de sciences biologiques

More information

VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis

VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis Pedro R. Peres-Neto March 2005 Department of Biology University of Regina Regina, SK S4S 0A2, Canada E-mail: Pedro.Peres-Neto@uregina.ca

More information

Testing the significance of canonical axes in redundancy analysis

Testing the significance of canonical axes in redundancy analysis Methods in Ecology and Evolution 2011, 2, 269 277 doi: 10.1111/j.2041-210X.2010.00078.x Testing the significance of canonical axes in redundancy analysis Pierre Legendre 1 *, Jari Oksanen 2 and Cajo J.

More information

Use R! Series Editors:

Use R! Series Editors: Use R! Series Editors: Robert Gentleman Kurt Hornik Giovanni G. Parmigiani Use R! Albert: Bayesian Computation with R Bivand/Pebesma/Gómez-Rubio: Applied Spatial Data Analysis with R Cook/Swayne: Interactive

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM)

Spatial modelling: a comprehensive framework for principal coordinate analysis of neighbour matrices (PCNM) ecological modelling 196 (2006) 483 493 available at www.sciencedirect.com journal homepage: www.elsevier.com/locate/ecolmodel Spatial modelling: a comprehensive framework for principal coordinate analysis

More information

QUANTIFYING PHYLOGENETICALLY STRUCTURED ENVIRONMENTAL VARIATION

QUANTIFYING PHYLOGENETICALLY STRUCTURED ENVIRONMENTAL VARIATION BRIEF COMMUNICATIONS 2647 Evolution, 57(11), 2003, pp. 2647 2652 QUANTIFYING PHYLOGENETICALLY STRUCTURED ENVIRONMENTAL VARIATION YVES DESDEVISES, 1,2 PIERRE LEGENDRE, 3,4 LAMIA AZOUZI, 5,6 AND SERGE MORAND

More information

Analyse canonique, partition de la variation et analyse CPMV

Analyse canonique, partition de la variation et analyse CPMV Analyse canonique, partition de la variation et analyse CPMV Legendre, P. 2005. Analyse canonique, partition de la variation et analyse CPMV. Sémin-R, atelier conjoint GREFi-CRBF d initiation au langage

More information

MEMGENE package for R: Tutorials

MEMGENE package for R: Tutorials MEMGENE package for R: Tutorials Paul Galpern 1,2 and Pedro Peres-Neto 3 1 Faculty of Environmental Design, University of Calgary 2 Natural Resources Institute, University of Manitoba 3 Département des

More information

MEMGENE: Spatial pattern detection in genetic distance data

MEMGENE: Spatial pattern detection in genetic distance data Methods in Ecology and Evolution 2014, 5, 1116 1120 doi: 10.1111/2041-210X.12240 APPLICATION MEMGENE: Spatial pattern detection in genetic distance data Paul Galpern 1,2 *, Pedro R. Peres-Neto 3, Jean

More information

Canonical Correlation Analysis of Longitudinal Data

Canonical Correlation Analysis of Longitudinal Data Biometrics Section JSM 2008 Canonical Correlation Analysis of Longitudinal Data Jayesh Srivastava Dayanand N Naik Abstract Studying the relationship between two sets of variables is an important multivariate

More information

Spatial-scale partitioning of in situ turbulent flow data over a pebble cluster in a gravel-bed river

Spatial-scale partitioning of in situ turbulent flow data over a pebble cluster in a gravel-bed river WATER RESOURCES RESEARCH, VOL. 43, W03416, doi:10.109/006wr005044, 007 Spatial-scale partitioning of in situ turbulent flow data over a pebble cluster in a gravel-bed river R. W. Jay Lacey, 1 Pierre Legendre,

More information

Wavelet methods and null models for spatial pattern analysis

Wavelet methods and null models for spatial pattern analysis Wavelet methods and null models for spatial pattern analysis Pavel Dodonov Part of my PhD thesis, by the Federal University of São Carlos (São Carlos, SP, Brazil), supervised by Dr Dalva M. Silva-Matos

More information

Canonical analysis. Pierre Legendre Département de sciences biologiques Université de Montréal

Canonical analysis. Pierre Legendre Département de sciences biologiques Université de Montréal Canonical analysis Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2017 Outline of the presentation 1. Canonical analysis: definition

More information

8. FROM CLASSICAL TO CANONICAL ORDINATION

8. FROM CLASSICAL TO CANONICAL ORDINATION Manuscript of Legendre, P. and H. J. B. Birks. 2012. From classical to canonical ordination. Chapter 8, pp. 201-248 in: Tracking Environmental Change using Lake Sediments, Volume 5: Data handling and numerical

More information

6. Spatial analysis of multivariate ecological data

6. Spatial analysis of multivariate ecological data Université Laval Analyse multivariable - mars-avril 2008 1 6. Spatial analysis of multivariate ecological data 6.1 Introduction 6.1.1 Conceptual importance Ecological models have long assumed, for simplicity,

More information

Modelling the effect of directional spatial ecological processes at different scales

Modelling the effect of directional spatial ecological processes at different scales Oecologia (2011) 166:357 368 DOI 10.1007/s00442-010-1867-y POPULATION ECOLOGY - ORIGINAL PAPER Modelling the effect of directional spatial ecological processes at different scales F. Guillaume Blanchet

More information

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis

Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis For example Data reduction approaches Cluster analysis Principal components analysis

More information

Dissimilarity and transformations. Pierre Legendre Département de sciences biologiques Université de Montréal

Dissimilarity and transformations. Pierre Legendre Département de sciences biologiques Université de Montréal and transformations Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2017 Definitions An association coefficient is a function

More information

Compare several group means: ANOVA

Compare several group means: ANOVA 1 - Introduction Compare several group means: ANOVA Pierre Legendre, Université de Montréal August 009 Objective: compare the means of several () groups for a response variable of interest. The groups

More information

DETECTING BIOLOGICAL AND ENVIRONMENTAL CHANGES: DESIGN AND ANALYSIS OF MONITORING AND EXPERIMENTS (University of Bologna, 3-14 March 2008)

DETECTING BIOLOGICAL AND ENVIRONMENTAL CHANGES: DESIGN AND ANALYSIS OF MONITORING AND EXPERIMENTS (University of Bologna, 3-14 March 2008) Dipartimento di Biologia Evoluzionistica Sperimentale Centro Interdipartimentale di Ricerca per le Scienze Ambientali in Ravenna INTERNATIONAL WINTER SCHOOL UNIVERSITY OF BOLOGNA DETECTING BIOLOGICAL AND

More information

VARIATION PARTITIONING OF SPECIES DATA MATRICES: ESTIMATION AND COMPARISON OF FRACTIONS

VARIATION PARTITIONING OF SPECIES DATA MATRICES: ESTIMATION AND COMPARISON OF FRACTIONS Ecology, 87(10), 2006, pp. 2614 2625 Ó 2006 by the Ecological Society of America VARIATION PARTITIONING OF SPECIES DATA MATRICES: ESTIMATION AND COMPARISON OF FRACTIONS PEDRO R. PERES-NETO, 1 PIERRE LEGENDRE,

More information

Multivariate Analysis of Ecological Data using CANOCO

Multivariate Analysis of Ecological Data using CANOCO Multivariate Analysis of Ecological Data using CANOCO JAN LEPS University of South Bohemia, and Czech Academy of Sciences, Czech Republic Universitats- uric! Lanttesbibiiothek Darmstadt Bibliothek Biologie

More information

REVIEWS. Villeurbanne, France. des Plantes (AMAP), Boulevard de la Lironde, TA A-51/PS2, F Montpellier cedex 5, France. Que bec H3C 3P8 Canada

REVIEWS. Villeurbanne, France. des Plantes (AMAP), Boulevard de la Lironde, TA A-51/PS2, F Montpellier cedex 5, France. Que bec H3C 3P8 Canada Ecological Monographs, 82(3), 2012, pp. 257 275 Ó 2012 by the Ecological Society of America Community ecology in the age of multivariate multiscale spatial analysis S. DRAY, 1,16 R. PE LISSIER, 2,3 P.

More information

Community ecology in the age of multivariate multiscale spatial analysis

Community ecology in the age of multivariate multiscale spatial analysis TSPACE RESEARCH REPOSITORY tspace.library.utoronto.ca 2012 Community ecology in the age of multivariate multiscale spatial analysis Published version Stéphane Dray Raphaël Pélissier Pierre Couteron Marie-Josée

More information

BIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA

BIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA BIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA PART 1: NESTED ANOVA Nested designs are used when levels of one factor are not represented within all levels of another factor. Often this is

More information

NONLINEAR REDUNDANCY ANALYSIS AND CANONICAL CORRESPONDENCE ANALYSIS BASED ON POLYNOMIAL REGRESSION

NONLINEAR REDUNDANCY ANALYSIS AND CANONICAL CORRESPONDENCE ANALYSIS BASED ON POLYNOMIAL REGRESSION Ecology, 8(4),, pp. 4 by the Ecological Society of America NONLINEAR REDUNDANCY ANALYSIS AND CANONICAL CORRESPONDENCE ANALYSIS BASED ON POLYNOMIAL REGRESSION VLADIMIR MAKARENKOV, AND PIERRE LEGENDRE, Département

More information

Bringing multivariate support to multiscale codependence analysis: Assessing the drivers of community structure across spatial scales

Bringing multivariate support to multiscale codependence analysis: Assessing the drivers of community structure across spatial scales Received: February 7 Accepted: July 7 DOI:./-X. RESEARCH ARTICLE Bringing multivariate support to multiscale codependence analysis: Assessing the drivers of community structure across spatial scales Guillaume

More information

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis Lecture 5: Ecological distance metrics; Principal Coordinates Analysis Univariate testing vs. community analysis Univariate testing deals with hypotheses concerning individual taxa Is this taxon differentially

More information

Bootstrapping, Randomization, 2B-PLS

Bootstrapping, Randomization, 2B-PLS Bootstrapping, Randomization, 2B-PLS Statistics, Tests, and Bootstrapping Statistic a measure that summarizes some feature of a set of data (e.g., mean, standard deviation, skew, coefficient of variation,

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Image Data Compression

Image Data Compression Image Data Compression Image data compression is important for - image archiving e.g. satellite data - image transmission e.g. web data - multimedia applications e.g. desk-top editing Image data compression

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Statistical comparison of univariate tests of homogeneity of variances

Statistical comparison of univariate tests of homogeneity of variances Submitted to the Journal of Statistical Computation and Simulation Statistical comparison of univariate tests of homogeneity of variances Pierre Legendre* and Daniel Borcard Département de sciences biologiques,

More information

Principal component analysis

Principal component analysis Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance

More information

Rejection regions for the bivariate case

Rejection regions for the bivariate case Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do

More information

Species associations

Species associations Species associations Pierre Legendre 1 and F. Guillaume Blanchet 2 1 Département de sciences biologiques, Université de Montréal 2 Department of Renewable Resources, University of Alberta Introduction

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

Applied Multivariate and Longitudinal Data Analysis

Applied Multivariate and Longitudinal Data Analysis Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) II Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 1 Compare Means from More Than Two

More information

Introduction to ordination. Gary Bradfield Botany Dept.

Introduction to ordination. Gary Bradfield Botany Dept. Introduction to ordination Gary Bradfield Botany Dept. Ordination there appears to be no word in English which one can use as an antonym to classification ; I would like to propose the term ordination.

More information

A Likelihood Ratio Test

A Likelihood Ratio Test A Likelihood Ratio Test David Allen University of Kentucky February 23, 2012 1 Introduction Earlier presentations gave a procedure for finding an estimate and its standard error of a single linear combination

More information

Reports. Using codispersion analysis to characterize spatial patterns in species co- occurrences

Reports. Using codispersion analysis to characterize spatial patterns in species co- occurrences Ecology, 97(1), 2016, pp. 32 39 2016 by the Ecological Society of America Using codispersion analysis to characterize spatial patterns in species co- occurrences H ANNAH L. BUCKLEY, 1,2,4 B RADLEY S. CASE,

More information

PROJECT PARTNER: PROJECT PARTNER:

PROJECT PARTNER: PROJECT PARTNER: PROJECT TITLE: Development of a Statewide Freshwater Mussel Monitoring Program in Arkansas PROJECT SUMMARY: A large data set exists for Arkansas' large river mussel fauna that is maintained in the Arkansas

More information

Analysis of Variance: Part 1

Analysis of Variance: Part 1 Analysis of Variance: Part 1 Oneway ANOVA When there are more than two means Each time two means are compared the probability (Type I error) =α. When there are more than two means Each time two means are

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures

Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

sphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19

sphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19 additive tree structure, 10-28 ADDTREE, 10-51, 10-53 EXTREE, 10-31 four point condition, 10-29 ADDTREE, 10-28, 10-51, 10-53 adjusted R 2, 8-7 ALSCAL, 10-49 ANCOVA, 9-1 assumptions, 9-5 example, 9-7 MANOVA

More information

A UNIFIED STRATEGY FOR ESTIMATING AND CONTROLLING SPATIAL, TEMPORAL AND PHYLOGENETIC AUTOCORRELATION IN ECOLOGICAL MODELS

A UNIFIED STRATEGY FOR ESTIMATING AND CONTROLLING SPATIAL, TEMPORAL AND PHYLOGENETIC AUTOCORRELATION IN ECOLOGICAL MODELS A UNIFIED STRATEGY FOR ESTIMATING AND CONTROLLING SPATIAL, TEMPORAL AND PHYLOGENETIC AUTOCORRELATION IN ECOLOGICAL MODELS Pedro R. Peres-Neto Université du Québec à Montréal. Case postale 8888, succursale

More information

4/4/2018. Stepwise model fitting. CCA with first three variables only Call: cca(formula = community ~ env1 + env2 + env3, data = envdata)

4/4/2018. Stepwise model fitting. CCA with first three variables only Call: cca(formula = community ~ env1 + env2 + env3, data = envdata) 0 Correlation matrix for ironmental matrix 1 2 3 4 5 6 7 8 9 10 11 12 0.087451 0.113264 0.225049-0.13835 0.338366-0.01485 0.166309-0.11046 0.088327-0.41099-0.19944 1 1 2 0.087451 1 0.13723-0.27979 0.062584

More information

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis Lecture 5: Ecological distance metrics; Principal Coordinates Analysis Univariate testing vs. community analysis Univariate testing deals with hypotheses concerning individual taxa Is this taxon differentially

More information

Principal Components Analysis (PCA)

Principal Components Analysis (PCA) Principal Components Analysis (PCA) Principal Components Analysis (PCA) a technique for finding patterns in data of high dimension Outline:. Eigenvectors and eigenvalues. PCA: a) Getting the data b) Centering

More information

Statistical foundations

Statistical foundations Statistical foundations Michael K. Tippett International Research Institute for Climate and Societ The Earth Institute, Columbia Universit ERFS Climate Predictabilit Tool Training Workshop Ma 4-9, 29 Ideas

More information

Introduction to Machine Learning

Introduction to Machine Learning 10-701 Introduction to Machine Learning PCA Slides based on 18-661 Fall 2018 PCA Raw data can be Complex, High-dimensional To understand a phenomenon we measure various related quantities If we knew what

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

An Introduction to Multivariate Statistical Analysis

An Introduction to Multivariate Statistical Analysis An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents

More information

Course in Data Science

Course in Data Science Course in Data Science About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst. The course gives an

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Algebra of Principal Component Analysis

Algebra of Principal Component Analysis Algebra of Principal Component Analysis 3 Data: Y = 5 Centre each column on its mean: Y c = 7 6 9 y y = 3..6....6.8 3. 3.8.6 Covariance matrix ( variables): S = -----------Y n c ' Y 8..6 c =.6 5.8 Equation

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Mean Ellenberg indicator values as explanatory variables in constrained ordination. David Zelený

Mean Ellenberg indicator values as explanatory variables in constrained ordination. David Zelený Mean Ellenberg indicator values as explanatory variables in constrained ordination David Zelený Heinz Ellenberg Use of mean Ellenberg indicator values in vegetation analysis species composition observed

More information

2.830 Homework #6. April 2, 2009

2.830 Homework #6. April 2, 2009 2.830 Homework #6 Dayán Páez April 2, 2009 1 ANOVA The data for four different lithography processes, along with mean and standard deviations are shown in Table 1. Assume a null hypothesis of equality.

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

Mixed-Models. version 30 October 2011

Mixed-Models. version 30 October 2011 Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector

More information

Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition

Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

More information

Is the sampling strategy interfering with the study of spatial variability of zooplankton communities?

Is the sampling strategy interfering with the study of spatial variability of zooplankton communities? 1940 Is the sampling strategy interfering with the study of spatial variability of zooplankton communities? Carol Avois, Pierre Legendre, Stéphane Masson, and Bernadette Pinel-Alloul Abstract: Surveys

More information

Introduction to the Design and Analysis of Experiments

Introduction to the Design and Analysis of Experiments Introduction to the Design and Analysis of Experiments Geoffrey M. Clarke, MA,Dip.stats.,c.stat. Honorary Reader in Applied Statistics, University of Kent at Canterbury and Consultant to the Applied Statistics

More information

WELCOME! Lecture 13 Thommy Perlinger

WELCOME! Lecture 13 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 13 Thommy Perlinger Parametrical tests (tests for the mean) Nature and number of variables One-way vs. two-way ANOVA One-way ANOVA Y X 1 1 One dependent variable

More information

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial

More information

Principal Component Analysis -- PCA (also called Karhunen-Loeve transformation)

Principal Component Analysis -- PCA (also called Karhunen-Loeve transformation) Principal Component Analysis -- PCA (also called Karhunen-Loeve transformation) PCA transforms the original input space into a lower dimensional space, by constructing dimensions that are linear combinations

More information

Selection on Multiple Traits

Selection on Multiple Traits Selection on Multiple Traits Bruce Walsh lecture notes Uppsala EQG 2012 course version 7 Feb 2012 Detailed reading: Chapter 30 Genetic vs. Phenotypic correlations Within an individual, trait values can

More information

Spatial autocorrelation: robustness of measures and tests

Spatial autocorrelation: robustness of measures and tests Spatial autocorrelation: robustness of measures and tests Marie Ernst and Gentiane Haesbroeck University of Liege London, December 14, 2015 Spatial Data Spatial data : geographical positions non spatial

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information