4/4/2018. Stepwise model fitting. CCA with first three variables only Call: cca(formula = community ~ env1 + env2 + env3, data = envdata)
|
|
- Dale Atkins
- 5 years ago
- Views:
Transcription
1 0 Correlation matrix for ironmental matrix vif.cca (Variance Inflation Scores) > vif.cca(base_cca) spatialb spatialc Eliminate variable 10: spatialb spatialc Eliminate variable spatial: cca(formula = community ~ spatial + 7, data = data) Total Constrained Unconstrained Eigalues, and their contribution to the mean squared contingency coefficient CCA3 CA1 CA2 CA3 CA4 CA5 CA6 CA7 CA8 CA9 Eigalue Proportion Explained Cumulative Proportion Accumulated constrained eigalues CCA3 Eigalue Proportion Explained Cumulative Proportion sample16 spatialb spatialc > vif.cca(model_fit) spatialb spatialc CCA with first three variables only Call: cca(formula = community ~ , data = data) Total Constrained Unconstrained Eigalues, and their contribution to the mean squared contingency coefficient CCA3 CA1 CA2 CA3 CA4 CA5 CA6 CA7 CA8 CA9 Eigalue Prop Explain Cum. Prop Accumulated constrained eigalues CCA Eigalue Proportion Explained Cumulative Proportion sample Stepwise model fitting Start: community ~ ** + spatial ** ** Step: community ~ spatial ** - spatial ** Step: community ~ spatial + 7 Note that exact output changes each time you run it (P values permutational). Adjust Pin option to 0.2 and Pout to 0.3: Step: community ~ spatial ** spatial ** Step: community ~ spatial ** - spatial **
2 sample16 sample3 Tests of significance What to Present > anova(model_fit) ** Model > anova(model_fit, by = "terms", perm = 1000) Terms added sequentially (first to last) ** spatial ** > anova(model_fit, by = "axis", perm = 1000) ** ** CCA CCA CCA aligned with CA, species relationships assumed to be unimodal A CCA with no constraints = CA RDA aligned with PCA, species relationships assumed to be linear An RDA with no constraints = PCA CCA is more common for the same reasons PCA not usually used for species data. Function rda (vegan package) works similarly to cca. rda(community ~ spatial + 7, data) Total inertia differs because PCA begins with variance/covariance matrix, not X 2 Total Constrained Unconstrained Eigalues, and their contribution to the correlations RDA spatialb RDA1 RDA2 RDA3 PC1 PC2 PC3 PC4 PC5 PC6 PC7 Eigalue Proportion Explained Cumulative Proportion RDA1 spatialc
3 sample3 0 PDA without ironmental matrix = PCA rda(x = community, scale = T) Partitioning of correlations: Total 10 1 Unconstrained 10 1 Eigalues, and their contribution to the correlations PC sample3 sample PC1 Example RDA application How do various ironmental variables affect morphology or behavior (plasticity question)? Assumption is that you know and have measured the ironmental variables that are going to impact the morphology/behavior measured. Morphology/Behavior Env. Variables PC1 PC2 PC3 PC4 PC5 PC6 PC7 PC8 PC9 PC10 Eigalue Proportion Explained Cumulative Proportion Individuals Samples Constrained Analysis of Principal Coordinates (capscale) Function capscale (vegan package) Similar to RDA but based on non-euclidean distances Function works with raw data (indicate which distance metric to use) or with a distance matrix (e.g. genetic distance) Same anova function tests for significance Various options for dealing with data standardization Also called distance based redundancy analysis in literature Anderson, M.J. & Willis, T.J. (2003). Canonical analysis of principal coordinates: a useful method of constrained ordination for ecology. Ecology 84, CAP spatialb sample16 spatialc Call: CAP1 capscale(formula = community ~ spatial + 7, data = data, distance = "bray") Partitioning of squared Bray distance: Total Constrained Unconstrained Gradients unknown, want to describe data or imply gradients from species data. No yes Important gradients are known and measured. Goal is to describe variability in species data as it relates to the measured gradients. Variability due to other unmeasured factors is ignored. Direct gradient analysis: CCA or RDA. Data are thought to be distinctly structured. Goal is to put in distinct groups. no yes Cluster hierarchical (agglomerative or divisive) or non hierarchial (k means). Goal is to look at the relationship among samples. Indirect gradient analysis: NMDS or CA preferred, PCA or PCoA for noncommunity datasets. 3
4 Unconstrained NMDS approach (exploratory) NMDS sample sample Used when you want to control (or partial out) a variable. Eg. larger spatial variable or something traditionally considered a covariate. Typically not something you re interested in quantifying as much as controlling for. Recall that CCA (and RDA, capscale) use multiple regression to optimize constraints to species or sample scores. Partial Regression Regression coefficient of X after controlling for influence of Z on both X and Y. NMDS1 Dementia Score Dementia Score Age Wanted relationship between X (age) and y (degree of dementia). Numerous other variables confound the relationship, such as IQ and mean number of hours spent reading daily. Residualized dementia score, effects of IQ have been partialed out. IQ Residualized Dimentia Age 4
5 0 0 All the same precautions apply as with CCA. Output will be nearly identical with the inclusion of the variance accounted for by the partialed variable. Total inertial in the ordination same as CA Inertia partitioned out comparison of the model with and without partial variable Inertia constrained by variables in model Unconstrained inertia Uses the same function as CCA or RDA Code cca_example<-cca(community ~ Condition(spatial),ironmental) There is no significance test for the partialed variable(s), just the % variance accounted for. cca(formula = community ~ 7 + Condition(spatial), data = data) Total Conditioned Constrained Unconstrained CA II a b c sample16 spatialb spatialc Test for influence of 7 while controlling for spatial variable. Eigalues, and their contribution to the mean squared contingency coefficient after removing the contribution of conditiniong variables CA1 CA2 CA3 CA4 CA5 CA6 CA7 CA8 CA9 Eigalue Proportion Explained Cumulative Proportion CA sample > anova(partial_cca) Model: cca(formula = community ~ 7 + Condition(spatial), data = data) Model ** Residual > anova(partial_cca, by = "terms", perm = 1000) Terms added sequentially (first to last) Model: cca(formula = community ~ 7 + Condition(spatial), data = data) ** Residual CA I 5
4/2/2018. Canonical Analyses Analysis aimed at identifying the relationship between two multivariate datasets. Cannonical Correlation.
GAL50.44 0 7 becki 2 0 chatamensis 0 darwini 0 ephyppium 0 guntheri 3 0 hoodensis 0 microphyles 0 porteri 2 0 vandenburghi 0 vicina 4 0 Multiple Response Variables? Univariate Statistics Questions Individual
More informationMultivariate Statistics Summary and Comparison of Techniques. Multivariate Techniques
Multivariate Statistics Summary and Comparison of Techniques P The key to multivariate statistics is understanding conceptually the relationship among techniques with regards to: < The kinds of problems
More information-Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the
1 2 3 -Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the 1950's. -PCA is based on covariance or correlation
More informationINTRODUCTION TO MULTIVARIATE ANALYSIS OF ECOLOGICAL DATA
INTRODUCTION TO MULTIVARIATE ANALYSIS OF ECOLOGICAL DATA David Zelený & Ching-Feng Li INTRODUCTION TO MULTIVARIATE ANALYSIS Ecologial similarity similarity and distance indices Gradient analysis regression,
More informationMultivariate Analysis of Ecological Data using CANOCO
Multivariate Analysis of Ecological Data using CANOCO JAN LEPS University of South Bohemia, and Czech Academy of Sciences, Czech Republic Universitats- uric! Lanttesbibiiothek Darmstadt Bibliothek Biologie
More informationMean Ellenberg indicator values as explanatory variables in constrained ordination. David Zelený
Mean Ellenberg indicator values as explanatory variables in constrained ordination David Zelený Heinz Ellenberg Use of mean Ellenberg indicator values in vegetation analysis species composition observed
More informationChapter 11 Canonical analysis
Chapter 11 Canonical analysis 11.0 Principles of canonical analysis Canonical analysis is the simultaneous analysis of two, or possibly several data tables. Canonical analyses allow ecologists to perform
More informationAn Introduction to R for the Geosciences: Ordination I
An Introduction to R for the Geosciences: Ordination I Gavin Simpson April 29, 2013 Summary This practical will use the PONDS dataset to demonstrate methods of indirect gradient analysis (PCA, CA, and
More informationDETECTING BIOLOGICAL AND ENVIRONMENTAL CHANGES: DESIGN AND ANALYSIS OF MONITORING AND EXPERIMENTS (University of Bologna, 3-14 March 2008)
Dipartimento di Biologia Evoluzionistica Sperimentale Centro Interdipartimentale di Ricerca per le Scienze Ambientali in Ravenna INTERNATIONAL WINTER SCHOOL UNIVERSITY OF BOLOGNA DETECTING BIOLOGICAL AND
More informationIndirect Gradient Analysis
Indirect Gradient Analysis Gavin Simpson May 12, 2006 Summary This practical will use the PONDS dataset to demonstrate methods of indirect gradient analysis (PCA, CA, and DCA) of species and environmental
More informationMultivariate Statistics 101. Ordination (PCA, NMDS, CA) Cluster Analysis (UPGMA, Ward s) Canonical Correspondence Analysis
Multivariate Statistics 101 Ordination (PCA, NMDS, CA) Cluster Analysis (UPGMA, Ward s) Canonical Correspondence Analysis Multivariate Statistics 101 Copy of slides and exercises PAST software download
More informationAnalysis of Multivariate Ecological Data
Analysis of Multivariate Ecological Data School on Recent Advances in Analysis of Multivariate Ecological Data 24-28 October 2016 Prof. Pierre Legendre Dr. Daniel Borcard Département de sciences biologiques
More informationLab 7. Direct & Indirect Gradient Analysis
Lab 7 Direct & Indirect Gradient Analysis Direct and indirect gradient analysis refers to a case where you have two datasets with variables that have cause-and-effect or mutual influences on each other.
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationAnalysis of community ecology data in R
Analysis of community ecology data in R Jinliang Liu ( 刘金亮 ) Institute of Ecology, College of Life Science Zhejiang University Email: jinliang.liu@foxmail.com http://jinliang.weebly.com R packages ###
More informationUnconstrained Ordination
Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)
More informationCAP. Canonical Analysis of Principal coordinates. A computer program by Marti J. Anderson. Department of Statistics University of Auckland (2002)
CAP Canonical Analysis of Principal coordinates A computer program by Marti J. Anderson Department of Statistics University of Auckland (2002) 2 DISCLAIMER This FORTRAN program is provided without any
More information4. Ordination in reduced space
Université Laval Analyse multivariable - mars-avril 2008 1 4.1. Generalities 4. Ordination in reduced space Contrary to most clustering techniques, which aim at revealing discontinuities in the data, ordination
More informationANOVA approach. Investigates interaction terms. Disadvantages: Requires careful sampling design with replication
ANOVA approach Advantages: Ideal for evaluating hypotheses Ideal to quantify effect size (e.g., differences between groups) Address multiple factors at once Investigates interaction terms Disadvantages:
More information1.3. Principal coordinate analysis. Pierre Legendre Département de sciences biologiques Université de Montréal
1.3. Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2018 Definition of principal coordinate analysis (PCoA) An ordination method
More informationLecture 2: Diversity, Distances, adonis. Lecture 2: Diversity, Distances, adonis. Alpha- Diversity. Alpha diversity definition(s)
Lecture 2: Diversity, Distances, adonis Lecture 2: Diversity, Distances, adonis Diversity - alpha, beta (, gamma) Beta- Diversity in practice: Ecological Distances Unsupervised Learning: Clustering, etc
More informationIntroduction to ordination. Gary Bradfield Botany Dept.
Introduction to ordination Gary Bradfield Botany Dept. Ordination there appears to be no word in English which one can use as an antonym to classification ; I would like to propose the term ordination.
More informationMultivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
More informationOrdination & PCA. Ordination. Ordination
Ordination & PCA Introduction to Ordination Purpose & types Shepard diagrams Principal Components Analysis (PCA) Properties Computing eigenvalues Computing principal components Biplots Covariance vs. Correlation
More information8. FROM CLASSICAL TO CANONICAL ORDINATION
Manuscript of Legendre, P. and H. J. B. Birks. 2012. From classical to canonical ordination. Chapter 8, pp. 201-248 in: Tracking Environmental Change using Lake Sediments, Volume 5: Data handling and numerical
More informationBIO 682 Multivariate Statistics Spring 2008
BIO 682 Multivariate Statistics Spring 2008 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 11 Properties of Community Data Gauch 1982, Causton 1988, Jongman 1995 a. Qualitative:
More informationMultivariate analysis
Multivariate analysis Prof dr Ann Vanreusel -Multidimensional scaling -Simper analysis -BEST -ANOSIM 1 2 Gradient in species composition 3 4 Gradient in environment site1 site2 site 3 site 4 site species
More informationMultivariate analysis of genetic data: an introduction
Multivariate analysis of genetic data: an introduction Thibaut Jombart MRC Centre for Outbreak Analysis and Modelling Imperial College London XXIV Simposio Internacional De Estadística Bogotá, 25th July
More informationMultivariate Ordination Analyses: Principal Component Analysis. Dilys Vela
Multivariate Ordination Analyses: Principal Component Analysis Dilys Vela Tatiana Boza Multivariate Analyses A multivariate data set includes more than one variable ibl recorded dd from a number of replicate
More informationG E INTERACTION USING JMP: AN OVERVIEW
G E INTERACTION USING JMP: AN OVERVIEW Sukanta Dash I.A.S.R.I., Library Avenue, New Delhi-110012 sukanta@iasri.res.in 1. Introduction Genotype Environment interaction (G E) is a common phenomenon in agricultural
More informationCanonical analysis. Pierre Legendre Département de sciences biologiques Université de Montréal
Canonical analysis Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2017 Outline of the presentation 1. Canonical analysis: definition
More informationFactors affecting the Power and Validity of Randomization-based Multivariate Tests for Difference among Ecological Assemblages
Factors affecting the Power and Validity of Randomization-based Multivariate Tests for Difference among Ecological Assemblages Cameron Hurst B.Sc. (Hons) This thesis was submitted in fulfillment of the
More informationPartial regression and variation partitioning
Partial regression and variation partitioning Pierre Legendre Département de sciences biologiques Université de Montréal http://www.numericalecology.com/ Pierre Legendre 2017 Outline of the presentation
More informationDesign decisions and implementation details in vegan
Design decisions and implementation details in vegan Jari Oksanen processed with vegan 2.3-5 in R version 3.2.4 Patched (2016-03-28 r70390) on April 4, 2016 Abstract This document describes design decisions,
More informationShort Answer Questions: Answer on your separate blank paper. Points are given in parentheses.
ISQS 6348 Final exam solutions. Name: Open book and notes, but no electronic devices. Answer short answer questions on separate blank paper. Answer multiple choice on this exam sheet. Put your name on
More informationIntroduction to Regression
Regression Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression,
More informationFigure 43 - The three components of spatial variation
Université Laval Analyse multivariable - mars-avril 2008 1 6.3 Modeling spatial structures 6.3.1 Introduction: the 3 components of spatial structure For a good understanding of the nature of spatial variation,
More information2/19/2018. Dataset: 85,122 islands 19,392 > 1km 2 17,883 with data
The group numbers are arbitrary. Remember that you can rotate dendrograms around any node and not change the meaning. So, the order of the clusters is not meaningful. Taking a subset of the data changes
More informationdiversity(datamatrix, index= shannon, base=exp(1))
Tutorial 11: Diversity, Indicator Species Analysis, Cluster Analysis Calculating Diversity Indices The vegan package contains the command diversity() for calculating Shannon and Simpson diversity indices.
More informationPrincipal Component Analysis (PCA) Theory, Practice, and Examples
Principal Component Analysis (PCA) Theory, Practice, and Examples Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite) variables. p k n A
More informationUse R! Series Editors:
Use R! Series Editors: Robert Gentleman Kurt Hornik Giovanni G. Parmigiani Use R! Albert: Bayesian Computation with R Bivand/Pebesma/Gómez-Rubio: Applied Spatial Data Analysis with R Cook/Swayne: Interactive
More informationMultivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis
Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis For example Data reduction approaches Cluster analysis Principal components analysis
More informationDIMENSION REDUCTION AND CLUSTER ANALYSIS
DIMENSION REDUCTION AND CLUSTER ANALYSIS EECS 833, 6 March 2006 Geoff Bohling Assistant Scientist Kansas Geological Survey geoff@kgs.ku.edu 864-2093 Overheads and resources available at http://people.ku.edu/~gbohling/eecs833
More informationHierarchical, Multi-scale decomposition of species-environment relationships
Landscape Ecology 17: 637 646, 2002. 2002 Kluwer Academic Publishers. Printed in the Netherlands. 637 Hierarchical, Multi-scale decomposition of species-environment relationships Samuel A. Cushman* and
More informationDescribing Change over Time: Adding Linear Trends
Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationPrinciple Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA
Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA Principle Components Analysis: Uses one group of variables (we will call this X) In
More informationMultivariate analysis of genetic data: exploring groups diversity
Multivariate analysis of genetic data: exploring groups diversity T. Jombart Imperial College London Bogota 01-12-2010 1/42 Outline Introduction Clustering algorithms Hierarchical clustering K-means Multivariate
More informationEXAM PRACTICE. 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False
EXAM PRACTICE 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False Stats 1: What is a Hypothesis? A testable assertion about how the world works Hypothesis
More informationDistance Measures. Objectives: Discuss Distance Measures Illustrate Distance Measures
Distance Measures Objectives: Discuss Distance Measures Illustrate Distance Measures Quantifying Data Similarity Multivariate Analyses Re-map the data from Real World Space to Multi-variate Space Distance
More informationPrincipal Component Analysis (PCA) Principal Component Analysis (PCA)
Recall: Eigenvectors of the Covariance Matrix Covariance matrices are symmetric. Eigenvectors are orthogonal Eigenvectors are ordered by the magnitude of eigenvalues: λ 1 λ 2 λ p {v 1, v 2,..., v n } Recall:
More informationPharmaceutical Experimental Design and Interpretation
Pharmaceutical Experimental Design and Interpretation N. ANTHONY ARMSTRONG, B. Pharm., Ph.D., F.R.Pharm.S., MCPP. KENNETH C. JAMES, M. Pharm., Ph.D., D.Sc, FRSC, F.R.Pharm.S., C.Chem. Welsh School of Pharmacy,
More informationstatistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI
statistical methods for tailoring seasonal climate forecasts Andrew W. Robertson, IRI tailored seasonal forecasts why do we make probabilistic forecasts? to reduce our uncertainty about the (unknown) future
More informationDimensionality Reduction Techniques (DRT)
Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,
More informationInter Item Correlation Matrix (R )
7 1. I have the ability to influence my child s well-being. 2. Whether my child avoids injury is just a matter of luck. 3. Luck plays a big part in determining how healthy my child is. 4. I can do a lot
More informationEcography. Supplementary material
Ecography ECOG-02663 Dambros, C. S., Moraris, J. W., Azevedo, R. A. and Gotelli, N. J. 2016. Isolation by distance, not rivers, control the distribution of termite species in the Amazonian rain forest.
More informationCourse in Data Science
Course in Data Science About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst. The course gives an
More informationA User's Guide To Principal Components
A User's Guide To Principal Components J. EDWARD JACKSON A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Brisbane Toronto Singapore Contents Preface Introduction 1. Getting
More information2/7/2018. Strata. Strata
The strata option allows you to control how permutations are done. Specifically, to constrain permutations. Why would you want to do this? In this dataset, there are clear differences in area (A vs. B),
More informationStructural Equation Modeling and Confirmatory Factor Analysis. Types of Variables
/4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris
More information1km. the estuary (J). The spatial extent of a site (50m x25m) is approximately as tall as each letter and twice as wide
1km Fig. 5. Map of sampling sites within the Orewa estuary. Sites are labeled alphabetically and sequentially from the estuary mouth (A) to the inner reaches of the estuary (J). The spatial extent of a
More informationVector Space Models. wine_spectral.r
Vector Space Models 137 wine_spectral.r Latent Semantic Analysis Problem with words Even a small vocabulary as in wine example is challenging LSA Reduce number of columns of DTM by principal components
More informationLECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS
LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS NOTES FROM PRE- LECTURE RECORDING ON PCA PCA and EFA have similar goals. They are substantially different in important ways. The goal
More informationPrincipal component analysis
Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance
More informationAnalysis of Multivariate Ecological Data
Analysis of Multivariate Ecological Data School on Recent Advances in Analysis of Multivariate Ecological Data 24-28 October 2016 Prof. Pierre Legendre Dr. Daniel Borcard Département de sciences biologiques
More informationCommunity surveys through space and time: testing the space-time interaction in the absence of replication
Community surveys through space and time: testing the space-time interaction in the absence of replication Pierre Legendre, Miquel De Cáceres & Daniel Borcard Département de sciences biologiques, Université
More informationTesting for group differences in brain functional connectivity
Testing for group differences in brain functional connectivity Junghi Kim, Wei Pan, for ADNI Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN 55455 Banff Feb
More informationVisualizing Tests for Equality of Covariance Matrices Supplemental Appendix
Visualizing Tests for Equality of Covariance Matrices Supplemental Appendix Michael Friendly and Matthew Sigal September 18, 2017 Contents Introduction 1 1 Visualizing mean differences: The HE plot framework
More informationPrincipal Component Analysis
I.T. Jolliffe Principal Component Analysis Second Edition With 28 Illustrations Springer Contents Preface to the Second Edition Preface to the First Edition Acknowledgments List of Figures List of Tables
More informationNoise & Data Reduction
Noise & Data Reduction Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis - Spectrum Dimension Reduction 1 Remember: Central Limit
More informationPRINCIPAL COMPONENT ANALYSIS
PRINCIPAL COMPONENT ANALYSIS 1 INTRODUCTION One of the main problems inherent in statistics with more than two variables is the issue of visualising or interpreting data. Fortunately, quite often the problem
More informationStatistics: A review. Why statistics?
Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval
More informationHow rural the EU RDP is? An analysis through spatial funds allocation
How rural the EU RDP is? An analysis through spatial funds allocation Beatrice Camaioni, Roberto Esposti, Antonello Lobianco, Francesco Pagliacci, Franco Sotte Department of Economics and Social Sciences
More informationTemporal eigenfunction methods for multiscale analysis of community composition and other multivariate data
Temporal eigenfunction methods for multiscale analysis of community composition and other multivariate data Pierre Legendre Département de sciences biologiques Université de Montréal Pierre.Legendre@umontreal.ca
More informationIntroduction to Machine Learning
10-701 Introduction to Machine Learning PCA Slides based on 18-661 Fall 2018 PCA Raw data can be Complex, High-dimensional To understand a phenomenon we measure various related quantities If we knew what
More informationOverview of clustering analysis. Yuehua Cui
Overview of clustering analysis Yuehua Cui Email: cuiy@msu.edu http://www.stt.msu.edu/~cui A data set with clear cluster structure How would you design an algorithm for finding the three clusters in this
More informationFitted vectors. Ordination and environment. Alternatives to vectors. Example: River bryophytes
Vegetation Analysis Ordination and environment Slide 1 Vegetation Analysis Ordination and environment Slide Ordination and environment We take granted that vegetation is controlled by environment, so 1.
More informationPreface. List of examples
Contents Preface List of examples i xix 1 LISREL models and methods 1 1.1 The general LISREL model 1 Assumptions 2 The covariance matrix of the observations as implied by the LISREL model 3 Fixed, free,
More informationVarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis
VarCan (version 1): Variation Estimation and Partitioning in Canonical Analysis Pedro R. Peres-Neto March 2005 Department of Biology University of Regina Regina, SK S4S 0A2, Canada E-mail: Pedro.Peres-Neto@uregina.ca
More informationCPSC 340: Machine Learning and Data Mining. Sparse Matrix Factorization Fall 2018
CPSC 340: Machine Learning and Data Mining Sparse Matrix Factorization Fall 2018 Last Time: PCA with Orthogonal/Sequential Basis When k = 1, PCA has a scaling problem. When k > 1, have scaling, rotation,
More informationGeneralized Linear Models (GLZ)
Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationPrincipal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17
Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into
More informationPCA and admixture models
PCA and admixture models CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar, Alkes Price PCA and admixture models 1 / 57 Announcements HW1
More informationDistance-based multivariate analyses confound location and dispersion effects
Methods in Ecology and Evolution 2012, 3, 89 101 doi: 10.1111/j.2041-210X.2011.00127.x Distance-based multivariate analyses confound location and dispersion effects David I. Warton 1 *, Stephen T. Wright
More informationIntroduction to multivariate analysis Outline
Introduction to multivariate analysis Outline Why do a multivariate analysis Ordination, classification, model fitting Principal component analysis Discriminant analysis, quickly Species presence/absence
More informationCorrespondence Analysis of Longitudinal Data
Correspondence Analysis of Longitudinal Data Mark de Rooij* LEIDEN UNIVERSITY, LEIDEN, NETHERLANDS Peter van der G. M. Heijden UTRECHT UNIVERSITY, UTRECHT, NETHERLANDS *Corresponding author (rooijm@fsw.leidenuniv.nl)
More informationsphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19
additive tree structure, 10-28 ADDTREE, 10-51, 10-53 EXTREE, 10-31 four point condition, 10-29 ADDTREE, 10-28, 10-51, 10-53 adjusted R 2, 8-7 ALSCAL, 10-49 ANCOVA, 9-1 assumptions, 9-5 example, 9-7 MANOVA
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationB. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i
B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts
More informationStatistical Analysis. G562 Geometric Morphometrics PC 2 PC 2 PC 3 PC 2 PC 1. Department of Geological Sciences Indiana University
PC 2 PC 2 G562 Geometric Morphometrics Statistical Analysis PC 2 PC 1 PC 3 Basic components of GMM Procrustes Whenever shapes are analyzed together, they must be superimposed together This aligns shapes
More informationAlgebra of Principal Component Analysis
Algebra of Principal Component Analysis 3 Data: Y = 5 Centre each column on its mean: Y c = 7 6 9 y y = 3..6....6.8 3. 3.8.6 Covariance matrix ( variables): S = -----------Y n c ' Y 8..6 c =.6 5.8 Equation
More informationNoise & Data Reduction
Noise & Data Reduction Andreas Wichert - Teóricas andreas.wichert@inesc-id.pt 1 Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis
More informationAn Introduction to Applied Multivariate Analysis with R
~ Snrinuer Brian Everitt Torsten Hathorn An Introduction to Applied Multivariate Analysis with R > Preface........................................................ vii 1 Multivariate Data and Multivariate
More informationCharacterizing and predicting cyanobacterial blooms in an 8-year
1 2 3 4 5 Characterizing and predicting cyanobacterial blooms in an 8-year amplicon sequencing time-course Authors Nicolas Tromas 1*, Nathalie Fortin 2, Larbi Bedrani 1, Yves Terrat 1, Pedro Cardoso 4,
More informationANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control You know how ANOVA works the total variation among
More informationToo good to be true: pitfalls of using mean Ellenberg indicator values in vegetation analyses
&& (2) Too good to be true: pitfalls of using mean Ellenberg indicator values in vegetation analyses David Zelený & André P. Schaffers Keywords Bio-indication; Circularity of reasoning; Compositional similarity;
More informationNONLINEAR REDUNDANCY ANALYSIS AND CANONICAL CORRESPONDENCE ANALYSIS BASED ON POLYNOMIAL REGRESSION
Ecology, 8(4),, pp. 4 by the Ecological Society of America NONLINEAR REDUNDANCY ANALYSIS AND CANONICAL CORRESPONDENCE ANALYSIS BASED ON POLYNOMIAL REGRESSION VLADIMIR MAKARENKOV, AND PIERRE LEGENDRE, Département
More informationR in Linguistic Analysis. Wassink 2012 University of Washington Week 6
R in Linguistic Analysis Wassink 2012 University of Washington Week 6 Overview R for phoneticians and lab phonologists Johnson 3 Reading Qs Equivalence of means (t-tests) Multiple Regression Principal
More informationTesting the significance of canonical axes in redundancy analysis
Methods in Ecology and Evolution 2011, 2, 269 277 doi: 10.1111/j.2041-210X.2010.00078.x Testing the significance of canonical axes in redundancy analysis Pierre Legendre 1 *, Jari Oksanen 2 and Cajo J.
More informationData Handling Methods
Data Handling Methods W a v e l e n g t h ( n m ) Data Handling Methods A Peak picking Fluorescence indices Regional Integration Data reduction 4 0 0. 0 0 3 7 5. 0 0 3 5 0. 0 0 3 2 5. 0 0 3 0 0. 0 0 2
More information