Multivariate Statistics Summary and Comparison of Techniques. Multivariate Techniques
|
|
- Gyles Tobias Nichols
- 5 years ago
- Views:
Transcription
1 Multivariate Statistics Summary and Comparison of Techniques P The key to multivariate statistics is understanding conceptually the relationship among techniques with regards to: < The kinds of problems each technique is suited for < The objective(s) of each technique < The data structure required for each technique < Sampling considerations for each technique < Underlying mathematical model, or lack thereof, of each technique < Potential for complementary use of techniques 1 Model y1 + y yi y 1 + y y i = x y 1 + y y i = x 1 + x x j Techniques Unconstrained Ordination Multivariate ANOVA Multi-Response Permutation Analysis of Similarities Mantel Test Discriminant Analysis Logistic Regression Classification Trees Indicator Species Analysis Constrained Ordination Canonical Correlation Multivariate Regression Trees 2
2 Technique Objective Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) 3 Extract gradients of maximum variation Establish groups of similar entities Test for & describe differences among groups of entities or predict group membership Extract gradients of variation in dependent variables explainable by independent variables Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Variance Emphasis Emphasizes variation among individual sampling entities by defining gradients of maximum total sample variance; describes the interentity variance structure. 4
3 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Variance Emphasis Emphasizes both differences and similarities among individual sampling entities by clustering entities based on inter-entity resemblance. 5 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Variance Emphasis Emphasizes variation among groups of sampling entities; describes the inter-group variance structure. 6
4 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Variance Emphasis Emphasizes variation among individual sampling entities by defining gradients of maximum total sample variance explainable by environmental variables 7 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Dependence Type Interdependence Interdependence Dependence Dependence (& interdependence?) 8
5 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Data Structure One set; >>2 variables (Y) One set; >>2 varibles (Y) Two sets; 1 grouping variable (X), >>2 discriminating variables (Y) Two sets; >>2 response (Y) variables, $1 explanatory (X) variables 9 Obs Group Y-set X-set 1 A a 11 a 12 a a 1p b 11 b 12 b b 1m 2 A a 21 a 22 a a 2p b 21 b 22 b b 2m 3 A a 31 a 32 a a 3p b 31 b 32 b b 3m n A a n1 a n2 a n3... a np b n1 b n2 b n3... b nm n+1 C c 11 c 12 c c 1p n+2 C c 21 c 22 c c 2p n+3 C c 31 c 32 c c 3p N C c n1 c n2 c n3... c np Unconstrained Ordination (PCA, PCO, CA, DCA, NMDS) (Family of techinques) 10
6 Obs Group Y-set X-set 1 A a 11 a 12 a a 1p b 11 b 12 b b 1m 2 A a 21 a 22 a a 2p b 21 b 22 b b 2m 3 A a 31 a 32 a a 3p b 31 b 32 b b 3m n A a n1 a n2 a n3... a np b n1 b n2 b n3... b nm n+1 C c 11 c 12 c c 1p n+2 C c 21 c 22 c c 2p n+3 C c 31 c 32 c c 3p N C c n1 c n2 c n3... c np Discrimination Techniques (MANOVA, MRPP, ANOSIM, Mantel; DA, LR, CART, ISA) 11 Obs Group Y-set X-set 1 A a 11 a 12 a a 1p b 11 b 12 b b 1m 2 A a 21 a 22 a a 2p b 21 b 22 b b 2m 3 A a 31 a 32 a a 3p b 31 b 32 b b 3m n A a n1 a n2 a n3... a np b n1 b n2 b n3... b nm n+1 C c 11 c 12 c c 1p n+2 C c 21 c 22 c c 2p n+3 C c 31 c 32 c c 3p N C c n1 c n2 c n3... c np Constrained Ordination (RDA, CCA, CAP, COR) MRT 12
7 Technique Unconstrained Ordination (PCA, MDS, CA, DCA, NMDS) (Family of techinques) Discrimination (MANOVA, MRPP, ANOSIM, Mantel, DA, LR, CART, ISA) Constrained Ordination (RDA, CCA, CAP) Sample Characteristics N (from one or unknown # pops) N (from unknown # pop's) N (from known # pop's) or N1, N2,... (from separate pop's) N (from one pop) 13 P Describe the major ecological gradients of variation among individual sampling entities, and/or to portray sampling entities along "continuous" gradients of maximum sample variation, then use... Unconstrained Ordination 14
8 P Describe the major ecological gradients of variation among individual sampling entities, and/or to portray sampling entities along "continuous" gradients of maximum sample variation, then use... P Assume linear relationship to ecological gradients... Unconstrained Ordination PCA, PCO(MDS) 15 P Describe the major ecological gradients of variation among individual sampling entities, and/or to portray sampling entities along "continuous" gradients of maximum sample variation, then use... P Assume linear relationship to ecological gradients... P Assume unimodal relationship to ecological gradients... Unconstrained Ordination PCA, PCO(MDS) CA(RA), DCA 16
9 P Describe the major ecological gradients of variation among individual sampling entities, and/or to portray sampling entities along "continuous" gradients of maximum sample variation, then use... P Assume linear relationship to ecological gradients... P Assume unimodal relationship to ecological gradients... P Assume no particular relationship; only monotonic relationship between input and output dissimilarities... Unconstrained Ordination PCA, PCO(MDS) CA(RA), DCA Alt: PCA* NMDS 17 P Establish artificial classes or groups of similar entities where pre-specified, welldefined groups do not already exist, and/or to portray sampling entities in "discrete" groups, then use... 18
10 P Establish artificial classes or groups of similar entities where pre-specified, welldefined groups do not already exist, and/or to portray sampling entities in "discrete" groups, then use... P Assign entities to a specified number of groups to maximize within-group similarity or form composite clusters... Non-hierarchical 19 P Establish artificial classes or groups of similar entities where pre-specified, welldefined groups do not already exist, and/or to portray sampling entities in "discrete" groups, then use... P Assign entities to a specified number of groups to maximize within-group similarity or form composite clusters... P Assign entities to groups and display relationships among groups as they form Non-hierarchical Hierarchical
11 P Establish artificial classes or groups of entities with similar species composition and abundance where pre-specified, well-defined groups do not already exist, based on measured environmental variables, and/or to portray sampling entities in "discrete" groups representing species assemblages with distinct environmental affinities, then use... Constrained (MRT) 21 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Test for significant differences among groups... < Parametric test... MANOVA / DA 22
12 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Test for significant differences among groups... < Parametric test... < Nonparametric tests... MANOVA / DA MRPP, ANOSIM, Mantel 23 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Describe the major ecological differences among groups... < Assume a linear discrimination function... DA (CAD) 24
13 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Describe the major ecological differences among groups... < Assume a linear discrimination function... < Assume a logistic discrimination function... DA (CAD) LR (MLR) 25 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Describe the major ecological differences among groups... < Assume a linear discrimination function... < Assume a logistic discrimination function... < Do not assume any particular function... DA (CAD) LR (MLR) CART (UCT) 26
14 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Describe the major ecological differences among groups... < Assume a linear discrimination function... < Assume a logistic discrimination function... < Do not assume any particular function... < Identify indicators for each group... DA (CAD) LR (MLR) CART (UCT) ISA 27 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Predict group membership of future observations... < Linear classification function... DA (LDF) 28
15 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Predict group membership of future observations... < Linear classification function... < Logistic classification function... DA (LDF) LR (MLR) 29 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Predict group membership of future observations... < Linear classification function... < Logistic classification function... < Decision tree classifier... DA (LDF) LR (MLR) CART (UCT) 30
16 P Differentiate among pre-specified, well-defined classes or groups of sampling entities, and to: P Predict group membership of future observations... < Linear classification function... < Logistic classification function... < Decision tree classifier... < Other nonparametric classifiers... DA (LDF) LR (MLR) CART (UCT) Kernel K nearest-neighbor 31 P Explain the variation in a single continuous dependent variable using two or more continuous independent variables, and/or to develop a model for predicting the value of the dependent variable from the values of the independent variables, then use... Alternatives: Multiple Linear Regression CART (URT) 32
17 P Explain the variation in a single dichotomous dependent (grouping) variable using two or more continuous and/or categorical independent variables, and/or to develop a model for predicting the group membership of a sampling entity from the values of the independent variables, then use... Alternatives: Multiple Logistic Regression DA CART (UCT) 33 P Describe the major ecological patterns in one set of (response) variables explainable by another set of (explanatory) variables, then use... Constrained Ordination or MRT 34
18 P Describe the major ecological patterns in one set of (response) variables explainable by another set of (explanatory) variables, then use... P Assume linear response function of response variables (species) along linear gradients defined by the explanatory variables (environment)... Constrained Ordination or MRT RDA, CAP 35 P Describe the major ecological patterns in one set of (response) variables explainable by another set of (explanatory) variables, then use... P Assume linear response function of response variables (species) along linear gradients defined by the explanatory variables (environment)... P Assume unimodal response function of response variables (species) along linear gradients defined by the explanatory variables (environment)... Constrained Ordination or MRT RDA, CAP CCA, DCCA Alt: RDA* 36
19 P Describe the major ecological patterns in one set of (response) variables explainable by another set of (explanatory) variables, then use... P Assume linear response function of response variables (species) along linear gradients defined by the explanatory variables (environment)... P Assume unimodal response function of response variables (species) along linear gradients defined by the explanatory variables (environment)... P Do not assume any response function... Constrained Ordination or MRT RDA, CAP CCA, DCCA MRT 37 P Determine the magnitude of the ecological relationships between two sets of variables expressed as distance matrices; i.e., dissimilarities between samples, then use... P Determine the magnitude of the ecological relationships between two sets of variables expressed as distance matrices after accounting for a third set of variables (i.e., Y~X Z), then use... Mantel Test Partial Mantel Test 38
20 Dependence Techniques Independent Variables 39 Dependence Techniques CT = Contingency tables SLR = Simple logistic regression MLR = Multiple logistic regression SRA = Simple linear regression MRA = Multiple linear regression T-test = T-test ANOVA = Analysis of variance UCT = Univar. classification trees URT = Univar. regression trees T 2 -test MANOVA = Multivariate analysis of variance DA = Discriminant analysis ISA RDA = Redundancy analysis CCA = Can. correspond. analysis COR = Canonical corr. analysis MRT = Hotelling s T 2 = Indicator species analysis CAP = Can. prin. coord. analysis = Multivar. regression trees 40
21 Dependence Techniques Independent Variables CT CT CT SLR SLR MLR UCT CT CT CT UCT DA DA SLR MLR UCT UCT DA DA CT RDA CT RDA CT RDA RDA DA CAP DA CAP MRT CAP CAP MRT CCA CCA COR CCA CCA COR SRA SRA MRA SRA MRA T-test ANOVA ANOVA URT URT T 2 -test RDA Manova RDA Manova RDA RDA DA CAP DA CAP MRT CAP CAP MRT ISA CCA ISA CCA COR CCA CCA COR RDA CAP CCA RDA CAP CCA 41 Advantages of Multivariate Statistics P Reflect more accurately the true multidimensional, multivariate nature of natural systems. P Provide a way to handle large data sets with large numbers of variables. P Provide a way of summarizing redundancy in large data sets. P Provide rules for combining variables in an "optimal" way. 42
22 Advantages of Multivariate Statistics P Provide a solution to the multiple comparison problem by controlling experimentwise error rate. P Provide a means of detecting and quantifying truly multivariate patterns that arise out of the correlational structure of the variable set. P Provide a means of exploring complex data sets for patterns and relationships from which hypotheses can be generated and subsequently tested experimentally. 43
4/2/2018. Canonical Analyses Analysis aimed at identifying the relationship between two multivariate datasets. Cannonical Correlation.
GAL50.44 0 7 becki 2 0 chatamensis 0 darwini 0 ephyppium 0 guntheri 3 0 hoodensis 0 microphyles 0 porteri 2 0 vandenburghi 0 vicina 4 0 Multiple Response Variables? Univariate Statistics Questions Individual
More informationMultivariate Analysis of Ecological Data using CANOCO
Multivariate Analysis of Ecological Data using CANOCO JAN LEPS University of South Bohemia, and Czech Academy of Sciences, Czech Republic Universitats- uric! Lanttesbibiiothek Darmstadt Bibliothek Biologie
More informationDETECTING BIOLOGICAL AND ENVIRONMENTAL CHANGES: DESIGN AND ANALYSIS OF MONITORING AND EXPERIMENTS (University of Bologna, 3-14 March 2008)
Dipartimento di Biologia Evoluzionistica Sperimentale Centro Interdipartimentale di Ricerca per le Scienze Ambientali in Ravenna INTERNATIONAL WINTER SCHOOL UNIVERSITY OF BOLOGNA DETECTING BIOLOGICAL AND
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationDiscrimination Among Groups. Discrimination Among Groups
Discrimination Among Groups Id Species Canopy Snag Canopy Cover Density Height 1 A 80 1.2 35 2 A 75 0.5 32 3 A 72 2.8 28..... 31 B 35 3.3 15 32 B 75 4.1 25 60 B 15 5.0 3..... 61 C 5 2.1 5 62 C 8 3.4 2
More information4/4/2018. Stepwise model fitting. CCA with first three variables only Call: cca(formula = community ~ env1 + env2 + env3, data = envdata)
0 Correlation matrix for ironmental matrix 1 2 3 4 5 6 7 8 9 10 11 12 0.087451 0.113264 0.225049-0.13835 0.338366-0.01485 0.166309-0.11046 0.088327-0.41099-0.19944 1 1 2 0.087451 1 0.13723-0.27979 0.062584
More informationChapter 11 Canonical analysis
Chapter 11 Canonical analysis 11.0 Principles of canonical analysis Canonical analysis is the simultaneous analysis of two, or possibly several data tables. Canonical analyses allow ecologists to perform
More informationINTRODUCTION TO MULTIVARIATE ANALYSIS OF ECOLOGICAL DATA
INTRODUCTION TO MULTIVARIATE ANALYSIS OF ECOLOGICAL DATA David Zelený & Ching-Feng Li INTRODUCTION TO MULTIVARIATE ANALYSIS Ecologial similarity similarity and distance indices Gradient analysis regression,
More informationSTATISTICS 407 METHODS OF MULTIVARIATE ANALYSIS TOPICS
STATISTICS 407 METHODS OF MULTIVARIATE ANALYSIS TOPICS Principal Component Analysis (PCA): Reduce the, summarize the sources of variation in the data, transform the data into a new data set where the variables
More informationFactors affecting the Power and Validity of Randomization-based Multivariate Tests for Difference among Ecological Assemblages
Factors affecting the Power and Validity of Randomization-based Multivariate Tests for Difference among Ecological Assemblages Cameron Hurst B.Sc. (Hons) This thesis was submitted in fulfillment of the
More informationUnconstrained Ordination
Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)
More informationDiscrimination Among Groups. Classification (and Regression) Trees
Discrimination Among Groups P Are groups significantly different? (How valid are the groups?) < Multivariate Analysis of Variance [(NP)MANOVA] < Multi-Response Permutation Procedures [MRPP] < Analysis
More informationCourse in Data Science
Course in Data Science About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst. The course gives an
More information-Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the
1 2 3 -Principal components analysis is by far the oldest multivariate technique, dating back to the early 1900's; ecologists have used PCA since the 1950's. -PCA is based on covariance or correlation
More informationMultivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis
Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis For example Data reduction approaches Cluster analysis Principal components analysis
More informationMean Ellenberg indicator values as explanatory variables in constrained ordination. David Zelený
Mean Ellenberg indicator values as explanatory variables in constrained ordination David Zelený Heinz Ellenberg Use of mean Ellenberg indicator values in vegetation analysis species composition observed
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationDiscriminant Analysis and Statistical Pattern Recognition
Discriminant Analysis and Statistical Pattern Recognition GEOFFREY J. McLACHLAN Department of Mathematics The University of Queensland St. Lucia, Queensland, Australia A Wiley-Interscience Publication
More informationIntroduction to ordination. Gary Bradfield Botany Dept.
Introduction to ordination Gary Bradfield Botany Dept. Ordination there appears to be no word in English which one can use as an antonym to classification ; I would like to propose the term ordination.
More informationBasics of Multivariate Modelling and Data Analysis
Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 2. Overview of multivariate techniques 2.1 Different approaches to multivariate data analysis 2.2 Classification of multivariate techniques
More informationMultivariate analysis
Multivariate analysis Prof dr Ann Vanreusel -Multidimensional scaling -Simper analysis -BEST -ANOSIM 1 2 Gradient in species composition 3 4 Gradient in environment site1 site2 site 3 site 4 site species
More informationPrincipal component analysis
Principal component analysis Motivation i for PCA came from major-axis regression. Strong assumption: single homogeneous sample. Free of assumptions when used for exploration. Classical tests of significance
More informationIntroduction to Spatial Analysis. Spatial Analysis. Session organization. Learning objectives. Module organization. GIS and spatial analysis
Introduction to Spatial Analysis I. Conceptualizing space Session organization Module : Conceptualizing space Module : Spatial analysis of lattice data Module : Spatial analysis of point patterns Module
More informationANOVA approach. Investigates interaction terms. Disadvantages: Requires careful sampling design with replication
ANOVA approach Advantages: Ideal for evaluating hypotheses Ideal to quantify effect size (e.g., differences between groups) Address multiple factors at once Investigates interaction terms Disadvantages:
More informationMultivariate Statistics Fundamentals Part 1: Rotation-based Techniques
Multivariate Statistics Fundamentals Part 1: Rotation-based Techniques A reminded from a univariate statistics courses Population Class of things (What you want to learn about) Sample group representing
More informationData Mining. 3.6 Regression Analysis. Fall Instructor: Dr. Masoud Yaghini. Numeric Prediction
Data Mining 3.6 Regression Analysis Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction Straight-Line Linear Regression Multiple Linear Regression Other Regression Models References Introduction
More informationClassification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees
Classification for High Dimensional Problems Using Bayesian Neural Networks and Dirichlet Diffusion Trees Rafdord M. Neal and Jianguo Zhang Presented by Jiwen Li Feb 2, 2006 Outline Bayesian view of feature
More informationApplied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition
Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world
More informationAnalysis of Multivariate Ecological Data
Analysis of Multivariate Ecological Data School on Recent Advances in Analysis of Multivariate Ecological Data 24-28 October 2016 Prof. Pierre Legendre Dr. Daniel Borcard Département de sciences biologiques
More informationSmall n, σ known or unknown, underlying nongaussian
READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4
More informationMultivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
More informationCHAPTER 2. Types of Effect size indices: An Overview of the Literature
CHAPTER Types of Effect size indices: An Overview of the Literature There are different types of effect size indices as a result of their different interpretations. Huberty (00) names three different types:
More information9/26/17. Ridge regression. What our model needs to do. Ridge Regression: L2 penalty. Ridge coefficients. Ridge coefficients
What our model needs to do regression Usually, we are not just trying to explain observed data We want to uncover meaningful trends And predict future observations Our questions then are Is β" a good estimate
More informationINFORMATION THEORY AND STATISTICS
INFORMATION THEORY AND STATISTICS Solomon Kullback DOVER PUBLICATIONS, INC. Mineola, New York Contents 1 DEFINITION OF INFORMATION 1 Introduction 1 2 Definition 3 3 Divergence 6 4 Examples 7 5 Problems...''.
More information10-810: Advanced Algorithms and Models for Computational Biology. Optimal leaf ordering and classification
10-810: Advanced Algorithms and Models for Computational Biology Optimal leaf ordering and classification Hierarchical clustering As we mentioned, its one of the most popular methods for clustering gene
More informationStatistics Toolbox 6. Apply statistical algorithms and probability models
Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of
More informationGeneralized Linear Models (GLZ)
Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the
More informationEXAM PRACTICE. 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False
EXAM PRACTICE 12 questions * 4 categories: Statistics Background Multivariate Statistics Interpret True / False Stats 1: What is a Hypothesis? A testable assertion about how the world works Hypothesis
More informationPrerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3
University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.
More informationToo good to be true: pitfalls of using mean Ellenberg indicator values in vegetation analyses
&& (2) Too good to be true: pitfalls of using mean Ellenberg indicator values in vegetation analyses David Zelený & André P. Schaffers Keywords Bio-indication; Circularity of reasoning; Compositional similarity;
More informationMultivariate Statistics 101. Ordination (PCA, NMDS, CA) Cluster Analysis (UPGMA, Ward s) Canonical Correspondence Analysis
Multivariate Statistics 101 Ordination (PCA, NMDS, CA) Cluster Analysis (UPGMA, Ward s) Canonical Correspondence Analysis Multivariate Statistics 101 Copy of slides and exercises PAST software download
More informationTechniques and Applications of Multivariate Analysis
Techniques and Applications of Multivariate Analysis Department of Statistics Professor Yong-Seok Choi E-mail: yschoi@pusan.ac.kr Home : yschoi.pusan.ac.kr Contents Multivariate Statistics (I) in Spring
More information5. Discriminant analysis
5. Discriminant analysis We continue from Bayes s rule presented in Section 3 on p. 85 (5.1) where c i is a class, x isap-dimensional vector (data case) and we use class conditional probability (density
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationData Analysis as a Decision Making Process
Data Analysis as a Decision Making Process I. Levels of Measurement A. NOIR - Nominal Categories with names - Ordinal Categories with names and a logical order - Intervals Numerical Scale with logically
More informationMachine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.
Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted
More informationStatistical Machine Learning
Statistical Machine Learning Christoph Lampert Spring Semester 2015/2016 // Lecture 12 1 / 36 Unsupervised Learning Dimensionality Reduction 2 / 36 Dimensionality Reduction Given: data X = {x 1,..., x
More informationClassification: Linear Discriminant Analysis
Classification: Linear Discriminant Analysis Discriminant analysis uses sample information about individuals that are known to belong to one of several populations for the purposes of classification. Based
More informationA User's Guide To Principal Components
A User's Guide To Principal Components J. EDWARD JACKSON A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Brisbane Toronto Singapore Contents Preface Introduction 1. Getting
More informationsphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19
additive tree structure, 10-28 ADDTREE, 10-51, 10-53 EXTREE, 10-31 four point condition, 10-29 ADDTREE, 10-28, 10-51, 10-53 adjusted R 2, 8-7 ALSCAL, 10-49 ANCOVA, 9-1 assumptions, 9-5 example, 9-7 MANOVA
More informationAn Introduction to Ordination Connie Clark
An Introduction to Ordination Connie Clark Ordination is a collective term for multivariate techniques that adapt a multidimensional swarm of data points in such a way that when it is projected onto a
More informationMachine Learning Practice Page 2 of 2 10/28/13
Machine Learning 10-701 Practice Page 2 of 2 10/28/13 1. True or False Please give an explanation for your answer, this is worth 1 pt/question. (a) (2 points) No classifier can do better than a naive Bayes
More informationReview of some concepts in predictive modeling
Harvard-MIT Division of Health Sciences and Technology HST.951J: Medical Decision Support, Fall 2005 Instructors: Professor Lucila Ohno-Machado and Professor Staal Vinterbo Review of some concepts in predictive
More informationLecture 2: Diversity, Distances, adonis. Lecture 2: Diversity, Distances, adonis. Alpha- Diversity. Alpha diversity definition(s)
Lecture 2: Diversity, Distances, adonis Lecture 2: Diversity, Distances, adonis Diversity - alpha, beta (, gamma) Beta- Diversity in practice: Ecological Distances Unsupervised Learning: Clustering, etc
More informationCAP. Canonical Analysis of Principal coordinates. A computer program by Marti J. Anderson. Department of Statistics University of Auckland (2002)
CAP Canonical Analysis of Principal coordinates A computer program by Marti J. Anderson Department of Statistics University of Auckland (2002) 2 DISCLAIMER This FORTRAN program is provided without any
More informationLogistic Regression: Regression with a Binary Dependent Variable
Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Introduction Edps/Psych/Stat/ 584 Applied Multivariate Statistics Carolyn J Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board of Trustees,
More information4. Ordination in reduced space
Université Laval Analyse multivariable - mars-avril 2008 1 4.1. Generalities 4. Ordination in reduced space Contrary to most clustering techniques, which aim at revealing discontinuities in the data, ordination
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Canonical Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Canonical Slide
More informationOutline Week 1 PCA Challenge. Introduction. Multivariate Statistical Analysis. Hung Chen
Introduction Multivariate Statistical Analysis Hung Chen Department of Mathematics https://ceiba.ntu.edu.tw/972multistat hchen@math.ntu.edu.tw, Old Math 106 2009.02.16 1 Outline 2 Week 1 3 PCA multivariate
More informationWolfgang Karl Härdle Leopold Simar. Applied Multivariate. Statistical Analysis. Fourth Edition. ö Springer
Wolfgang Karl Härdle Leopold Simar Applied Multivariate Statistical Analysis Fourth Edition ö Springer Contents Part I Descriptive Techniques 1 Comparison of Batches 3 1.1 Boxplots 4 1.2 Histograms 11
More informationMachine Learning. CUNY Graduate Center, Spring Lectures 11-12: Unsupervised Learning 1. Professor Liang Huang.
Machine Learning CUNY Graduate Center, Spring 2013 Lectures 11-12: Unsupervised Learning 1 (Clustering: k-means, EM, mixture models) Professor Liang Huang huang@cs.qc.cuny.edu http://acl.cs.qc.edu/~lhuang/teaching/machine-learning
More informationDiscriminant Analysis with High Dimensional. von Mises-Fisher distribution and
Athens Journal of Sciences December 2014 Discriminant Analysis with High Dimensional von Mises - Fisher Distributions By Mario Romanazzi This paper extends previous work in discriminant analysis with von
More informationMark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.
CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.
More informationTypes of Statistical Tests DR. MIKE MARRAPODI
Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample
More informationPattern Recognition and Machine Learning
Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability
More informationLab 7. Direct & Indirect Gradient Analysis
Lab 7 Direct & Indirect Gradient Analysis Direct and indirect gradient analysis refers to a case where you have two datasets with variables that have cause-and-effect or mutual influences on each other.
More informationMultivariate statistical methods and data mining in particle physics
Multivariate statistical methods and data mining in particle physics RHUL Physics www.pp.rhul.ac.uk/~cowan Academic Training Lectures CERN 16 19 June, 2008 1 Outline Statement of the problem Some general
More informationPATTERN CLASSIFICATION
PATTERN CLASSIFICATION Second Edition Richard O. Duda Peter E. Hart David G. Stork A Wiley-lnterscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane Singapore Toronto CONTENTS
More informationUse R! Series Editors:
Use R! Series Editors: Robert Gentleman Kurt Hornik Giovanni G. Parmigiani Use R! Albert: Bayesian Computation with R Bivand/Pebesma/Gómez-Rubio: Applied Spatial Data Analysis with R Cook/Swayne: Interactive
More informationOverview of clustering analysis. Yuehua Cui
Overview of clustering analysis Yuehua Cui Email: cuiy@msu.edu http://www.stt.msu.edu/~cui A data set with clear cluster structure How would you design an algorithm for finding the three clusters in this
More information1. Introduction to Multivariate Analysis
1. Introduction to Multivariate Analysis Isabel M. Rodrigues 1 / 44 1.1 Overview of multivariate methods and main objectives. WHY MULTIVARIATE ANALYSIS? Multivariate statistical analysis is concerned with
More informationMultivariate analysis of genetic data: an introduction
Multivariate analysis of genetic data: an introduction Thibaut Jombart MRC Centre for Outbreak Analysis and Modelling Imperial College London XXIV Simposio Internacional De Estadística Bogotá, 25th July
More informationClassification techniques focus on Discriminant Analysis
Classification techniques focus on Discriminant Analysis Seminar: Potentials of advanced image analysis technology in the cereal science research 2111 2005 Ulf Indahl/IMT - 14.06.2010 Task: Supervised
More information2/19/2018. Dataset: 85,122 islands 19,392 > 1km 2 17,883 with data
The group numbers are arbitrary. Remember that you can rotate dendrograms around any node and not change the meaning. So, the order of the clusters is not meaningful. Taking a subset of the data changes
More informationday month year documentname/initials 1
ECE471-571 Pattern Recognition Lecture 13 Decision Tree Hairong Qi, Gonzalez Family Professor Electrical Engineering and Computer Science University of Tennessee, Knoxville http://www.eecs.utk.edu/faculty/qi
More informationData Mining Techniques
Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 12 Jan-Willem van de Meent (credit: Yijun Zhao, Percy Liang) DIMENSIONALITY REDUCTION Borrowing from: Percy Liang (Stanford) Linear Dimensionality
More informationSTA414/2104. Lecture 11: Gaussian Processes. Department of Statistics
STA414/2104 Lecture 11: Gaussian Processes Department of Statistics www.utstat.utoronto.ca Delivered by Mark Ebden with thanks to Russ Salakhutdinov Outline Gaussian Processes Exam review Course evaluations
More informationFinal Overview. Introduction to ML. Marek Petrik 4/25/2017
Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,
More informationA Least Squares Formulation for Canonical Correlation Analysis
A Least Squares Formulation for Canonical Correlation Analysis Liang Sun, Shuiwang Ji, and Jieping Ye Department of Computer Science and Engineering Arizona State University Motivation Canonical Correlation
More informationOrdination & PCA. Ordination. Ordination
Ordination & PCA Introduction to Ordination Purpose & types Shepard diagrams Principal Components Analysis (PCA) Properties Computing eigenvalues Computing principal components Biplots Covariance vs. Correlation
More informationMachine Learning Support Vector Machines. Prof. Matteo Matteucci
Machine Learning Support Vector Machines Prof. Matteo Matteucci Discriminative vs. Generative Approaches 2 o Generative approach: we derived the classifier from some generative hypothesis about the way
More informationAn Alternative Algorithm for Classification Based on Robust Mahalanobis Distance
Dhaka Univ. J. Sci. 61(1): 81-85, 2013 (January) An Alternative Algorithm for Classification Based on Robust Mahalanobis Distance A. H. Sajib, A. Z. M. Shafiullah 1 and A. H. Sumon Department of Statistics,
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 3 for Applied Multivariate Analysis Outline 1 Reprise-Vectors, vector lengths and the angle between them 2 3 Partial correlation
More informationLecture 4 Discriminant Analysis, k-nearest Neighbors
Lecture 4 Discriminant Analysis, k-nearest Neighbors Fredrik Lindsten Division of Systems and Control Department of Information Technology Uppsala University. Email: fredrik.lindsten@it.uu.se fredrik.lindsten@it.uu.se
More informationContents Lecture 4. Lecture 4 Linear Discriminant Analysis. Summary of Lecture 3 (II/II) Summary of Lecture 3 (I/II)
Contents Lecture Lecture Linear Discriminant Analysis Fredrik Lindsten Division of Systems and Control Department of Information Technology Uppsala University Email: fredriklindsten@ituuse Summary of lecture
More informationPrinciples of Pattern Recognition. C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata
Principles of Pattern Recognition C. A. Murthy Machine Intelligence Unit Indian Statistical Institute Kolkata e-mail: murthy@isical.ac.in Pattern Recognition Measurement Space > Feature Space >Decision
More informationIntroduction to multivariate analysis Outline
Introduction to multivariate analysis Outline Why do a multivariate analysis Ordination, classification, model fitting Principal component analysis Discriminant analysis, quickly Species presence/absence
More informationEconometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018
Econometrics I Lecture 10: Nonparametric Estimation with Kernels Paul T. Scott NYU Stern Fall 2018 Paul T. Scott NYU Stern Econometrics I Fall 2018 1 / 12 Nonparametric Regression: Intuition Let s get
More informationShort Note: Naive Bayes Classifiers and Permanence of Ratios
Short Note: Naive Bayes Classifiers and Permanence of Ratios Julián M. Ortiz (jmo1@ualberta.ca) Department of Civil & Environmental Engineering University of Alberta Abstract The assumption of permanence
More informationDiscriminant Analysis and Statistical Pattern Recognition
Discriminant Analysis and Statistical Pattern Recognition GEOFFRY J. McLACHLAN The University of Queensland @EEC*ENCE A JOHN WILEY & SONS, INC., PUBLICATION This Page Intentionally Left Blank Discriminant
More information8. FROM CLASSICAL TO CANONICAL ORDINATION
Manuscript of Legendre, P. and H. J. B. Birks. 2012. From classical to canonical ordination. Chapter 8, pp. 201-248 in: Tracking Environmental Change using Lake Sediments, Volume 5: Data handling and numerical
More informationAnalysis of community ecology data in R
Analysis of community ecology data in R Jinliang Liu ( 刘金亮 ) Institute of Ecology, College of Life Science Zhejiang University Email: jinliang.liu@foxmail.com http://jinliang.weebly.com R packages ###
More informationEXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING
EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical
More information6. Let C and D be matrices conformable to multiplication. Then (CD) =
Quiz 1. Name: 10 points per correct answer. (20 points for attendance). 1. Let A = 3 and B = [3 yy]. When is A equal to B? xx A. When x = 3 B. When y = 3 C. When x = y D. Never 2. See 1. What is the dimension
More informationCS534 Machine Learning - Spring Final Exam
CS534 Machine Learning - Spring 2013 Final Exam Name: You have 110 minutes. There are 6 questions (8 pages including cover page). If you get stuck on one question, move on to others and come back to the
More informationStructure in Data. A major objective in data analysis is to identify interesting features or structure in the data.
Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two
More information2 D wavelet analysis , 487
Index 2 2 D wavelet analysis... 263, 487 A Absolute distance to the model... 452 Aligned Vectors... 446 All data are needed... 19, 32 Alternating conditional expectations (ACE)... 375 Alternative to block
More informationProbabilistic Methods in Bioinformatics. Pabitra Mitra
Probabilistic Methods in Bioinformatics Pabitra Mitra pabitra@cse.iitkgp.ernet.in Probability in Bioinformatics Classification Categorize a new object into a known class Supervised learning/predictive
More information