Applied Multivariate Analysis
|
|
- Gwendolyn Haynes
- 6 years ago
- Views:
Transcription
1 Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017
2 Dimension reduction Principal Component Analysis (PCA)
3 The problem in exploratory multivariate data analysis usually is the large number of variables. Consentration of the number of variables to fewer new variables is one form of data reduction. Major tools in this process is principal component analysis (PCA) and exploratory factor analysis (FA). PCA is a technical transformation and FA is model based.
4 The aim in PCA is to replace the original variables, x 1, x 2,..., x p, by few new variables, y 1,..., y k, that are linear combinations of the x-variables, preserve essentially all the information in the x-variables, and are uncorrelated with each other.
5 More formally: The first principal component is y 1 = a 11 x 1 + a 12 x a 1p x p, (1) where the coefficients, a 1j (j = 1,..., p) are defined such that var[y 1 ] = under the restriction (scaling constraint) max var[a 11x a 1p x p ] (2) (a 11,...,a 1p ) a a 2 1p = 1. (3)
6 The second principal component is with a 2j defined such that var[y 2 ] = y 2 = a 21 x 1 + a 22 x a 2p x p (4) max var[a 21x a 2p x p ], (5) (a 21,...,a 2p ) a a 2 2p = 1, (6) and cov[y 1, y 2 ] = 0. (7)
7 Altogether there are p principal components, but not all of them are important. Thus, through the principal components a set of correlated variables are transformed a set of uncorrelated variables.
8 Mathematically the principal components are a solution of the eigenvalues of the covariance matrix of x-variables. The coefficients of the first PC are the elements of the eigenvector corresponding to the largest eigenvalue, the coefficients of the second PC are the elements of the eigenvector of the second largest eigenvalue, and so on. Remark 3.1: The principal component analysis is usually in practice obtained from the correlation matrix rather than the covariance matrix. Correlations are scale free, while covariances are not. Remark 3.2: PC solution from a correlation matrix is different from that of a covariance matrix.
9 Let l i denote the ith iegenvalue of or correlation matrix (or covariance matrix) of the the x-variables, such that l 1 l 2 l p, then p p var[x i ] = l i (8) and i=1 i=1 var[y i ] = l i. (9) Thus, the ith component explains l i 100 p j=1 var[x j] % (10) of the total variance of the x-variables. Remark 3.3: In the case of correlation matrix, the variables are standarized with unit variance, i.e., var[x j ] = 1 and p j=1 var[x j] = p. Thus the explanatory power of the ith component extracted from the correlation matix is lapplied i Multivariate Analysis
10 Assuming that the components are extracted form the correlatin matrix, correlation of the original variable x i with the component y j are given by corr[x i, y j ] = a ji lj, (12) and are called loadings. Thus, the loadings (correlations) are just scaled the eigenvector coefficients, but may be easier to interpret, because correlations are between 1 and 1. If varibales with high correlation have something common that can be used as the basis for the naming. Remark 3.4: If the components are extracted from the covariance matrix the loadings are where s i is the standard deviation of x i. corr[x i, y j ] = a ji lj s i, (13)
11 Example 1 Crime rates in the USA in 2005 per 100,000 people by states. Source: Violent crimes: murder and nonnegligent manslaughter, forcible rape, robbery, and aggarvated assault. Property crimes: burglary, larceny-theft, and motor vehicle theft. Using SAS PROC PRINCOMP, the results are: proc princomp data = uscrime2005 out = uscrime_components; title US crime rates per 100,000 population by state ; var murder rape robbery assault burglary larceny auto; run;
12 US crime rates per 100,000 population by state, year 2005 Simple Statistics murder rape robbery assault burglary larceny auto Mean StD Correlation Matrix murder rape robbery assault burglary larceny auto murder rape robbery assault burglary larceny auto Eigenvalues of the Correlation Matrix Eigenvalue Difference Proportion Cumulative
13 Eigenvectors Prin1 Prin2 Prin3 Prin4 Prin5 Prin6 Prin7 murder rape robbery assault burglary larceny auto The eigenvalues indicate that two (or three) components provide a good summary of the data. Of the total variance 76% is accounted by the first two components and 85% by the first three components.
14 The loadings matrix for the first three components: Principal component loadings Prin1 Prin2 Prin3 murder rape robbery assault burglary larceny auto All loadings for the first component are about the same and fairly high except for rape. Thus, the first component describes general criminality. The second component loads (positive) high on rape, larceny, and burglary and negative high on murder and assault. Thus this component seems to measure the preponderance of property and sexual crime over violent crimes (other than sexual) and vice versa (sign of an eigenvector can be changed). These kinds of components are called bipolar. Here it means that high
15 The third component is not that clear but high values of the component indicate those states where rape and assault crimes are high while property crimes tend to be below average. On the other hand, again high negative value indicate high level of property crime.
16 Number of components A rule of thumb to decide the number of meaningful components is select those for which the eigenvalue is equal or greater then 1 (e.g. SPSS uses this as an automatic rule). Another criterion is the so called Cattell s scree test. The rule is to retain all the eigenvalues (hence, the number of components) in the sharp descent (before the elbow point ) in the plot of eigenvalues against the their ordinal number. Usually there is a discernible drop (break point) before the eigenvalues start to level in the plot.
17 Cattell s Scree Plot for the Crime 2005 Data The eigenvalue criterion supports two components and the scree test two or three. We have selected three.
18 Significant coefficients The loadings (scaled component coefficients) are correlations. It can be shown that if the population correlation is zero, the sample correlation is asymptotically normally distributed with zero mean and variance 1/(n 1), where n is the sample size. Using this we can use the rule that those coefficients are statistically significant that are plus/minus two standard errors away from zero. stderr = 1/ n 1 (14)
19 In the crime data n = 52, thus those coefficients are statistically significant whose loadings are on absolute value larger than 2 n 1 = (15) Thus, for the first component all the coefficients are statistically significant, for the second all but assault and auto, and for the third rape and larceny, while assault and burglary are on the borderline.
20 Recap The main usage of principal components are for indexes and for new variables in subsequent studies. PCA is not a statistical model. It is merely a linear transformation of original variables to new variables for the purpose of reducing the dimensionality of the problem (concentrate information).
PRINCIPAL COMPONENTS ANALYSIS (PCA)
PRINCIPAL COMPONENTS ANALYSIS (PCA) Introduction PCA is considered an exploratory technique that can be used to gain a better understanding of the interrelationships between variables. PCA is performed
More informationPrincipal Component Analysis & Factor Analysis. Psych 818 DeShon
Principal Component Analysis & Factor Analysis Psych 818 DeShon Purpose Both are used to reduce the dimensionality of correlated measurements Can be used in a purely exploratory fashion to investigate
More informationPrincipal Component Analysis. Applied Multivariate Statistics Spring 2012
Principal Component Analysis Applied Multivariate Statistics Spring 2012 Overview Intuition Four definitions Practical examples Mathematical example Case study 2 PCA: Goals Goal 1: Dimension reduction
More informationAll data is subject to further review and change.
All data is subject to further review and change. 5500 5000 4500 32% Increase Number or Incidents 4000 3500 3000 2500 2000 1500 21% Decrease 3742 4936 2008 2009 1000 500 0 1052 Part I 827 Part II 800 700
More informationU.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010
U.C. Davis FBI Part I & Part II Crime Offenses 2008 to 2010 2010 PART I OFFENSES 2008 Number of Actual Offenses 2009 Number of Actual Offenses 2010 Number of Actual Offenses 2009 to 2010 Percent Change
More informationPrincipal Component Analysis (PCA) Principal Component Analysis (PCA)
Recall: Eigenvectors of the Covariance Matrix Covariance matrices are symmetric. Eigenvectors are orthogonal Eigenvectors are ordered by the magnitude of eigenvalues: λ 1 λ 2 λ p {v 1, v 2,..., v n } Recall:
More informationFBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime Rate Charts
U.C. Davis Medical Center Crime Statistics (Medical Center) PDF Version FBI Part I & Part II Crime Offenses Arrests Miscellaneous Activity Value of Stolen Property Crime Pie Charts Crime Line Charts Crime
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Dimension reduction Exploratory (EFA) Background While the motivation in PCA is to replace the original (correlated) variables
More informationPrinciple Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA
Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA Principle Components Analysis: Uses one group of variables (we will call this X) In
More informationMultivariate Statistics (I) 2. Principal Component Analysis (PCA)
Multivariate Statistics (I) 2. Principal Component Analysis (PCA) 2.1 Comprehension of PCA 2.2 Concepts of PCs 2.3 Algebraic derivation of PCs 2.4 Selection and goodness-of-fit of PCs 2.5 Algebraic derivation
More informationUC POLICE DEPARTMENT REPORTS DASHBOARD
UC POLICE DEPARTMENT REPORTS DASHBOARD UC MERCED Annual 1. UC Merced FBI Part I Crime Offenses 2 2. UC Merced FBI Part II Crime Offenses 3 3. UC Merced Arrests - FBI Crime Offenses 4 4. UC Merced Value
More informationBelton Police Department
Belton Police Department Uniform Crime Report for June 2013 RETURN-A - MONTHLY RETURN OF OFFENSES KNOWN TO THE POLICE 1 Classification of offenses 1 Criminal Homicide A Murder/nonnegligent homicide 2 Offenses
More informationUCLA STAT 233 Statistical Methods in Biomedical Imaging
UCLA STAT 233 Statistical Methods in Biomedical Imaging Instructor: Ivo Dinov, Asst. Prof. In Statistics and Neurology University of California, Los Angeles, Spring 2004 http://www.stat.ucla.edu/~dinov/
More informationPrincipal Components Analysis (PCA)
Principal Components Analysis (PCA) Principal Components Analysis (PCA) a technique for finding patterns in data of high dimension Outline:. Eigenvectors and eigenvalues. PCA: a) Getting the data b) Centering
More informationPrincipal Component Analysis, A Powerful Scoring Technique
Principal Component Analysis, A Powerful Scoring Technique George C. J. Fernandez, University of Nevada - Reno, Reno NV 89557 ABSTRACT Data mining is a collection of analytical techniques to uncover new
More informationLecture 4: Principal Component Analysis and Linear Dimension Reduction
Lecture 4: Principal Component Analysis and Linear Dimension Reduction Advanced Applied Multivariate Analysis STAT 2221, Fall 2013 Sungkyu Jung Department of Statistics University of Pittsburgh E-mail:
More informationPrincipal Component Analysis (PCA) Theory, Practice, and Examples
Principal Component Analysis (PCA) Theory, Practice, and Examples Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite) variables. p k n A
More informationChapter 4: Factor Analysis
Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.
More informationData reduction for multivariate analysis
Data reduction for multivariate analysis Using T 2, m-cusum, m-ewma can help deal with the multivariate detection cases. But when the characteristic vector x of interest is of high dimension, it is difficult
More informationCrime and Fire Statistics
Monmouth University Police Department Crime Statistics Murder Negligent Manslaughter Forcible Sex Offenses Rape Criminal Sexual Contact Non-Forced Sex Offenses Incest Statutory Rape Robbery Aggravated
More informationCrime and Fire Statistics
Crime and Fire Statistics Monmouth University Police Department Crime Statistics Murder Negligent Manslaughter Forcible Sex Offenses Rape Criminal Sexual Contact Non-Forced Sex Offenses Incest Statutory
More informationPrincipal Components Analysis
Principal Components Analysis Lecture 9 August 2, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #9-8/2/2011 Slide 1 of 54 Today s Lecture Principal Components Analysis
More informationITT Technical Institute, Salem, Oregon Safety and Security Policies with Crime Statistics Report CRIME STATISTICS
ITT Technical Institute, Salem, Oregon Safety and Security Policies with Crime Statistics Report CRIME STATISTICS In each of the specified calendar years, the following number of crimes were reported to
More informationLECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS
LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS NOTES FROM PRE- LECTURE RECORDING ON PCA PCA and EFA have similar goals. They are substantially different in important ways. The goal
More informationVAR2 VAR3 VAR4 VAR5. Or, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables) -Factors are linear constructions of the set of variables (see #8 under
More informationDriving Forces of Houston s Burglary Hotspots During Hurricane Rita
Driving Forces of Houston s Burglary Hotspots During Hurricane Rita Marco Helbich Department of Geography University of Heidelberg Heidelberg, Germany & Michael Leitner Department of Geography and Anthropology
More informationOr, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables; the critical source
More information2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 What is factor analysis? What are factors? Representing factors Graphs and equations Extracting factors Methods and criteria Interpreting
More informationAnalysis of Violent Crime in Los Angeles County
Analysis of Violent Crime in Los Angeles County Xiaohong Huang UID: 004693375 March 20, 2017 Abstract Violent crime can have a negative impact to the victims and the neighborhoods. It can affect people
More informationDimensionality Reduction Techniques (DRT)
Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,
More informationClusters. Unsupervised Learning. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved
Clusters Unsupervised Learning Luc Anselin http://spatial.uchicago.edu 1 curse of dimensionality principal components multidimensional scaling classical clustering methods 2 Curse of Dimensionality 3 Curse
More informationCh. 10 Principal Components Analysis (PCA) Outline
Ch. 10 Principal Components Analysis (PCA) Outline 1. Why use PCA? 2. Calculating Principal Components 3. Using Principal Components in Regression 4. PROC FACTOR This material is loosely related to Section
More informationPrincipal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17
Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into
More informationHow to Run the Analysis: To run a principal components factor analysis, from the menus choose: Analyze Dimension Reduction Factor...
The principal components method of extraction begins by finding a linear combination of variables that accounts for as much variation in the original variables as possible. This method is most often used
More informationLEWISVILLE POLICE DEPARTMENT
STATISTICAL REPORT FOR THE MONTH OF RUSSELL KERBOW CHIEF OF POLICE PREPARED BY: ROBIN BRIZENDINE PUBLIC SAFETY DATA TECHNICIAN DATE PREPARED: September 20, 2017 ORI: TX0610600 AGENCY: LEWISVILLE STATE:
More informationPrincipal component analysis (PCA) for clustering gene expression data
Principal component analysis (PCA) for clustering gene expression data Ka Yee Yeung Walter L. Ruzzo Bioinformatics, v17 #9 (2001) pp 763-774 1 Outline of talk Background and motivation Design of our empirical
More informationB. Weaver (18-Oct-2001) Factor analysis Chapter 7: Factor Analysis
B Weaver (18-Oct-2001) Factor analysis 1 Chapter 7: Factor Analysis 71 Introduction Factor analysis (FA) was developed by C Spearman It is a technique for examining the interrelationships in a set of variables
More informationPrincipal Component Analysis of Crime Rate in Nigeria: A Case Study of Ekiti and Osun State
American Journal of Mathematics and Statistics 2018, 8(4): 79-88 DOI: 10.5923/j.ajms.20180804.01 Principal Component Analysis of Crime Rate in Nigeria: A Case Study of Ekiti and Osun State Faweya O. 1,*,
More informationPrincipal component analysis
Principal component analysis Angela Montanari 1 Introduction Principal component analysis (PCA) is one of the most popular multivariate statistical methods. It was first introduced by Pearson (1901) and
More informationPrincipal Components Analysis using R Francis Huang / November 2, 2016
Principal Components Analysis using R Francis Huang / huangf@missouri.edu November 2, 2016 Principal components analysis (PCA) is a convenient way to reduce high dimensional data into a smaller number
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Discriminant Analysis Background 1 Discriminant analysis Background General Setup for the Discriminant Analysis Descriptive
More informationPrincipal Component Analysis
Principal Component Analysis 1 Principal Component Analysis Principal component analysis is a technique used to construct composite variable(s) such that composite variable(s) are weighted combination
More information1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables
1 A factor can be considered to be an underlying latent variable: (a) on which people differ (b) that is explained by unknown variables (c) that cannot be defined (d) that is influenced by observed variables
More informationExploratory Factor Analysis and Principal Component Analysis
Exploratory Factor Analysis and Principal Component Analysis Today s Topics: What are EFA and PCA for? Planning a factor analytic study Analysis steps: Extraction methods How many factors Rotation and
More informationIntroduction to Factor Analysis
to Factor Analysis Lecture 10 August 2, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #10-8/3/2011 Slide 1 of 55 Today s Lecture Factor Analysis Today s Lecture Exploratory
More informationUnconstrained Ordination
Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)
More informationExploratory Factor Analysis and Principal Component Analysis
Exploratory Factor Analysis and Principal Component Analysis Today s Topics: What are EFA and PCA for? Planning a factor analytic study Analysis steps: Extraction methods How many factors Rotation and
More informationILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENTS ANALYSIS
ILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENTS ANALYSIS W. T. Federer, C. E. McCulloch and N. J. Miles-McDermott Biometrics Unit, Cornell University, Ithaca, New York 14853-7801 BU-901-MA December 1986
More informationBasics of Multivariate Modelling and Data Analysis
Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 6. Principal component analysis (PCA) 6.1 Overview 6.2 Essentials of PCA 6.3 Numerical calculation of PCs 6.4 Effects of data preprocessing
More informationPrinciples of factor analysis. Roger Watson
Principles of factor analysis Roger Watson Factor analysis Factor analysis Factor analysis Factor analysis is a multivariate statistical method for reducing large numbers of variables to fewer underlying
More informationPRINCIPAL COMPONENTS ANALYSIS
121 CHAPTER 11 PRINCIPAL COMPONENTS ANALYSIS We now have the tools necessary to discuss one of the most important concepts in mathematical statistics: Principal Components Analysis (PCA). PCA involves
More informationPrincipal Components Theory Notes
Principal Components Theory Notes Charles J. Geyer August 29, 2007 1 Introduction These are class notes for Stat 5601 (nonparametrics) taught at the University of Minnesota, Spring 2006. This not a theory
More informationFactor Analysis Continued. Psy 524 Ainsworth
Factor Analysis Continued Psy 524 Ainsworth Equations Extraction Principal Axis Factoring Variables Skiers Cost Lift Depth Powder S1 32 64 65 67 S2 61 37 62 65 S3 59 40 45 43 S4 36 62 34 35 S5 62 46 43
More informationQuantitative Understanding in Biology Principal Components Analysis
Quantitative Understanding in Biology Principal Components Analysis Introduction Throughout this course we have seen examples of complex mathematical phenomena being represented as linear combinations
More informationSTP 226 ELEMENTARY STATISTICS NOTES
STP 226 ELEMENTARY STATISTICS NOTES PART 1V INFERENTIAL STATISTICS CHAPTER 12 CHI SQUARE PROCEDURES 12.1 The Chi Square Distribution A variable has a chi square distribution if the shape of its distribution
More informationMultivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis
Multivariate Data Analysis a survey of data reduction and data association techniques: Principal Components Analysis For example Data reduction approaches Cluster analysis Principal components analysis
More informationGEOG 4110/5100 Advanced Remote Sensing Lecture 15
GEOG 4110/5100 Advanced Remote Sensing Lecture 15 Principal Component Analysis Relevant reading: Richards. Chapters 6.3* http://www.ce.yildiz.edu.tr/personal/songul/file/1097/principal_components.pdf *For
More informationMultivariate Statistics
Multivariate Statistics Chapter 3: Principal Component Analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical
More informationCity of Newcastle Police Services Report. Third Quarter 2017
City of Newcastle Police Services Report Third Quarter 2017 Prepared by: The Research, Planning and Information Services Unit of the King County Sheriff s Office CITY OF NEWCASTLE CHIEF OF POLICE SERVICES
More informationSeasonality in recorded crime: preliminary findings
Seasonality in recorded crime: preliminary findings Celia Hird Chandni Ruparel Home Office Online Report /7 The views expressed in this report are those of the authors, not necessarily those of the Home
More informationECE 501b Homework #6 Due: 11/26
ECE 51b Homework #6 Due: 11/26 1. Principal Component Analysis: In this assignment, you will explore PCA as a technique for discerning whether low-dimensional structure exists in a set of data and for
More informationPrincipal Component Analysis -- PCA (also called Karhunen-Loeve transformation)
Principal Component Analysis -- PCA (also called Karhunen-Loeve transformation) PCA transforms the original input space into a lower dimensional space, by constructing dimensions that are linear combinations
More informationMachine Learning 2nd Edition
INTRODUCTION TO Lecture Slides for Machine Learning 2nd Edition ETHEM ALPAYDIN, modified by Leonardo Bobadilla and some parts from http://www.cs.tau.ac.il/~apartzin/machinelearning/ The MIT Press, 2010
More informationTMA4267 Linear Statistical Models V2017 [L3]
TMA4267 Linear Statistical Models V2017 [L3] Part 1: Multivariate RVs and normal distribution (L3) Covariance and positive definiteness [H:2.2,2.3,3.3], Principal components [H11.1-11.3] Quiz with Kahoot!
More informationPrincipal Component Analysis Utilizing R and SAS Software s
International Journal of Current Microbiology and Applied Sciences ISSN: 2319-7706 Volume 7 Number 05 (2018) Journal homepage: http://www.ijcmas.com Original Research Article https://doi.org/10.20546/ijcmas.2018.705.441
More informationPrincipal Components Analysis (PCA) and Singular Value Decomposition (SVD) with applications to Microarrays
Principal Components Analysis (PCA) and Singular Value Decomposition (SVD) with applications to Microarrays Prof. Tesler Math 283 Fall 2015 Prof. Tesler Principal Components Analysis Math 283 / Fall 2015
More informationIntroduction to Factor Analysis
to Factor Analysis Lecture 11 November 2, 2005 Multivariate Analysis Lecture #11-11/2/2005 Slide 1 of 58 Today s Lecture Factor Analysis. Today s Lecture Exploratory factor analysis (EFA). Confirmatory
More informationStructure in Data. A major objective in data analysis is to identify interesting features or structure in the data.
Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two
More informationStatistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis
Statistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis 1 Violet Omuchira, John Kihoro, 3 Jeremiah Kiingati Jomo Kenyatta University of Agriculture
More informationDimension Reduction Techniques. Presented by Jie (Jerry) Yu
Dimension Reduction Techniques Presented by Jie (Jerry) Yu Outline Problem Modeling Review of PCA and MDS Isomap Local Linear Embedding (LLE) Charting Background Advances in data collection and storage
More informationMajor Crime Map Help Documentation
Major Crime Map Help Documentation This web application is designed to make it easier to visualize and understand crime trends in Overland Park. The data for this application are generally limited to relatively
More informationSecond-Order Inference for Gaussian Random Curves
Second-Order Inference for Gaussian Random Curves With Application to DNA Minicircles Victor Panaretos David Kraus John Maddocks Ecole Polytechnique Fédérale de Lausanne Panaretos, Kraus, Maddocks (EPFL)
More informationPrincipal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R,
Principal Component Analysis (PCA) PCA is a widely used statistical tool for dimension reduction. The objective of PCA is to find common factors, the so called principal components, in form of linear combinations
More informationData Preprocessing Tasks
Data Tasks 1 2 3 Data Reduction 4 We re here. 1 Dimensionality Reduction Dimensionality reduction is a commonly used approach for generating fewer features. Typically used because too many features can
More informationFactor Analysis. -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD
Factor Analysis -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD 1 From PCA to factor analysis Remember: PCA tries to estimate a transformation of the data such that: 1. The maximum amount
More informationSTATISTICAL SHAPE MODELS (SSM)
STATISTICAL SHAPE MODELS (SSM) Medical Image Analysis Serena Bonaretti serena.bonaretti@istb.unibe.ch ISTB - Institute for Surgical Technology and Biomechanics University of Bern Overview > Introduction
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Principal Analysis Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board
More informationMEMPHIS POLICE DEPARTMENT
MEMPHIS POLICE DEPARTMENT Sentinel Camera Project City Council Districts (Data Dates Jan. 1 Dec. 31, 2016) District 1 - Councilman Bill Morrison District 2 - Councilman Frank Colvett District 3 Councilwoman
More information18.S096 Problem Set 7 Fall 2013 Factor Models Due Date: 11/14/2013. [ ] variance: E[X] =, and Cov[X] = Σ = =
18.S096 Problem Set 7 Fall 2013 Factor Models Due Date: 11/14/2013 1. Consider a bivariate random variable: [ ] X X = 1 X 2 with mean and co [ ] variance: [ ] [ α1 Σ 1,1 Σ 1,2 σ 2 ρσ 1 σ E[X] =, and Cov[X]
More informationMultivariate Analysis and Likelihood Inference
Multivariate Analysis and Likelihood Inference Outline 1 Joint Distribution of Random Variables 2 Principal Component Analysis (PCA) 3 Multivariate Normal Distribution 4 Likelihood Inference Joint density
More informationMultivariate and Multivariable Regression. Stella Babalola Johns Hopkins University
Multivariate and Multivariable Regression Stella Babalola Johns Hopkins University Session Objectives At the end of the session, participants will be able to: Explain the difference between multivariable
More informationPRINCIPAL COMPONENTS ANALYSIS
PRINCIPAL COMPONENTS ANALYSIS Iris Data Let s find Principal Components using the iris dataset. This is a well known dataset, often used to demonstrate the effect of clustering algorithms. It contains
More informationPackage paramap. R topics documented: September 20, 2017
Package paramap September 20, 2017 Type Package Title paramap Version 1.4 Date 2017-09-20 Author Brian P. O'Connor Maintainer Brian P. O'Connor Depends R(>= 1.9.0), psych, polycor
More informationPrincipal Components. Summary. Sample StatFolio: pca.sgp
Principal Components Summary... 1 Statistical Model... 4 Analysis Summary... 5 Analysis Options... 7 Scree Plot... 8 Component Weights... 9 D and 3D Component Plots... 10 Data Table... 11 D and 3D Component
More informationFactor analysis. George Balabanis
Factor analysis George Balabanis Key Concepts and Terms Deviation. A deviation is a value minus its mean: x - mean x Variance is a measure of how spread out a distribution is. It is computed as the average
More informationDidacticiel - Études de cas
1 Topic New features for PCA (Principal Component Analysis) in Tanagra 1.4.45 and later: tools for the determination of the number of factors. Principal Component Analysis (PCA) 1 is a very popular dimension
More informationIntermediate Social Statistics
Intermediate Social Statistics Lecture 5. Factor Analysis Tom A.B. Snijders University of Oxford January, 2008 c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, 2008 1
More informationEigenvalues, Eigenvectors, and an Intro to PCA
Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Changing Basis We ve talked so far about re-writing our data using a new set of variables, or a new basis.
More informationEigenvalues, Eigenvectors, and an Intro to PCA
Eigenvalues, Eigenvectors, and an Intro to PCA Eigenvalues, Eigenvectors, and an Intro to PCA Changing Basis We ve talked so far about re-writing our data using a new set of variables, or a new basis.
More informationEigenfaces. Face Recognition Using Principal Components Analysis
Eigenfaces Face Recognition Using Principal Components Analysis M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive Neuroscience, 3(1), pp. 71-86, 1991. Slides : George Bebis, UNR
More informationPROCESS MONITORING OF THREE TANK SYSTEM. Outline Introduction Automation system PCA method Process monitoring with T 2 and Q statistics Conclusions
PROCESS MONITORING OF THREE TANK SYSTEM Outline Introduction Automation system PCA method Process monitoring with T 2 and Q statistics Conclusions Introduction Monitoring system for the level and temperature
More informationA Peak to the World of Multivariate Statistical Analysis
A Peak to the World of Multivariate Statistical Analysis Real Contents Real Real Real Why is it important to know a bit about the theory behind the methods? Real 5 10 15 20 Real 10 15 20 Figure: Multivariate
More informationTAMS39 Lecture 10 Principal Component Analysis Factor Analysis
TAMS39 Lecture 10 Principal Component Analysis Factor Analysis Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content - Lecture Principal component analysis
More informationPercent Change. Last YTD
Part I L Month Month HOMICIDE 1 0 5 3 66.7% 1 4 3 100.0% 80.0% 100.0% 0 0 0 2 6 8 MURDER 0 0 4 1 300.0% 0 3 1-75.0% 100.0% 0 0 0 2 6 8 MANSLAUGHTER 0 0 0 2-100.0% 0 0 2 - - 100.0% 0 0 0 0 0 0 NEG. HOMICIDE
More informationMedford Police Department April 2018 Consolidated Incident Report(includes all nature of call codes)
Part I L Month Month HOMICIDE 0 0 2 1 100.0% 0 3 1-150.0% 100.0% 0 0 0 0 5 5 MURDER 0 0 2 1 100.0% 0 2 1-100.0% 100.0% 0 0 0 0 2 2 MANSLAUGHTER 0 0 0 0 0% 0 0 0 - - - 0 0 0 0 0 0 NEG. HOMICIDE - TRAFFIC
More informationPrincipal components
Principal components Principal components is a general analysis technique that has some application within regression, but has a much wider use as well. Technical Stuff We have yet to define the term covariance,
More informationPercent Change. Last YTD MANSLAUGHTER % % 100.0% JUST. HOMICIDE %
Part I L Month Month HOMICIDE 1 0 3 1 200.0% 2 4 1 200.0% 133.3% 100.0% 0 2 2 0 4 4 MURDER 0 0 1 1-1 2 1-200.0% 100.0% 0 0 0 0 1 1 MANSLAUGHTER 1 0 2 0 200% 1 2 0 100.0% 100.0% - 0 2 2 0 3 3 JUST. HOMICIDE
More informationCarapace Measurements for Female Turtles
Carapace Measurements for Female Turtles Data on three dimensions of female turtle carapaces (shells): X 1 =log(carapace length) X 2 =log(carapace width) X 3 =log(carapace height) ince the measurements
More informationDate: Wed 12 July 2017 Reference Number: FOI Category: Stats - Crime
Date: Wed 12 July 2017 Reference Number: 20171145 FOI Category: Stats - Crime Title: Hate Crimes Request Date: Friday, 23 June, 2017 Response Date: Wednesday, 12 July, 2017 Request Details: Q1: The total
More informationPrincipal Components Analysis
Principal Components Analysis Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 16-Mar-2017 Nathaniel E. Helwig (U of Minnesota) Principal
More information