Intermediate Social Statistics

Size: px
Start display at page:

Download "Intermediate Social Statistics"

Transcription

1 Intermediate Social Statistics Lecture 5. Factor Analysis Tom A.B. Snijders University of Oxford January, 2008 c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 This course is taught by Raymond Duch and Tom Snijders. Computer classes by David Armstrong and Mark Pickup. Course websites: (see teaching) snijders/iss.htm Today: Factor Analysis. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

2 Factor analysis Factor analysis is used for two broad purposes. 1. Measurement (confirmatory factor analysis). The classical example is the measurement of intelligence by Spearman (1904). 2. Compression of information (exploratory factor analysis). Reduction of several variables to a much smaller number that contain the same information. Here we primarily treat confirmatory factor analysis. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Confirmatory factor analysis The researcher postulates a latent variable which cannot be observed directly, but of which indicators can be observed. Examples : 1. Intelligence measurement Indicators: scores on tasks depending on intelligence. 2. Left-right political attitudes Indicators: agreement with policy-related statements. The simplest FA model has one factor F and a number (say p) of observed indicators X i ; they are related by the equation X i = a i0 + a i1 F + U i (i = 1,..., p) where U i are the residuals. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

3 Latent Observed Error a 1 x 1 U 1 a 2 x 2 U 2 F a 3... x 3 U ap x p U p One-factor model c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 The factor F and the residuals u i are unobserved. Is it possible to estimate and test this model? Distributional assumptions 1. The factor F and residuals U i all are uncorrelated, and have expected values The factor F has unit variance. 3. Possibly: the factor F and residuals U i all have normal distributions. The assumption of linear relations between the factors and observed variables is essential for FA. The assumption of normal distributions is necessary for statistical testing. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

4 A consequence of the linearity and non-correlation assumptions, for the one-factor model, is Cov(X i, X j ) = Cov(a i F + U i, a j F + U j ) = a i a j (i j) and hence a i a j Corr(X i, X j ) = S.D.(X i ) S.D.(X j ). This means that all the rows of the correlation matrix of X are proportional, and similarly all the columns are proportional. Also, perhaps after multiplying some of the variables by 1 to obtain the correct polarity, all correlations are positive. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 The general linear factor model It is a very strong requirement to explain all in a set of variables (except for very small sets) by one common factor. A more applicable model is the linear factor model with a general number of q factors: X i = a i0 + a i1 F 1 + a i2 F a iq F q + U i (i = 1,..., p). It is usual to standardize observed variables to have zero means (a i0 = 0 )and unit variances. In matrix notation, X = AF + U, where Cov(F ) = I and Cov(U) is diagonal. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

5 The parameters a ij are called the factor loadings. Put in a matrix they are called the pattern matrix: a 11 a a 1q a 21 a a 2q A = a p1 a p2... a pq The default models have uncorrelated factors : Cov(F) = I; but interpretation may become more attractive when allowing correlated factors. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 The standard distributional assumptions now are: 1. The factors F j are uncorrelated with the residuals U i, and the residuals are mutually uncorrelated. 2. The factors F j and residuals U i all have expected values The factors F j have unit variances. 4. Possibly 1: The factors F j are mutually uncorrelated. 5. Possibly 2: the factors F j and residuals U i have multivariate normal distributions. However, the assumptions about zero correlations may partially be dropped (see later example). c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

6 The variance of X i is given by q aij 2 + σi 2, where σi 2 = Var(U i ). j=1 This is 1 under the unit variance assumption. The first part of this, q hi 2 = aij 2, j=1 is explained by the factors and is called the communality of X i ; the remainder, σi 2 = Var(U i ) is the unique variance of X i. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 More generally: Cov(Y ) = Cov(AF + U) = AA + Σ. Thus, we have (p q) + q parameters in A and Σ to model the p(p + 1)/2 parameters in Cov(Y ). However, for 2 or more factors, the pattern matrix can be rotated and yield exactly the same fit: AR has the same fit as A if RR = I. The resulting number of restrictions on the covariance matrix implied by the standard factor model is ( (p q) 2 (p + q) ) /2. These restrictions can be tested by a likelihood ratio chi-squared test. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

7 Explained variance For orthogonal (i.e., uncorrelated) factors, the total variance explained by factor F j is p i=1 a2 ij. This is usually referred to as the eigenvalue for factor F j. For all relevant estimation methods, this is largest for the first factor F 1, and decreases with j. The number of factors will be determined so, that a large total explained variance q p aij 2 j=1 i=1 is obtained for a small number q of factors. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Rotation of factors The rotation of factors is a linear transformation such that the factors remain uncorrelated and retain unit variances. This changes the pattern matrix, but leaves the model as a whole unchanged: it is a reparametrization of the same model. The best interpretation usually is obtained when some of the loadings are high, and others are close to 0: simple structure. Procedures for rotation in view of obtaining simple structure are varimax (max. variance of squared loadings in columns) and quartimax (max. variance of squared loadings in rows). Oblique rotation methods try to do this c without Tom A.B. Snijders the(university restriction of Oxford) of uncorrelated Intermediate Socialfactors. Statistics January, / 31

8 Example: measurement of social capital From Petr Matějů and Anna Vitásková: Interpersonal Trust and Mutually Beneficial Exchanges. Czech Sociological Review, 2006, Vol. 42, No. 3: In a study on social capital, Matějů and Vitásková (2006) remarked that social capital tends to be conceptualized in two different ways: as a characteristic of the social environment based on interpersonal and institutional trust; and as a characteristic of individuals embedded in mutually beneficial social exchanges. They proposed to measure these two kinds of social capital by means of the following questions. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / TRUST1 = TRUST: There are only a few people I can trust completely. 2. TRUST2 = BEST: Most of the time you can be sure that other people want the best for you. 3. TRUST3 = ADVNT: If you are not careful, other people will take advantage of you. 4. EXNET1 = PRVHLP: How often, because of your job, the office you hold, or contacts you have, do other people (relatives, friends, acquaintances) turn to you to help them solve some problems, cope with difficult situations, or apply your influence for their benefit? 5. EXNET2 = GETHLP: And what about you? When you are in a difficult situation, do you think there are people who could intervene on your behalf? 6. EXNET3 = IMPORT: How important a role do useful contacts play in your life? c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

9 These questions were used in a survey of 1200 adult inhabitants of the Czech republic, held in 2001 as an extension of the International Social Survey Programme (ISSP). The answer categories were 5-point scales. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 As you see, the correlations do not follow the pattern for a one-factor model. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

10 Nevertheless a one-factor model was fitted as a first step. (Factor loadings in figure are incorrect.) c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Note the number of parameters: 1 factor, 6 variables: 6 factor loadings, 6 residual variances; 6 variables: 15 parameters for a totally free covariance matrix. Hence 9 residual degrees of freedom. The likelihood ratio test with χ 2 = 214.3, d.f. = 9, indicates an extremely poor fit. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

11 The underlying theory pointed to a two-factor model with loadings of the first three variables on Factor 1, and of the last three on Factor 2. This model had χ 2 = 49.7, d.f. = 9, a substantial improvement but still not a good fit. Therefore some of the zero-correlation assumptions were dropped. This led to the following model, which had χ 2 = 3.6, d.f. = 6. This model fits too well, it is not parsimonious. Dropping the between-factor correlation leads to χ 2 = 6.0, d.f. = 7. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

12 Assessment of fit There are various ways in which the fit of a FA model can be assessed. 1. The likelihood ratio (LR) test of the implied restrictions on the covariance matrix of Y. 2. Compare the fitted with the observed covariance (correlation) matrix. 3. The LR test has high power for large sample sizes; this may lead to overly complicated (non-parsimonious) models. Various fit indices have been developed which take into account fit between model and data, degrees of freedom (df ), and sample size (N). c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Root Mean Square Error of Approximation The Root Mean Square Error of Approximation (RMSEA) (Steiger and Lind, 1980) is a descriptive measure indicating relative lack of fit: (χ RMSEA = 2 /df ) 1 N 1 where N = sample size. Rule of thumb: RMSE.05 signals good fit, >.10 poor fit. Note that all model selection procedures not based on a test for the null hypothesis that the model holds (such as the LR test) imply that the researcher is satisfied with a model that is not true, but a good approximation. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

13 Many other fit indices have been developed; see the literature, e.g. overviews in Karin Schermelleh-Engel, Helfried Moosbrugger, and Hans Müller, Evaluating the Fit of Structural Equation Models: Tests of Significance and Descriptive Goodness-of-Fit Measures. Methods of Psychological Research Online 2003, 8.2, Xitao Fan and Stephen A. Sivo, Sensitivity of Fit Indexes to Misspecified Structural or Measurement Model Components: Rationale of Two-Index Strategy Revisited. Structural Equation Modeling, 12, (see course website). c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Determination of the number of factors The major way of obtaining a well-fitting but parsimonious model is by determining the number of factors. This is done by considering the various fit indices. Often a scree plot is made of the explained variance obtained by each consecutive (orthogonal) factor. Fine-tuning is done by setting some factor loadings to 0, and by allowing correlations between factors or between residuals. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

14 Factor scores Often it is important to estimate the values of the latent variables. Since statisticians use the term estimation for trying to approximate parameters and prediction for trying to approximate values of random variables, this is technically called the prediction of the latent variables. The predictors are called factor scores. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 The factor scores can be predicted by the conditional means of the latent variables, given the observed variables. Interpretation of factors can be done using factor loadings and using factor scores; for orthogonal factors this will lead to the same picture, for correlated factors there are differences in interpretation. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

15 Principal Component Analysis (PCA) A method related to Factor Analysis, but without the assumptions of the existence of latent variables, is Principal Component Analysis. Here the observed variables are again X 1, X 2,..., X p; now the purpose of the method is to obtain q variables that are linear combinations of X 1,..., X p and from which the original X i can be optimally predicted. This is similar to Factor Analysis without residuals: the vector of components is defined as A X ; the observed covariance matrix of Y is approximated by a covariance matrix of the form A A, and the number q of components is determined such that the total explained variance is high, for a low q. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31 Factor Analysis Glossary (1) Thanks to Dave Armstrong! Factor Loading Coefficient aij relating the unobserved variable Fj to the observed variable Xi. Factor Pattern Matrix Matrix of factor loadings, a11 a12 a1q a21 a22 a2q.... ap1 ap2 apq Communality Amount of variance of observed variable Xi shared with the other variables; usually denoted as hi 2 = q j=1 aij. Uniqueness Amount of an observed variable s variance not shared with the other variables. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

16 Factor Analysis Glossary (2) Eigenvalue In an unrotated factor solution, the amount of variance explained by each factor. Rotation Factor solutions are only identified up to a rotation, meaning there are infinitely many solutions that are equally good in terms of variance explained (ability to reproduce the correlation matrix). Rotating means moving the factors around in space often so they explain the same amount of variance, but so they also have other desirable properties. Factor Structure Matrix Asymmetric matrix of correlations between the observed variables and the factors. This is the same as the Factor Pattern Matrix for orthogonal factors, but these are not the same when we allow the factors to be correlated. c Tom A.B. Snijders (University of Oxford) Intermediate Social Statistics January, / 31

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

Applied Multivariate Analysis

Applied Multivariate Analysis Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Dimension reduction Exploratory (EFA) Background While the motivation in PCA is to replace the original (correlated) variables

More information

Multivariate Fundamentals: Rotation. Exploratory Factor Analysis

Multivariate Fundamentals: Rotation. Exploratory Factor Analysis Multivariate Fundamentals: Rotation Exploratory Factor Analysis PCA Analysis A Review Precipitation Temperature Ecosystems PCA Analysis with Spatial Data Proportion of variance explained Comp.1 + Comp.2

More information

Exploratory Factor Analysis and Principal Component Analysis

Exploratory Factor Analysis and Principal Component Analysis Exploratory Factor Analysis and Principal Component Analysis Today s Topics: What are EFA and PCA for? Planning a factor analytic study Analysis steps: Extraction methods How many factors Rotation and

More information

Exploratory Factor Analysis and Principal Component Analysis

Exploratory Factor Analysis and Principal Component Analysis Exploratory Factor Analysis and Principal Component Analysis Today s Topics: What are EFA and PCA for? Planning a factor analytic study Analysis steps: Extraction methods How many factors Rotation and

More information

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS NOTES FROM PRE- LECTURE RECORDING ON PCA PCA and EFA have similar goals. They are substantially different in important ways. The goal

More information

Introduction to Confirmatory Factor Analysis

Introduction to Confirmatory Factor Analysis Introduction to Confirmatory Factor Analysis Multivariate Methods in Education ERSH 8350 Lecture #12 November 16, 2011 ERSH 8350: Lecture 12 Today s Class An Introduction to: Confirmatory Factor Analysis

More information

Introduction to Factor Analysis

Introduction to Factor Analysis to Factor Analysis Lecture 10 August 2, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #10-8/3/2011 Slide 1 of 55 Today s Lecture Factor Analysis Today s Lecture Exploratory

More information

Dimensionality Reduction Techniques (DRT)

Dimensionality Reduction Techniques (DRT) Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,

More information

Factor analysis. George Balabanis

Factor analysis. George Balabanis Factor analysis George Balabanis Key Concepts and Terms Deviation. A deviation is a value minus its mean: x - mean x Variance is a measure of how spread out a distribution is. It is computed as the average

More information

TAMS39 Lecture 10 Principal Component Analysis Factor Analysis

TAMS39 Lecture 10 Principal Component Analysis Factor Analysis TAMS39 Lecture 10 Principal Component Analysis Factor Analysis Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content - Lecture Principal component analysis

More information

Introduction to Factor Analysis

Introduction to Factor Analysis to Factor Analysis Lecture 11 November 2, 2005 Multivariate Analysis Lecture #11-11/2/2005 Slide 1 of 58 Today s Lecture Factor Analysis. Today s Lecture Exploratory factor analysis (EFA). Confirmatory

More information

1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables

1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables 1 A factor can be considered to be an underlying latent variable: (a) on which people differ (b) that is explained by unknown variables (c) that cannot be defined (d) that is influenced by observed variables

More information

Principal Component Analysis & Factor Analysis. Psych 818 DeShon

Principal Component Analysis & Factor Analysis. Psych 818 DeShon Principal Component Analysis & Factor Analysis Psych 818 DeShon Purpose Both are used to reduce the dimensionality of correlated measurements Can be used in a purely exploratory fashion to investigate

More information

The Common Factor Model. Measurement Methods Lecture 15 Chapter 9

The Common Factor Model. Measurement Methods Lecture 15 Chapter 9 The Common Factor Model Measurement Methods Lecture 15 Chapter 9 Today s Class Common Factor Model Multiple factors with a single test ML Estimation Methods New fit indices because of ML Estimation method

More information

Factor Analysis Continued. Psy 524 Ainsworth

Factor Analysis Continued. Psy 524 Ainsworth Factor Analysis Continued Psy 524 Ainsworth Equations Extraction Principal Axis Factoring Variables Skiers Cost Lift Depth Powder S1 32 64 65 67 S2 61 37 62 65 S3 59 40 45 43 S4 36 62 34 35 S5 62 46 43

More information

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables /4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris

More information

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models Confirmatory Factor Analysis: Model comparison, respecification, and more Psychology 588: Covariance structure and factor models Model comparison 2 Essentially all goodness of fit indices are descriptive,

More information

STAT 730 Chapter 9: Factor analysis

STAT 730 Chapter 9: Factor analysis STAT 730 Chapter 9: Factor analysis Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Data Analysis 1 / 15 Basic idea Factor analysis attempts to explain the

More information

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Canonical Edps/Soc 584 and Psych 594 Applied Multivariate Statistics Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Canonical Slide

More information

2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 What is factor analysis? What are factors? Representing factors Graphs and equations Extracting factors Methods and criteria Interpreting

More information

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the

More information

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT

More information

Chapter 3: Testing alternative models of data

Chapter 3: Testing alternative models of data Chapter 3: Testing alternative models of data William Revelle Northwestern University Prepared as part of course on latent variable analysis (Psychology 454) and as a supplement to the Short Guide to R

More information

Introduction to Structural Equation Modeling

Introduction to Structural Equation Modeling Introduction to Structural Equation Modeling Notes Prepared by: Lisa Lix, PhD Manitoba Centre for Health Policy Topics Section I: Introduction Section II: Review of Statistical Concepts and Regression

More information

Factor Analysis & Structural Equation Models. CS185 Human Computer Interaction

Factor Analysis & Structural Equation Models. CS185 Human Computer Interaction Factor Analysis & Structural Equation Models CS185 Human Computer Interaction MoodPlay Recommender (Andjelkovic et al, UMAP 2016) Online system available here: http://ugallery.pythonanywhere.com/ 2 3 Structural

More information

Factor Analysis. Qian-Li Xue

Factor Analysis. Qian-Li Xue Factor Analysis Qian-Li Xue Biostatistics Program Harvard Catalyst The Harvard Clinical & Translational Science Center Short course, October 7, 06 Well-used latent variable models Latent variable scale

More information

Factor Analysis. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA

Factor Analysis. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA Factor Analysis Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA 1 Factor Models The multivariate regression model Y = XB +U expresses each row Y i R p as a linear combination

More information

Confirmatory Factor Analysis. Psych 818 DeShon

Confirmatory Factor Analysis. Psych 818 DeShon Confirmatory Factor Analysis Psych 818 DeShon Purpose Takes factor analysis a few steps further. Impose theoretically interesting constraints on the model and examine the resulting fit of the model with

More information

The 3 Indeterminacies of Common Factor Analysis

The 3 Indeterminacies of Common Factor Analysis The 3 Indeterminacies of Common Factor Analysis James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) The 3 Indeterminacies of Common

More information

Short Answer Questions: Answer on your separate blank paper. Points are given in parentheses.

Short Answer Questions: Answer on your separate blank paper. Points are given in parentheses. ISQS 6348 Final exam solutions. Name: Open book and notes, but no electronic devices. Answer short answer questions on separate blank paper. Answer multiple choice on this exam sheet. Put your name on

More information

Exploratory Factor Analysis and Canonical Correlation

Exploratory Factor Analysis and Canonical Correlation Exploratory Factor Analysis and Canonical Correlation 3 Dec 2010 CPSY 501 Dr. Sean Ho Trinity Western University Please download: SAQ.sav Outline for today Factor analysis Latent variables Correlation

More information

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models

More information

B. Weaver (18-Oct-2001) Factor analysis Chapter 7: Factor Analysis

B. Weaver (18-Oct-2001) Factor analysis Chapter 7: Factor Analysis B Weaver (18-Oct-2001) Factor analysis 1 Chapter 7: Factor Analysis 71 Introduction Factor analysis (FA) was developed by C Spearman It is a technique for examining the interrelationships in a set of variables

More information

Robustness of factor analysis in analysis of data with discrete variables

Robustness of factor analysis in analysis of data with discrete variables Aalto University School of Science Degree programme in Engineering Physics and Mathematics Robustness of factor analysis in analysis of data with discrete variables Student Project 26.3.2012 Juha Törmänen

More information

Factor Analysis. -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD

Factor Analysis. -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD Factor Analysis -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD 1 From PCA to factor analysis Remember: PCA tries to estimate a transformation of the data such that: 1. The maximum amount

More information

Dimension Reduction and Classification Using PCA and Factor. Overview

Dimension Reduction and Classification Using PCA and Factor. Overview Dimension Reduction and Classification Using PCA and - A Short Overview Laboratory for Interdisciplinary Statistical Analysis Department of Statistics Virginia Tech http://www.stat.vt.edu/consult/ March

More information

Factor Analysis Edpsy/Soc 584 & Psych 594

Factor Analysis Edpsy/Soc 584 & Psych 594 Factor Analysis Edpsy/Soc 584 & Psych 594 Carolyn J. Anderson University of Illinois, Urbana-Champaign April 29, 2009 1 / 52 Rotation Assessing Fit to Data (one common factor model) common factors Assessment

More information

Key Algebraic Results in Linear Regression

Key Algebraic Results in Linear Regression Key Algebraic Results in Linear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 30 Key Algebraic Results in

More information

Principal Component Analysis (PCA) Theory, Practice, and Examples

Principal Component Analysis (PCA) Theory, Practice, and Examples Principal Component Analysis (PCA) Theory, Practice, and Examples Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite) variables. p k n A

More information

Psychology 454: Latent Variable Modeling How do you know if a model works?

Psychology 454: Latent Variable Modeling How do you know if a model works? Psychology 454: Latent Variable Modeling How do you know if a model works? William Revelle Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 18 Outline 1 Goodness

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 4: Factor analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical Engineering Pedro

More information

Inference using structural equations with latent variables

Inference using structural equations with latent variables This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Confirmatory Factor Analysis

Confirmatory Factor Analysis Confirmatory Factor Analysis Latent Trait Measurement and Structural Equation Models Lecture #6 February 13, 2013 PSYC 948: Lecture #6 Today s Class An introduction to confirmatory factor analysis The

More information

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA Principle Components Analysis: Uses one group of variables (we will call this X) In

More information

Hypothesis Testing for Var-Cov Components

Hypothesis Testing for Var-Cov Components Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output

More information

Factor Analysis (10/2/13)

Factor Analysis (10/2/13) STA561: Probabilistic machine learning Factor Analysis (10/2/13) Lecturer: Barbara Engelhardt Scribes: Li Zhu, Fan Li, Ni Guan Factor Analysis Factor analysis is related to the mixture models we have studied.

More information

Or, in terms of basic measurement theory, we could model it as:

Or, in terms of basic measurement theory, we could model it as: 1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables; the critical source

More information

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17 Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis Chris Funk Lecture 17 Outline Filters and Rotations Generating co-varying random fields Translating co-varying fields into

More information

VAR2 VAR3 VAR4 VAR5. Or, in terms of basic measurement theory, we could model it as:

VAR2 VAR3 VAR4 VAR5. Or, in terms of basic measurement theory, we could model it as: 1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables) -Factors are linear constructions of the set of variables (see #8 under

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Unconstrained Ordination

Unconstrained Ordination Unconstrained Ordination Sites Species A Species B Species C Species D Species E 1 0 (1) 5 (1) 1 (1) 10 (4) 10 (4) 2 2 (3) 8 (3) 4 (3) 12 (6) 20 (6) 3 8 (6) 20 (6) 10 (6) 1 (2) 3 (2) 4 4 (5) 11 (5) 8 (5)

More information

Structural Model Equivalence

Structural Model Equivalence Structural Model Equivalence James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Structural Model Equivalence 1 / 34 Structural

More information

9.1 Orthogonal factor model.

9.1 Orthogonal factor model. 36 Chapter 9 Factor Analysis Factor analysis may be viewed as a refinement of the principal component analysis The objective is, like the PC analysis, to describe the relevant variables in study in terms

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Factor Analysis. Statistical Background. Chapter. Herb Stenson and Leland Wilkinson

Factor Analysis. Statistical Background. Chapter. Herb Stenson and Leland Wilkinson Chapter 12 Herb Stenson and Leland Wilkinson FACTOR provides principal components analysis and common factor analysis (maximum likelihood and iterated principal axis). SYSTAT has options to rotate, sort,

More information

Part 2: EFA Outline. Exploratory and Confirmatory Factor Analysis. Basic ideas: 1. Linear regression on common factors. Basic Ideas of Factor Analysis

Part 2: EFA Outline. Exploratory and Confirmatory Factor Analysis. Basic ideas: 1. Linear regression on common factors. Basic Ideas of Factor Analysis Exploratory and Confirmatory Factor Analysis Part 2: EFA and Factor Rotation Michael Friendly Psychology 6140 Part 2: EFA Outline 1 Linear regression on common factors Partial linear independence Partial

More information

Principal Components Theory Notes

Principal Components Theory Notes Principal Components Theory Notes Charles J. Geyer August 29, 2007 1 Introduction These are class notes for Stat 5601 (nonparametrics) taught at the University of Minnesota, Spring 2006. This not a theory

More information

Inter Item Correlation Matrix (R )

Inter Item Correlation Matrix (R ) 7 1. I have the ability to influence my child s well-being. 2. Whether my child avoids injury is just a matter of luck. 3. Luck plays a big part in determining how healthy my child is. 4. I can do a lot

More information

UNIVERSITY OF CALGARY. The Influence of Model Components and Misspecification Type on the Performance of the

UNIVERSITY OF CALGARY. The Influence of Model Components and Misspecification Type on the Performance of the UNIVERSITY OF CALGARY The Influence of Model Components and Misspecification Type on the Performance of the Comparative Fit Index (CFI) and the Root Mean Square Error of Approximation (RMSEA) in Structural

More information

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An

More information

Principal Components Analysis and Exploratory Factor Analysis

Principal Components Analysis and Exploratory Factor Analysis Principal Components Analysis and Exploratory Factor Analysis PRE 905: Multivariate Analysis Lecture 12: May 6, 2014 PRE 905: PCA and EFA (with CFA) Today s Class Advanced matrix operations Principal Components

More information

Factor Analysis: An Introduction. What is Factor Analysis? 100+ years of Factor Analysis FACTOR ANALYSIS AN INTRODUCTION NILAM RAM

Factor Analysis: An Introduction. What is Factor Analysis? 100+ years of Factor Analysis FACTOR ANALYSIS AN INTRODUCTION NILAM RAM NILAM RAM 2018 PSYCHOLOGY R BOOTCAMP PENNSYLVANIA STATE UNIVERSITY AUGUST 16, 2018 FACTOR ANALYSIS https://psu-psychology.github.io/r-bootcamp-2018/index.html WITH ADDITIONAL MATERIALS AT https://quantdev.ssri.psu.edu/tutorials

More information

Principles of factor analysis. Roger Watson

Principles of factor analysis. Roger Watson Principles of factor analysis Roger Watson Factor analysis Factor analysis Factor analysis Factor analysis is a multivariate statistical method for reducing large numbers of variables to fewer underlying

More information

Package paramap. R topics documented: September 20, 2017

Package paramap. R topics documented: September 20, 2017 Package paramap September 20, 2017 Type Package Title paramap Version 1.4 Date 2017-09-20 Author Brian P. O'Connor Maintainer Brian P. O'Connor Depends R(>= 1.9.0), psych, polycor

More information

Machine Learning 2nd Edition

Machine Learning 2nd Edition INTRODUCTION TO Lecture Slides for Machine Learning 2nd Edition ETHEM ALPAYDIN, modified by Leonardo Bobadilla and some parts from http://www.cs.tau.ac.il/~apartzin/machinelearning/ The MIT Press, 2010

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

Factor Analysis (FA) Non-negative Matrix Factorization (NMF) CSE Artificial Intelligence Grad Project Dr. Debasis Mitra

Factor Analysis (FA) Non-negative Matrix Factorization (NMF) CSE Artificial Intelligence Grad Project Dr. Debasis Mitra Factor Analysis (FA) Non-negative Matrix Factorization (NMF) CSE 5290 - Artificial Intelligence Grad Project Dr. Debasis Mitra Group 6 Taher Patanwala Zubin Kadva Factor Analysis (FA) 1. Introduction Factor

More information

Pollution Sources Detection via Principal Component Analysis and Rotation

Pollution Sources Detection via Principal Component Analysis and Rotation Pollution Sources Detection via Principal Component Analysis and Rotation Vanessa Kuentz 1 in collaboration with : Marie Chavent 1 Hervé Guégan 2 Brigitte Patouille 1 Jérôme Saracco 1,3 1 IMB, Université

More information

EFA. Exploratory Factor Analysis

EFA. Exploratory Factor Analysis EFA Exploratory Factor Analysis EFA Today s goal: Teach Exploratory Factor Analysis (EFA) This will not be on the test :-) Outline - EFA theory - EFA in R EFA theory Exploratory Factor Analysis Why EFA?

More information

R = µ + Bf Arbitrage Pricing Model, APM

R = µ + Bf Arbitrage Pricing Model, APM 4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.

More information

WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin

WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin Quantitative methods II WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin The first factor analysis C. Spearman (1904). General intelligence, objectively determined and measured. The American Journal

More information

Factor Analysis. Summary. Sample StatFolio: factor analysis.sgp

Factor Analysis. Summary. Sample StatFolio: factor analysis.sgp Factor Analysis Summary... 1 Data Input... 3 Statistical Model... 4 Analysis Summary... 5 Analysis Options... 7 Scree Plot... 9 Extraction Statistics... 10 Rotation Statistics... 11 D and 3D Scatterplots...

More information

Multivariate and Multivariable Regression. Stella Babalola Johns Hopkins University

Multivariate and Multivariable Regression. Stella Babalola Johns Hopkins University Multivariate and Multivariable Regression Stella Babalola Johns Hopkins University Session Objectives At the end of the session, participants will be able to: Explain the difference between multivariable

More information

Penalized varimax. Abstract

Penalized varimax. Abstract Penalized varimax 1 Penalized varimax Nickolay T. Trendafilov and Doyo Gragn Department of Mathematics and Statistics, The Open University, Walton Hall, Milton Keynes MK7 6AA, UK Abstract A common weakness

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

Quantitative Understanding in Biology Principal Components Analysis

Quantitative Understanding in Biology Principal Components Analysis Quantitative Understanding in Biology Principal Components Analysis Introduction Throughout this course we have seen examples of complex mathematical phenomena being represented as linear combinations

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Data Mining. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining. Dimensionality reduction. Hamid Beigy. Sharif University of Technology. Fall 1395 Data Mining Dimensionality reduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 42 Outline 1 Introduction 2 Feature selection

More information

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46 BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

STA 431s17 Assignment Eight 1

STA 431s17 Assignment Eight 1 STA 43s7 Assignment Eight The first three questions of this assignment are about how instrumental variables can help with measurement error and omitted variables at the same time; see Lecture slide set

More information

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts A Study of Statistical Power and Type I Errors in Testing a Factor Analytic Model for Group Differences in Regression Intercepts by Margarita Olivera Aguilar A Thesis Presented in Partial Fulfillment of

More information

w. T. Federer, z. D. Feng and c. E. McCulloch

w. T. Federer, z. D. Feng and c. E. McCulloch ILLUSTRATIVE EXAMPLES OF PRINCIPAL COMPONENT ANALYSIS USING GENSTATIPCP w. T. Federer, z. D. Feng and c. E. McCulloch BU-~-M November 98 ~ ABSTRACT In order to provide a deeper understanding of the workings

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Principal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R,

Principal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R, Principal Component Analysis (PCA) PCA is a widely used statistical tool for dimension reduction. The objective of PCA is to find common factors, the so called principal components, in form of linear combinations

More information

Ross (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM.

Ross (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM. 4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.

More information

Feature Transformation

Feature Transformation Página 1 de 31 On this page Introduction to Nonnegative Matrix Factorization Principal Component Analysis (PCA) Quality of Life in U.S. Cities Factor Analysis Introduction to Feature transformation is

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

1 Principal Components Analysis

1 Principal Components Analysis Lecture 3 and 4 Sept. 18 and Sept.20-2006 Data Visualization STAT 442 / 890, CM 462 Lecture: Ali Ghodsi 1 Principal Components Analysis Principal components analysis (PCA) is a very popular technique for

More information

Statistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis

Statistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis Statistical Analysis of Factors that Influence Voter Response Using Factor Analysis and Principal Component Analysis 1 Violet Omuchira, John Kihoro, 3 Jeremiah Kiingati Jomo Kenyatta University of Agriculture

More information

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab Applied Statistics Lab Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab SEM Model 3.64 7.32 Education 2.6 Income 2.1.6.83 Charac. of Individuals 1 5.2e-06 -.62 2.62

More information

More PCA; and, Factor Analysis

More PCA; and, Factor Analysis More PCA; and, Factor Analysis 36-350, Data Mining 26 September 2008 Reading: Principles of Data Mining, section 14.3.3 on latent semantic indexing. 1 Latent Semantic Analysis: Yet More PCA and Yet More

More information

sempower Manual Morten Moshagen

sempower Manual Morten Moshagen sempower Manual Morten Moshagen 2018-03-22 Power Analysis for Structural Equation Models Contact: morten.moshagen@uni-ulm.de Introduction sempower provides a collection of functions to perform power analyses

More information

Phenotypic factor analysis

Phenotypic factor analysis 1 Phenotypic factor analysis Conor V. Dolan & Michel Nivard VU, Amsterdam Boulder Workshop - March 2018 2 Phenotypic factor analysis A statistical technique to investigate the dimensionality of correlated

More information

Canonical Correlation & Principle Components Analysis

Canonical Correlation & Principle Components Analysis Canonical Correlation & Principle Components Analysis Aaron French Canonical Correlation Canonical Correlation is used to analyze correlation between two sets of variables when there is one set of IVs

More information

Condition 9 and 10 Tests of Model Confirmation with SEM Techniques

Condition 9 and 10 Tests of Model Confirmation with SEM Techniques Condition 9 and 10 Tests of Model Confirmation with SEM Techniques Dr. Larry J. Williams CARMA Director Donald and Shirley Clifton Chair of Survey Science Professor of Management University of Nebraska

More information

What is Structural Equation Modelling?

What is Structural Equation Modelling? methods@manchester What is Structural Equation Modelling? Nick Shryane Institute for Social Change University of Manchester 1 Topics Where SEM fits in the families of statistical models Causality SEM is

More information

Estimation of Curvilinear Effects in SEM. Rex B. Kline, September 2009

Estimation of Curvilinear Effects in SEM. Rex B. Kline, September 2009 Estimation of Curvilinear Effects in SEM Supplement to Principles and Practice of Structural Equation Modeling (3rd ed.) Rex B. Kline, September 009 Curvlinear Effects of Observed Variables Consider the

More information

Wavelet Transform And Principal Component Analysis Based Feature Extraction

Wavelet Transform And Principal Component Analysis Based Feature Extraction Wavelet Transform And Principal Component Analysis Based Feature Extraction Keyun Tong June 3, 2010 As the amount of information grows rapidly and widely, feature extraction become an indispensable technique

More information