The European Commission s science and knowledge service. Joint Research Centre
|
|
- Ralph Buck Carpenter
- 5 years ago
- Views:
Transcription
1 The Euroean Commission s science and knowledge service Joint Research Centre
2 Ste 7: Statistical coherence (II) PCA, Exloratory Factor Analysis, Cronbach alha Hedvig Norlén COIN th JRC Annual Training on Comosite Indicators & Scoreboards 06-08/11/2017, Isra (IT)
3 Ste 7: Statistical coherence (II) The CI measures a multifaceted henomenon - a combination of different asects (SUB-PILLARS/PILLARS) (all related to the henomenon but different from each other) Each asect can be measured by a set of observable variables (INDICATORS) Framework of the Global Innovation Index 2017 CI PILLARS SUB-PILLARS INDICATORS 3 JRC-COIN Ste 7: Statistical Coherence (II)
4 Ste 7: Statistical coherence (II) CI PCA, FA are used to verify the internal consistency within each SUB-PILLAR (across INDICATORS) PILLARS SUB-PILLARS INDICATORS Framework of the Global Innovation Index JRC-COIN Ste 7: Statistical Coherence (II)
5 Ste 7: Statistical coherence (II) CI PCA, FA are used to verify the internal consistency within each SUB-PILLAR (across INDICATORS) within each PILLAR (across SUB-PILLARS) PILLARS SUB-PILLARS INDICATORS Framework of the Global Innovation Index JRC-COIN Ste 7: Statistical Coherence (II)
6 Ste 7: Statistical coherence (II) CI PCA, FA are used to verify the internal consistency within each SUB-PILLAR (across INDICATORS) within each PILLAR (across SUB-PILLARS) within the CI (across PILLARS) Search for 1-DIMENSIONALITY within each subset Framework of the Global Innovation Index 2017 PILLARS SUB-PILLARS INDICATORS 6 JRC-COIN Ste 7: Statistical Coherence (II)
7 Ste 7: Statistical coherence (II) Want to answer the following questions by using a multivariate method (PCA, FA, ) Is the hierarchical structure of our comosite indicator statistically well-defined? Is the set of available indicators sufficient/aroriate to describe the illars (dimensions) and the sub-illars (sub-dimensions) of the comosite indicator? 7 JRC-COIN Ste 7: Statistical Coherence (II)
8 Multivariate data The tye of analysis to be erformed deends on the data of the indicators Data Indicators Quantitative Qualitative Mixed/ structured PCA/FA PCA Princial Comonent Analysis FA Factor Analysis dispca/fa,(m)ca FAMD/MFA PCA for discrete data (based on olychoric correlations) (M)CA (Multile) Corresondence Analysis. FAMD Factor Analysis of Mixed Data MFA Multile Factor Analysis. 8 JRC-COIN Ste 7: Statistical Coherence (II)
9 Data matrix of indicators x11 x12... x1 x21 x22... x2 X =.. xn 1xn2... x n n = number of objects (countries, ) = number of variables (indicators) The revious stes in building our CI - Ste 3 Outlier detection and missing data estimation - Ste 4 Normalisation/Standardisation X correlated data set (without outliers and (without or a few) missing values) PCA, FA are based on correlations 9 JRC-COIN Ste 7: Statistical Coherence (II)
10 Exloratory FA Two main tyes of FA: Exloratory (EFA) and Confirmatory (CFA) EFA examining the relationshis among the variables/indicators and do not have an à riori fixed number of factors CFA a clear idea about the number of factors you will encounter, and about which variables will most likely load onto each factor 10 JRC-COIN Ste 7: Statistical Coherence (II)
11 Is PCA = Exloratory FA? PCA FA Similar in many ways but an imortant difference that has effects on how to use them 1) Both are data reduction techniques - they allow you to cature the variance in variables in a smaller set 2) Same (or similar) rocedures used to run in many software (same stes extraction, interretation, rotation, choosing the number of comonents/factors) 11 JRC-COIN Ste 7: Statistical Coherence (II)
12 Is PCA = Exloratory FA? NO! PCA FA Similar in many ways but an imortant difference that has effects on how to use them 1) Both are data reduction techniques - they allow you to cature the variance in variables in a smaller set 2) Same (or similar) rocedures used to run in many software (same stes extraction, interretation, choosing the number of comonents/factors) Difference: PCA is a linear combination of variables FA is a measurement model of a latent variable (statistical model) 12 JRC-COIN Ste 7: Statistical Coherence (II)
13 Is PCA = Exloratory FA? NO! PCA Observed indicators are reduced into comonents FA Latent factors drive the observed variables Item 1 X 1 w 1 λ 1 Item 1 X 1 ε 1 Item 2 X 2... w 2 Comonent Factor λ 2 Item 2 X 2... ε 2 ε i Error terms Item X w λ Item X ε PCA summarizes information of all indicators and reduces it into a fewer number of comonents Each PC i is a new variable comuted as a linear combination of the original variables PC = w x + w x w x Model of the measurement of a latent variable. This latent variable cannot be directly measured with a single variable. Seen through the relationshis it causes in a set of X variables. X X X 1 i = λ F + ε = λif + ε 1 i = λ F + ε 1 13 JRC-COIN Ste 7: Statistical Coherence (II)
14 Some historical notes PCA FA PCA robably the oldest of multivariate analysis methods. Pearson (1901) said that his methods can be easily alied to numerical roblems and although he says that the calculations become cumbersome for four or more variables he suggests that they are still quite feasible Hotelling (1933) chose his comonents so as to maximize their successive contributions to the total of the variances of the original variables, rincial comonents FA was develoed by Searman (1904) in the field of sychology (discovered that school children's scores on a wide variety of seemingly unrelated subjects were ositively correlated). Cattell (1952) exanded on Searman's idea of a two-factor theory of intelligence FA - Charles Edward Searman ( ) FA - Raymond Bernard Cattell ( ) PCA - Karl Pearson ( ) PCA - Harold Hotelling ( ) 14 JRC-COIN Ste 7: Statistical Coherence (II)
15 First stes in PCA FA Check the correlation structure of the data and do 1) and 2) 1) Bartlett s shericity test The Bartlett s test checks if the observed correlation matrix R diverges significantly from the identity matrix. H 0 : R = 1, H 1 : R 1 Test statistic 2 χ = n 1 6 log Under H 0, it follows a χ² distribution with a (-1)/2 degrees of freedom. R n = samle size = number of indicators Want to reject H 0 to be able to do PCA,FA! (check n/<5) (Bartlett (1937)) 15 JRC-COIN Ste 7: Statistical Coherence (II)
16 First stes in PCA FA Check the correlation structure of the data and do 1) and 2) 2) Kaiser-Meyer-Olkin (KMO) Measure for Samling Adequacy Indicators are more or less correlated, but the correlation between two indicators can be influenced by the others. Use artial correlation in order to measure the relation between two indicators by removing the effect of the remaining indicators (controlling for the confounding variable) KMO 2 rij 2 r + ij = 2 aij r ij Pearson correlation coefficient a ij artial correlation coefficient Want KMO close to 1 to be able to do PCA,FA! KMO>0.6 ok (Kaiser (1970), Kaiser-Meyer (1974)) 16 JRC-COIN Ste 7: Statistical Coherence (II)
17 First stes in PCA FA Check the correlation structure of the data and do 1) and 2) 2) Kaiser-Meyer-Olkin (KMO) Measure for Samling Adequacy Kaiser ut the following values on the results: 0.00 to 0.49 unaccetable 0.50 to 0.59 miserable 0.60 to 0.69 mediocre 0.70 to 0.79 middling 0.80 to 0.89 meritorious 0.90 to 1.00 marvelous 1) Bartlett s shericity test and 2) KMO test rovide info whether it is ossible to do PCA, FA but do not give info of the magic number - how comonents/factors are needed 17 JRC-COIN Ste 7: Statistical Coherence (II)
18 How to get the PCs Eigenvalue decomosition of a data correlation matrix (case of CI), (or covariance matrix) Singular Value Decomosition (SVD) of a data matrix after mean centering (normalizing) the data matrix 18 JRC-COIN Ste 7: Statistical Coherence (II)
19 Finding the magic number - determining how many comonents in PCA and factors in FA Many methods exist. The 3 most common are: 1) Kaiser Guttman Eigenvalues greater than one criterion (Guttman (1954), Kaiser (1960)). Select all comonents with eigenvalues over 1 (or 0.9) 2) Cattell s scree test (Cattell (1966)) Above the elbow aroach Scree 3) Certain ercentage of exlained variance e.g., >2/3, 75%, 80%, 19 JRC-COIN Ste 7: Statistical Coherence (II)
20 Examle PCA: The Global Talent Cometitiveness Index 2017 (GTCI 2017) 1. Enable Use PCA to exlore the dimensionality of the first sub-illar Regulatory Landscae The 5 variables should be correlated. Suorts the assumtion that they are all measuring the same concet! 20 JRC-COIN Ste 7: Statistical Coherence (II)
21 GTCI 2017 Sub-illar Regulatory Landscae 1) Bartlett s shericity test χ² - test statistic 588, df= (-1)/2=10 -value <<< 0.01 Reject H 0 2) KMO Measure for Samling Adequacy Overall MSA = 0.86 KMO>0.6 Check the correlation structure MSA for each variable Government Effectiveness = 0.79 Business-Government Relations = 0.96 Political Stability = 0.94 Regulatory Quality = 0.87 Corrution = JRC-COIN Ste 7: Statistical Coherence (II)
22 GTCI 2017 Sub-illar Regulatory Landscae PCA results Total variance exlained Comonent Eigenvalue Variance Cumulative variance 3,85 = 0, , , , ,95 2 Pearson correlation coefficients between illar and rincial comonent 0,77 = 3,85/5 PC1 PC2 PC3 PC4 PC5 1 3,85 0,77 0,77 1. Government Effectiveness 0,96-0,09-0,17-0,06-0,2 2 0,65 0,13 0,90 2. Business-Government Relations 0,67 0,73 0,14 0,00-0,01 3 0,31 0,06 0,96 3. Political Stability 0,84-0,31 0,44 0,06-0,01 4 0,12 0,02 0,99 4. Regulatory Quality 0,94-0,05-0,23 0,24 0,08 5 0,06 0,01 1,00 5. Corrution 0,95-0,10-0,09-0,24 0,13 Sum 5 1 SS 3,85 0,65 0,31 0,12 0,06 Stoing criterion Eigenvalue>0,90 (or cumulative variance over 0,70) Comonent loadings = 0.96X + GE 0.67X + BGR 0.84X + 1 PS 0.94X + RQ PC 0. 95X C One dimension verified! 22 JRC-COIN Ste 7: Statistical Coherence (II)
23 Examle 2 PCA: The Global Talent Cometitiveness Index 2017 (GTCI 2017) 5. VT Skills Use PCA to exlore the dimensionality of the sub-illar Emloyability 4 variables correlated? 1. Ease of finding skilled emloyees 2. Rel of education system to the eco 3. Availability of scientists and engineers 4. Skills ga as major constraint 23 JRC-COIN Ste 7: Statistical Coherence (II)
24 GTCI 2017 Sub-illar Emloyability 1) Bartlett s shericity test χ² - test statistic 264, df= (-1)/2=6 -value <<< 0.01 Reject H 0 2) KMO Measure for Samling Adequacy Overall MSA = 0.73 KMO>0.6 Check the correlation structure MSA for each variable 1. Ease of finding skilled emloyees = Relev of education system to the eco = Availability of scientists and engineers = Skills ga as major constraint = JRC-COIN Ste 7: Statistical Coherence (II)
25 GTCI 2017 Sub-illar Emloyability PCA results Total variance exlained Comonent Eigenvalue Variance Cumulative variance Pearson correlation coefficients between illar and rincial comonent PC1 PC2 PC3 PC4 1 2,66 0,66 0,66 Ind 1 0,91-0,28-0,08 0,30 2 0,91 0,23 0,89 Ind 2 0,90-0,09 0,41-0,12 3 0,28 0,07 0,96 Ind 3 0,92-0,04-0,33-0,21 4 0,15 0,04 1,00 Ind 4 0,42 0,91 0,00 0,07 SS 2,66 0,91 0,28 0,15 Stoing criterion Eigenvalue>0,90 (or cumulative variance over 0,70) PC PC 1 2 = 0,91X + 0,90X + 0,92X + 0,42X 2 PCs retrieved but Indicator 4 Skills 1 = 0.28X X 2 3 0,04X ,91X 4 ga as major constraint correlates considerably better with the PC2 25 JRC-COIN Ste 7: Statistical Coherence (II)
26 GTCI 2017 Sub-illar Emloyability PCA results Indicator 4 Skills ga as major constraint roblematic JRC Audit reort 2017: In the 2017 data set, the indicator Skills ga as major constraint has a very low imact on the GTCI (Index) not found to be imortant at the overall index level. It is suggested that the GTCI develoment team reconsider the inclusion of this variable (or relace it with other variable/s) in next year s release. Correlation structure Indicators, Sub-illar, Pillar and Index GTCI 2018 reort (to be ublished, January 2018): Indicator Skills ga as a major constraint has been deleted entirely from the framework, as ointed out by the JRC last year. 26 JRC-COIN Ste 7: Statistical Coherence (II)
27 Cronbach s α Cronbach s α is measure of internal consistency and reliability Based uon classical test theory. Most common estimate of reliability, Cronbach (1951) Cronbach s α is defined as Ratio of the Total Covariance in the test to the Total Variance in the test. By test we mean the set of indicators constituting a sub-illar/ illar/ci. Let X ) be the variance of indicator i and Var X total variance in sub-illar/illar/ci Var ( ) ( i i = 1 Average covariance of an indicator with any other indicator is α = ( Var( X ) ) i= i Var( X i= i ) Var( X i )/ i= 1 / ( 1) = Var 1 i Var ( X i ) i= 1 i= 1 ( 1) ( X ) i= i Var( X i= i ) 1 1 ( ) Var X i= 1 i Var( X i ) = number of indicators 27 JRC-COIN Ste 7: Statistical Coherence (II)
28 Cronbach s α Cronbach s α is measure of internal consistency and reliability α = 1 1 i= 1 Var Var( X ) ( ) X i= 1 i i = number of indicators α coefficient ranges from 0 to 1. If all indicators are entirely indeendent from one another (i.e. are not correlated or share no covariance) α = 0. If all of the items have high covariances, then α will aroach 1. Caution! α increases when the number of indicators increases Cronbach (1951) 28 JRC-COIN Ste 7: Statistical Coherence (II)
29 Cronbach s α cut off values Cronbach s α is measure of internal consistency and reliability α = 1 1 i= 1 Var Var( X ) ( ) X i= 1 i i Cronbach's alha Internal consistency 0.9 α Excellent 0.8 α < 0.9 Good 0.7 α < 0.8 Accetable 0.6 α < 0.7 Questionable 0.5 α < 0.6 Poor α < 0.5 Unaccetable 29 JRC-COIN Ste 7: Statistical Coherence (II)
30 Cronbach s α GTCI 2017 Sub-illar Regulatory Landscae Cronbach s α is measure of internal consistency and reliability GTCI Sub-illar "Regulatory Landscae" Indicator Cronbach's Alha Corrected Item-Total Correlation 0,92 Cronbach s Alha if indicator is droed Ind 1. Government Effectiveness 0,93 0,88 Ind 2. Business- Government Relations 0,55 0,95 Ind 3. Political Stability 0,75 0,92 Ind 4. Regulatory Quality 0,89 0,89 Ind 5. Corrution 0,92 0,88 Cronbach's alha Internal consistency 0.9 α Excellent 0.8 α < 0.9 Good 0.7 α < 0.8 Accetable 0.6 α < 0.7 Questionable 0.5 α < 0.6 Poor α < 0.5 Unaccetable 30 JRC-COIN Ste 7: Statistical Coherence (II)
31 Cronbach s α GTCI 2017 Sub-illar Regulatory Landscae Cronbach s α is measure of internal consistency and reliability "Emloyability" GTCI Sub-illar Indicator Cronbach's Alha Corrected Item-Total Correlation 0,73 Cronbach s Alha if indicator is droed Ind 1. Ease of finding skilled emloyees 0,68 0,62 Ind 2. Relev of educ system to the economy 0,67 0,58 Ind 3. Availability of scientists and engineers 0,75 0,58 Ind 4. Skills ga as major constraint 0,23 0,90 Cronbach's alha Internal consistency 0.9 α Excellent 0.8 α < 0.9 Good 0.7 α < 0.8 Accetable 0.6 α < 0.7 Questionable 0.5 α < 0.6 Poor α < 0.5 Unaccetable Ind 4 roblematic and Cronbach s α may hence be used as another measure to include/exclude indicators! 31 JRC-COIN Ste 7: Statistical Coherence (II)
32 Last... PCA for discrete data A ractically imortant violation of the normality assumtion underlying the PCA occurs when the data are discrete! Several kinds of discrete data (binary, ordinal, count, nominal ) Different ways on how to roceed: 1) Simle aroach, use the discrete x s as if they were continuous in the PCA but instead of using the Pearson moment correlation coefficients use Searman or Kendall rank correlation coefficients. 2) PCA using olychoric correlations. Pearson (1901) founder of this aroach (as well!), develoed further by Olsson (1979) and Jöreskog (2004) 32 JRC-COIN Ste 7: Statistical Coherence (II)
33 Last... PCA using olychoric correlations The olychoric correlation (PCC) coefficient is a measure of association for ordinal variables which rests uon an assumtion of an underlying joint continuous normal distribution. (Later generalizations of different continuous distributions). PCC coefficient is MLE of the roduct-moment correlation between the underlying normal variables 33 JRC-COIN Ste 7: Statistical Coherence (II)
34 References Books Johnson, Alied Multivariate Statistical Analysis (6th Revised edition), Pearson Education Limited (2013) Jolliffe, Princial Comonent Analysis, Second Edition, Sringer (2002) Leandre, Fabrigar, Duane, Exloratory Factor Analysis, Oxford (2011) Rencher, Methods of multivariate analysis, Wiley (1995) Tinsley and Brown, Handbook of Alied Multivariate Statistics and Mathematical Modeling, Academic Press (2000) Tucker, MacCallum, Exloratory Factor Analysis (1997) 34 JRC-COIN Ste 7: Statistical Coherence (II)
35 THANK YOU Any questions? Welcome to us at: You may contact us & The Euroean Commission s Cometence Centre on Comosite Indicators and Scoreboards COIN in the EU Science Hub htts://ec.euroa.eu/jrc/en/coin COIN tools are available at: htts://comosite-indicators.jrc.ec.euroa.eu/
The European Commission s science and knowledge service. Joint Research Centre
The European Commission s science and knowledge service Joint Research Centre Step 7: Statistical coherence (II) Principal Component Analysis and Reliability Analysis Hedvig Norlén COIN 2018-16th JRC Annual
More informationUsing Factor Analysis to Study the Effecting Factor on Traffic Accidents
Using Factor Analysis to Study the Effecting Factor on Traffic Accidents Abstract Layla A. Ahmed Deartment of Mathematics, College of Education, University of Garmian, Kurdistan Region Iraq This aer is
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Dimension reduction Exploratory (EFA) Background While the motivation in PCA is to replace the original (correlated) variables
More information2. Sample representativeness. That means some type of probability/random sampling.
1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)
More informationHotelling s Two- Sample T 2
Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test
More informationNotes on Instrumental Variables Methods
Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationFactor analysis. George Balabanis
Factor analysis George Balabanis Key Concepts and Terms Deviation. A deviation is a value minus its mean: x - mean x Variance is a measure of how spread out a distribution is. It is computed as the average
More informationMULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION
MULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION M. Jabbari Nooghabi, Deartment of Statistics, Faculty of Mathematical Sciences, Ferdowsi University of Mashhad, Mashhad-Iran. and H. Jabbari
More informationResearch of power plant parameter based on the Principal Component Analysis method
Research of ower lant arameter based on the Princial Comonent Analysis method Yang Yang *a, Di Zhang b a b School of Engineering, Bohai University, Liaoning Jinzhou, 3; Liaoning Datang international Jinzhou
More informationUsing a Computational Intelligence Hybrid Approach to Recognize the Faults of Variance Shifts for a Manufacturing Process
Journal of Industrial and Intelligent Information Vol. 4, No. 2, March 26 Using a Comutational Intelligence Hybrid Aroach to Recognize the Faults of Variance hifts for a Manufacturing Process Yuehjen E.
More informationHidden Predictors: A Factor Analysis Primer
Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More informationFeedback-error control
Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller
More informationUniform Law on the Unit Sphere of a Banach Space
Uniform Law on the Unit Shere of a Banach Sace by Bernard Beauzamy Société de Calcul Mathématique SA Faubourg Saint Honoré 75008 Paris France Setember 008 Abstract We investigate the construction of a
More informationA STUDY ON THE UTILIZATION OF COMPATIBILITY METRIC IN THE AHP: APPLYING TO SOFTWARE PROCESS ASSESSMENTS
ISAHP 2005, Honolulu, Hawaii, July 8-10, 2003 A SUDY ON HE UILIZAION OF COMPAIBILIY MERIC IN HE AHP: APPLYING O SOFWARE PROCESS ASSESSMENS Min-Suk Yoon Yosu National University San 96-1 Dundeok-dong Yeosu
More informationEcological Resemblance. Ecological Resemblance. Modes of Analysis. - Outline - Welcome to Paradise
Ecological Resemblance - Outline - Ecological Resemblance Mode of analysis Analytical saces Association Coefficients Q-mode similarity coefficients Symmetrical binary coefficients Asymmetrical binary coefficients
More informationCHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit
Chater 5 Statistical Inference 69 CHAPTER 5 STATISTICAL INFERENCE.0 Hyothesis Testing.0 Decision Errors 3.0 How a Hyothesis is Tested 4.0 Test for Goodness of Fit 5.0 Inferences about Two Means It ain't
More informationPrinciples of factor analysis. Roger Watson
Principles of factor analysis Roger Watson Factor analysis Factor analysis Factor analysis Factor analysis is a multivariate statistical method for reducing large numbers of variables to fewer underlying
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationMultivariate and Multivariable Regression. Stella Babalola Johns Hopkins University
Multivariate and Multivariable Regression Stella Babalola Johns Hopkins University Session Objectives At the end of the session, participants will be able to: Explain the difference between multivariable
More informationNumerical Linear Algebra
Numerical Linear Algebra Numerous alications in statistics, articularly in the fitting of linear models. Notation and conventions: Elements of a matrix A are denoted by a ij, where i indexes the rows and
More informationTests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)
Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant
More informationVAR2 VAR3 VAR4 VAR5. Or, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables) -Factors are linear constructions of the set of variables (see #8 under
More informationCHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules
CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is
More informationOr, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables; the critical source
More informationA Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition
A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino
More informationCombining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationANALYSIS ON PROBLEMS AND MOTIVATION OF POST 90S UNDERGRADUATES' ETIQUETTE
ANALYSIS ON PROBLEMS AND MOTIVATION OF POST 90S UNDERGRADUATES' ETIQUETTE WEIHONG KONG Deartment of Social Sciences, Handan College, Handan 056005, Hebei, China ABSTRACT At resent, the ost 90s has become
More informationUncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning
TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment
More informationMATHEMATICS ELEMENTARY STATISTICAL TABLES. F D J Dunstan, A B J Nix, J F Reynolds, R J Rowlands.
MATHEMATICS ELEMENTARY STATISTICAL TABLES F D J Dunstan, A B J Nix, J F Reynolds, R J Rowlands www.wjec.co.uk MATHEMATICS Elementary Statistical Tables F D J Dunstan, A B J Nix, J F Reynolds, R J Rowlands
More informationMULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION
Journal of Statistics: Advances in heory and Alications Volume 8, Number, 07, Pages -44 Available at htt://scientificadvances.co.in DOI: htt://dx.doi.org/0.864/jsata_700868 MULIVARIAE SAISICAL PROCESS
More informationScaling Multiple Point Statistics for Non-Stationary Geostatistical Modeling
Scaling Multile Point Statistics or Non-Stationary Geostatistical Modeling Julián M. Ortiz, Steven Lyster and Clayton V. Deutsch Centre or Comutational Geostatistics Deartment o Civil & Environmental Engineering
More informationDeriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.
Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &
More informationChapter 13 Variable Selection and Model Building
Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant
More informationA MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION
O P E R A T I O N S R E S E A R C H A N D D E C I S I O N S No. 27 DOI:.5277/ord73 Nasrullah KHAN Muhammad ASLAM 2 Kyung-Jun KIM 3 Chi-Hyuck JUN 4 A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST
More informationTowards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK
Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)
More informationPlotting the Wilson distribution
, Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion
More informationA Factor Analysis of Key Decision Factors for Implementing SOA-based Enterprise level Business Intelligence Systems
icbst 2014 International Conference on Business, Science and Technology which will be held at Hatyai, Thailand on the 25th and 26th of April 2014. AENSI Journals Australian Journal of Basic and Applied
More informationUnsupervised Hyperspectral Image Analysis Using Independent Component Analysis (ICA)
Unsuervised Hyersectral Image Analysis Using Indeendent Comonent Analysis (ICA) Shao-Shan Chiang Chein-I Chang Irving W. Ginsberg Remote Sensing Signal and Image Processing Laboratory Deartment of Comuter
More informationBayesian Spatially Varying Coefficient Models in the Presence of Collinearity
Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory
More informationMATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK
Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment
More informationOne-way ANOVA Inference for one-way ANOVA
One-way ANOVA Inference for one-way ANOVA IPS Chater 12.1 2009 W.H. Freeman and Comany Objectives (IPS Chater 12.1) Inference for one-way ANOVA Comaring means The two-samle t statistic An overview of ANOVA
More informationDimensionality Reduction Techniques (DRT)
Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationPrincipal Components Analysis and Unsupervised Hebbian Learning
Princial Comonents Analysis and Unsuervised Hebbian Learning Robert Jacobs Deartment of Brain & Cognitive Sciences University of Rochester Rochester, NY 1467, USA August 8, 008 Reference: Much of the material
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationFinite Mixture EFA in Mplus
Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered
More informationAdaptive estimation with change detection for streaming data
Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment
More informationA New Asymmetric Interaction Ridge (AIR) Regression Method
A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and
More informationChapter 1: PROBABILITY BASICS
Charles Boncelet, obability, Statistics, and Random Signals," Oxford University ess, 0. ISBN: 978-0-9-0005-0 Chater : PROBABILITY BASICS Sections. What Is obability?. Exeriments, Outcomes, and Events.
More informationLECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]
LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for
More informationEstimating Internal Consistency Using Bayesian Methods
Journal of Modern Alied Statistical Methods Volume 0 Issue Article 25 5--20 Estimating Internal Consistency Using Bayesian Methods Miguel A. Padilla Old Dominion University, maadill@odu.edu Guili Zhang
More informationChapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population
Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection
More informationAI*IA 2003 Fusion of Multiple Pattern Classifiers PART III
AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers
More informationSTA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2
STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous
More informationEvaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models
Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models Ketan N. Patel, Igor L. Markov and John P. Hayes University of Michigan, Ann Arbor 48109-2122 {knatel,imarkov,jhayes}@eecs.umich.edu
More informationBenoît MULKAY Université de Montpellier. January Preliminary, Do not quote!
Bivariate Probit Estimation for Panel Data: a two-ste Gauss-Hermite Quadrature Aroach with an alication to roduct and rocess innovations for France Benoît MULKAY Université de Montellier January 05 Preliminary,
More informationAn Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling
Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian
More informationStructural Equation Modeling and Confirmatory Factor Analysis. Types of Variables
/4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris
More information7.2 Inference for comparing means of two populations where the samples are independent
Objectives 7.2 Inference for comaring means of two oulations where the samles are indeendent Two-samle t significance test (we give three examles) Two-samle t confidence interval htt://onlinestatbook.com/2/tests_of_means/difference_means.ht
More informationMetrics Performance Evaluation: Application to Face Recognition
Metrics Performance Evaluation: Alication to Face Recognition Naser Zaeri, Abeer AlSadeq, and Abdallah Cherri Electrical Engineering Det., Kuwait University, P.O. Box 5969, Safat 6, Kuwait {zaery, abeer,
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationMorten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014
Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get
More informationVersion 1.0. klm. General Certificate of Education June Mathematics. Statistics 4. Mark Scheme
Version.0 klm General Certificate of Education June 00 Mathematics MS04 Statistics 4 Mark Scheme Mark schemes are reared by the Princial Examiner and considered, together with the relevant questions, by
More informationASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE
J Jaan Statist Soc Vol 34 No 2004 9 26 ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE Yasunori Fujikoshi*, Tetsuto Himeno
More informationMultivariate analyses in microbial ecology
MINIREVIEW Multivariate analyses in microbial ecology Alban Ramette Microbial habitat grou, Max Planck Institute for Marine Microbiology, Bremen, Germany OnlineOen: This article is available free online
More information2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 What is factor analysis? What are factors? Representing factors Graphs and equations Extracting factors Methods and criteria Interpreting
More informationThe Binomial Approach for Probability of Detection
Vol. No. (Mar 5) - The e-journal of Nondestructive Testing - ISSN 45-494 www.ndt.net/?id=7498 The Binomial Aroach for of Detection Carlos Correia Gruo Endalloy C.A. - Caracas - Venezuela www.endalloy.net
More informationOn the Chvatál-Complexity of Knapsack Problems
R u t c o r Research R e o r t On the Chvatál-Comlexity of Knasack Problems Gergely Kovács a Béla Vizvári b RRR 5-08, October 008 RUTCOR Rutgers Center for Oerations Research Rutgers University 640 Bartholomew
More informationThe analysis and representation of random signals
The analysis and reresentation of random signals Bruno TOÉSNI Bruno.Torresani@cmi.univ-mrs.fr B. Torrésani LTP Université de Provence.1/30 Outline 1. andom signals Introduction The Karhunen-Loève Basis
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationCharacterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations
Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations PINAR KORKMAZ, BILGE E. S. AKGUL and KRISHNA V. PALEM Georgia Institute of
More informationExploratory Factor Analysis and Canonical Correlation
Exploratory Factor Analysis and Canonical Correlation 3 Dec 2010 CPSY 501 Dr. Sean Ho Trinity Western University Please download: SAQ.sav Outline for today Factor analysis Latent variables Correlation
More informationStatics and dynamics: some elementary concepts
1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and
More informationBrownian Motion and Random Prime Factorization
Brownian Motion and Random Prime Factorization Kendrick Tang June 4, 202 Contents Introduction 2 2 Brownian Motion 2 2. Develoing Brownian Motion.................... 2 2.. Measure Saces and Borel Sigma-Algebras.........
More informationBiostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression
Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout
More informationThe spectrum of kernel random matrices
The sectrum of kernel random matrices Noureddine El Karoui Deartment of Statistics, University of California, Berkeley Abstract We lace ourselves in the setting of high-dimensional statistical inference,
More informationOpen Research Online The Open University s repository of research publications and other research outputs
Oen Research Online The Oen University s reository of research ublications and other research oututs A modified rincial comonent technique based on the LASSO Journal Item How to cite: Jolliffe, I.T.; Trendafilov,
More informationdn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential
Chem 467 Sulement to Lectures 33 Phase Equilibrium Chemical Potential Revisited We introduced the chemical otential as the conjugate variable to amount. Briefly reviewing, the total Gibbs energy of a system
More informationProbability Estimates for Multi-class Classification by Pairwise Coupling
Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics
More information16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes
Infinite Series 6. Introduction We extend the concet of a finite series, met in section, to the situation in which the number of terms increase without bound. We define what is meant by an infinite series
More informationImpact Damage Detection in Composites using Nonlinear Vibro-Acoustic Wave Modulations and Cointegration Analysis
11th Euroean Conference on Non-Destructive esting (ECND 214), October 6-1, 214, Prague, Czech Reublic More Info at Oen Access Database www.ndt.net/?id=16448 Imact Damage Detection in Comosites using Nonlinear
More informationApproximating min-max k-clustering
Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost
More information16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes
Infinite Series 6.2 Introduction We extend the concet of a finite series, met in Section 6., to the situation in which the number of terms increase without bound. We define what is meant by an infinite
More informationPreconditioning techniques for Newton s method for the incompressible Navier Stokes equations
Preconditioning techniques for Newton s method for the incomressible Navier Stokes equations H. C. ELMAN 1, D. LOGHIN 2 and A. J. WATHEN 3 1 Deartment of Comuter Science, University of Maryland, College
More informationGeneral Linear Model Introduction, Classes of Linear models and Estimation
Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)
More informationA Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression
Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi
More informationMulti-Operation Multi-Machine Scheduling
Multi-Oeration Multi-Machine Scheduling Weizhen Mao he College of William and Mary, Williamsburg VA 3185, USA Abstract. In the multi-oeration scheduling that arises in industrial engineering, each job
More informationSlides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide
s Preared by JOHN S. LOUCKS St. Edward s s University 1 Chater 11 Comarisons Involving Proortions and a Test of Indeendence Inferences About the Difference Between Two Poulation Proortions Hyothesis Test
More informationChapter 10. Supplemental Text Material
Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=
More informationJohn Weatherwax. Analysis of Parallel Depth First Search Algorithms
Sulementary Discussions and Solutions to Selected Problems in: Introduction to Parallel Comuting by Viin Kumar, Ananth Grama, Anshul Guta, & George Karyis John Weatherwax Chater 8 Analysis of Parallel
More informationUnobservable Selection and Coefficient Stability: Theory and Evidence
Unobservable Selection and Coefficient Stability: Theory and Evidence Emily Oster Brown University and NBER August 9, 016 Abstract A common aroach to evaluating robustness to omitted variable bias is to
More informationEstimating function analysis for a class of Tweedie regression models
Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal
More informationMultivariate Statistics (I) 2. Principal Component Analysis (PCA)
Multivariate Statistics (I) 2. Principal Component Analysis (PCA) 2.1 Comprehension of PCA 2.2 Concepts of PCs 2.3 Algebraic derivation of PCs 2.4 Selection and goodness-of-fit of PCs 2.5 Algebraic derivation
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More information1 Extremum Estimators
FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective
More informationBERNOULLI TRIALS and RELATED PROBABILITY DISTRIBUTIONS
BERNOULLI TRIALS and RELATED PROBABILITY DISTRIBUTIONS A BERNOULLI TRIALS Consider tossing a coin several times It is generally agreed that the following aly here ) Each time the coin is tossed there are
More informationUNCERTAINLY MEASUREMENT
UNCERTAINLY MEASUREMENT Jan Čaek, Martin Ibl Institute of System Engineering and Informatics, University of Pardubice, Pardubice, Czech Reublic caek@uce.cz, martin.ibl@uce.cz In recent years, a series
More information