Multivariate Statistics

Size: px
Start display at page:

Download "Multivariate Statistics"

Transcription

1 Multivariate Statistics Chapter 4: Factor analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid Course 2017/2018 Master in Mathematical Engineering Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 1 / 61

2 1 Introduction 2 The factor model 3 Estimation of the loading matrix and the factor scores Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 2 / 61

3 Introduction Sometimes, it is not always possible to measure the quantities of interest directly. For instance, in Psychology, intelligence is a prime example. Usually, scores in subjects such as Mathematics, Language and Literature or comprehensive tests are used to describe a child s intelligence. One may wonder whether is possible to define a certain score to measure the person s intelligence from these measurements. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 3 / 61

4 Introduction Factor analysis (FA) is a statistical technique that recognizes that there is an association between certain measured quantities and some hidden quantities. More precisely, the main aims of FA is: 1 to exhibit the relationship between the measured and the underlying variables; and 2 to estimate the underlying variables, usually called hidden or latent variables. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 4 / 61

5 Introduction The model-based nature of FA has invited, and resulted in, many theoretical and statistical advances. Particularly, the factor model allows an elegant description of: 1 the underlying structure among the observable variables; 2 the hidden variables; and 3 the relationship between both type of variables. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 5 / 61

6 Introduction FA applies to data with fewer hidden variables than measured quantities, but we do not need to know explicitly the number of hidden variables. It is not possible to determine the hidden variables uniquely, and typically, we focus on expressing the observed variables in terms of a smaller number of uncorrelated factors. There is a strong connection with Principal Component Analysis, and we shall see that a principal component decomposition leads to one possible FA solution. However, there are important distinctions between PCA and FA. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 6 / 61

7 Introduction The rest of this chapter is devoted: to present the factor model and its main characteristics; and to see how to estimate the parameters of the model and the hidden factors. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 7 / 61

8 The factor model Let x = (x 1,..., x p ) be a multivariate random variable with mean vector µ x and covariance matrix Σ x. Note that we do not assume any distribution of the random variable x. However, one of the estimation methods presented later assumes Gaussianity of x. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 8 / 61

9 The factor model The factor model establishes that x can be written as follows: where: 1 µ x = E [x] is a p-dimensional vector. x = µ x + Lf + ɛ 2 L is a p r matrix of unknown constants called the loading matrix. 3 f is a r-dimensional random vector of latent variables, where r < p, with mean vector 0 r and covariance matrix I r (the factors). 4 ɛ is a p-dimensional unobserved vector of errors with mean vector 0 r and diagonal covariance matrix Σ ɛ. 5 It is assumed that the perturbations are uncorrelated with the factors, i.e., Cov [f, ɛ] = E [f ɛ ] = 0 r p and Cov [ɛ, f ] = E [ɛf ] = 0 p r. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 9 / 61

10 The factor model As in PCA, the driving force of the factor model is Σ x. From the factor model, it is not difficult to see that: Σ x = Cov [x] = E [ (x µ x ) (x µ x ) ] = LL + Σ ɛ Therefore, Σ x can be written in terms of L and Σ ɛ. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 10 / 61

11 The factor model Consequently, the variances of the variables in x are given by: σx,j 2 = L j L j + σɛ,j 2 = τ jj + σɛ,j 2 for j = 1,..., p, where: 1 τ jj is called the j-th communality; and 2 σɛ,j 2 is the variance of the j-th element of ɛ and is called the j-th uniqueness. Additionally, the covariances of the variables in x are given by: for j, k = 1,..., p with j k. σ x,jk = L j L k = τ jk Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 11 / 61

12 The factor model The loading matrix, L, has a simple interpretation. Note that: Cov [x, f ] = E [(x µ x ) f ] = E [(Lf + ɛ) f ] = L Thus, L is the covariance between x (the multivariate random variable of interest) and f (the latent factors). Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 12 / 61

13 The factor model In the factor model, neither the loading matrix, L, nor the factors, f, are observable. This poses the problem of indeterminacy: if H is a r r orthogonal matrix, then, the factor model can be written as: x = µ x + Lf + ɛ = µ x + LHH f + ɛ = µ x + L f + ɛ where L = LH and f = H f. Both models contain uncorrelated factors, with an identity covariance matrix. As a consequence, at best, the loading matrix and the factors are unique up to an orthogonal transformation because an orthogonal transformation of the factors leads to another factors, and similar relationships hold for the loading matrix. Indeed, we cannot uniquely recover the loading matrix or the factors from knowledge of the covariance matrix Σ x, as the next example illustrates. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 13 / 61

14 The factor model Assume a one factor model for a bivariate random variable x = (x 1, x 2 ) with mean 0 2 and covariance matrix: ( ) Σ x = The one-factor model is: x = so that, f is a univariate random variable. ( ) ( ) ( ) x1 L11 ɛ1 = f + x 2 L 21 ɛ 2 Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 14 / 61

15 The factor model Therefore, Σ x can be written as follows: ( ) ( ) ( L11 ( ) σ 2 Σ x = = L11 L L 21 + ɛ, σɛ,22 2 ( ) ( ) L 2 = 11 L 11 L 21 σ 2 L 11 L 21 L 2 + ɛ, σɛ,22 2 ) = Consequently, 1.25 = L σɛ, = L 11 L = L σɛ,22 2 A solution for L is L 11 = 1 and L 21 = 0.5. Another option is L 11 = 0.75 and L 21 = Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 15 / 61

16 The factor model Unless other information is available, it is not clear which solution one should pick. From the intuitive point of view, we might prefer the solution which has the smallest covariance matrix Σ ɛ, as measured by the trace of Σ ɛ. In this case, the solution with L 11 = 1 and L 21 = 0.5 would be preferable. The varimax criterion is a method for distinguishing between loading matrices which is easy to calculate and interpret. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 16 / 61

17 The factor model Let, VC (L) = r 1 p k=1 p L 4 jk 1 p j=1 p j=1 L 2 jk 2 The quantity VC (L) reminds us of a sample variance, with the difference that it is applied to the squared entries L 2 jk of L. Starting with a loading matrix L, we can consider rotated loading matrices L = LH, where H is an orthogonal r r matrix. The varimax criterion selects that orthogonal r r matrix which: H = arg maxvc (LH) H that leads to the loading matrix L = LH. As we shall see, varimax optimal rotations lead to visualizations of the loading matrix that admit an easier interpretation than unrotated loadings matrices. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 17 / 61

18 The factor model In the previous example, it is easy to show that VC (L) = , for the first solution, while VC (L) = , for the second solution. Then, the first solution is the selected by the varimax criterion. The two ways of choosing the loading matrix, namely, finding the loading matrix L with the smaller trace of Σ ɛ or finding that with the larger VC, are not equivalent. However, in this example, both ways resulted in the same solution. Note that, in this example, we have not found the optimal orthogonal matrix H, we have just compared two possible solutions. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 18 / 61

19 The factor model If the univariate variables in x = (x 1,..., x p ) have different units of measurement, it is preferable to consider the scaled variables. Then, the univariate standardization of the variables in x leads to the multivariate random variable y = 1/2 x (x µ x ), where x is a diagonal matrix with the variances of the variables in x. The multivariate random variable y has mean 0 p and covariance matrix ϱ x, the correlation matrix of x. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 19 / 61

20 The factor model Then, if x follows the factor model given by: x = µ x + Lf + ɛ with the properties given in slide 9, then y follows the factor model given by: with covariance matrix decomposition: y = 1/2 x Lf + 1/2 x ɛ ϱ x = 1/2 x LL 1/2 x + 1/2 x Σ ɛ 1/2 x Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 20 / 61

21 The factor model As a consequence, the factor model for y is similar to the factor model for x with: 1 a loading matrix M = 1/2 x L; 2 a set of factors f (the same factors); and 3 a set of errors ε = 1/2 x ɛ with diagonal covariance matrix Σ ε = 1/2 x Σ ɛ 1/2 x. In other words, we have the factor model for y given by: with covariance matrix decomposition: y = Mf + ε ϱ x = MM + Σ ε Working with ϱ x is advantageous because the diagonal entries of ϱ x are 1, then the sum of communalities and uniquenesses are 1, and their interpretations become easier. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 21 / 61

22 Estimation of the loading matrix and the factor scores In practice, we have a data matrix X of dimension n p such that each row of X, x i, for i = 1,..., n, has been generated by the factor model. Given the data matrix X, the goal is to estimate the loading matrix, L, the covariance matrix of the errors, Σ ɛ, and the values of the factors for each observation x i, for i = 1,..., n, which are called the factor scores. There are two main approaches for such goals: non-distributional methods; and distributional methods. For Gaussian data, we expect that distributional methods that explicitly assumes Gaussianity of the data works better than non-distributional methods. Indeed methods based on Gaussian assumptions still work well if the distribution of the data does not deviate too much from Gaussianity. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 22 / 61

23 Estimation of the loading matrix and the factor scores There are two main non-distributional methods: 1 Principal component factor analysis; and 2 Principal factor analysis. Both methods are based on: 1 the covariance matrix decomposition given by Σ x = LL + Σ ɛ, if we work with x; or 2 the correlation matrix decomposition given by ϱ x = MM + Φ, if we work with the scaled variable y. For simplicity, we present the methods for x and the covariance decomposition but similar results are obtained for y and the correlation decomposition. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 23 / 61

24 Estimation of the loading matrix and the factor scores We start with the principal component factor analysis method. First, the spectral decomposition of Σ x is given by: Σ x = V p Λ p V p where: Vp is the matrix that contains the eigenvectors of Σ p; and Λp is the p p diagonal matrix that contains the eigenvalues of Σ p. Consequently, from the covariance matrix decomposition, we have that: V p Λ p V p = LL + Σ ɛ Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 24 / 61

25 Estimation of the loading matrix and the factor scores If we assume that Σ ɛ = 0 p p, then V p Λ p V p = LL. Now, as L has dimension p r, LL is a matrix with rank r < p. As a consequence, Λ p contains p r eigenvalues equal to 0, so that, we can put: L = V r Λ 1/2 r where: Vr is the matrix that contains the eigenvectors of Σ x associated with the r eigenvalues of Σ x different of 0; and Λr is the r r diagonal matrix that contains these eigenvalues. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 25 / 61

26 Estimation of the loading matrix and the factor scores Now, given the data matrix X, we can compute the sample covariance matrix S x (or the sample correlation matrix R x, if the data have been scaled). Thus, eigenvectors and eigenvalues of Σ x (or ϱ x ) are replaced with those of S x (or R x ). The idea is to select r as in PCA, i.e., using the variance explained by the principal components obtained from S x or R x. Of course, in most of the situations the assumption Σ ɛ = 0 p p is unrealistic but the method use to provide with good estimates of the loading matrix. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 26 / 61

27 Illustrative example (I) Consider the eight univariate variables measured on the 50 states of the USA. We took the logarithm for the first, third and eighth variables. Additionally, we consider the sample correlation matrix rather than the sample covariance matrix. The first three eigenvectors explain the 76.69% of the total variability. Then, using the three largest eigenvalues of the correlation matrix and the associated eigenvectors, we can estimate the loading matrix, M (note that we are using the correlation matrix). For that we use the formula M = V Rx 3 ( Λ Rx 3 ) 1/2, where V R x 3 is the matrix with the first three eigenvectors of R x; and Λ R x 3 is the diagonal matrix with the three largest eigenvalues of R x. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 27 / 61

28 Illustrative example (I) The estimated loading matrix is given by: M = Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 28 / 61

29 Illustrative example (I) Now, we can use the varimax rotation to obtain the final estimate of the loading matrix, that is given by: M = Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 29 / 61

30 Illustrative example (I) The first factor distinguishes between cold states with rich, educated, long-lived and non-violent states, from warm states with poor, ill-educated, short-lived and violent states. The second factor distinguishes big states with rich and educated populations but violent and short-lived, from small states with long-lived people. The third factor distinguishes populated and violent states from less-populated, cold, non-violent, but short-lived, states. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 30 / 61

31 Illustrative example (I) Once that we have estimated the loading matrix, it is possible to estimate the covariance matrix of the errors, Σ ε, with the diagonal matrix of R x M ( M ). In this case, the estimated uniquenesses are given by 0.28, 0.35, 0.21, 0.24, 0.12, 0.17, 0.29 and Therefore, the variables better explained by the factors are murder, HS graduates and log-area. Estimation of the value of the factors (the factor scores) for each state will be given later. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 31 / 61

32 Estimation of the loading matrix and the factor scores In principal factor analysis, we also start with the equality: Σ x = LL + Σ ɛ Therefore, LL = Σ x Σ ɛ has to be a matrix with range r < p because L has dimension p r. As a consequence, Σ x Σ ɛ has p r eigenvalues equal to 0. Thus, the spectral decomposition of Σ x Σ ɛ is given by: where: Σ x Σ ɛ = U r Ω r U r Ur is the matrix that contains the eigenvectors of Σ x Σ ɛ associated with the r eigenvalues of Σ x Σ ɛ different of 0; and Ωr is the r r diagonal matrix that contains these eigenvalues. Therefore, we can put: L = U r Ω 1/2 r Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 32 / 61

33 Estimation of the loading matrix and the factor scores As in the principal components factor analysis method, the covariance matrix Σ x (or the correlation matrix ϱ x, if the data have been scaled) is replaced with the sample covariance matrix S x (or the sample correlation matrix R x, if the data have been scaled). The problem with this method is that Σ ɛ is unknown and should be estimated. For that, we can use the estimate of Σ ɛ obtained with the principal components factor analysis method. Thus, eigenvectors and eigenvalues of Σ x Σ ɛ (or ϱ x Φ) are replaced with those of S x Σ ɛ (or R x Σ ε ). Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 33 / 61

34 Illustrative example (I) Consider again the eight univariate variables measured on the 50 states of the USA. Note that we are working with the sample correlation matrix, R x. As with the principal components factor analysis method, we consider three factors because the first three eigenvectors of R x Σ ε explains the 88.8% of the total variability. Then, using the three largest eigenvalues of the correlation matrix and the associated eigenvectors, we can estimate the loading matrix, M. As in the previous method, we use the formula where: M = U Rx Σ ε 3 ( Ω Rx Σ ε 3 U R x Σ ε 3 is the matrix with the first three eigenvectors of R x Σ ε; and ) 1/2, Ω R x Σ ε 3 is the diagonal matrix with the three largest eigenvalues of R x Σ ε. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 34 / 61

35 Illustrative example (I) The estimated loading matrix is given by: M = Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 35 / 61

36 Illustrative example (I) Now, we can use the varimax rotation to obtain the final estimate of the loading matrix, that is given by: M = Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 36 / 61

37 Illustrative example (I) Note that both matrices are very close to those estimated with the principal components factor analysis method. Consequently, the interpretation of the factor is similar to that the obtained for the previous estimation method. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 37 / 61

38 Illustrative example (I) As in the previous case, once that we have estimated the loading matrix, it is possible to estimate the covariance matrix of the errors, Σ ɛ, with the diagonal matrix of R x M ( M ). In this case, the estimated uniquenesses are 0.42, 0.52, 0.26, 0.31, 0.15, 0.21, 0.38 and Therefore, the variables better explained by the factors are log-illiteracy, murder, HS Graduate and log-area. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 38 / 61

39 Estimation of the loading matrix and the factor scores We focus now in estimating the factor scores. The Bartlett factor scores method can be used with these non-parametric methods. Essentially, the Bartlett method assumes that the factors are parameters. Then, from the factor model, the p 1 vector x i, for i = 1,..., n, is given by: Therefore, by generalized least squares: x i = µ x + Lf i + ɛ i f i = ( L Σ 1 ɛ L ) 1 L Σ 1 ɛ (x i µ x ) The final estimate is obtained after replacing µ x, L and Σ ɛ by their respective estimates. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 39 / 61

40 Estimation of the loading matrix and the factor scores If we have used the scaled data, the estimated factor scores are given by: where y i are the scaled observations. f i = ( M Σ 1 ε M ) 1 M Σ 1 ε y i Note that L and M are replaced with L and M if we have used the varimax rotation. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 40 / 61

41 Illustrative example (I) The next three slides show scatterplots of the three factors estimated with the previous methods. For a better comparison the sign of the second factor for the principal factor analysis method has been changed. The three plots appear to confirm the interpretation of the factors given before. Note that Alaska can be seen as a kind of outlier. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 41 / 61

42 Illustrative example (I) Scores with first method Second factor South Carolina Mississippi West Virginia Arkansas Kentucky Louisiana North Carolina AlabamaGeorgia Tennessee New Mexico Rhode Island Delaware New Jersey New Hampshire Vermont Maine Maryland North Dakota Oklahoma South Pennsylvania Indiana Dakota Virginia Ohio Missouri Idaho Florida New York Michigan Arizona Illinois Montana Texas Wyoming Nevada Alaska Connecticut Massachusetts Hawaii Wisconsin Iowa Nebraska Minnesota Kansas Utah Oregon Colorado Washington Californi First factor Scores with second method Rhode Island Second factor South Carolina Mississippi West Virginia Kentucky Arkansas Louisiana North Carolina Alabama Georgia Tennessee New Mexico Delaware New Jersey New Hampshire Vermont Maine Maryland North Dakota Oklahoma Pennsylvania Virginia South Dakota Indiana Ohio Missouri Idaho New Florida York Michigan Illinois Arizona TexasMontana Wyoming Nevada Alaska Connecticut Massachusetts Hawaii Wisconsin Iowa Nebraska Minnesota KansasOregon Utah Colorado Washington Californi First factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 42 / 61

43 Illustrative example (I) Scores with first method Californi Third factor Mississippi South Carolina Louisiana Alabama Georgia North Carolina Arkansas Kentucky West Virginia Tennessee New Mexico Texas Florida New York Virginia Arizona Illinois Maryland Michigan New Jersey Pennsylvania Ohio Missouri Oklahoma Indiana Rhode Island Delaware Alaska Idaho New Hampshire Nevada North Dakota MaineVermont Montana WyomingSouth Dakota Hawaii Massachusetts Connecticut Oregon Wisconsin Colorado Kansas Iowa Minnesota Nebraska Utah Washington First factor Scores with second method Californi Third factor Mississippi South Carolina Alabama Louisiana Georgia North Carolina Kentucky Arkansas West Virginia Tennessee New Mexico Texas New Florida York Arizona Virginia Maryland Michigan Illinois New Jersey Pennsylvania Ohio Missouri Oklahoma Indiana Rhode Island Delaware Alaska Idaho New Hampshire North Dakota Maine Vermont Nevada Montana Wyoming South Dakota Hawaii Massachusetts Connecticut Oregon Wisconsin Kansas Colorado Iowa Minnesota Nebraska Utah Washington First factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 43 / 61

44 Illustrative example (I) Scores with first method California Third factor Alaska Nevada Texas New Florida York Hawaii Alabama Louisiana Arizona Georgia Michigan Illinois Virginia New Jersey Washington Maryland Massachusetts North Ohio Tennessee Carolina Pennsylvania Connecticut Missouri Mississippi Oregon South Indiana Arkansas Kentucky Carolina Oklahoma Colorado Kansas Wisconsin Minnesota New Mexico Iowa West Virginia Delaware Utah Nebraska Idaho North Dakota New Hampshire Montana Maine Vermont Wyoming South Dakota Rhode Is Second factor Scores with second method California Third factor Alaska Nevada Texas New Florida York Hawaii Arizona Alabama Louisiana Michigan Illinois Georgia Washington Virginia New Jersey Maryland Massachusetts Ohio North Tennessee Connecticut Oregon Carolina Pennsylvania Missouri Mississippi Oklahoma Indiana South Kentucky Arkansas Carolina Kansas Colorado New Mexico Minnesota Wisconsin Iowa Delaware Utah Nebraska West Virginia Idaho North Dakota New Hampshire Montana Maine Vermont Wyoming South Dakota Rhode Is Second factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 44 / 61

45 Estimation of the loading matrix and the factor scores Principal component factor analysis and principal factor analysis are non-parametric procedures and can therefore be applied to data without requiring knowledge of the underlying distribution of the data. If we know that the data are Gaussian or not very different from Gaussian, exploiting this extra knowledge may lead to better estimators for the loading matrix and the factor scores. Now, we consider the case in which x and ɛ are Gaussian distributed and use maximum likelihood to estimate the model parameters. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 45 / 61

46 Estimation of the loading matrix and the factor scores Given a data matrix generated from the Gaussian factor model, the log-likelihood function of the parameters of the model is given by: l (µ x, Σ x X ) = np 2 log 2π n 2 log Σ x 1 2 n i=1 (x i µ x ) Σ 1 x (x i µ x ) where Σ x = LL + Σ ɛ. As seen in Chapter 2, the MLE of µ x is given by µ x = x. Replacing this quantity in l (µ x, Σ x X ), leads to: l (Σ x X, µ x = x) = np 2 log 2π n 2 log Σ x n 1 2 Tr [ Σ 1 x S x ] Now, we replace Σ x = LL + Σ ɛ leading to: l (L, Σ ɛ X, µ x = x) = np 2 log 2π n 2 log LL + Σ ɛ n 1 2 Tr [ (LL + Σ ɛ ) 1 S x ] Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 46 / 61

47 Estimation of the loading matrix and the factor scores There are not explicit form for the MLE of L and Σ ɛ, denoted by L and Σ ɛ, respectively, unless some restrictions of the form of these matrices are imposed. Then, optimization methods are required to obtain the MLEs numerically. In any case, the MLE is invariant under linear transformations of the variables. Therefore, the solutions obtained using the original or the scaled variables are equivalent. Indeed, the MLE, as implemented in R, provides with the solution for the scaled data. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 47 / 61

48 Estimation of the loading matrix and the factor scores In order to determine the number of factors with MLE (assuming Gaussianity), we can use the likelihood ratio test for the following hypothesis: H 0 : the number of factors is r H 1 : the number of factors is not r The likelihood ratio test for these hypothesis is given by: L L + Σ ( ɛ ( L L λ = n log np + (n 1) Tr + Σ ) ) 1 ɛ Sx Σ x where Σ x is the MLE of Σ x and S x is the sample covariance matrix. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 48 / 61

49 Estimation of the loading matrix and the factor scores The ( test statistic λ under ) the null hypothesis H 0 has a χ 2 distribution with (p r) 2 (p + r) degrees of freedom. 1 2 The idea is to apply sequentially the LR test, i.e., start with r = 1 and, if the test is rejected, then consider the case of r = 2, and so on. Note however that the maximum number of factors that we can consider values of r that verifies (p r) 2 (p + r) > 0. Thus, there exists a maximum value of r that can be used with MLE. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 49 / 61

50 Estimation of the loading matrix and the factor scores The regression factor scores method is used usually with MLE to estimate the factor scores. The method assumes that the factors are random variables, and to look for a linear predictor that minimizes the mean squared prediction error. The pair (f i, x i ) has a multivariate Gaussian distribution. Therefore, it is possible to show that the linear predictor that minimizes the mean squared prediction error is just: E [f i x i ] = ( I r + L Σ 1 ɛ L ) 1 L Σ 1 ɛ (x i µ x ) The final estimate is obtained after replacing µ x, L and Σ ɛ by their respective ML estimates. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 50 / 61

51 Illustrative example (I) Consider the eight univariate variables measured on the 50 states of the USA. Note that this dataset is rarely Gaussian distributed. However, we apply the MLE approach to estimate the factor model. We estimate the factor model using MLE. The values of the LRT statistic λ are 89.63, 46.53, and 7.39, respectively, with associated p-values , , and , respectively. Therefore, four factors are adequate using the LRT. However, note that the data are highly non-gaussian, so these p-values should be taken with caution. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 51 / 61

52 Illustrative example (I) The estimated loading matrix using the scaled data and using the varimax rotation is given by: M = The estimated uniquenesses are , 0.538, 0.258, 0.255, , , and Therefore, the variables better explained by the factors are log-population, murder, HS graduates and frost. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 52 / 61

53 Illustrative example (I) The first factor distinguishes between states with long-lived and rich and educated populations from violent states with ill-educated people. The second factor distinguishes between violent states from long-lived states. The third factor distinguishes between cold states with educated people from warm states with ill-educated people. The fourth factor distinguishes large-populated states from less-populated states. The following slides show the estimated factor scores. Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 53 / 61

54 Illustrative example (I) Scores with MLE Nevada Alaska Second factor Alabama Georgia Mississippi Louisiana South Carolina KentuckyNorth Carolina Tennessee Arkansas West Virginia North Dakota Michigan New Mexico Illinois New York Texas Wyoming Missouri Virginia Maryland Florida Vermont Ohio Indiana Montana Delaware Idaho Arizona Pennsylvania Oklahoma New Hampshire New Jersey Kansas Hawaii Maine Nebraska Wisconsin Connecticut Minnesota South Dakota Iowa Massachusetts Oregon Colorado Washington Utah Californi Rhode Island First factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 54 / 61

55 Illustrative example (I) Scores with MLE North Dakota Nevada Third factor Kentucky Rhode Island West Virginia North Carolina Tennessee South Carolina Georgia Arkansas Mississippi Alabama Louisiana South Dakota New Hampshire Minnesota Maine Vermont Wisconsin Illinois Wyoming Pennsylvania Michigan Ohio Montana Connecticut Indiana Iowa Missouri New Jersey Nebraska New Mexico Maryland Idaho New York Kansas Virginia Massachusetts Delaware Oklahoma Texas Oregon Florida Arizona Colorado Alaska Utah Washington Californi Hawaii First factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 55 / 61

56 Illustrative example (I) Scores with MLE Pennsylvania New York Californi Ohio Illinois Texas New Jersey Michigan Fourth factor North Carolina Georgia Tennessee Kentucky South Carolina Louisiana Alabama West Virginia Arkansas Mississippi Rhode Island North Dakota Missouri Virginia Oklahoma Maine South Dakota Delaware Wisconsin Indiana Minnesota Florida Maryland Connecticut Iowa Nebraska New Hampshire New Mexico Montana Idaho Vermont Massachusetts Kansas Oregon Arizona Hawaii Wyoming Colorado Washington Utah Nevada Alaska First factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 56 / 61

57 Illustrative example (I) Scores with MLE North Dakota Nevada Third factor Rhode Island South Minnesota Dakota Wisconsin Maine Connecticut Iowa Nebraska Massachusetts Oregon Washington New Hampshire Colorado Vermont Wyoming Illinois Pennsylvania Montana Ohio Indiana New Jersey Missouri Utah Kentucky New Mexico Idaho Maryland West Virginia Kansas North Carolina New York Virginia Tennessee Delaware South Carolina Oklahoma Arkansas Mississippi Texas California Louisiana Florida Arizona Michigan Georgia Alabama Alaska Hawaii Second factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 57 / 61

58 Illustrative example (I) Scores with MLE Pennsylvania California New York Ohio Illinois New Jersey Texas Michigan Fourth factor Massachusetts Wisconsin Minnesota Connecticut Iowa Washington Oregon Nebraska Maine Rhode Island North South Dakota Dakota Hawaii Indiana Florida Missouri North Carolina Virginia Maryland Tennessee Kentucky Colorado Kansas Oklahoma South Carolina Louisiana West Virginia Arkansas Mississippi Arizona Utah New Hampshire New Mexico Idaho Montana Vermont Delaware Wyoming Georgia Alabama Alaska Nevada Second factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 58 / 61

59 Illustrative example (I) Scores with MLE California New York Ohio Pennsylvania Illinois Texas New Jersey Michigan Fourth factor Hawaii Arizona Florida Washington Louisiana Oregon Alabama Massachusetts Wisconsin Indiana Minnesota North Carolina Missouri Virginia Maryland Connecticut Georgia Iowa Tennessee Kentucky Colorado Oklahoma Kansas South Carolina Nebraska West Virginia Mississippi Arkansas Maine Utah Rhode Island New Hampshire New Mexico South Dakota North Dakota Idaho Montana Nevada Vermont Delaware Wyoming Alaska Third factor Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 59 / 61

60 Chapter outline We are ready now for: Chapter 5: Multidimensional scaling Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 60 / 61

61 1 Introduction 2 The factor model 3 Estimation of the loading matrix and the factor scores Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 4 Master in Mathematical Engineering 61 / 61

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 3: Principal Component Analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 6: Cluster Analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical Engineering

More information

Multivariate Analysis

Multivariate Analysis Multivariate Analysis Chapter 5: Cluster analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2015/2016 Master in Business Administration and

More information

New Educators Campaign Weekly Report

New Educators Campaign Weekly Report Campaign Weekly Report Conversations and 9/24/2017 Leader Forms Emails Collected Text Opt-ins Digital Journey 14,661 5,289 4,458 7,124 317 13,699 1,871 2,124 Pro 13,924 5,175 4,345 6,726 294 13,086 1,767

More information

Standard Indicator That s the Latitude! Students will use latitude and longitude to locate places in Indiana and other parts of the world.

Standard Indicator That s the Latitude! Students will use latitude and longitude to locate places in Indiana and other parts of the world. Standard Indicator 4.3.1 That s the Latitude! Purpose Students will use latitude and longitude to locate places in Indiana and other parts of the world. Materials For the teacher: graph paper, globe showing

More information

Jakarta International School 6 th Grade Formative Assessment Graphing and Statistics -Black

Jakarta International School 6 th Grade Formative Assessment Graphing and Statistics -Black Jakarta International School 6 th Grade Formative Assessment Graphing and Statistics -Black Name: Date: Score : 42 Data collection, presentation and application Frequency tables. (Answer question 1 on

More information

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis Lecture 5: Ecological distance metrics; Principal Coordinates Analysis Univariate testing vs. community analysis Univariate testing deals with hypotheses concerning individual taxa Is this taxon differentially

More information

Challenge 1: Learning About the Physical Geography of Canada and the United States

Challenge 1: Learning About the Physical Geography of Canada and the United States 60ºN S T U D E N T H A N D O U T Challenge 1: Learning About the Physical Geography of Canada and the United States 170ºE 10ºW 180º 20ºW 60ºN 30ºW 1 40ºW 160ºW 50ºW 150ºW 60ºW 140ºW N W S E 0 500 1,000

More information

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis Lecture 5: Ecological distance metrics; Principal Coordinates Analysis Univariate testing vs. community analysis Univariate testing deals with hypotheses concerning individual taxa Is this taxon differentially

More information

, District of Columbia

, District of Columbia State Capitals These are the State Seals of each state. Fill in the blank with the name of each states capital city. (Hint: You may find it helpful to do the word search first to refresh your memory.),

More information

Correction to Spatial and temporal distributions of U.S. winds and wind power at 80 m derived from measurements

Correction to Spatial and temporal distributions of U.S. winds and wind power at 80 m derived from measurements JOURNAL OF GEOPHYSICAL RESEARCH, VOL. 109,, doi:10.1029/2004jd005099, 2004 Correction to Spatial and temporal distributions of U.S. winds and wind power at 80 m derived from measurements Cristina L. Archer

More information

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee May 2018

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee May 2018 Cooperative Program Allocation Budget Receipts May 2018 Cooperative Program Allocation Budget Current Current $ Change % Change Month Month from from Contribution Sources 2017-2018 2016-2017 Prior Year

More information

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee October 2017

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee October 2017 Cooperative Program Allocation Budget Receipts October 2017 Cooperative Program Allocation Budget Current Current $ Change % Change Month Month from from Contribution Sources 2017-2018 2016-2017 Prior

More information

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee October 2018

Cooperative Program Allocation Budget Receipts Southern Baptist Convention Executive Committee October 2018 Cooperative Program Allocation Budget Receipts October 2018 Cooperative Program Allocation Budget Current Current $ Change % Change Month Month from from Contribution Sources 2018-2019 2017-2018 Prior

More information

Hourly Precipitation Data Documentation (text and csv version) February 2016

Hourly Precipitation Data Documentation (text and csv version) February 2016 I. Description Hourly Precipitation Data Documentation (text and csv version) February 2016 Hourly Precipitation Data (labeled Precipitation Hourly in Climate Data Online system) is a database that gives

More information

Club Convergence and Clustering of U.S. State-Level CO 2 Emissions

Club Convergence and Clustering of U.S. State-Level CO 2 Emissions Methodological Club Convergence and Clustering of U.S. State-Level CO 2 Emissions J. Wesley Burnett Division of Resource Management West Virginia University Wednesday, August 31, 2013 Outline Motivation

More information

Preview: Making a Mental Map of the Region

Preview: Making a Mental Map of the Region Preview: Making a Mental Map of the Region Draw an outline map of Canada and the United States on the next page or on a separate sheet of paper. Add a compass rose to your map, showing where north, south,

More information

Abortion Facilities Target College Students

Abortion Facilities Target College Students Target College Students By Kristan Hawkins Executive Director, Students for Life America Ashleigh Weaver Researcher Abstract In the Fall 2011, Life Dynamics released a study entitled, Racial Targeting

More information

Additional VEX Worlds 2019 Spot Allocations

Additional VEX Worlds 2019 Spot Allocations Overview VEX Worlds 2019 Spot s Qualifying spots for the VEX Robotics World Championship are calculated twice per year. On the following table, the number in the column is based on the number of teams

More information

Outline. Administrivia and Introduction Course Structure Syllabus Introduction to Data Mining

Outline. Administrivia and Introduction Course Structure Syllabus Introduction to Data Mining Outline Administrivia and Introduction Course Structure Syllabus Introduction to Data Mining Dimensionality Reduction Introduction Principal Components Analysis Singular Value Decomposition Multidimensional

More information

A. Geography Students know the location of places, geographic features, and patterns of the environment.

A. Geography Students know the location of places, geographic features, and patterns of the environment. Learning Targets Elementary Social Studies Grade 5 2014-2015 A. Geography Students know the location of places, geographic features, and patterns of the environment. A.5.1. A.5.2. A.5.3. A.5.4. Label North

More information

Last time: PCA. Statistical Data Mining and Machine Learning Hilary Term Singular Value Decomposition (SVD) Eigendecomposition and PCA

Last time: PCA. Statistical Data Mining and Machine Learning Hilary Term Singular Value Decomposition (SVD) Eigendecomposition and PCA Last time: PCA Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml

More information

QF (Build 1010) Widget Publishing, Inc Page: 1 Batch: 98 Test Mode VAC Publisher's Statement 03/15/16, 10:20:02 Circulation by Issue

QF (Build 1010) Widget Publishing, Inc Page: 1 Batch: 98 Test Mode VAC Publisher's Statement 03/15/16, 10:20:02 Circulation by Issue QF 1.100 (Build 1010) Widget Publishing, Inc Page: 1 Circulation by Issue Qualified Non-Paid Circulation Qualified Paid Circulation Individual Assoc. Total Assoc. Total Total Requester Group Qualified

More information

Printable Activity book

Printable Activity book Printable Activity book 16 Pages of Activities Printable Activity Book Print it Take it Keep them busy Print them out Laminate them or Put them in page protectors Put them in a binder Bring along a dry

More information

Intercity Bus Stop Analysis

Intercity Bus Stop Analysis by Karalyn Clouser, Research Associate and David Kack, Director of the Small Urban and Rural Livability Center Western Transportation Institute College of Engineering Montana State University Report prepared

More information

2005 Mortgage Broker Regulation Matrix

2005 Mortgage Broker Regulation Matrix 2005 Mortgage Broker Regulation Matrix Notes on individual states follow the table REG EXEMPTIONS LIC-EDU LIC-EXP LIC-EXAM LIC-CONT-EDU NET WORTH BOND MAN-LIC MAN-EDU MAN-EXP MAN-EXAM Alabama 1 0 2 0 0

More information

Online Appendix: Can Easing Concealed Carry Deter Crime?

Online Appendix: Can Easing Concealed Carry Deter Crime? Online Appendix: Can Easing Concealed Carry Deter Crime? David Fortunato University of California, Merced dfortunato@ucmerced.edu Regulations included in institutional context measure As noted in the main

More information

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984.

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 984. y ˆ = a + b x + b 2 x 2K + b n x n where n is the number of variables Example: In an earlier bivariate

More information

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Organizing and Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 2 Organizing and Summarizing Data Section 2.1 Organizing Qualitative Data Objectives 1. Organize Qualitative Data in Tables 2. Construct Bar Graphs 3. Construct Pie Charts When data is collected

More information

What Lies Beneath: A Sub- National Look at Okun s Law for the United States.

What Lies Beneath: A Sub- National Look at Okun s Law for the United States. What Lies Beneath: A Sub- National Look at Okun s Law for the United States. Nathalie Gonzalez Prieto International Monetary Fund Global Labor Markets Workshop Paris, September 1-2, 2016 What the paper

More information

Office of Special Education Projects State Contacts List - Part B and Part C

Office of Special Education Projects State Contacts List - Part B and Part C Office of Special Education Projects State Contacts List - Part B and Part C Source: http://www.ed.gov/policy/speced/guid/idea/monitor/state-contactlist.html Alabama Customer Specialist: Jill Harris 202-245-7372

More information

North American Geography. Lesson 2: My Country tis of Thee

North American Geography. Lesson 2: My Country tis of Thee North American Geography Lesson 2: My Country tis of Thee Unit Overview: As students work through the activities in this unit they will be introduced to the United States in general, different regions

More information

RELATIONSHIPS BETWEEN THE AMERICAN BROWN BEAR POPULATION AND THE BIGFOOT PHENOMENON

RELATIONSHIPS BETWEEN THE AMERICAN BROWN BEAR POPULATION AND THE BIGFOOT PHENOMENON RELATIONSHIPS BETWEEN THE AMERICAN BROWN BEAR POPULATION AND THE BIGFOOT PHENOMENON ETHAN A. BLIGHT Blight Investigations, Gainesville, FL ABSTRACT Misidentification of the American brown bear (Ursus arctos,

More information

Summary of Natural Hazard Statistics for 2008 in the United States

Summary of Natural Hazard Statistics for 2008 in the United States Summary of Natural Hazard Statistics for 2008 in the United States This National Weather Service (NWS) report summarizes fatalities, injuries and damages caused by severe weather in 2008. The NWS Office

More information

Multivariate Classification Methods: The Prevalence of Sexually Transmitted Diseases

Multivariate Classification Methods: The Prevalence of Sexually Transmitted Diseases Multivariate Classification Methods: The Prevalence of Sexually Transmitted Diseases Summer Undergraduate Mathematical Sciences Research Institute (SUMSRI) Lindsay Kellam, Queens College kellaml@queens.edu

More information

Osteopathic Medical Colleges

Osteopathic Medical Colleges Osteopathic Medical Colleges Matriculants by U.S. States and Territories Entering Class 0 Prepared by the Research Department American Association of Colleges of Osteopathic Medicine Copyright 0, AAM All

More information

SUPPLEMENTAL NUTRITION ASSISTANCE PROGRAM QUALITY CONTROL ANNUAL REPORT FISCAL YEAR 2008

SUPPLEMENTAL NUTRITION ASSISTANCE PROGRAM QUALITY CONTROL ANNUAL REPORT FISCAL YEAR 2008 SUPPLEMENTAL NUTRITION ASSISTANCE PROGRAM QUALITY CONTROL ANNUAL REPORT FISCAL YEAR 2008 U.S. DEPARTMENT OF AGRICULTURE FOOD AND NUTRITION SERVICE PROGRAM ACCOUNTABILITY AND ADMINISTRATION DIVISION QUALITY

More information

Alpine Funds 2017 Tax Guide

Alpine Funds 2017 Tax Guide Alpine s 2017 Guide Alpine Dynamic Dividend ADVDX 1/30/17 1/31/17 1/31/17 0.020000000 0.019248130 0.000000000 0.00000000 0.019248130 0.013842273 0.000000000 0.000000000 0.000751870 0.000000000 0.00 0.00

More information

Alpine Funds 2016 Tax Guide

Alpine Funds 2016 Tax Guide Alpine s 2016 Guide Alpine Dynamic Dividend ADVDX 01/28/2016 01/29/2016 01/29/2016 0.020000000 0.017621842 0.000000000 0.00000000 0.017621842 0.013359130 0.000000000 0.000000000 0.002378158 0.000000000

More information

JAN/FEB MAR/APR MAY/JUN

JAN/FEB MAR/APR MAY/JUN QF 1.100 (Build 1010) Widget Publishing, Inc Page: 1 Circulation Breakdown by Issue Qualified Non-Paid Qualified Paid Previous This Previous This Total Total issue Removals Additions issue issue Removals

More information

Crop Progress. Corn Mature Selected States [These 18 States planted 92% of the 2017 corn acreage]

Crop Progress. Corn Mature Selected States [These 18 States planted 92% of the 2017 corn acreage] Crop Progress ISSN: 00 Released October, 0, by the National Agricultural Statistics Service (NASS), Agricultural Statistics Board, United s Department of Agriculture (USDA). Corn Mature Selected s [These

More information

High School World History Cycle 2 Week 2 Lifework

High School World History Cycle 2 Week 2 Lifework Name: Advisory: Period: High School World History Cycle 2 Week 2 Lifework This packet is due Monday, November 7 Complete and turn in on Friday for 10 points of EXTRA CREDIT! Lifework Assignment Complete

More information

extreme weather, climate & preparedness in the american mind

extreme weather, climate & preparedness in the american mind extreme weather, climate & preparedness in the american mind Extreme Weather, Climate & Preparedness In the American Mind Interview dates: March 12, 2012 March 30, 2012. Interviews: 1,008 Adults (18+)

More information

Meteorology 110. Lab 1. Geography and Map Skills

Meteorology 110. Lab 1. Geography and Map Skills Meteorology 110 Name Lab 1 Geography and Map Skills 1. Geography Weather involves maps. There s no getting around it. You must know where places are so when they are mentioned in the course it won t be

More information

LABORATORY REPORT. If you have any questions concerning this report, please do not hesitate to call us at (800) or (574)

LABORATORY REPORT. If you have any questions concerning this report, please do not hesitate to call us at (800) or (574) LABORATORY REPORT If you have any questions concerning this report, please do not hesitate to call us at (800) 332-4345 or (574) 233-4777. This report may not be reproduced, except in full, without written

More information

LABORATORY REPORT. If you have any questions concerning this report, please do not hesitate to call us at (800) or (574)

LABORATORY REPORT. If you have any questions concerning this report, please do not hesitate to call us at (800) or (574) LABORATORY REPORT If you have any questions concerning this report, please do not hesitate to call us at (800) 332-4345 or (574) 233-4777. This report may not be reproduced, except in full, without written

More information

Non-iterative, regression-based estimation of haplotype associations

Non-iterative, regression-based estimation of haplotype associations Non-iterative, regression-based estimation of haplotype associations Benjamin French, PhD Department of Biostatistics and Epidemiology University of Pennsylvania bcfrench@upenn.edu National Cancer Center

More information

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional s by Location of Permanent Home Address and Degree Level Louisiana Acadia 19 13 0 3 0 3 0 0 0 Allen 5 5 0 0 0 0 0 0 0 Ascension 307 269 2 28 1 6 0 1 0 Assumption 14 12 0 1 0 1 0 0 0 Avoyelles 6 4 0 1 0

More information

MINERALS THROUGH GEOGRAPHY

MINERALS THROUGH GEOGRAPHY MINERALS THROUGH GEOGRAPHY INTRODUCTION Minerals are related to rock type, not political definition of place. So, the minerals are to be found in a variety of locations that doesn t depend on population

More information

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional s by Location of Permanent Home Address and Degree Level Louisiana Acadia 26 19 0 6 1 0 0 0 0 Allen 7 7 0 0 0 0 0 0 0 Ascension 275 241 3 23 1 6 0 1 0 Assumption 13 12 0 1 0 0 0 0 0 Avoyelles 15 11 0 3

More information

BlackRock Core Bond Trust (BHK) BlackRock Enhanced International Dividend Trust (BGY) 2 BlackRock Defined Opportunity Credit Trust (BHL) 3

BlackRock Core Bond Trust (BHK) BlackRock Enhanced International Dividend Trust (BGY) 2 BlackRock Defined Opportunity Credit Trust (BHL) 3 MUNICIPAL FUNDS Arizona (MZA) California Municipal Income Trust (BFZ) California Municipal 08 Term Trust (BJZ) California Quality (MCA) California Quality (MUC) California (MYC) Florida Municipal 00 Term

More information

OUT-OF-STATE 965 SUBTOTAL OUT-OF-STATE U.S. TERRITORIES FOREIGN COUNTRIES UNKNOWN GRAND TOTAL

OUT-OF-STATE 965 SUBTOTAL OUT-OF-STATE U.S. TERRITORIES FOREIGN COUNTRIES UNKNOWN GRAND TOTAL Report ID: USSR8072-V3 Page No. 1 Jurisdiction: ON-CAMPUS IL Southern Illinois University - Carb 1 0 0 0 Black Hawk College Quad-Cities 0 0 1 0 John A Logan College 1 0 0 0 Rend Lake College 1 0 0 0 Aurora

More information

An Analysis of Regional Income Variation in the United States:

An Analysis of Regional Income Variation in the United States: Modern Economy, 2017, 8, 232-248 http://www.scirp.org/journal/me ISSN Online: 2152-7261 ISSN Print: 2152-7245 An Analysis of Regional Income Variation in the United States: 1969-2013 Orley M. Amos Jr.

More information

Office of Budget & Planning 311 Thomas Boyd Hall Baton Rouge, LA Telephone 225/ Fax 225/

Office of Budget & Planning 311 Thomas Boyd Hall Baton Rouge, LA Telephone 225/ Fax 225/ Louisiana Acadia 20 17 3 0 0 0 Allen 2 2 0 0 0 0 Ascension 226 185 37 2 1 1 Assumption 16 15 1 0 0 0 Avoyelles 20 19 1 0 0 0 Beauregard 16 11 4 0 0 1 Bienville 2 2 0 0 0 0 Bossier 22 18 4 0 0 0 Caddo 91

More information

Insurance Department Resources Report Volume 1

Insurance Department Resources Report Volume 1 2014 Insurance Department Resources Report Volume 1 201 Insurance Department Resources Report Volume One 201 The NAIC is the authoritative source for insurance industry information. Our expert solutions

More information

MINERALS THROUGH GEOGRAPHY. General Standard. Grade level K , resources, and environmen t

MINERALS THROUGH GEOGRAPHY. General Standard. Grade level K , resources, and environmen t Minerals through Geography 1 STANDARDS MINERALS THROUGH GEOGRAPHY See summary of National Science Education s. Original: http://books.nap.edu/readingroom/books/nses/ Concept General Specific General Specific

More information

GIS use in Public Health 1

GIS use in Public Health 1 Geographic Information Systems (GIS) use in Public Health Douglas Morales, MPH Epidemiologist/GIS Coordinator Office of Health Assessment and Epidemiology Epidemiology Unit Objectives Define GIS and justify

More information

JAN/FEB MAR/APR MAY/JUN

JAN/FEB MAR/APR MAY/JUN QF 1.100 (Build 1010) Widget Publishing, Inc Page: 1 Circulation Breakdown by Issue Analyzed Nonpaid and Verified Paid Previous This Previous This Total Total issue Removals Additions issue issue Removals

More information

Chapter 5 Linear least squares regression

Chapter 5 Linear least squares regression Chapter 5 Linear least squares regression Consider first simple linear regression. For now, the model of interest is only the model of the conditional expectations; the distribution of the residuals is

More information

Pima Community College Students who Enrolled at Top 200 Ranked Universities

Pima Community College Students who Enrolled at Top 200 Ranked Universities Pima Community College Students who Enrolled at Top 200 Ranked Universities Institutional Research, Planning and Effectiveness Project #20170814-MH-60-CIR August 2017 Students who Attended Pima Community

More information

Infant Mortality: Cross Section study of the United State, with Emphasis on Education

Infant Mortality: Cross Section study of the United State, with Emphasis on Education Illinois State University ISU ReD: Research and edata Stevenson Center for Community and Economic Development Arts and Sciences Fall 12-15-2014 Infant Mortality: Cross Section study of the United State,

More information

National Organization of Life and Health Insurance Guaranty Associations

National Organization of Life and Health Insurance Guaranty Associations National Organization of and Health Insurance Guaranty Associations November 21, 2005 Dear Chief Executive Officer: Consistent with prior years, NOLHGA is providing the enclosed data regarding insolvency

More information

FLOOD/FLASH FLOOD. Lightning. Tornado

FLOOD/FLASH FLOOD. Lightning. Tornado 2004 Annual Summaries National Oceanic and Atmospheric Administration National Environmental Satellite Data Information Service National Climatic Data Center FLOOD/FLASH FLOOD Lightning Tornado Hurricane

More information

Rank University AMJ AMR ASQ JAP OBHDP OS PPSYCH SMJ SUM 1 University of Pennsylvania (T) Michigan State University

Rank University AMJ AMR ASQ JAP OBHDP OS PPSYCH SMJ SUM 1 University of Pennsylvania (T) Michigan State University Rank University AMJ AMR ASQ JAP OBHDP OS PPSYCH SMJ SUM 1 University of Pennsylvania 4 1 2 0 2 4 0 9 22 2(T) Michigan State University 2 0 0 9 1 0 0 4 16 University of Michigan 3 0 2 5 2 0 0 4 16 4 Harvard

More information

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

A Summary of State DOT GIS Activities

A Summary of State DOT GIS Activities A Summary of State DOT GIS Activities Prepared for the 2006 AASHTO GIS-T Symposium Columbus, OH Introduction This is the 11 th year that the GIS-T Symposium has conducted a survey of GIS activities at

More information

Package ZIM. R topics documented: August 29, Type Package. Title Statistical Models for Count Time Series with Excess Zeros. Version 1.

Package ZIM. R topics documented: August 29, Type Package. Title Statistical Models for Count Time Series with Excess Zeros. Version 1. Package ZIM August 29, 2013 Type Package Title Statistical Models for Count Time Series with Excess Zeros Version 1.0 Date 2013-06-15 Author Ming Yang, Gideon K. D. Zamba, and Joseph E. Cavanaugh Maintainer

More information

Statistical Methods for Data Mining

Statistical Methods for Data Mining Statistical Methods for Data Mining Kuangnan Fang Xiamen University Email: xmufkn@xmu.edu.cn Linear Model Selection and Regularization Recall the linear model Y = 0 + 1 X 1 + + p X p +. In the lectures

More information

Fungal conservation in the USA

Fungal conservation in the USA The following supplements accompany the article Fungal conservation in the USA Jessica L. Allen*, James C. Lendemer *Corresponding author: jlendemer@nybg.org Endangered Species Research 28: 33 42 (2015)

More information

KS PUBL 4YR Kansas State University Pittsburg State University SUBTOTAL-KS

KS PUBL 4YR Kansas State University Pittsburg State University SUBTOTAL-KS Report ID: USSR8072-V3 Page No. 1 Jurisdiction: ON-CAMPUS IL PUBL TCH DeVry University Addison 1 0 0 0 Eastern Illinois University 1 0 0 0 Illinois State University 0 0 2 0 Northern Illinois University

More information

Chapter 11 : State SAT scores for 1982 Data Listing

Chapter 11 : State SAT scores for 1982 Data Listing EXST3201 Chapter 12a Geaghan Fall 2005: Page 1 Chapter 12 : Variable selection An example: State SAT scores In 1982 there was concern for scores of the Scholastic Aptitude Test (SAT) scores that varied

More information

Stem-and-Leaf Displays

Stem-and-Leaf Displays 3.2 Displaying Numerical Data: Stem-and-Leaf Displays 107 casts in your area? (San Luis Obispo Tribune, June 15, 2005). The responses are summarized in the table below. Extremely 4% Very 27% Somewhat 53%

More information

United States Geography Unit 1

United States Geography Unit 1 United States Geography Unit 1 I WANT YOU TO STUDY YOUR GEORGAPHY Name: Period: Due Date: Geography Key Terms Absolute Location: Relative Location: Demographic Map: Population Density: Sun-Belt: Archipelago:

More information

112th U.S. Senate ACEC Scorecard

112th U.S. Senate ACEC Scorecard 112th U.S. Senate ACEC Scorecard HR 658 FAA Funding Alaska Alabama Arkansas Arizona California Colorado Connecticut S1 Lisa Murkowski R 100% Y Y Y Y Y Y Y Y Y S2 Mark Begich D 89% Y Y Y Y N Y Y Y Y S1

More information

A GUIDE TO THE CARTOGRAPHIC PRODUCTS OF

A GUIDE TO THE CARTOGRAPHIC PRODUCTS OF A GUIDE TO THE CARTOGRAPHIC PRODUCTS OF THE FEDERAL DEPOSITORY LIBRARY PROGRAM (FDLP) This guide was designed for use as a collection development tool by map selectors of depository libraries that participate

More information

Evidence for increasingly extreme and variable drought conditions in the contiguous United States between 1895 and 2012

Evidence for increasingly extreme and variable drought conditions in the contiguous United States between 1895 and 2012 Evidence for increasingly extreme and variable drought conditions in the contiguous United States between 1895 and 2012 Sierra Rayne a,, Kaya Forest b a Chemologica Research, 318 Rose Street, PO Box 74,

More information

Introducing North America

Introducing North America Introducing North America I. Quick Stats Includes U.S. & Canada U.S consists of 50 States Federal Government Democracy 4 th in world w/ land area 3 rd in population Economic leader of free world II. Major

More information

MO PUBL 4YR 2090 Missouri State University SUBTOTAL-MO

MO PUBL 4YR 2090 Missouri State University SUBTOTAL-MO Report ID: USSR8072-V3 Page No. 1 Jurisdiction: ON-CAMPUS IL American Intercontinental Universit 0 0 1 0 Northern Illinois University 0 0 4 0 Southern Illinois Univ - Edwardsvil 2 0 2 0 Southern Illinois

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical

More information

All-Time Conference Standings

All-Time Conference Standings All-Time Conference Standings Pac 12 Conference Conf. Matches Sets Overall Matches Team W L Pct W L Pct. Score Opp Last 10 Streak Home Away Neutral W L Pct. Arizona 6 5.545 22 19.537 886 889 6-4 W5 4-2

More information

October 2016 v1 12/10/2015 Page 1 of 10

October 2016 v1 12/10/2015 Page 1 of 10 State Section S s Effective October 1, 2016 Overview The tables list the Section S items that will be active on records with a target date on or after October 1, 2016. The active on each item subset code

More information

Office of Budget & Planning 311 Thomas Boyd Hall Baton Rouge, LA Telephone 225/ Fax 225/

Office of Budget & Planning 311 Thomas Boyd Hall Baton Rouge, LA Telephone 225/ Fax 225/ Louisiana Acadia 25 19 4 2 0 0 Allen 8 7 1 0 0 0 Ascension 173 143 26 1 0 3 Assumption 14 12 2 0 0 0 Avoyelles 51 41 9 0 0 1 Beauregard 18 14 3 0 0 1 Bienville 5 0 4 0 1 0 Bossier 28 27 0 1 0 0 Caddo 95

More information

The Heterogeneous Effects of the Minimum Wage on Employment Across States

The Heterogeneous Effects of the Minimum Wage on Employment Across States The Heterogeneous Effects of the Minimum Wage on Employment Across States Wuyi Wang a, Peter C.B. Phillips b, Liangjun Su c a Institute for Economic and Social Research, Jinan University b Yale University,

More information

This project is supported by a National Crime Victims' Right Week Community Awareness Project subgrant awarded by the National Association of VOCA

This project is supported by a National Crime Victims' Right Week Community Awareness Project subgrant awarded by the National Association of VOCA TABLE OF CONTENTS ACTIVITY PAGES Coloring Pages Younger Children (6) Coloring Pages Older (2) Connect the Dots (1 English, 1 Spanish) Word Search 1 (1 English, 1 Spanish) Younger Children Word Search 2

More information

CCC-A Survey Summary Report: Number and Type of Responses

CCC-A Survey Summary Report: Number and Type of Responses CCC-A Survey Summary Report: Number and Type of Responses Suggested Citation: American Speech-Language-Hearing Association. (2011). 2011 Membership survey. CCC-A survey summary report: Number and type

More information

Division I Sears Directors' Cup Final Standings As of 6/20/2001

Division I Sears Directors' Cup Final Standings As of 6/20/2001 1 Stanford (Cal.) 1359 2 90 9 75 20 62.5 0 0 0 0 0 0 3 75 1 100 5 60 8 76 4 80 0 0 2 70 2 California-Los Angeles 1138 0 0 5 78.5 17 67 0 0 0 0 0 0 2 90 9 50 5 60 2 0 33 48.5 2 70 1 100 3 Georgia 890.5

More information

Green Building Criteria in Low-Income Housing Tax Credit Programs Analysis

Green Building Criteria in Low-Income Housing Tax Credit Programs Analysis Green Building Criteria in Low-Income Housing Tax Credit Programs 2010 Analysis www.globalgreen.org September 2010 Green Building Criteria in State Low-Income Housing Tax Credit Programs 2 Introduction

More information

PHYSICS BOWL - APRIL 22, 1998

PHYSICS BOWL - APRIL 22, 1998 AAPT/Metrologic High School Physics Contest PHYSICS BOWL - APRIL 22, 1998 40 QUESTIONS 45 MINUTES This contest is sponsored by the American Association of Physics Teachers (AAPT) and Metrologic Instruments

More information

Direct Selling Association 1

Direct Selling Association 1 Total Number of Firms 1,100 104 58 46 14 41 22 27 44 60 1 U.S. Sales Volume & Growth 1 Average Retail Sales per Firm ($thousands) 2015 $32,836 $210,572 $320,820 $69,556 $1,041 $11,788 $100,216 $721,897

More information

Physical Features of Canada and the United States

Physical Features of Canada and the United States I VIUAL Physical Features of Canada and the United tates 170 ARCTIC OCA Aleutian s 1 Bering ea ALAKA Yukon R. Mt. McKinley (20,320 ft. 6,194 m) Gulf of Alaska BROOK RAG RAG Queen Charlotte s R Vancouver

More information

State Section S Items Effective October 1, 2017

State Section S Items Effective October 1, 2017 State Section S Items Effective October 1, 2017 Overview The tables list the Section S items that will be active on records with a target date on or after October 1, 2017. The active item on each item

More information

Physical Features of Canada and the United States

Physical Features of Canada and the United States Physical Features of Canada and the United tates 170 ARCTIC OCA Aleutian s 1 1 Bering ea ALAKA Yukon R. Mt. McKinley (20,320 ft. 6,194 m) Gulf of Alaska BROOK RAG RAG Queen Charlotte s R Vancouver O C

More information

William Battye * EC/R Incorporated, Chapel Hill, NC. William Warren-Hicks, Ph.D. EcoStat, Inc., Mebane, NC

William Battye * EC/R Incorporated, Chapel Hill, NC. William Warren-Hicks, Ph.D. EcoStat, Inc., Mebane, NC MODULATING EMISSIONS FROM ELECTRIC GENERATING UNITS AS A FUNCTION OF METEOROLOGICAL VARIABLES William Battye * EC/R Incorporated, Chapel Hill, NC William Warren-Hicks, Ph.D. EcoStat, Inc., Mebane, NC Steve

More information

Summary of Terminal Master s Degree Programs in Philosophy

Summary of Terminal Master s Degree Programs in Philosophy Summary of Terminal Master s Degree Programs in Philosophy Faculty and Student Demographics All data collected by the ican Philosophical Association. The data in this publication have been provided by

More information

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH 2011 NATIONAL SURVEY ON DRUG USE AND HEALTH PERSON-LEVEL SAMPLING WEIGHT CALIBRATION Prepared for the 2011 Methodological Resource Book Contract No. HHSS283200800004C RTI Project No. 0211838.207.004 Phase

More information

DOWNLOAD OR READ : USA PLANNING MAP PDF EBOOK EPUB MOBI

DOWNLOAD OR READ : USA PLANNING MAP PDF EBOOK EPUB MOBI DOWNLOAD OR READ : USA PLANNING MAP PDF EBOOK EPUB MOBI Page 1 Page 2 usa planning map usa planning map pdf usa planning map Printable USA Blank Map, USA Blank Map PDF, Blank US State Map. Thursday, 19

More information

BOWL - APRIL 27, QUESTIONS 45 MINUTES

BOWL - APRIL 27, QUESTIONS 45 MINUTES PHYSICSBOWL AAPT/Metrologic High School Physics Contest PHYSICS BOWL - APRIL 27, 1995 40 QUESTIONS 45 MINUTES This contest is sponsored by the American Association of Physics Teachers (AAPT) and Metrologic

More information

St. Luke s College Institutional Snapshot

St. Luke s College Institutional Snapshot 1. Student Headcount St. Luke s College Institutional Snapshot Enrollment # % # % # % Full Time 139 76% 165 65% 132 54% Part Time 45 24% 89 35% 112 46% Total 184 254 244 2. Average Age Average Age Average

More information

If you have any questions concerning this report, please feel free to contact me. REPORT OF LABORATORY ANALYSIS

If you have any questions concerning this report, please feel free to contact me. REPORT OF LABORATORY ANALYSIS #=CL# LI USE: FR - JILL LAVOIE LI OBJECT ID: August 22, 2014 Jill Lavoie Merrimack Village District - UCMR3 2 Greens Pond Road Merrimack, NH 03054 RE: Dear Jill Lavoie: Enclosed are the analytical results

More information

Earl E. ~rabhl/ This report is preliminary and has not been reviewed for conformity with U.S. Geological Survey editoral standards GEOLOGICAL SURVEY

Earl E. ~rabhl/ This report is preliminary and has not been reviewed for conformity with U.S. Geological Survey editoral standards GEOLOGICAL SURVEY UNITED STATES DEPARTMENT OF THE INTERIOR GEOLOGICAL SURVEY Minimum landslide damage in the United States, 1973-1983 by Earl E. ~rabhl/ Open-File Report 84-486 This report is preliminary and has not been

More information