LISREL Workshop at M3 Conference 2015

Size: px
Start display at page:

Download "LISREL Workshop at M3 Conference 2015"

Transcription

1 LISREL Workshop at M3 Conference 2015 Karl G Jöreskog Norwegian Business School & Uppsala University May 11, 2015 Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

2 Ways of Working with LISREL Use LISREL s Graphical Interface (GUI) Use Syntax Files PRELIS Syntax SIMPLIS Syntax LISREL Syntax Other Syntax Syntax for Generalized Linear Models Syntax for Multilevel Analysis Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

3 Learning by Examples Real Examples and Real Data Learning How to Do it with LISREL Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

4 Topics to be Covered Creating the LISREL system data file (LSF file) Exploratory and Confirmatory Factor Analysis Estimating Latent Mean Differences Estimating Latent Growth Curves from Data with Missing Values Latent Variable Scores Multilevel Analysis Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

5 Example: Holzinger & Swineford Two Schools Data In this example we learn 1 how to import data into a LISREL system data file.lsf 2 how to graph and screen the data 3 how to divide the data into two groups (schools) 4 how to do exploratory factor analysis in one school 5 how to do confirmatory factor analysis in the other school 6 how to deal with non-normality 7 how to modify the model by means of path diagrams 8 how to interpret chi-square differences 9 how to estimate and test latent variable differences Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

6 Holzinger & Swineford (1939) collected data on twenty-six psychological tests administered to seventh- and eighth-grade children in two schools in Chicago: the Pasteur School and the Grant-White School. The Pasteur School had students whose parents had immigrated from Europe, mostly France and Germany. The students of the Grant-White School came from middle American white families. Nine of these tests are selected for this example. The nine tests are (with the original variable number in parenthesis): VISPERC Visual Perception (V1) CUBES Cubes (V2) LOZENGES Lozenges (V4) PARCOMP Paragraph Comprehension (V6) SENCOMP Sentence Completion (V7) WORDMEAN Word meaning (V9) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

7 ADDITION Addition (V10) COUNTDOT Counting dots (V12) S-C CAPS Straight-curved capitals (V13) The nine test scores are preceded by four variables: SCHOOL 0 for Pasteur School; 1 for Grant-White School GENDER 0 for Boy; 1 for Girl AGEYEAR Age in years BIRTHMON Birthmonth: 1 = January, 2 = Februray = December The data file is an SPSS data file hsschools.sav. There are no missing values. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

8 Importing, Graphing and Screening the Data It is important to understand what you can do before you learn to measure how well you seem to have done it. (Tukey,1977). Importing hsschools.sav to hsschools.lsf Defining the attributes of the data Variable Type: Ordinal or Continuou Missing Values Histogram: Plot WORDMEAN Bar Chart: Plot AGEYEAR Scatter Plot: Plot WORDMEAN on VISPERC Three-Dimensional Bar Chart: Plot AGEYEAR on SCHOOL Box-Whisker Plot: Plot WORDMEAN on SCHOOL Data Screening Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

9 The Box-Whisker Plot As implemented in LISREL the Box and Whisker plot gives the most important characteristics of the distributions of the continuous variable for each value of the categorical variable. The box indicates the area between lower and upper quartile so that 25% of the observations is below the bottom border of the box and 25% of the observations is above the top border of the box. The border line between the two colored areas in the box is the mean. If the areas above and below the mean are of different sizes, this indicates that the distribution of the observations is skewed. The two unfilled circles are the largest and the smallest value in the data. The wiskers are located ±2 IQR, where IQR is the interquartile range, i.e. the distance between the upper and lower quartile which is the length of the box. Observations that are outside of the whiskers are sometimes called outliers. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

10 Splitting the Data into Groups Selecting Cases Selecting or Deleting Variables Tests of Univariate and Multivariate Normality Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

11 Exploratory Factor Analysis of the Pasteur School Sample Number of Factors Rotation Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

12 Confirmatory Factor Analysis of the Grant-White School Sample The distinction between exploratory and confirmatory factor analysis is illustrated in the following figures Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

13 To illustrate confirmatory factor analysis we use the data on the nine psychological variables for the Grant White school. For simplicity here we will call the data npv.lsf and use the short form npv in all files. Bare in mind that this refer to the Grant White school sample. It is hypothesized that these variables have three correlated common factors: visual perception here called Visual, verbal ability here called Verbal and speed here called Speed such that the first three variables measure Visual, the next three measure Verbal, and the last three measure Speed. A path diagram of the model to be estimated is shown in the next slide. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

14 Confirmatory Factor Analysis Model for Nine Psychological Variables Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

15 Estimating and Testing the Model by Maximum Likelihood (ML) Estimating and Testing the Model by Robust Maximum Likelihood (RML) Evaluating the Fit Modifying the Model Estimating and Testing the Modified Model by Robust Maximum Likelihood (RML) Evaluating Chi-Square Differences Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

16 Parameter Estimates, Normal Standard Errors, and Robust Standard Errors Parameter Parameter Standard Errors Factor Loadings Estimates Normal Robust VISPERC on Visual λ (x) CUBES on Visual λ (x) LOZENGES on Visual λ (x) PARCOMP on Verbal λ (x) SENCOMP on Verbal λ (x) WORDMEAN on Verbal λ (x) ADDITION on Speed λ (x) COUNTDOT on Speed SCCAPS on Speed Factor Correlations Verbal vs Visual φ Verbal vs Speed φ Verbal vs Speed φ λ (x) λ (x) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

17 Modified Confirmatory Factor Analysis Model for Nine Psychological Variables Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

18 Example: Stability of Alienation Wheaton, et al. (1977) report on a study concerned with the stability over time of attitudes such as alienation, and the relation to background variables such as education and occupation. Data on attitude scales were collected from 932 persons in two rural regions in Illinois at three points in time: 1966, 1967, and The variables used for the present example are the Anomia subscale and the Powerlessness subscale, taken to be indicators of Alienation. This example uses data from 1967 and 1971 only. The background variables are the respondent s education (years of schooling completed) and Duncan s Socioeconomic Index (SEI). These are taken to be indicators of the respondent s socioeconomic status (Ses). The data file is stability.lsf. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

19 The research questions are: Has the feeling of alienation increased or degreased between 1967 and 1971? Is the feeling of alienation stable over time? Stability is indicated by the variance of alienation. If the variance has changed much between 1967 and 1971 it means that for some people the feeling of alienation has increased while it has decreased for other people. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

20 The model to be considered here is shown in Slide 22. For Models A and B we estimate this model as covariance strcture. Model A we assume uncorrelated measurement errors. For Model B we assume that the measurement errors on ANOMIA67 and ANOMIA71 and the mesurement errors on POWERL67 and POWERL71 are correlated as in Slide 22. For Model C we add a mean structure to estimate the latent mean difference between Alien71 and Alien67. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

21 The error terms of ANOMIA and POWERL are assumed to be correlated over time to take specific factors into account. The four one-way arrows on the right side represent the measurement errors in ANOMIA67, POWERL67, ANOMIA71, and POWERL71, respectively. The two-way arrows on the right side indicate that some of these measurement errors are correlated. The covariance between the two error terms for each variable can be interpreted as a specific error variance. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

22 Stability of Alienation Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

23 Using the notation y 1 = ANOMIA67, y 2 = POWERL67, y 3 = ANOMIA71, y 4 = POWERL71, x 1 = EDUC, x 2 = SEI, ξ = Ses, η 1 = Alien67, η 2 = Alien71, the model is y 1 y 2 y 3 y 4 ( η1 = η 2 ( x1 x 2 ) = τ (y) 1 τ (y) 2 τ (y) 3 τ (y) 4 ) ( = ( 0 0 β 0 + τ (x) 1 τ (x) 2 ) ) ( η1 1 0 λ λ 2 ( 1 + η 2 ) + λ 3 ( γ1 ( η1 η 2 ) + ) ( δ1 ξ + γ 2 ) ξ + ) δ 2 ( ζ1 ζ 2 ɛ 1 ɛ 2 ɛ 3 ɛ 4 ) (1) (2) (3) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

24 Estimating a Specific Variance The unique part of a measurement (the error terms δ i and ɛ i ) usually consists of two components: a specific factor s i and a pure random measurement error e i. These are indistinguishable, unless the measurements are designed in such a way that they can be separately identified (panel designs and multitrait-multimethod). The error terms δ i and ɛ i are often called measurement errors even though it is widely recognized that these terms may also contain a specific factor. Since ANOMIA67 and ANOMIA71 are the same measurement they may contain a specific factor s, so that the error terms ɛ 1 = s + e 1, (4) ɛ 3 = s + e 3, (5) where e 1 and e 3 are pure random measurement error. Then Cov(ɛ 1, ɛ 3 = Var(s). (6) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

25 Intercepts and Means in SIMPLIS The variable CONST is a variable which is equal to 1 for every case. This variable is always available in SIMPLIS; it need not be in the data. It is used to estimate an intercept term or a mean. For example, Y = CONST X is used to specify the regression of Y on X: Y = α + γx. α is the coefficient of CONST just like γ is the coefficient of X. One can also use CONST to estimate a mean. For example, Y = CONST will estimate the mean of Y as the coefficient of CONST. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

26 Analysis of Longitudinal Data with Missing Values Examples of Longitudinal Data Psychology The develepment of cognitive abilities in children Economy Growth of nations or sales of companies Sociology Change in crime rates across communities Time Intervals: Dayly, Weekly, Monthly, or Annually Unit of Analysis: Individuals, Businesses, Regions, Countries Regardless of the subject area or the time interval, social and behavioral scientists have a keen interest in describing and explaining the time trajectories of their variables. (Bollen & Curran (2006, p. 1) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

27 Types of Incomplete Data Unit nonresponse Subject attrition Item nonresponse Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

28 Mechanisms of Nonresponse MCAR Missing completely at random MAR Missing at random MNAR Missing not at random Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

29 Probability of Missing Data MCAR Pr(z ij = missing) does not depend on any variable in the data. MAR Pr(z ij = missing) may depend on other variables in the data but not on z ij. Example: A missing value of a person s income may depend on his/her age and education but not on his/her actual income. MNAR Pr(z ij = missing) depends on z ij. Example: In a questionnaire people with higher income tend not to report their income. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

30 Solutions to Incomplete Data Listwise deletion Pairwise deletion Imputation by matching Multiple imputation EM MCMC Direct analysis (FIML) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

31 Maximum Likelihood with Missing Values Let z be a random vector of order k with a multivariate normal distribution with mean vector µ and covariance matrix Σ both of which are functions of a parameter vector θ. Suppose we have a sample of N independent observations of z. If there are missing data we assume they ara missing at random. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

32 Let z i of order k i be the subset of z observed in unit i. Then z i N(µ i, Σ i ), where µ i and Σ i consist of the rows and columns of µ and Σ for which there are observations in z i. The logarithm of the likeliihod for the total sample is ln L = 1 2 N i=1 {ln(2π) + ln Σ i + tr[σ 1 i (z i µ i )(z i µ i ) ]} Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

33 Instead of maximizing ln L, maximum likelihood estimates of θ can be obtained by minimizing the function F (θ) = N i=1 {ln Σ i + trσ 1 i G i }, where G i = (z i µ i )(z i µ i ). The first and second derivatives of F (θ) required for the minimization can be obtained in the usual way. The deviance for the saturated model can be obtained by minimizing with respect to µ and Σ. N i=1 {ln Σ i + trσ 1 i G i }, Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

34 The Linear Curve Model y it = a i + b i T t + e it (7) i = 1, 2,..., N persons (8) T t = Time at occasion t = 1, 2,..., n i (9) a i = α + u i (10) ( ui b i = β + v i (11) ) N(0, Φ) (12) v i e it N(0, σe) 2 (13) y it = (α + βt t ) + (u i + v i T t + e it ) (14) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

35 Trajectory for Person 1 y Time Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

36 Fitted Regression Line for Person 1 y Time Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

37 Fitted Regression Lines for Four Persons y Time The thick line is the average regression line. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

38 A Quadratic Curve Model y it = a i + b i T t + c i T 2 t + e it (15) i = 1, 2,..., N persons (16) T t = Time at occasion t = 1, 2,..., n i (17) a i = α + u i (18) b i = β + v i (19) u i v i w i c i = γ + w i (20) N(0, Φ), e it N(0, σ 2 e) (21) y it = (α + βt t + γt 2 t ) + (u i + v i T t + w i T 2 t + e it ) (22) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

39 The Linear Curve Model with Covariate y it = a i + b i T t + e it (23) i = 1, 2,..., N persons (24) T t = Time at occasion t = 1, 2,..., n i (25) a i = α + γ a z i + u i (26) b i = β + γ b z i + v i (27) z i = Covariate observed on person i (28) ( ) ui N(0, Φ) (29) v i e it N(0, σe) 2 (30) y it = (α + βt t + γ a z i + γ b T t z i ) + (u i + v i T t + e it ) (31) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

40 PD of the Linear Curve Model with Covariate y 1 y 2 y 3 y a b z Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

41 Example: Treatment of Prostate Cancer A medical doctor offered all his patients diagnosed with prostate cancer a treatment aimed at reducing the cancer activity in the prostate. The severity of prostate cancer is often assessed by a plasma component known as prostate specific antigen (PSA), an enzyme that is elevated in the presence of prostate cancer. The PSA level was measured regularly every three months. The data contains five repeated measurements of PSA. The age of the patient is also included in the data. Not every patient accepted the offer initially and several patients chose to enter the program after the first occasion. Some patients, who accepted the initial offer, are absent at some later occasions for various reasons. Thus there are missing values in the data. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

42 The aim of this study is to answer the following questions: What is the average initial PSA value? Do all patients have the same initial PSA value? Is there an overall effect of treatment. Is there a decline of PSA values over time, and, if so, what is the average rate of decline? Do all patients have the same rate of decline? Does the individual initial PSA value and/or the rate of decline depend on the patient s age? This is a typical example of repeated measurements data, the analysis of which is sometimes done within the framework of multilevel analysis. It represents the simplest type of two-level model but it can also be analyzed as a structural equation model, see Bollen & Curran (2006). In this context it illustrates a mean and covariance structure model estimated from longitudinal data with missing values. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

43 Two Ways of Organizing the Data As multilevel data As longitudinal data Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

44 PSA Data in Multilevel Form Patient Months PSA Age Patients 9 and 10 are missing at 3 months Patient 15 is missing at 3 and 6 months Patient 16 is missing at 0, 3, and 12 months Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

45 PSA Data in Longitudinal Form Patient PSA0 PSA3 PSA6 PSA9 PSA12 Age Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

46 The data file for this example is psavar.lsf, where missing values are shown as If the data is imported from an external source which already have a missing value code, the missing values will show up in the lsf file as , which is the global missing data code in LISREL. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

47 In this kind of data it is inevitable that there are missing values. For example, a patient may be on vacation or ill or unable to come to the doctor for any reason at some occasion or a patient may die and therefore will not come to the doctor after a certain occasion. It is seen in that Patients 9 and 10 are missing at 3 months Patient 15 is missing at 3 and 6 months Patient 16 is missing at 0, 3, and 12 months In the following analysis it is assumed that data are missing at random (MAR), although there may be a small probability that a patient will be missing because his PSA value is high. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

48 The model to be estimated is y it = a i + b i T t + e it (32) i = 1, 2,..., N individuals (33) T t = Time at occasion t = 1, 2,..., n i (34) a i = α + γ a z i + u i (35) b i = β + γ b z i + v i (36) z i = Covariate observed on individual i (37) ( ) ui N(0, Φ) (38) v i e it N(0, σ 2 e) (39) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

49 An interpretation of this is as follows. Each patient has his own linear growth curve 2, represented by (32) which is the regression of y it on time with intercept a i and slope b i varying across patients. In principle, the intercepts a i and slopes b i could all be different across patients. It is of interest to know if the intercepts and/or the slopes are equal across patients. The four cases are illustrated in the next Figure. If there is variation in intercepts and/or the slopes across patients, one is interested in whether a covariate z i (in this case age) can predict the intercept and/or the slope. 2 In general, the growth curves are not restricted to be linear, but can be quadratic, cubic, or other types of functions of time, see Jöreskog, Sörbom, Du Toit, & Du Toit (2003). Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

50 Four Cases of Intercepts and Slopes Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

51 Linear Growth Curve for psavar Data Raw Data from File psavar.lsf Latent Variables: a b Relationships PSA0 = 1*a 0*b PSA3 = 1*a 3*b PSA6 = 1*a 6*b PSA9 = 1*a 9*b PSA12 = 1*a 12*b a b = CONST Equal Error Variances: PSA0 - PSA12 Path Diagram End of Problem Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

52 Linear Growth Curve with Covariate for psavar Data Raw Data from File psavar.lsf Latent Variables: a b Relationships PSA0 = 1*a 0*b PSA3 = 1*a 3*b PSA6 = 1*a 6*b PSA9 = 1*a 9*b PSA12 = 1*a 12*b a b = CONST Age Let the Errors on a and b correlate Equal Error Variances: PSA0 - PSA12 Path Diagram End of Problem Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

53 Example: Training of Air Traffic Controllers Kanfer & Ackerman (1989) analyzed data on 141 U.S. Air Force enlisted personnel who carried out a computerized air traffic controller task developed by Kanfer and Ackerman. 3 The subjects were instructed to accept planes into their hold pattern and land them safely and efficiently on one of four runways, varying in length and compass directions, according to rules governing plane movements and landing requirements. For each subject, the success of a series of between three and six 10-minute trials was recorded. The measurement employed was the number of correct landings per trial. A measure of cognitive ability was also obtained from each controller at the begin of the trials. 3 Permission for SSI to use the copyrighted raw data was provided by R. Kanfer and P.L. Ackerman. The data remain the copyrighted property of Ruth Kanfer and Phillip L. Ackerman. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

54 In this example we learn 1 how to analyze longitudinal data with missing values 2 alternative ways to deal with missing values 3 how to plot and interpret a box-plot 4 how to decide the shape of the growth curve 5 how to estimate and test so called latent growth curves using FIML Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

55 Questions 1 Should the learning curve be linear, quadratic, cubic, or other? 2 Do the shape of the curve differ between controllers? 3 Do the coefficients describing the learning curve depend on cognitive ability? Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

56 Box Whisker Plot of Air Controller Data Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

57 A Cubic Fitted to Three Controllers Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

58 Latent Variable Scores There are several methods available for estimating latent variable scores 4, e.g., Lawley & Maxwell (1971, Chapter 8) or Bartholomew & Knott (1999, pp ). The two most commonly used methods for estimating latent variable scores are the regression method of Thomson (1939) and the Bartlett method (Bartlett, 1937). In LISREL we use the procedure of Anderson & Rubin (1956) as described in Jöreskog (2000). This procedure has the advantage of producing latent variable scores that have the same covariance matrix as the latent variables themselves. The results reported here are based on latent variable scores estimated by the Anderson & Rubin method. This section describes how latent variable scores can be obtained with LISREL 9 and illustrates their use with one example. 4 In classical exploratory factor analysis these are usually called factor scores Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

59 Example: Panel Model for Political Democracy Bollen (1989, p. 17) presents a panel model of political democracy and industrialization in 75 countries. The model is shown in the path diagram in the following Figure Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

60 The variables in the model are y 1 Freedom of press 1960 y 2 Freedom of political opposition 1960 y 3 Fairness of elections 1960 y 4 Effectiveness of legislature 1960 y 5 Freedom of press 1965 y 6 Freedom of political opposition 1965 y 7 Fairness of elections 1965 y 8 Effectiveness of legislature 1965 x 1 GNP per capita 1960 x 2 Energy consumtion per capita 1960 x 3 Percentage of labor force in industry 1960 η 1 Democracy in 1960 (Latent variable Dem60) η 2 Democracy in 1965 (Latent variable Dem65) ξ Level of industrialization in 1960 (Latent variable Indus) Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

61 y 1 y 4 are taken as indicators of the latent variable Dem60 (Democracy 1960) and y 5 y 8 are taken as indicators of the latent variable Dem65 (Democracy 1965). x 1 x 3 are taken as indicators of the latent variable Indus (Industrialization 1960). Data on y 1 x 3 are available for 75 developing countries. The datafile is POLIDEM.LSF. The following SIMPLIS syntax file (BA1a.SPL) will estimate scores on Demo60, Demo65, and Indus for each country in file POLIDEMnew.LSF. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

62 Industrialization-Democracy Example Raw Data from file POLIDEM.LSF Latent Variables: Dem60 Dem65 Indus Relationships: Y1= 1*Dem60 Y2-Y4 = Dem60 Y5 = 1*Dem65 Y6-Y8 = Dem65 X9 = 1*Indus X10-X11 = Indus Dem60 = Indus Dem65 = Dem60 Indus Set Dem60 -> Y2 = Dem65 -> Y6 Set Dem60 -> Y3 = Dem65 -> Y7 Set Dem60 -> Y4 = Dem65 -> Y8 Let the errors of Y5 and Y1 be correlated let the errors of Y6 and Y2 be correlated Let the errors of Y7 and Y3 be correlated Let the errors of Y8 and Y4 be correlated LSFfile POLIDEMnew.LSF Path Diagram End of Problem Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

63 The variables y 1 y 4 are the same variables as y 5 y 8 measured at two points in time. So Bollen & Arminger (1991) assume that their loadings on η 1 and η 2 are the same. This is specified by the lines Set Dem60 -> Y2 = Dem65 -> Y6 Set Dem60 -> Y3 = Dem65 -> Y7 Set Dem60 -> Y4 = Dem65 -> Y8 The loadings of y 1 and y 5 are set to 1. Furthermore, they assume that the measurement errors of corresponding y-variables are correlated. This is specified by the lines Let the errors of Y5 and Y1 be correlated let the errors of Y6 and Y2 be correlated Let the errors of Y7 and Y3 be correlated Let the errors of Y8 and Y4 be correlated Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

64 After this run is completed the file POLIDEMnew.LSF contains the following variables Y1 Y2 Y3 Y4 Y5 Y6 Y7 Y8 X1 X2 X3 Dem60 Dem65 Indus It would be useful to know which country each row belongs to. Unfortunately, this information is not available. However, one can construct a variable COUNTRY which runs from 1 to 75 5 This is done with the following PRELIS syntax file (BA2.PRL) which at the same time constructs scores on another latent variable Diff = Dem65 - Dem60. SY=POLIDEMnew.LSF New COUNTRY=TIME New Diff=Dem65-Dem60 CO ALL Select COUNTRY Dem60 Dem65 Diff OU RA=DEMDIFF.LSF 5 The variable TIME is always available in PRELIS. It assigns values 1, 2,..., N to the cases. It is intended mainly for time series. Hence its name TIME. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

65 The file DEMDIFF.LSF contains the following variables COUNTRY Dem60 Dem65 Diff With this file one can do various things to find out which countries have most democracy or which countries increased or decreased their democracy between 1960 and For example, do a bivariate line plot of Diff against COUNTRY. This shows that country 2 increased democracy most and country 30 had the largest decrease. This can also be seen by sorting Diff in descending order. This shows that country 2 has a Diff value of 1.69 and country 30 has a Diff value of These are the best and worst countries. The second best and second worst countries are the countries 22 and 34. These have Diff values of 1.39 and -1.38, respectively. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

66 Example: Attitudes to Drinking and Driving (DRINK) In this example we learn 1 how to test each part of a model one at a time 2 how to build the model successively by adding one part at a time 3 how to estimate and test each model using SIMPLIS syntax 4 how to evaluate the fit of the final model 5 how to estimate the final model using LISREL syntax Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

67 A study designed to determine the predictors of drinking and driving behavior used the theory of planned behavior of Ajzen (1991) to setup the model shown in the path diagram in Figure 68. The latent variables shown in the figure are as follows: Attitude attitude toward drinking and driving Norms social norms pertaining to drinking and driving Control percieved control over drinking and driving Intention intention to drink and drive Behavior drinking and driving behavior Attitude is measured by five indicators X 2 -X 5, Norms is measured by three indicators X 6 -X 8, Control is measured by four indicators X 9 -X 12, Intention is measured by two indicators Y 1 and Y 2, and behavior is measured by two indicators Y 3 and Y 4. Data on 756 male drivers aged years is given in file drinkdata.lsf. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

68 Figure: Conceptual Path Diagram for Attitudes to Drinking and Driving Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

69 This example illustrates a possible strategy of analysis. Begin by testing the measurement model for Attitude, see file drink11a.spl. Drinking and Driving Testing the Measurement Model for Attitude Raw Data from File drinkdata.lsf Latent Variables Attitude Relationships X1-X5 = Attitude Path Diagram End of Problem Note that although the data file drinkdata.lsf contains many variables, LISREL automatically selects the subset of variables used in the model. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

70 To test the measurement model for Attitude and Norms simultaneously, add Norms in the list of Latent Variables and add the line, see file drink12a.spl. X6-X8 = Norms To test the measurement model for Attitude, Norms, and Control simultaneously, add Control in the list of Latent Variables and add the line, see file drink13a.spl. X9-X12= Control Finally, to test the measurement model for all latent variables simultaneosly, add Intention and Behavior in the list of Latent Variables and add the two lines, see file drink14a.spl. Y1-Y2 = Intention Y3-Y4 = Behavior Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

71 If any of these analysis shows a large modification index for an indicator, the measurement model must be reconsidered and modified. For example, suppose there is a large modification index for the path from Attitude to X8. This might mean that X8 is not entirely an indicator of Norms but to some extent also a measure of Attitude. If this idea makes sense then the model should be modified by letting X8 be a composite measure of both Norms and Attitude. One can now test the full model in Figure 68 by adding the two lines, see file drink15a.spl. Intention = Attitude - Control Behavior = Intention defining the structural relationships among the latent variables Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

72 The full SIMPLIS command file is now Drinking and Driving Raw Data from File drinkdata.lsf Latent Variables Attitude Norms Control Intention Behavior Relationships Y1-Y2 = Intention Y3-Y4 = Behavior X1-X5 = Attitude X6-X8 = Norms X9-X12= Control Intention = Attitude - Control Behavior = Intention Robust Estimation Path Diagram End of Problem Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

73 The output gives the following goodness-of-fit statistics Degrees of Freedom for (C1)-(C2) 96 Maximum Likelihood Ratio Chi-Square (C1) (P = ) Browne s (1984) ADF Chi-Square (C2_NT) (P = ) which suggests that the model does not fit. However, this is a test of exact fit. According to Browne & Cudeck (1993) one can use the following fit measures to test if the model fits approximately Root Mean Square Error of Approximation (RMSEA) Percent Confidence Interval for RMSEA ( ; ) P-Value for Test of Close Fit (RMSEA < 0.05) 1.00 which suggests that the model fits at least approximately. Karl G Jöreskog ( ) LISREL Workshop at M3 Conference 2015 May 11, / 73

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the

More information

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised ) Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Categorical and Zero Inflated Growth Models

Categorical and Zero Inflated Growth Models Categorical and Zero Inflated Growth Models Alan C. Acock* Summer, 2009 *Alan C. Acock, Department of Human Development and Family Sciences, Oregon State University, Corvallis OR 97331 (alan.acock@oregonstate.edu).

More information

Structural equation modeling

Structural equation modeling Structural equation modeling Rex B Kline Concordia University Montréal ISTQL Set B B1 Data, path models Data o N o Form o Screening B2 B3 Sample size o N needed: Complexity Estimation method Distributions

More information

WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin

WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin Quantitative methods II WELCOME! Lecture 14: Factor Analysis, part I Måns Thulin The first factor analysis C. Spearman (1904). General intelligence, objectively determined and measured. The American Journal

More information

Jan. 22, An Overview of Structural Equation. Karl Joreskog and LISREL: A personal story

Jan. 22, An Overview of Structural Equation. Karl Joreskog and LISREL: A personal story Jan., 8 Karl Joreskog and LISREL: A personal stor Modeling HC Weng //8 95, /5, born in Amal, Swedan Uppsala: 955-96 Ambition was to stud mathematics and phsics and become a high school teacher Princeton:

More information

Exploring Cultural Differences with Structural Equation Modelling

Exploring Cultural Differences with Structural Equation Modelling Exploring Cultural Differences with Structural Equation Modelling Wynne W. Chin University of Calgary and City University of Hong Kong 1996 IS Cross Cultural Workshop slide 1 The objectives for this presentation

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Multiple group models for ordinal variables

Multiple group models for ordinal variables Multiple group models for ordinal variables 1. Introduction In practice, many multivariate data sets consist of observations of ordinal variables rather than continuous variables. Most statistical methods

More information

INTRODUCTION TO STRUCTURAL EQUATION MODELS

INTRODUCTION TO STRUCTURAL EQUATION MODELS I. Description of the course. INTRODUCTION TO STRUCTURAL EQUATION MODELS A. Objectives and scope of the course. B. Logistics of enrollment, auditing, requirements, distribution of notes, access to programs.

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

SEM for Categorical Outcomes

SEM for Categorical Outcomes This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Multilevel Structural Equation Modeling

Multilevel Structural Equation Modeling Multilevel Structural Equation Modeling Joop Hox Utrecht University j.hox@uu.nl http://www.joophox.net 14_15_mlevsem Multilevel Regression Three level data structure Groups at different levels may have

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Assessing the relation between language comprehension and performance in general chemistry. Appendices Assessing the relation between language comprehension and performance in general chemistry Daniel T. Pyburn a, Samuel Pazicni* a, Victor A. Benassi b, and Elizabeth E. Tappin c a Department of Chemistry,

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Fixed effects results...32

Fixed effects results...32 1 MODELS FOR CONTINUOUS OUTCOMES...7 1.1 MODELS BASED ON A SUBSET OF THE NESARC DATA...7 1.1.1 The data...7 1.1.1.1 Importing the data and defining variable types...8 1.1.1.2 Exploring the data...12 Univariate

More information

Spring 2012 Student Performance Analysis

Spring 2012 Student Performance Analysis Spring 2012 Student Performance Analysis Algebra I Standards of Learning Presentation may be paused and resumed using the arrow keys or the mouse. 1 Representing and Evaluating Cube Roots and Square Roots

More information

Basics of Modern Missing Data Analysis

Basics of Modern Missing Data Analysis Basics of Modern Missing Data Analysis Kyle M. Lang Center for Research Methods and Data Analysis University of Kansas March 8, 2013 Topics to be Covered An introduction to the missing data problem Missing

More information

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755 Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual

More information

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p ) Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

More information

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author... From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...

More information

Systematic error, of course, can produce either an upward or downward bias.

Systematic error, of course, can produce either an upward or downward bias. Brief Overview of LISREL & Related Programs & Techniques (Optional) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 STRUCTURAL AND MEASUREMENT MODELS:

More information

Factor analysis. George Balabanis

Factor analysis. George Balabanis Factor analysis George Balabanis Key Concepts and Terms Deviation. A deviation is a value minus its mean: x - mean x Variance is a measure of how spread out a distribution is. It is computed as the average

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

STRUCTURAL EQUATION MODEL (SEM)

STRUCTURAL EQUATION MODEL (SEM) STRUCTURAL EQUATION MODEL (SEM) V. Čekanavičius, G. Murauskas 1 PURPOSE OF SEM To check if the model of possible variable dependencies matches data. SEM can contain latent (directly unobservable) variables.

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Inference using structural equations with latent variables

Inference using structural equations with latent variables This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

4. Path Analysis. In the diagram: The technique of path analysis is originated by (American) geneticist Sewell Wright in early 1920.

4. Path Analysis. In the diagram: The technique of path analysis is originated by (American) geneticist Sewell Wright in early 1920. 4. Path Analysis The technique of path analysis is originated by (American) geneticist Sewell Wright in early 1920. The relationships between variables are presented in a path diagram. The system of relationships

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d. Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Chapter2 Description of samples and populations. 2.1 Introduction.

Chapter2 Description of samples and populations. 2.1 Introduction. Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that

More information

Short Answer Questions: Answer on your separate blank paper. Points are given in parentheses.

Short Answer Questions: Answer on your separate blank paper. Points are given in parentheses. ISQS 6348 Final exam solutions. Name: Open book and notes, but no electronic devices. Answer short answer questions on separate blank paper. Answer multiple choice on this exam sheet. Put your name on

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Introduction to Matrix Algebra and the Multivariate Normal Distribution

Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Structural Equation Modeling Lecture #2 January 18, 2012 ERSH 8750: Lecture 2 Motivation for Learning the Multivariate

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Misspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification

Misspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification Misspecification in Nonrecursive SEMs 1 Nonrecursive Latent Variable Models under Misspecification Misspecification in Nonrecursive SEMs 2 Abstract A problem central to structural equation modeling is

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION In this lab you will first learn how to display the relationship between two quantitative variables with a scatterplot and also how to measure the strength of

More information

Preface. List of examples

Preface. List of examples Contents Preface List of examples i xix 1 LISREL models and methods 1 1.1 The general LISREL model 1 Assumptions 2 The covariance matrix of the observations as implied by the LISREL model 3 Fixed, free,

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Multilevel Analysis of Grouped and Longitudinal Data

Multilevel Analysis of Grouped and Longitudinal Data Multilevel Analysis of Grouped and Longitudinal Data Joop J. Hox Utrecht University Second draft, to appear in: T.D. Little, K.U. Schnabel, & J. Baumert (Eds.). Modeling longitudinal and multiple-group

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) 3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Goals for the Morning

Goals for the Morning Introduction to Growth Curve Modeling: An Overview and Recommendations for Practice Patrick J. Curran & Daniel J. Bauer University of North Carolina at Chapel Hill Goals for the Morning Brief review of

More information

A Threshold-Free Approach to the Study of the Structure of Binary Data

A Threshold-Free Approach to the Study of the Structure of Binary Data International Journal of Statistics and Probability; Vol. 2, No. 2; 2013 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education A Threshold-Free Approach to the Study of

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

104 Business Research Methods - MCQs

104 Business Research Methods - MCQs 104 Business Research Methods - MCQs 1) Process of obtaining a numerical description of the extent to which a person or object possesses some characteristics a) Measurement b) Scaling c) Questionnaire

More information

Multilevel/Mixed Models and Longitudinal Analysis Using Stata

Multilevel/Mixed Models and Longitudinal Analysis Using Stata Multilevel/Mixed Models and Longitudinal Analysis Using Stata Isaac J. Washburn PhD Research Associate Oregon Social Learning Center Summer Workshop Series July 2010 Longitudinal Analysis 1 Longitudinal

More information

Quantitative Bivariate Data

Quantitative Bivariate Data Statistics 211 (L02) - Linear Regression Quantitative Bivariate Data Consider two quantitative variables, defined in the following way: X i - the observed value of Variable X from subject i, i = 1, 2,,

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

A study on new customer satisfaction index model of smart grid

A study on new customer satisfaction index model of smart grid 2nd Annual International Conference on Electronics, Electrical Engineering and Information Science (EEEIS 2016) A study on new index model of smart grid Ze-San Liu 1,a, Zhuo Yu 1,b and Ai-Qiang Dong 1,c

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts A Study of Statistical Power and Type I Errors in Testing a Factor Analytic Model for Group Differences in Regression Intercepts by Margarita Olivera Aguilar A Thesis Presented in Partial Fulfillment of

More information

Keystone Exams: Algebra

Keystone Exams: Algebra KeystoneExams:Algebra TheKeystoneGlossaryincludestermsanddefinitionsassociatedwiththeKeystoneAssessmentAnchorsand Eligible Content. The terms and definitions included in the glossary are intended to assist

More information

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS NOTES FROM PRE- LECTURE RECORDING ON PCA PCA and EFA have similar goals. They are substantially different in important ways. The goal

More information

SEM Day 3 Lab Exercises SPIDA 2007 Dave Flora

SEM Day 3 Lab Exercises SPIDA 2007 Dave Flora SEM Day 3 Lab Exercises SPIDA 2007 Dave Flora 1 Today we will see how to estimate SEM conditional latent trajectory models and interpret output using both SAS and LISREL. Exercise 1 Using SAS PROC CALIS,

More information

WHAT IS STRUCTURAL EQUATION MODELING (SEM)?

WHAT IS STRUCTURAL EQUATION MODELING (SEM)? WHAT IS STRUCTURAL EQUATION MODELING (SEM)? 1 LINEAR STRUCTURAL RELATIONS 2 Terminología LINEAR LATENT VARIABLE MODELS T.W. Anderson (1989), Journal of Econometrics MULTIVARIATE LINEAR RELATIONS T.W. Anderson

More information

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

Chapter Eight: Assessment of Relationships 1/42

Chapter Eight: Assessment of Relationships 1/42 Chapter Eight: Assessment of Relationships 1/42 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu

More information

1 Introduction. 2 Example

1 Introduction. 2 Example Statistics: Multilevel modelling Richard Buxton. 2008. Introduction Multilevel modelling is an approach that can be used to handle clustered or grouped data. Suppose we are trying to discover some of the

More information

Revised: 2/19/09 Unit 1 Pre-Algebra Concepts and Operations Review

Revised: 2/19/09 Unit 1 Pre-Algebra Concepts and Operations Review 2/19/09 Unit 1 Pre-Algebra Concepts and Operations Review 1. How do algebraic concepts represent real-life situations? 2. Why are algebraic expressions and equations useful? 2. Operations on rational numbers

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models Confirmatory Factor Analysis: Model comparison, respecification, and more Psychology 588: Covariance structure and factor models Model comparison 2 Essentially all goodness of fit indices are descriptive,

More information

SPSS LAB FILE 1

SPSS LAB FILE  1 SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Mixed Effects Models

Mixed Effects Models Mixed Effects Models What is the effect of X on Y What is the effect of an independent variable on the dependent variable Independent variables are fixed factors. We want to measure their effect Random

More information

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables /4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris

More information

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts

More information

Related Concepts: Lecture 9 SEM, Statistical Modeling, AI, and Data Mining. I. Terminology of SEM

Related Concepts: Lecture 9 SEM, Statistical Modeling, AI, and Data Mining. I. Terminology of SEM Lecture 9 SEM, Statistical Modeling, AI, and Data Mining I. Terminology of SEM Related Concepts: Causal Modeling Path Analysis Structural Equation Modeling Latent variables (Factors measurable, but thru

More information

UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys

UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys.. 1. (i) The treatment being compared is: (ii). (5) 3. (3) 4. (4) Study 1: the number

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already

More information