Evaluating sensitivity of parameters of interest to measurement invariance using the EPC-interest

Similar documents
Trends in Human Development Index of European Union

A Markov system analysis application on labour market dynamics: The case of Greece

Mathematics. Pre-Leaving Certificate Examination, Paper 2 Higher Level Time: 2 hours, 30 minutes. 300 marks L.20 NAME SCHOOL TEACHER

AD HOC DRAFTING GROUP ON TRANSNATIONAL ORGANISED CRIME (PC-GR-COT) STATUS OF RATIFICATIONS BY COUNCIL OF EUROPE MEMBER STATES

More formally, the Gini coefficient is defined as. with p(y) = F Y (y) and where GL(p, F ) the Generalized Lorenz ordinate of F Y is ( )

ECON Introductory Econometrics. Lecture 13: Internal and external validity

The Information Content of Capacity Utilisation Rates for Output Gap Estimates

Weighted Voting Games

Introduction A research example from ASR mltcooksd mlt2stage Outlook. Multi Level Tools. Influential cases in multi level modeling

Modelling structural change using broken sticks

An Introduction to Mplus and Path Analysis

STATISTICS Relationships between variables: Correlation

An Introduction to Path Analysis

Part A: Salmonella prevalence estimates. (Question N EFSA-Q ) Adopted by The Task Force on 28 March 2007

F M U Total. Total registrants at 31/12/2014. Profession AS 2, ,574 BS 15,044 7, ,498 CH 9,471 3, ,932

Annotated Exam of Statistics 6C - Prof. M. Romanazzi

Path Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis

Applying ex-post harmonization of. in corruption research

How Well Are Recessions and Recoveries Forecast? Prakash Loungani, Herman Stekler and Natalia Tamirisa

data latent variable models with the EPC-interest

Appendix B: Detailed tables showing overall figures by country and measure

Structural equation modeling in evaluation of technological potential of European Union countries in the years

Chapter 6 Scatterplots, Association and Correlation

Variance estimation on SILC based indicators

PLUTO The Transport Response to the National Planning Framework. Dr. Aoife O Grady Department of Transport, Tourism and Sport

The School Geography Curriculum in European Geography Education. Similarities and differences in the United Europe.

Export Destinations and Input Prices. Appendix A

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

Sustainability of balancing item of balance of payment for OECD countries: evidence from Fourier Unit Root Tests

A clustering view on ESS measures of political interest:

RISK ASSESSMENT METHODOLOGIES FOR LANDSLIDES

40 Years Listening to the Beat of the Earth

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

A Prior Distribution of Bayesian Nonparametrics Incorporating Multiple Distances

Composition of capital ES060 ES060 POWSZECHNAES060 BANCO BILBAO VIZCAYA ARGENTARIA S.A. (BBVA)

Composition of capital DE025

Composition of capital CY007 CY007 POWSZECHNACY007 BANK OF CYPRUS PUBLIC CO LTD

Composition of capital NO051

Composition of capital LU045 LU045 POWSZECHNALU045 BANQUE ET CAISSE D'EPARGNE DE L'ETAT

Composition of capital DE028 DE028 POWSZECHNADE028 DekaBank Deutsche Girozentrale, Frankfurt

Composition of capital CY006 CY006 POWSZECHNACY006 CYPRUS POPULAR BANK PUBLIC CO LTD

A nonparametric test for path dependence in discrete panel data

Composition of capital DE017 DE017 POWSZECHNADE017 DEUTSCHE BANK AG

Composition of capital FR013

Composition of capital FR015

Composition of capital ES059

The Changing Nature of Gender Selection into Employment: Europe over the Great Recession

Online Appendix for Cultural Biases in Economic Exchange? Luigi Guiso Paola Sapienza Luigi Zingales

Walkthrough for Illustrations. Illustration 1

Course Econometrics I

Fittings and Pipe Connections

Composition of capital as of 30 September 2011 (CRD3 rules)

Composition of capital as of 30 September 2011 (CRD3 rules)

Composition of capital as of 30 September 2011 (CRD3 rules)

Composition of capital as of 30 September 2011 (CRD3 rules)

Composition of capital as of 30 September 2011 (CRD3 rules)

Composition of capital as of 30 September 2011 (CRD3 rules)

Lecture 2: Intermediate macroeconomics, autumn Lars Calmfors

North-South Gap Mapping Assignment Country Classification / Statistical Analysis

Interval-Based Composite Indicators

PIRLS 2011 The PIRLS 2011 Safe and Orderly School Scale

ORGANISATION FOR ECONOMIC CO-OPERATION AND DEVELOPMENT

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

PIRLS 2011 The PIRLS 2011 Students Motivated to Read Scale

PIRLS 2011 The PIRLS 2011 Students Engaged in Reading Lessons Scale

STATISTICA MULTIVARIATA 2

PIRLS 2016 INTERNATIONAL RESULTS IN READING

Jay Magidson Statistical Innovations

Measuring Instruments Directive (MID) MID/EN14154 Short Overview

Economic and Social Council

The trade dispute between the US and China Who wins? Who loses?

Foundations of Modern Macroeconomics B. J. Heijdra & F. van der Ploeg Chapter 7: A Closer Look at the Labour Market

2006 Supplemental Tax Information for JennisonDryden and Strategic Partners Funds

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

Gravity Analysis of Regional Economic Interdependence: In case of Japan

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

NASDAQ OMX Copenhagen A/S. 3 October Jyske Bank meets 9% Core Tier 1 ratio in EU capital exercise

10/27/2015. Content. Well-homogenized national datasets. Difference (national global) BEST (1800) Difference BEST (1911) Difference GHCN & GISS (1911)

Weekly price report on Pig carcass (Class S, E and R) and Piglet prices in the EU. Carcass Class S % + 0.3% % 98.

Title: Testing for Measurement Invariance with Latent Class Analysis. Abstract

Using Mixture Latent Markov Models for Analyzing Change in Longitudinal Data with the New Latent GOLD 5.0 GUI

Modelling and projecting the postponement of childbearing in low-fertility countries

This document is a preview generated by EVS

Chapter 1. International Student Achievement in Reading

The EuCheMS Division Chemistry and the Environment EuCheMS/DCE

Canadian Imports of Honey

Chapter 4 Data with Two Variables

c. {2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97}. Also {x x is a prime less than 100}.

EUMETSAT. A global operational satellite agency at the heart of Europe. Presentation for the Spanish Industry Day Madrid, 15 March 2012

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

Quick Guide QUICK GUIDE. Activity 1: Determine the Reaction Rate in the Presence or Absence of an Enzyme

Paul Krugman s New Economic Geography: past, present and future. J.-F. Thisse CORE-UCLouvain (Belgium)

This document is a preview generated by EVS

Logistic Regression. Some slides from Craig Burkett. STA303/STA1002: Methods of Data Analysis II, Summer 2016 Michael Guerzhoy

WHO EpiData. A monthly summary of the epidemiological data on selected vaccine preventable diseases in the European Region

Bayesian Analysis of Latent Variable Models using Mplus

Lecture 2: Intermediate macroeconomics, autumn Lars Calmfors

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

Chapter 4 Data with Two Variables

Transcription:

Evaluating sensitivity of parameters of interest to measurement invariance using the EPC-interest Department of methodology and statistics, Tilburg University WorkingGroupStructuralEquationModeling26-27.02.2015, FU Berlin

Conclusion This talk discusses the ``EPC-interest'' EPC-interest is like SEM's expected parameter change (``EPC-self'') but instead of measuring the change in the restricted parameter, it measures the change in the parameter of interest; Introduced for measurement invariance evaluation in continuous data SEM by Oberski (2014) and for categorical data by Oberski et al. (frth) Implemented in Latent Gold 5.0, experimental version of, working on main branch.

The problem of measurement invariance

p(y) = ξ J p(ξ x) p(y j ξ) j=1

p(y) = ξ J p(ξ x) p(y j ξ, x) j=1

Measurement invariance: the problem Problem of measurement invariance: we want to know γ, but δ 0 might bias this parameterofinterest.

Preceding solutions to the problem 1. Selecting one indicator as a reference indicator (``anchor item'') apriori; 2. Imposing a strong prior on differential functioning (Muthén&Asparouhov2012) 3. Test null hypothesis of full or partial invariance.

Preceding solutions to the problem 1. Selecting one indicator as a reference indicator (``anchor item'') apriori; + Don't need further restrictions; - No way of testing reference indicator. E.g.: Setting loadings to 1 in each group, ``alignment method''. 2. Imposing a strong prior on differential functioning (Muthén&Asparouhov2012) 3. Test null hypothesis of full or partial invariance.

Preceding solutions to the problem 1. Selecting one indicator as a reference indicator (``anchor item'') apriori; 2. Imposing a strong prior on differential functioning (Muthén&Asparouhov2012) + Attractive data-driven solution when prior is neither too informative nor too weak; - More research needed to figure out when prior is neither too informative nor too weak. 3. Test null hypothesis of full or partial invariance.

Preceding solutions to the problem 1. Selecting one indicator as a reference indicator (``anchor item'') apriori; 2. Imposing a strong prior on differential functioning (Muthén&Asparouhov2012) 3. Test null hypothesis of full or partial invariance. + When high-powered test of full measurement invariance is not rejected, may be safe to simply continue without the need for further modeling; - Rarely happens in practice. + Partial invariance looks at size and significance of δ, but not all big δ's are important nor are small ones necessarily unimportant. So does not guarantee that he parameter of interest in free of measurement differences (Oberski,2014) 4. Sensitivity analysis: allow partial violations when they matter.

Preceding solutions to the problem 1. Selecting one indicator as a reference indicator (``anchor item'') apriori; 2. Imposing a strong prior on differential functioning (Muthén&Asparouhov2012) 3. Test null hypothesis of full or partial invariance. 4. Sensitivity analysis: allow partial violations when they matter.

Measurement invariance: the problem and a solution Problem of measurement invariance: we want to know γ, but δ 0 might bias this parameterofinterest. Solution: Use EPC-interest: the expected change in γ when freeing δ. If EPC-interest is big (e.g. can change sign), incorporate δ; if EPC-interest is small, ignore it.

EPC-interest

EPC-interest, also for categorical data ( ) θ ( ˆδ) EPC-interest = P δ δ = PĤ 1 θθ Ĥθδ EPC-self = ˆγ a ˆγ + O(ψ ψ), where P selects the parameters of interest γ from the parameter vector θ, H is a Hessian, and O(ψ ψ) is an approximation term depending on the overall amount of misspecification (parameter differences).

Simulation: how good is the approximation? Setup: P(Y j = 1 x) = [1 + exp( x)] 1, with j {2, 3, 4}, and structural model x = γz + ϵ with γ = 1 and ϵ N(0, 1). We then introduced a violation of measurement invariance for the first indicator, P(Y 1 = 1 x) = [1 + exp( x δz)] 1. Nine conditions varied sample size, n {250, 500, 1000}, and the size of the invariance violation: δ = 0 (no violation), 0.5 (moderate), or 1 (extreme). Data were generated using R 3.1.2 and analyzed using Latent GOLD 5.0.0.14161.

n 250 500 True δ 0 0.5 1 0 0.5 1 0 Est. ˆγ 1.010 1.151 1.353 0.980 1.152 1.330 1.013 Bias ˆγ -0.010-0.151-0.353 0.020-0.152-0.330-0.013 EPC-int. 0.003-0.166-0.494-0.001-0.180-0.486 0.004 Table : Simulation study of EPC-interest. Shown is the average point estimate for the γ parameter of interest under full measurement invariance (``Est''), its difference from the true value γ = 1 (``Bias''), and the average EPC-interest.

Example with categorical data See

Figure : Graphical representation of the multilevel latent class regression model for (post)materialism measured by three partial ranking tasks. Observed variables are shown in rectangles while unobserved (``latent'') variables are shown in ellipses.

Multilevel latent class model w/ covariates for rankings L(θ) = P(A 1, A 2, B 1, B 2, C 1, C 2 Z 1, Z 2 ) = C n c P(G c ) P(X ic Z 1ic, Z 2ic, G c ) c=1 G i=1 X P(A 1ic, A 2ic X ic )P(B 1ic, B 2ic X ic )P(C 1ic, C 2ic X ic ), Goal: estimate γ (especially its sign). Possibleproblem: Violations of scalar and metric measurement invariance (DIF), parameterized respectively as τ and λ. Solution: See if these matter for the sign of γ.

Table : Full invariance multilevel latent class model: EPC-interest values. λ jkxg Estimates EPC-interest Est. s.e. Task 1 Task 2 Task 3 Class 1 GDP -0.035 (0.007) 0.073 0.252 0.005 Class 2 GDP -0.198 (0.012) -0.163-0.058 0.002 Class 1 Women 0.013 (0.001) -0.003 0.029 0.002 Class 2 Women -0.037 (0.001) -0.006-0.013 0.002 Free ``loadings'' for task 1 and task 2.

Table : Partially invariant multilevel latent class model: EPC-interest values. λ jkxg Estimates EPC-interest Est. s.e. Task 1 Task 2 Task 3 Class 1 GDP -0.127 (0.008) 0.097 Class 2 GDP 0.057 (0.011) 0.161 Class 1 Women 0.008 (0.001) 0.001 Class 2 Women 0.020 (0.001) 0.007

WhathasbeengainedbyusingEPC-interest: I am fairly confident here that there truly is "approximate measurement invariance", in the sense that any violations of measurement invariance do not bias the primary conclusions. I think attaining this goal is the main purpose of model fit evaluation.

Conclusion

Whatistheproblem? We do latent variable modeling with a goal in mind. But the latent variable model might be misspecified. The appropriate question: "will that affect my goal?" The actual question: "do the data fit the model in the population" (LR) or "are the model and the data far apart relative to model complexity" (RMSEA etc.) Whatisthesolution? Evaluatedirectlywhateffectpossiblemisspecifications haveonthegoaloftheanalysis.

References Oberski, D. (2014). Evaluating sensitivity of parameters of interest to measurement invariance in latent variable models. Political Analysis, 22(1):45--60. Oberski, D. and Vermunt, J. (2013). A model-based approach to goodness-of-fit evaluation in item response theory. Measurement: InterdisciplinaryResearch&Perspectives, 11:117--122. Oberski, D., Vermunt, J., and Moors, G. (frth). Evaluating measurement invariance in categorical data latent variable models with the EPC-interest. Vermunt, J. K. and Magidson, J. (2013). TechnicalguideforLatent GOLD 5.0: Basicandadvanced. Statistical Innovations Inc., Belmont, MA.

SEM regression coefficient example Two problems with invariance testing Conclusions Unaffected by misspecification Affected by misspecification Misspecified invariance model fit ``Good'' fit ``Bad'' fit (1) (2) Overparameterization or unnecessarily d i s c a r d e d i t e m, group, or scale. (3) Non-invariance invalidates conclusions. (4)

SEM regression coefficient example European Sociological Review 2008, 24(5), 583--599

SEM regression coefficient example Poland Czech Republic Greece United Kingdom Slovenia Belgium France Portugal Finland Hungary Norway Spain Ireland Germany Netherlands Switzerland Austria Danmark Sweden Poland Czech Republic Greece United Kingdom Slovenia Belgium France Portugal Finland Hungary Norway Spain Ireland Germany Netherlands Switzerland Austria Danmark Sweden Conservation Self transcendence 1.0 0.5 0.0 0.5 1.0 1.0 0.5 0.0 0.5 1.0 Regression coefficient ALLOW NOCOND

SEM regression coefficient example EPC-interest statistics of at least 0.1 in absolute value with respect to the latent variable regression coefficients. Metric invariance (loading) restriction ``Conditions Work skills'' in... Slovenia France Hungary Ireland EPC-interest w.r.t.: Conditions Self-transcendence -0.073-0.092-0.067 0.073 Conservation 0.144 0.139 0.123-0.113 SEPC-self 0.610 0.692 0.759-0.514

SEM regression coefficient example What has been gained by using EPC-interest Full metric invariance model: "close fit"; EPC-interest still detects threats to cross-country comparisons of regression coefficients;

SEM regression coefficient example What has been gained by using EPC-interest Full metric invariance model: "close fit"; EPC-interest still detects threats to cross-country comparisons of regression coefficients; MI and EPC-self do not detect these particular misspecifications;

SEM regression coefficient example What has been gained by using EPC-interest Full metric invariance model: "close fit"; EPC-interest still detects threats to cross-country comparisons of regression coefficients; MI and EPC-self do not detect these particular misspecifications; MI and EPC-self detect other misspecifications;

SEM regression coefficient example What has been gained by using EPC-interest Full metric invariance model: "close fit"; EPC-interest still detects threats to cross-country comparisons of regression coefficients; MI and EPC-self do not detect these particular misspecifications; MI and EPC-self detect other misspecifications; Looking at EPC-interest reveals that these do not affect the cross-country comparisons of regression coefficients.

SEM regression coefficient example What has been gained by using EPC-interest Full metric invariance model: "close fit"; EPC-interest still detects threats to cross-country comparisons of regression coefficients; MI and EPC-self do not detect these particular misspecifications; MI and EPC-self detect other misspecifications; Looking at EPC-interest reveals that these do not affect the cross-country comparisons of regression coefficients.

Example Goal: Estimate gender differences in "valuing Stimulation": (1) Very much like me; (2) Like me; (3) Somewhat like me; (4) A little like me; (5) Not like me; (6) Not like me at all. (S)he looks for adventures and likes to take risks. (S)he wants to have an exciting life. (S)he likes surprises and is always looking for new things to do. He thinks it is important to do lots of different things in life. Tool: Structural Equation Model for European Social Survey data (n = 18519 men and 16740 women). (OriginalstudybySchwarzetal. 2005)

Assume: Butreally(?): What difference does it make for the goal: true gender differences in values? (re-analysisofdatabyoberski2014) Latent mean difference estimate ± 2 s.e. 0.2 0.0 0.2 Men value more Women value more Model Scalar invariance Free intercept 'Adventure' ACPO ST SD HE COTR SE UN BE "Human value" factor

PROBLEM The original authors found that the conditional independence model fit the data "approximately" (p. 1013)... "Chi-squaredeterioratedsignificantly, χ 2 (19) = 3313, p <.001, butcfi didnotchange. Changeinchi-squareis highlysensitivewithlargesamplesizesandcomplex models. Theotherindicessuggestedthatscalarinvariance mightbeaccepted(cfi =.88, RMSEA =.04, CI =.039.040, PCLOSE =1.0).''

PROBLEM The original authors found that the conditional independence model fit the data "approximately" (p. 1013)... "Chi-squaredeterioratedsignificantly, χ 2 (19) = 3313, p <.001, butcfi didnotchange. Changeinchi-squareis highlysensitivewithlargesamplesizesandcomplex models. Theotherindicessuggestedthatscalarinvariance mightbeaccepted(cfi =.88, RMSEA =.04, CI =.039.040, PCLOSE =1.0).''... but unfortunately this "acceptable" misspecification could reversetheirconclusions!

EPC-interest applied to Stimulation example After fitting the full scalar invariance model, Effect size estimate of sex difference in Stimulation is +0.214 (s.e. 0.0139).

EPC-interest applied to Stimulation example After fitting the full scalar invariance model, Effect size estimate of sex difference in Stimulation is +0.214 (s.e. 0.0139). But EPC-interest of equal "Adventure" item intercept is -0.243.

EPC-interest applied to Stimulation example After fitting the full scalar invariance model, Effect size estimate of sex difference in Stimulation is +0.214 (s.e. 0.0139). But EPC-interest of equal "Adventure" item intercept is -0.243. So EPC-interest suggests conclusioncanbereversed by freeing a misspecified scalar invariance restriction

EPC-interest applied to Stimulation example After fitting the full scalar invariance model, Effect size estimate of sex difference in Stimulation is +0.214 (s.e. 0.0139). But EPC-interest of equal "Adventure" item intercept is -0.243. So EPC-interest suggests conclusioncanbereversed by freeing a misspecified scalar invariance restriction Actualchange when freeing this intercept is very close to EPC-interest: -0.235.

EPC-interest applied to Stimulation example After fitting the full scalar invariance model, Effect size estimate of sex difference in Stimulation is +0.214 (s.e. 0.0139). But EPC-interest of equal "Adventure" item intercept is -0.243. So EPC-interest suggests conclusioncanbereversed by freeing a misspecified scalar invariance restriction Actualchange when freeing this intercept is very close to EPC-interest: -0.235.