Inference with Heywood cases

Similar documents
Gramian Matrices in Covariance Structure Models

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

Improper Solutions in Exploratory Factor Analysis: Causes and Treatments

Misspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification

The Davies Problem: A New Test for Random Slope in the Hierarchical Linear Model

Testing Structural Equation Models: The Effect of Kurtosis

Nesting and Equivalence Testing

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Causal Inference Using Nonnormality Yutaka Kano and Shohei Shimizu 1

Scaled and adjusted restricted tests in. multi-sample analysis of moment structures. Albert Satorra. Universitat Pompeu Fabra.

Confirmatory Factor Analysis. Psych 818 DeShon

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares

SEM with observed variables: parameterization and identification. Psychology 588: Covariance structure and factor models

Inference using structural equations with latent variables

Longitudinal Modeling with Logistic Regression

Title. Description. Remarks and examples. stata.com. stata.com. Variable notation. methods and formulas for sem Methods and formulas for sem

WHAT IS STRUCTURAL EQUATION MODELING (SEM)?

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Accounting for Population Uncertainty in Covariance Structure Analysis

Using Estimating Equations for Spatially Correlated A

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Model Assumptions; Predicting Heterogeneity of Variance

Standard errors in covariance structure models: Asymptotics versus bootstrap

1 Procedures robust to weak instruments

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models

Sample size determination for logistic regression: A simulation study

Using Mplus individual residual plots for. diagnostics and model evaluation in SEM

Strati cation in Multivariate Modeling

Some New Aspects of Dose-Response Models with Applications to Multistage Models Having Parameters on the Boundary

Assessing Factorial Invariance in Ordered-Categorical Measures

Bayesian Analysis of Latent Variable Models using Mplus

An Empirical Characteristic Function Approach to Selecting a Transformation to Normality

UNIVERSITY OF CALGARY. The Influence of Model Components and Misspecification Type on the Performance of the

Evaluating Small Sample Approaches for Model Test Statistics in Structural Equation Modeling

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap

EVALUATION OF STRUCTURAL EQUATION MODELS

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Factor Analysis & Structural Equation Models. CS185 Human Computer Interaction

1 The Robustness of LISREL Modeling Revisited

Introduction to Confirmatory Factor Analysis

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

simple if it completely specifies the density of x

Introduction to Structural Equation Modeling

POLI 8501 Introduction to Maximum Likelihood Estimation

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

Latent variable interactions

sempower Manual Morten Moshagen

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference

Bayesian Interpretations of Heteroskedastic Consistent Covariance. Estimators Using the Informed Bayesian Bootstrap

INTRODUCTION TO STRUCTURAL EQUATION MODELS

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

An overview of applied econometrics

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Comparing standardized coefficients in structural equation modeling: a model reparameterization approach

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Robustness and Distribution Assumptions

Comparison between conditional and marginal maximum likelihood for a class of item response models

Inference about the Indirect Effect: a Likelihood Approach

Robust Means Modeling vs Traditional Robust Tests 1

Testing structural equation models: the effect of kurtosis. Tron Foss BI Norwegian Business School. Karl G. Jøreskog BI Norwegian Business School

On the econometrics of the Koyck model

Evaluation of structural equation models. Hans Baumgartner Penn State University

Partial Identification and Confidence Intervals

A Practitioner s Guide to Cluster-Robust Inference

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab

Simple Linear Regression

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)

Testing Restrictions and Comparing Models

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

Likelihood-Based Methods

Introduction to Random Effects of Time and Model Estimation

Fit Indices Versus Test Statistics

Confidence intervals for the variance component of random-effects linear models

Applied Statistics and Econometrics

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models

Reconciling factor-based and composite-based approaches to structural equation modeling

Model fit evaluation in multilevel structural equation models

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A note on profile likelihood for exponential tilt mixture models

A Cautionary Note on the Use of LISREL s Automatic Start Values in Confirmatory Factor Analysis Studies R. L. Brown University of Wisconsin

Lawrence D. Brown* and Daniel McCarthy*

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

You can compute the maximum likelihood estimate for the correlation

Quantile regression and heteroskedasticity

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Running head: TESTING INEQUALITY CONSTRAINED HYPOTHESES. Testing Inequality Constrained Hypotheses in SEM Models

Transcription:

Inference with Joint work with Kenneth Bollen (UNC) and Victoria Savalei (UBC) NSF support from SES-0617193 with funds from SSA September 18, 2009

What is a case? (1931) considered characterization of the correlation matrices: 1 r 1 r 2 r 1 r 3... r 1 r n r 1 r 2 1 r 2 r 3... r 2 r n......., r 1 r 2... r n 0 r 1 r n r 2 r n r 3 r n... 1 He showed that for this scheme r 2 1 1 + 1 r 2 2 1 r 2 2 + r 2 3 1 r 2 3 + r 2 n 1 r 2 n and hence r 1 can be greater than 1, which was an extension on the earlier results by Spearman and Garnett.

What is a case? analysis literature of early 1960s: correlations greater than 1? Modern SEM usage: improper solutions, correlation estimates greater than 1, variance estimates less than 0 The current project deals mostly with variance estimates. Indicator variances variances

Outliers (Bollen 1987). Causes of Non-convergence and underidentification (Van Driel 1978, Boomsma & Hoogland 2001). Empirical underidentification (Rindskopf 1984). Structurally misspecified models (Van Driel 1978, Dillon, Kumar & Mulani 1987, Sato 1987, Bollen 1989, & Bollen 2008). Sampling fluctuations (Van Driel 1978, Boomsma 1983, Anderson & Gerbing 1984).

Inference problem: H + : σ 2 > 0 H : σ 2 = 0 H : σ 2 < 0 Typology of Normal situation Perfect indicator (!?) or no factor present Misspecified model Sample estimate Population value ˆσ 2 > 0 ˆσ 2 < 0 σ 2 > 0?? σ 2 = 0?? σ 2 < 0 Type I error Misspecification detected!

Savalei & (2008) Breakdown of the papers case with an error variance. True situation is H. Should we consider H + or H + H as an alternative? & Bollen (2008) case with an error variance. True situation is H ; the model is structurally misspecified. Does it cause problems for inference? Work in progress with Vika case with an factor variance. True situation is H, and there are even more serious regularity condition violations. Does it cause problems for inference?

Is that really a variance? 1 σ 11 = λ 2 1 + θ 1 σ 12 = λ 1 λ 2 ξ 1 σ 13 = λ 1 λ 3 σ 22 = σ 23 = λ 2 2 + θ 2 λ 2 λ 3 σ 33 = λ 2 3 + θ 3 y 1 y 2 y 3 δ 1 δ 2 δ 3

Is that really a variance? 1 ˆλ 1 = s 12 s 13 /s 23 ˆλ 2 = s 12 s 23 /s 13 ξ 1 ˆλ 3 = s 13 s 23 /s 12 ˆθ 1 = s 11 s 12 s 13 /s 23 ˆθ 2 = s 22 s 12 s 23 /s 13 ˆθ 3 = s 33 s 13 s 23 /s 12 y 1 y 2 y 3 δ 1 δ 2 δ 3 Questions?

Savalei and (2008) Savalei, V. and, S. (2008), Constrained vs. unconstrained estimation in structural equation modeling, Psychological Methods 13, 150 170. Research question: if the truth is H, should we test against H + or against H H +?

Constrained estimation Irregular problem: the standard regularity condition of interior point is violated. Chernoff (1954): test with normal data of µ = 0 vs. µ > 0. Shapiro (1985): geometry of constrained parameter spaces in SEM. Stram & Lee (1994): variance components in mixed models. Jamshidian & Bentler (1994): algorithms for constrained estimation for SEM. Andrews (1999, 2001) established the most general results. Stoel, Garre, Dolan & van den Wittenboer (2006): computational procedure for testing H vs. H +.

Constrained estimation If there are k parameters and r boundaries present, the (asymptotic) distribution is T k j=k r w j χ 2 j, where k j=k r w j = 1 with weights w j that depend on information matrix, i.e., covariances between parameters.

Constrained estimation Chernoff (1954): X N(µ, 1); distribution of the test H vs. H + is 1 2 χ2 0 + 1 2 χ2 1 Density 0.0 0.2 0.4 0.6 0.8 1.0 x -3-2 -1 0 1 2 3 x

Constrained estimation Chernoff (1954): X N(µ, 1); distribution of the test H vs. H + is 1 2 χ2 0 + 1 2 χ2 1 Density 0.0 0.2 0.4 0.6 0.8 1.0 x μ^ -3-2 -1 0 1 2 3 x

Constrained estimation Chernoff (1954): X N(µ, 1); distribution of the test H vs. H + is 1 2 χ2 0 + 1 2 χ2 1 Density 0.0 0.2 0.4 0.6 0.8 1.0 x -3-2 -1 0 1 2 3 x

Constrained estimation Chernoff (1954): X N(µ, 1); distribution of the test H vs. H + is 1 2 χ2 0 + 1 2 χ2 1 Density 0.0 0.2 0.4 0.6 0.8 1.0 x μ^ -3-2 -1 0 1 2 3 x

Constrained estimation Chernoff (1954): X N(µ, 1); distribution of the test H vs. H + is 1 2 χ2 0 + 1 2 χ2 1 cdf 0.0 0.2 0.4 0.6 0.8 1.0-1 0 1 2 3 4 x

Savalei & (2008): Constrained estimation: cons Mixtures arise only when a combination of conditions about true parameters (unknown to the researcher) and estimation procedures (constrained estimation) occur together. Test of overall fit has distribution which is impossible to characterize (due to unknowns). Conservative upper bound: χ 2 k. Constrained estimation is only internally consistent when all the procedures are laid out ahead rather than followed the data! Ad hoc procedure of resetting to zero: effectively equivalent to constrained estimation, but type I error is likely inflated. Software implementation: explicit constraints are required; a far more difficult optimization procedure. Non-normal data and not asymptotically robust situations: just forget it.

Conditional approach Dijkstra (1992): start with the first stage estimation; if occur: 1 restrict parameter(s); 2 release degree(s) of freedom; 3 re-estimate the model and report p-value only. The test statistic itself is not meaningful. In this conditional approach, the unconditional distribution is again the mixture. It also leads to somewhat improved finite sample approximation when the parameter is near the boundary. Implementation: SAS PROC CALIS

Savalei & (2008): Unconstrained estimation Overall fit test has a known distribution, same for all points in parameter space! Decomposition of χ 2 into the fit and effect of the boundary components. Easier to implement than constrained estimation. More informative about sources of misfit. Provides power against a broader range of alternatives.

Software implementation Savalei & (2008) compared AMOS 7.0, EQS 6.1, LISREL 8.8, Mplus 4.2, and SAS PROC CALIS in SAS 9.1. Unconstrained estimation is default in all software packages except EQS. Release one d.f. for conditional inference: only in SAS PROC CALIS. Constrained Wald test: no software does that. Numerical and conceptual discrepancies between packages. Convergence diagnostic/missing s.e. problems with LISREL. Should the standard error on the constrained parameter be zero? correlations > 1: parameterization may matter.

Phantom variables 158 SAVALEI AND KOLENIKOV Rindskopf (1983): device to impose inequality constraints. Y1 1-7 I 1 Y2 1 Figure 4. A parameterization of a standard growth curve model to create an implicit inequality constraint on the variance of the slope V[S] = ψ factor. Variance of V is S = b 2 fixed to 1. I intercept factor; S slope factor; V phantom latent variable; Y1 Y4 observed variables. the issue that is somewhat concealed when the focus is on mixtures: that in order to gain power on one tail of the distribution we must explicitly ignore the other tail. Constrained and Unconstrained Estimation in S 3 Y3 1 b 13 V Y4 gives the right solution (Dijkstra samples and with larger models, th behave much worse and have a w local maxima; at what point this i not clear (Bentler & Tanaka, 198 Loken, 2005; Rubin & Thayer, 198 vantage of the ad hoc approach is t to see the inadmissible estimates a This makes it difficult to disregard is precisely what is required to ca mation correctly, as we discuss lat The approach of reparameteriz Rindskopf (1983, 1984) as a wa constraints at the time when SEM s allow for them (see also Bentler, 1 He suggested, for example, that to from going negative, the variances fixed to 1 and their regression coeffi Figure 4 illustrates how a standard be reparameterized to avoid negati ance of the slope factor by the in variable V (Rindskopf, 1983, 1984 have that var(s) b 2, and the est always nonnegative, regardless of t When ψ S = 0, b = 0, and l/ ψ S = 2b l/ b = 0, so information matrix is not invertible! Phantom variables substitute one irregularity for another.

Recommended procedures Commit to restricting the estimates to be in their proper range; use conditional approach to overall fit test and mixture distribution for χ 2 difference tests. Leave the choice to the software and don t intervene (EQS: constrained estimation) Allow inadmissible solutions and test them for significance. Use Wald tests, confidence intervals or χ 2 difference tests that all have their regular distributions. Questions?

and Bollen (2008), S. and Bollen, K. A. (2008), Testing negative error variances: Is a case a symptom of misspecification? Under review. Research question: testing specification, as expressed via H H + against H

Data generating process Population case 0.3 y 2 ζ 2 1 y 1 0.8 1 0.55 y 3 ζ 3 1 Jump to simulations

CFA model fit Population case 0.229 ξ 1 1 1.313 3.457 y 1 y 2 y 3 δ 1 δ 2 δ 3 0.771 0.696 0.467 Jump to simulations

CFA with restricted variance Population case 0.275 1 ξ 1 1.313 2.866 y 1 y 2 y 3 δ 1 δ 2 δ 3 0.724 0.615 0 Jump to simulations

Misspecified models Distributional misspecification: the distribution of the data is not normal, likelihood methods are not fully applicable (Satorra 1990, Satorra & Bentler 1994). Structural misspecification: the structure of the model, the number of latent variables, the relations between the variables in the model are not specified correctly (Yuan, Marshall & Bentler 2003). case: impossible value in the population; evidence of structural misspecification (or what??) If the model is misspecified, doesn t everything just fall apart??

Misspecified models Huber (1967) Point estimates are consistent for the minimizer of the population fit function: arg min F (S, Σ(θ)) arg min F (Σ, Σ(θ)) 0, F(Ω, Σ(θ)) = ln Σ(θ) + Σ(θ) 1 Ω ln S dim Σ(θ) = as.eq. = (ω σ(θ)) V (ω σ(θ)), ω = vech Ω, σ(θ) = vech Σ(θ) Uncertainty about these estimates is given by an asymptotic covariance matrix (sandwich estimator): n(ˆθn θ 0 ) d N ( 0, A 1 BA T ), A = E ψ(x, θ 0 ), B = E ψ(x, θ 0 )ψ(x, θ 0 ) T, ψ(x, θ 0 ) = σ(θ) V (s σ(θ))

Special : Misspecified models Eicker (1967) and White (1980), linear regression with heteroskedastic errors: ˆβ = (X X) 1 X y, ˆv( ˆβ) = (X X) 1 (X ee X)(X X) 1 Browne (1974): least squares estimator for SEM. White (1982): somewhat milder regularity conditions that are easier to check in practice; application to some common econometric models. Arminger & Schoenberg (1989): an econometrics paper in Psychometrika. Satorra & Bentler (1994): model based sandwich estimator with explicit expression for matrix B (the meat of the sandwich) as a function of the model parameters and the fourth order moments of data. Yuan & Hayashi (2006): comparison of empirical sandwich and the bootstrap standard errors for SEM.

Can standard errors be trusted? Distributional Structural specification specification Correct Incorrect Correct I, S-B, ES, EB, BSB I, ES, EB Incorrect S-B, ES, EB, BSB ES, EB Analytic standard errors: I observed or expected information matrix S-B Satorra-Bentler standard errors ES empirical (Huber) sandwich Resampling standard errors: EB empirical bootstrap BSB Bollen-Stine bootstrap with data rotation

Likelihood ratio type tests Overall fit Tests of H H + vs. H χ 2 -difference tests (from θ k = 0) Signed root tests for simple χ 2 difference and scaled Satorra & Bentler (2001) difference r(θ 0 ) = sign[ˆθ θ 0 ] T Wald type tests, using Information matrix standard errors Satorra-Bentler standard errors Huber empirical sandwich standard errors

Simulation study Saturated model with 3 variables and 6 parameters: χ 2 0 0, no way to test the model fit... unless you hit a case! Jump to the case in population example Main results: HUGE biases of the case in small samples. Information matrix standard errors are biased leading to undercoverage of CIs based on them when the data are non-normal.

Simulation study Larger model with 4 variables, 8 parameters and 2 d.f. for model fit test. Data generating process 0.8 y 1 1 0.6 y 3 0.4 y 4 0.1 y 2 0.1 ζ 3 1 ζ 4 1 ζ 2 1

Simulation study Larger model with 4 variables, 8 parameters and 2 d.f. for model fit test. CFA model fit 1.195 ξ 1 1 0.345 0.633 0.464 y 1 y 2 y 3 y 4 δ 1 δ 2 δ 3 δ 4 0.195 1.018 1.109 1.066

Simulation study Larger model with 4 variables, 8 parameters and 2 d.f. for model fit test. CFA with restricted variance 1 ξ 1 1 0.4 0.76 0.56 y 1 y 2 y 3 y 4 δ 1 δ 2 δ 3 δ 4 0 1 1.01 1.01

Simulation study Larger model with 4 variables, 8 parameters and 2 d.f. for model fit test. Main results: HUGE biases of the case in small samples. Information matrix standard errors are biased leading to undercoverage of CIs based on them when the data are non-normal. Satorra-Bentler standard errors are biased down leading to undercoverage even when the data are normal! Huber empirical sandwich standard errors are asymptotically accurate. Empirical bootstrap standard errors are even more accurate in smaller samples, and better centered to the true variability. Signed root of Satorra-Bentler scaled difference is the most accurate test in terms of the size, and has the greatest power among accurate tests!

Zero factor variances Koleinkov and Savalei: work in progress. Question: testing factor variances, as opposed to error variances. 1 1 1 2 x1 x2 x3 x4 x5 x6 x7 x8 1 1 Method 1 Method2

Zero factor variances Koleinkov and Savalei: work in progress. Question: testing factor variances, as opposed to error variances. Complications: If a factor variance is zero, what happens to its covariances with other factors and the loadings of observed variables? How many parameters are we actually testing? What are degrees of freedom?

Zero factor variances Technically speaking, zero factor variances lead to underidentified models. φ k = 0 Corr(ξ k, ξ j ) is underid, λ lk is underid Technical research into underidentified models (Davies 1977, Davies 1987, Andrews & Ploberger 1994, Hansen 1996): Asymptotic distribution still exists! It is characterized by max χ 2 (θ) over a range of θ Analytical work is very limited Simulation approach seem attractive

Simulation study Data generation: 1 factor multivariate normal model, 6 variables. Fitted models: extra one or two factors, loading on half of the variables each. 1 x1 x2 x3 x4 x5 x6 1 1 Method 1 Method 2 Numeric stability issues: hundreds of maximization iterations; ridge-like likelihoods; absurd parameter estimates and standard errors.

Test statistic distribution One extraneous factor, free loadings, N = 2000 0.2.4.6.8 1 χ 2(1) χ 2(2) χ 2(3) LR test 0 2 4 6 8 10 x Kolmogorov-Smirnov test vs. χ 2 3 : p-value = 0.022.

Test statistic distribution One extraneous factor, fixed loadings, N = 2000 0.2.4.6.8 1 χ 2(1) χ 2(2) χ 2(3) LR test 0 2 4 6 8 10 x Kolmogorov-Smirnov test vs. χ 2 1 : p-value = 0.115.

Test statistic distribution Two extraneous factors, free loadings, N = 2000 0.05.1.15 χ 2(6) χ 2(7) χ 2(8) LR test 0 5 10 15 20 x Kolmogorov-Smirnov test vs. χ 2 8 : p-value = 0.047.

I Anderson, J. C. & Gerbing, D. (1984), The effect of sampling error on convergence, improper solutions, and goodness-of-fit indices for maximum likelihood confirmatory factor analysis, Psychometrika 49, 155 173. Andrews, D. W. K. (1999), Estimation when a parameter is on a boundary, Econometrica 67(6), 1341 1383. Andrews, D. W. K. (2001), Testing when a parameter is on the boundary of the maintained hypothesis, Econometrica 69(3), 683 734. doi:10.1111/1468-0262.00210. Andrews, D. W. K. & Ploberger, W. (1994), Optimal tests when a nuisance parameter is present only under the alternative, Econometrica 62(6), 1383 1414. Arminger, G. & Schoenberg, R. J. (1989), Pseudo maximum likelihood estimation and a test for misspecification in mean and covariance structure models, Psychometrika 54, 409 426. Bollen, K. A. (1987), Outliers and improper solutions: A confirmatory factor analysis example, Sociological Methods and Research 15, 375 384.

II Bollen, K. A. (1989), Structural Equations with Latent Variables, Wiley, New York. Boomsma, A. (1983), On the Robustness of LISREL (Maximum Likelihood Estimation) Against Small Sample Size and Nonnormality, Sociometric Research Foundation, Amsterdam, the Netherlands. Boomsma, A. & Hoogland, J. J. (2001), The robustness of LISREL modeling revisited, Structural Equation Modeling: Present and Future. Browne, M. W. (1974), Generalized least squares estimators in the analysis of covariances structures, South African Statistical Journal 8, 1 24. Chernoff, H. (1954), On the distribution of the likelihood ratio, The Annals of Mathematical Statistics 25(3), 573 578. Davies, R. B. (1977), Hypothesis testing when a nuisance parameter is present only under the alternative, Biometrika 64(2), 247 254. Davies, R. B. (1987), Hypothesis testing when a nuisance parameter is present only under the alternatives, Biometrika 74(1), 33 43.

III Dijkstra, T. K. (1992), On statistical inference with parameter estimates on the boundary of the parameter space, British Journal of Mathematical and Statistical Psychology 45, 289 309. Dillon, W. R., Kumar, A. & Mulani, N. (1987), Offending estimates in covariance structure analysis: Comments on the causes and solutions to, Psychological Bulletin 101, 126 135. Eicker, F. (1967), Limit theorems for regressions with unequal and dependent errors, in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, University of California Press, Berkeley, pp. 59 82. Hansen, B. E. (1996), Inference when a nuisance parameter is not identified under the null hypothesis, Econometrica 64(2), 413 430., H. B. (1931), On finite sequences of real numbers, Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 134(824), 486 501.

IV Huber, P. (1967), The behavior of the maximum likelihood estimates under nonstandard conditions, in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, University of California Press, Berkeley, pp. 221 233. Jamshidian, M. & Bentler, P. M. (1994), Gramian matrices in covariance structure models, Applied Psychological Measurement 18(1), 79 94., S. & Bollen, K. A. (2008), Testing negative error variances: Is a case a symptom of misspecification? under review in Sociological Methods and Research. Rindskopf, D. (1983), Parameterizing inequality constraints on unique variances in linear structural models, Psychometrika 48(1), 73 83. Rindskopf, D. (1984), Structural equation models: Empirical identification,, and related problems, Sociological Methods and Research 13, 109 119. Sato, M. (1987), Pragmatic treatment of improper solutions in factor analysis, Annals of the Institute of Statistics and Mathematics, part B 39, 443 455.

V Satorra, A. (1990), Robustness issues in structural equation modeling: A review of recent developments, Quality and Quantity 24, 367 386. Satorra, A. & Bentler, P. (2001), A scaled difference chi-square test statistic for moment structure analysis, Psychometrika 66(4), 507 514. Satorra, A. & Bentler, P. M. (1994), Corrections to test statistics and standard errors in covariance structure analysis, in A. von Eye & C. C. Clogg, eds, Latent variables analysis, Sage, Thousands Oaks, CA, pp. 399 419. Savalei, V. &, S. (2008), Constrained vs. unconstrained estimation in structural equation modeling, Psychological Methods 13, 150 170. Shapiro, A. (1985), Asymptotic distribution of test statistic in the analysis of moment structures under inequality constraints, Biometrika 72(1), 133 144.

VI Stoel, R. D., Garre, F. G., Dolan, C. & van den Wittenboer, G. (2006), On the likelihood ratio test in structural equation modeling when parameters are subject to boundary constraints, Psychological Methods 11(4). Stram, D. O. & Lee, J. W. (1994), Variance components testing in the longitudinal mixed effects model, Biometrics 50(4), 1171 1177. Van Driel, O. P. (1978), On various causes of improper solutions in maximum likelihood factor analysis, Psychometrika 43, 225 43. White, H. (1980), A heteroskedasticity-consistent covariance-matrix estimator and a direct test for heteroskedasticity, Econometrica 48(4), 817 838. White, H. (1982), Maximum likelihood estimation of misspecified models, Econometrica 50(1), 1 26. Yuan, K.-H. & Hayashi, K. (2006), Standard errors in covariance structure models: Asymptotics versus bootstrap, British Journal of Mathematical and Statistical Psychology 59, 397 417.

VII Yuan, K. H., Marshall, L. L. & Bentler, P. M. (2003), Assessing the effect of model misspecifications on parameter estimates in structural equation models, Sociological Methodology 33(1), 241 265.