The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27

Similar documents
The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification

Generalized Linear Models (GLZ)

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Stat 5101 Lecture Notes

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

Subject CS1 Actuarial Statistics 1 Core Principles

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

HANDBOOK OF APPLICABLE MATHEMATICS

Chapter 4: Generalized Linear Models-II

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

Model Checking and Improvement

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

Generalized Linear Models 1

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Generalized Linear Models

Linear Regression Models P8111

Chapter 1 Statistical Inference

Generalized Linear Models I

Semiparametric Generalized Linear Models

Generalized Linear Models for Non-Normal Data

5.3 Three-Stage Nested Design Example

POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Model Estimation Example

Prediction of Bike Rental using Model Reuse Strategy

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

Module 6: Model Diagnostics

Open Problems in Mixed Models

LOGISTIC REGRESSION Joseph M. Hilbe

COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION

The Adequate Bootstrap

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

Multinomial Logistic Regression Models

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

General Regression Model

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.

Introduction to Statistical Analysis

Sections 4.1, 4.2, 4.3

Exam Applied Statistical Regression. Good Luck!

You can compute the maximum likelihood estimate for the correlation

Generalized Models: Part 1

Recall the Basics of Hypothesis Testing

Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data

MODEL SELECTION BASED ON QUASI-LIKELIHOOD WITH APPLICATION TO OVERDISPERSED DATA

STATISTICS SYLLABUS UNIT I

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Stat 5102 Final Exam May 14, 2015

Generalized linear models

Poisson regression: Further topics

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

11. Generalized Linear Models: An Introduction

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Investigating Models with Two or Three Categories

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal

Parametric Inference Maximum Likelihood Inference Exponential Families Expectation Maximization (EM) Bayesian Inference Statistical Decison Theory

Model Fitting and Model Selection. Model Fitting in Astronomy. Model Selection in Astronomy.

Investigation of goodness-of-fit test statistic distributions by random censored samples

Generalized Estimating Equations

Spline Density Estimation and Inference with Model-Based Penalities

ˆπ(x) = exp(ˆα + ˆβ T x) 1 + exp(ˆα + ˆβ T.

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

Generalized linear models

Akaike Information Criterion

36-463/663: Multilevel & Hierarchical Models

1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches

The Behaviour of the Akaike Information Criterion when Applied to Non-nested Sequences of Models

Outline of GLMs. Definitions

Categorical data analysis Chapter 5

On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori

Introduction to Generalized Models

COMPLEMENTARY LOG-LOG MODEL

Estimating prediction error in mixed models

Variability within multi-component systems. Bayesian inference in probabilistic risk assessment The current state of the art

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Answer Keys to Homework#10

STAT 7030: Categorical Data Analysis

Bayesian Model Diagnostics and Checking

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Discrete Dependent Variable Models

Large Sample Properties of Estimators in the Classical Linear Regression Model

Using Estimating Equations for Spatially Correlated A

REVISED PAGE PROOFS. Logistic Regression. Basic Ideas. Fundamental Data Analysis. bsa350

Transcription:

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification Brett Presnell Dennis Boos Department of Statistics University of Florida and Department of Statistics North Carolina State University The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27

The Problem Question: Given data Y 1,..., Y n, independent, with predictors x 1,..., x n, and a parametric model f(y i ; x i, θ) for the density of Y i, with θ a p-vector of unknown model parameters, how do we test for model misspecification? Answer: It depends. Are the data i.i.d.? (no x i s) Are there replications? Are the Y i s continuous or categorical? Univariate or multivariate? Is f of some specific form that I have a test for?... The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.2/27

Some Answers Kolmogorov-Smirnov, Cramér-von Mises, Anderson-Darling, Shapiro-Wilks, Lillefors,... Pearson chi-sq, deviance, power divergence,... Mardia s skewness and kurtosis tests,... Find (or contrive) a bigger model (but we are NOT trying to do variable selection here). However, there is often no obvious answer, esp. if we have x i s and no replication. Might try: Graphical methods. Information matrix test? The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.3/27

Motivation θ = MLE of θ. θ (i) = MLE of θ if ith obs. deleted from sample. f(y i ; x i, θ (i) ) measures how well the model predicts the ith observation. Compare to f(y i ; x i, θ ). f(y i ; x i, θ (i) ) f(y i ; x i, θ ) always: n l j ( θ ) l i ( θ ) n l j ( θ (i) ) l i ( θ (i) ) n l j ( θ ) l i ( θ (i) ) j=1 j=1 j=1 If f(y i ; x i, θ (i) ) << f(y i ; x i, θ ), then fitted model must shift appreciably to accomodate ith obs. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.4/27

Compare the in-sample and out-of-sample likelihoods as a global measure of lack-of-fit: IOS = log ( n i=1 f(y i; x i, θ) ) n i=1 f(y i; x i, θ (i) ) An Idea = n { l(yi ; x i, θ) l(y i ; x i, θ (i) ) }, i=1 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.5/27

Connections The cross-validated (out-of-sample) log-likelihood (Stone, 1977) or Bayesian analogues (Geisser and Eddy, 1979) are popular in model selection. Geisser (1990) uses a Bayesian s version of the individual terms of IOS to test for outliers (discordancy). Similar motivation for Cook s distance. Asymptotic form of IOS related to information matrix test (White, 1982). In simple examples, IOS approximates well known, intuitive test statistics. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.6/27

For model X 1,..., X n i.i.d. Poisson(λ), IOS s2 Example: IID Poisson Obvious comparison of moments (Fisher, 1973). IOS P 1 if model correctly specified. Asymptotically normal. Y The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.7/27

For model X 1,..., X n i.i.d.exp(µ), Example: IID Exponential IOS s2 Y 2. Obvious comparison of moments again. IOS P 1 if model correctly specified. Asymptotically normal. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.8/27

For model X 1,..., X n i.i.d.n(µ, σ 2 ), IOS 1 ( ) µ4 2 σ 4 + 1, where µ 4 is sample 4th central moment. IOS is approximately a kurtosis test. IOS P 2 if model correctly specified. Asymptotically normal. Example: IID Normal The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.9/27

Example: Normal Regression For model Y i indep N(x T i β, σ2 ), let β be the LSE. Then IOS n i=1 h iê 2 i σ 2 + µ 4 2 σ 4 1 2, where h i = x T i (XT X) 1 x i, the leverage of the ith obs., e i = Y i x T i β, σ 2 = n 1 n i=1 e 2 i, µ 4 = n 1 n i=1 e 4 i. Looks for heterogeneity of variance and kurtosis. IOS P p = dim(β) + 1 if model correctly specified. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.10/27

Approximate Form For theory, regularity conditions are similar to those for consistency and asymptotic normality of MLEs. Details are provided for the i.i.d. case. IOS IOS A = o p (n 1/2 ), where (i.i.d. case) IOS A = 1 n with n i=1 l(y i ; θ) T Â( θ) 1 l(yi ; θ) = tr { Â( θ) 1 B( θ) }, Â( θ) = n 1 n { l(y i ; θ)}, B( θ) = n 1 n l(y i ; θ) l(y i ; θ) T i=1 i=1 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.11/27

IOS P IOS = E { l(y1 ; θ 0 ) T A(θ 0 ) 1 l(y1 ; θ 0 ) } Some Theory where = tr { A(θ 0 ) 1 B(θ 0 ) } H = 0 p I(θ) = E{ l(y 1 ; θ)}, B(θ) = E{ l(y 1 ; θ) l(y 1 ; θ) T }. n 1/2 (IOS IOS ) is asymptotically normally distributed (but the asymptotic variance is complicated and convergence is slow). In i.i.d. location-scale models, the null hypothesis distributions of IOS and IOS A do not depend on parameter values. Parametric bootstrap p-values exact up to MC. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.12/27

Discussion of IOS A IOS A /n is the trace term appearing in model selection criteria based on Kullback-Leibler discrepancy (Linhart and Zucchini, 1986)). This is related to the fact that AIC can be viewed as an approximation to the out-of-sample log-likelihood in which the trace term is replaced by p/n (Stone, 1977). IOS A is the trace of the ratio of the observed Fisher information matrix, Â( θ), a model-dependent estimate of cov{ l(y 1 ; θ 0 )}, and B( θ), the sample covariance of l(y i ; θ), a model-free estimate. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.13/27

Information Matrix Test White s 1982 (668) information matrix (IM) test is based on a quadratic form in the vector of differences between elements of B( θ) and Â( θ). Similarities: Both IOS A and IM test looking at something important, e.g., should we use a sandwich estimator of var( θ), or can we trust the model and use inverse Fisher information? Asymptotic distributions not very useful for either IOS/IOS A or IM. Use parametric bootstrap instead. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.14/27

IOS vs IM Consistency: IM test consistent against any alternative with A(θ 0 ) B(θ 0 )? IOS suggests how to look for alternatives that IOS will miss. Can I use it this afternoon? IM requires a lot of prior analysis/programming for each type of problem, even if we bootstrap. IOS is automatic if we bootstrap. IOS A requires much less analysis than IM. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.15/27

Larsen and Marx (2001) Example: Hurricane Rainfall Max 24 hour precipitations for 36 hurricanes. Fit a gamma distribution (p = 2). shape = 2.2 ŝcale = 3.3 IOS = 3.6. (Largest obs. contributes 1.73.) Bootstrap p-value Test All data Drop largest Drop 2 largest IOS.028.061.101 IOS A.022.053.099 A-D.044.164.154 K-S.274.380.430 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.16/27

Hurricane Rainfall Data 0 10 20 30 Max Rainfall (inches) The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.17/27

Johnson and Wichern (1998) Example: Board Stiffness Four measurements of stiffness on 30 boards. Model: 4-dimensional normal (p = 14). IOS = 30.7 p-value boot =.002 J&W identify two outliers (which contribute 5.1 and 13.5 to IOS). After deletion: IOS = 27.2 p-value boot =.006 For testing i.i.d. multivariate normal model: IOS with parametric bootstrap is exact. IOS is very much like Mardia s multivariate kurtosis test. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.18/27

Feigl and Zelen (1965) Example: Leukemia Survival Survival times of 33 leukemia patients. Predictors: WBC = log of white blood count AG = a binary factor (17 AG pos., 16 AG neg.) Models include WBC:AG interaction (p = 5 for both): Gamma GLM with log link. IOS = 15.7, p-value boot =.03 Linear regression w/ log(survival) as response. IOS = 7.29, p-value boot =.22 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.19/27

Leukemia Survival (ctd) Does it matter which model we use? Interaction non-significant in gamma model, marginally significant in lognormal model. Important scientific finding in original paper was that WBC slope for AG negative group not significantly different from zero. Simulations suggest that.05 level tests maintain nominal size for both models. If testing lognormal model, power against gamma model is only about.20. Not great, but maybe not bad for n = 33, p = 5. Where s the competition? The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.20/27

Agresti (1996) Example: Horseshoe Crabs No. of satellites counted for 173 nesting females. Model: Poisson GLM with log link Carapace width as predictor (p = 2) Agresti tries Pearson chi-sq and deviance tests: Must pool data over ranges of carapace width. Finds no evidence of lack of fit. Later finds other evidence of overdispersion. IOS = 5.6 p-value boot 0 Negbinom model (p = 3): IOS = 2.66, p-val =.91. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.21/27

Slaton, Piegorsch and Durham (2000) 107 rat litters, 4 dose levels. Example: Toxicology Model: Heckman-Willis. Beta-binomial regression with implicit logit link. α = exp(a 0 + a 1 x), β = exp(b 0 + b 1 x). p = 4. Slaton, et al tested HW model against a larger (p = 6) model and found no evidence against HW. Larger model allows intralitter correlation to vary freely between the 4 dose levels, BUT still has beta-binomial response, and implicit logit link. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.22/27

For HW model IOS = 6.34 p-value boot =.04 IOS A = 5.19 p-value boot =.03 Toxicology (ctd) Further analysis (using IOS and other more standard tests) suggests that: Logit link inappropriate. Response at three lowest doses adequately modeled as binomial with same p for all three doses. Response at highest dose NOT adequately modeled by beta-binomial. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.23/27

Toxicology (ctd) Overall Proportion Dead 0.00 0.05 0.10 0.15 0.20 0.25 0.0 0.1 0.2 0.3 0.4 Dose The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.24/27

IOS is computer intensive (IOS A less so); automatic and easy to employ; and applicable to a variety of problems, where sometimes IM is the only obvious competitor. Other things to work on: dependent data (time series; spatial data). censored data. models without a fully specified likelihood. Conclusions The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.25/27

Some References AGRESTI, A. (1996). An Introduction to Categorical Data Analysis. Wiley, New York. FEIGL, P. and ZELEN, M. (1965). Estimation of exponential survival probabilities with concomitant information. Biometrics 21 826 838. FISHER, R. A. (1973). Statistical Methods for Research Workers. 14th ed. Hafner, New York. GEISSER, S. (1990). Predictive approaches to discordancy testing. In Bayesian and likelihood methods in statistics and econometrics: Essays in honor of George A. Barnard (S. Geisser, J. S. Hodges, S. J. Press and A. Zellner, eds.). North-Holland Publishing Co., Amsterdam. GEISSER, S. and EDDY, W. F. (1979). A predictive approach to model selection. J. Am. Statist. Ass. 74 153 160. JOHNSON, R. A. and WICHERN, D. W. (1998). Applied Multivariate Statistical Analysis. 4th ed. Prentice Hall, Upper Saddle River, NJ. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.26/27

More References LARSEN, R. J. and MARX, M. L. (2001). An Introduction to Mathematical Statistics and Its Applications. 3rd ed. Prentice-Hall Inc., Englewood Cliffs, NJ. LINHART, H. and ZUCCHINI, W. (1986). Model Selection. Wiley, New York. SLATON, T. L., PIEGORSCH, W. W. and DURHAM, S. D. (2000). Estimation and testing with overdispersed proportions using the beta-logistic regression model of Heckman and Willis. Biometrics 56 125 133. STONE, M. (1977). An asymptotic equivalence of choice of model by cross-validation and Akaike s criterion. J. R. Statist. Soc. B 39 44 47. WHITE, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica 50 1 26. The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.27/27