COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES. Department of Biostatistics, University of North Carolina at Chapel Hill, NC.

Similar documents
LESLIE GODFREY LIST OF PUBLICATIONS

An Encompassing Test for Non-Nested Quantile Regression Models

INTERNALLY STUDENTIZED RESIDUALS TEST FOR MULTIPLE NON-NESTED REGRESSION MODELS

A Monte Carlo Sampling Approach to Testing Separate Families of Hypotheses: Monte Carlo Results

Large Sample Properties of Estimators in the Classical Linear Regression Model

FACULTY WORKING PAPER NO. 1130

Maximum Likelihood (ML) Estimation

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Introduction to the Mathematical and Statistical Foundations of Econometrics Herman J. Bierens Pennsylvania State University

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Christopher Dougherty London School of Economics and Political Science

Binary choice 3.3 Maximum likelihood estimation

Testing Restrictions and Comparing Models

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC

EC821: Time Series Econometrics, Spring 2003 Notes Section 9 Panel Unit Root Tests Avariety of procedures for the analysis of unit roots in a panel

Introduction to Eco n o m et rics

On robust and efficient estimation of the center of. Symmetry.

An Information Criteria for Order-restricted Inference

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007

1 Appendix A: Matrix Algebra

INFORMATION THEORY AND STATISTICS

GMM Based Tests for Locally Misspeci ed Models

Canonical Correlation Analysis of Longitudinal Data

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models

Testing Statistical Hypotheses

MAXIMUM LIKELIHOOD ESTIMATION FOR THE GROWTH CURVE MODEL WITH UNEQUAL DISPERSION MATRICES. S. R. Chakravorti

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

Comparing non-nested models in the search for new physics

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap

Directed acyclic graphs and the use of linear mixed models

GARCH Models Estimation and Inference

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS

Bayesian Interpretations of Heteroskedastic Consistent Covariance. Estimators Using the Informed Bayesian Bootstrap

Testing and Model Selection

GARCH Models Estimation and Inference

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

TIME SERIES DATA ANALYSIS USING EVIEWS

A better way to bootstrap pairs

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

WEAKER MSE CRITERIA AND TESTS FOR LINEAR RESTRICTIONS IN REGRESSION MODELS WITH NON-SPHERICAL DISTURBANCES. Marjorie B. MCELROY *

ON THE USE OF GENERAL POSITIVE QUADRATIC FORMS IN SIMULTANEOUS INFERENCE

ON EFFICIENT FORECASTING IN LINEAR REGRESSION MODELS

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

Derivation of Monotone Likelihood Ratio Using Two Sided Uniformly Normal Distribution Techniques

A note on profile likelihood for exponential tilt mixture models

MODEL SELECTION FOR NONNESTED LINEAR MIXED MODELS. Ché L. Smith

Economic modelling and forecasting

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

Graduate Econometrics I: Asymptotic Theory

Thomas J. Fisher. Research Statement. Preliminary Results

1. Introduction Over the last three decades a number of model selection criteria have been proposed, including AIC (Akaike, 1973), AICC (Hurvich & Tsa

388 Index Differencing test ,232 Distributed lags , 147 arithmetic lag.

MONTE CARLO ANALYSIS OF CHANGE POINT ESTIMATORS

BEBR FACULTY WORKING PAPER NO Joint Tests of Non-Nested Models and General Error Specifications. Michael McAleer M.

Applied Econometrics (QEM)

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

NUCLEAR NORM PENALIZED ESTIMATION OF INTERACTIVE FIXED EFFECT MODELS. Incomplete and Work in Progress. 1. Introduction

Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770

Finite-sample quantiles of the Jarque-Bera test

GARCH Models Estimation and Inference. Eduardo Rossi University of Pavia

On a connection between the Bradley Terry model and the Cox proportional hazards model

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Functional Form. Econometrics. ADEi.

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

Greene, Econometric Analysis (6th ed, 2008)

Econometrics of Panel Data

Chapter 1. GMM: Basic Concepts

Econometric Analysis of Cross Section and Panel Data

An Introduction to Multivariate Statistical Analysis

Generalized Linear Models

Statistics 262: Intermediate Biostatistics Model selection

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

AN EFFICIENT GLS ESTIMATOR OF TRIANGULAR MODELS WITH COVARIANCE RESTRICTIONS*

Theory and Methods of Statistical Inference. PART I Frequentist likelihood methods

Ch. 5 Hypothesis Testing

Open Problems in Mixed Models

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

Tests of transformation in nonlinear regression

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

COLLABORATION OF STATISTICAL METHODS IN SELECTING THE CORRECT MULTIPLE LINEAR REGRESSIONS

Statistical Inference On the High-dimensional Gaussian Covarianc

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

A Local Generalized Method of Moments Estimator

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL

A Very Brief Summary of Statistical Inference, and Examples

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Multiple hypothesis testing and testing one hypothesis multiple times: two sides of the same coin?

Transcription:

COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES by Lloyd J. Edwards and Ronald W. Helms Department of Biostatistics, University of North Carolina at Chapel Hill, NC. Institute of Statistics Mimeo Series No. 1894 March 1992

1 Comparing Transformations Using Tests of Separate Families By Lloyd J. Edwards and Ronald W. Helms Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599 Summary Cox's method for testing separate hypotheses is applied to derive a decision rule for choosing between the models f(y) "'" NnC~~l' O'~!n) and g(y) "'" Nn(~~2' O'~!n)' The asymptotic distributions of the decision statistics are derived, subject to suitable regularity conditions. Some key words: Regression; Separate families; Transformation. 1. Introduction A number of situations arise in which one must choose between the following two hypothesized models for a set of data: (l.la) (l.2a) or written in standard regression notation as: (l.lb) (l.2b) where f(!) = [f(yi)] is an n x 1 vector of transformed values of a response variable, g(!) = [g(yi)], l and ~ are n x p and n x q matrices of values of independent variables, ~l and ~2 are unknown primary parameter vectors (p x 1 and q x 1, respectively), O'~ and O'~ are unknown variance parameters, and! "'" Nn(g, O'~!n)' y"'" Nn(g, O'~!n)' A diagnostic procedure may lead to a negative conclusion that one or both of these models does not fit the data well. Since diagnostic procedures do not combine information from both models they do not lead to a positive decision rule for selecting one model instead of the other.

2 Cox (1961, 1962) defined two composite hypotheses as separate if a simple hypothesis in one cannot be reached as a limit of simple hypotheses in the other. In particular, if one hypothesis is a subhypothesis of another the two hypotheses are not separate. Cox proposed a general methodology, based on likelihood ratios, for testing separate hypotheses after recognizing that ad hoc methods certainly existed for handling these types of problems. Cox left open many details and examples for subsequent investigation. In the ensuing years, several authors investigated testing separate families of hypotheses either by Cox's method or by proposing an alternative method (see Atkinson (1970), Sawyer (1983), Mizon (1984), Loh (1985)). Pesaran (1974) derived Cox's test for testing separate linear models with nonstochastic explanatory variables and later (Pesaran, 1982) studied power calculcations for these tests. Jackson (1968) derived the Cox test for comparing several families, including lognormal versus gamma, exponential versus lognormal, and others. Pereira (1977) studied consistency of the Cox test compared to Atkinson's (1970) test and Pereira (1978) applied Cox's method to derive a procedure for choosing between separate log-linear models. Numerous other references can be found in Mizon and Richard (1986) and Godfrey (1988). To our knowledge no one has applied Cox's method to derive a procedure for choosing between two different transformed regression models. In this paper we apply Cox's method to derh:e a decision procedure for choosing between two different transformed regression models with possibly different design matrices, along the lines of Pesaran (1974). 2. A Procedure Based On Cox's Method.. The hypotheses to be tested are given by (1.1) and (1.2). The Yi are assumed positive. Unlike typical likelihood ratio test situations, in this context neither HI nor H 2 is a "null hypothesis" with a "preferred status". At some points in the procedure HI will be the "null hypothesis" and at other points H 2 will be the null. We assume both f( ) and g(.) are one-to-one, continuously differentiable, real-valued functions. For convenience, define:!i = Row/~), i=l,...,n and b = [g({\!i~i))] is an n x 1 vector of the function g composed with the function {I, (2.1) [. ] will denote an n x 1 vector throughout the remainder of this paper, (2.2a) (2.2b)

3 We shall consider the asymptotic properties of test procedures as the number of observations, n, goes to infinity. We make the following regularity assumptions: (2.3a) (2.3b) lim (-n 1 ZTh) = E} (J. 0). n-oo - - - z z.,.. - (2.3c) The initial phase of Cox's method treats HI as a "null hypothesis" and leads to the statistic ( Cox, 1961, 1962 ) (2.4) where the log-likelihoods for HI and H 2 are given by Ll(~l) and L2(~2); L 2l =L l (f l ) - L 2 (f 2 ) is the maximized log-likelihood ratio; ~l = (O'~, ~f)t, ~2 = (O'~, ~nt with f l and f 2 denoting the maximum likelihood estimators of ~l and ~2 under HI and H 2. Plim( ) denotes the probability limit under the null hypothesis HI' where the limit is a function of the- unknown parameter ~l and ~l is replaced by fl' Cox's procedure would compare the maximized log-likelihood ratio with its expected value under the null hypothesis if the expected value were available. Here, the expected value is not available and has been replaced by the rightmost term in (2.4). Cox showed that when HI is true, the test statistic Ti is asymptotically normally distributed with mean zero and variance VI(Ti), under certain regularity conditions (see White (1982) ). Cox showed that the variance, which is a function of the unknown parameter ~l' can be given by: (2.5) where we define L 2l = Ll(~l) - L2(~2l) and ~2l is the asymptotic expectation of ~2 under HI ; II = Il(~l) is the information matrix of ~l under HI; 8{Plim (L2dn)} '1 = 'l(~l) = n n-a~l (2.6) Using the log-likelihood equations for HI and H 2, it is straightforward to show that

4 where Ll(~I) = - ~log(21r0'~) - 2;if(~) - ~~I)T(f(~) - ~~1) 1 and The maximum likelihood estimates of the parameters under HI and H 2 are given by and Under Hl' we find that (2.9) We use a first order Taylor expansion on the function h(fi) = g({\~~1 + fi» about the point fi = 0 for i=l,...,n to obtain the following: (2.10) where g'(. ) and gl/(. ) will denote the first and second derivatives of the function g(. ). Thus, under HI we have So, under suitable conditions (see Appendix A), u~ - fto-~1 -+ 0 in probability as n-+oo, where (2.12) -1 2 where y = diag{[g'(f (~~1» n

5 Assume fto-~1 -+ 0"~1 (~1) as n -+ 00. Since u~ is a consistent estimate of O"~ assuming HI is true, we have the following result: (2.13) Thus, we have a test statistic expression for Tl' which is an approximation of Tt: (2.14) To derive the asymptotic variance of Tt we need the asymptotic expectation of ~2 Under Hl' we can write under HI' again by using a first order Taylor expansion. We then have, by using the above expression, that ~2 -+ 1!21 in probability as n -+ 00 under HI' where (2.15) Assuming HI is true, we can write 2 L21 = ~log:2l- 2;if(l) - ~1!1)T(f(l) - ~1!1) + 2;2 (g(l) - ~1!21)T(g(l) - ~1!21) 1 1 21 n 0"~1 1 n 2 1 n -1-1 2 ~ 2"log-2 - -2.:~:>i +-2.L {g(f (~I!I» + g'(f (~I!I»fi -!ii!21}, 0"1 20"11=1 20"211=1 again by using a first order Taylor expansion. Now we have 1 n 1-1 2 1 2' ~ Vd 2".L (-2(g'(f (!ii!i))) - 2)fi 1=1 0"21 0"1 1 n -1-1 +:r.l 1 (g(f (!ii!i» -!ii!21)g'(f (!ii!i»fi] V211=

6 (2.16) since fi independent identically distributed N(O, 11'~) In addition, it can be shown that for i=l,...,n., (2.17a) where (2.17b) and (2.17c) It also can be shown that (2.18) Thus, we have everything we need to evaluate (2.19) To get V1(T1) we replace the unknown parameters by their consistent estimates. Hence, by Cox's methodology we have TdrY1(T1)]1/2 is asymptotically normally distributed with mean zero and variance one under H 1 This allows us to provide a large sample approximation of the associated p-value of the test. In Cox's formulation the two hypotheses, H 1 and H 2, are considered asymmetrically, where H 2 serves as the type of alternative for which high power is desired. Hence, it is instructive to compute T 2' Cox's test statistic assuming the null and alternative hypotheses have been interchanged. Appendix B gives the main results for interchanging H 1 with H 2

7 3. Discussion We have derived the Cox statistic for tests of separate families of hypotheses for comparing two different transformed regression models. The testing procedure allows great flexibility in choosing possibly different design matrices for selecting between transformations. In the derivation we assumed that both f(. ) and g(. ) were continuously differentiable. However, as can be seen in the derivation, we only need the existence of the first and second derivatives. The full utility of this method can only be assessed after further research into its asymptotic power and level of significance performance, small sample properties, and applications to particular testing scenarios by other researchers. Although some of the equations may look formidable, the computations can be carried out rather straight-forward using any computer software which has matrix handling capabilities such as SAS IML. One common example of application would be to use the results of this paper to compare an untransformed linear regression model to that of a log-transformed model (see Edwards and Helms (1990». Acknowledgement The authors would like to thank Dr. P. K. Sen for his helpful suggestions. This research was supported by NHLBI grant number P50HL19171. REFERENCES Atkinson, A. C. (1970), A method for discriminating between models. J. R. Statist. Soc. B, 32, 323-353. Box, G. E. P. and Cox, D. R. (1964), An analysis of transformations. J. R. Statist. Soc. B 26, 211-243, discussion 244-252. Cox, D. R. (1961), Tests of separate families of hypotheses. Proc. 4th Berkeley Symp. 1, 105-123. Cox, D. R. (1962), Further results on tests of separate families of hypothesis. J. R. Statist. Soc. B 24, 406-424. Edwards, L. J. and Helms, R. W. (1990), On choosing between a linear or log-linear regression model. Unpublished manuscript. Department of Biostatistics, University of North Carolina at Chapel Hill..Fisher, G. and McAleer, M. (1979), On the interpretation of the Cox test in econometrics. Economics Letters 4, 145-150.

8 Godfrey, L. G. (1988), Misspecification Tests In Econometrics: The Lagrange Multiplier Principle and Other Approaches. Cambridge University Press, New York. Jackson, O. A. Y. (1968), Some results on tests of separate families of hypotheses. Biometrika 55, 2, 355-363. Loh, Wei-Yin (1985), A new method for testing separate families of hypotheses. J. Amer. Statist. Assoc., 80, 362-368. Mizon, G. E. (1984), The encompassing principle in econometrics, Econometrics and Quantitative Economics, Ed. D. F. Hendry and K. F. Wallis, pp. 135-172. Oxford: Basil Blackwell. Mizon, G. E. and Richard, J.-F. (1986), The encompassing principle and its application to testing non-nested hypotheses. Econometrica 54, 3, pp 657-678. Pereira, B., (1977), A note on the consistency and on the finite sample comparisons of some tests ofseparate families of hypotheses. Biometrika 64, 1, 109-113. Pereira, B., (1978), Tests and efficiencies of separate regression models. Biometrika 65, 2, 319-327. Pesaran, M. H. (1974), On the general problem of model selection. Review of Economic Studies 126, 153-171. Pesaran, M. H. and Deaton, A. S. (1978), Testing non-nested nonlinear regression models. Econometrica, 46, 677-694. Pesaran, M. H. (1982), Comparison of local power of alternative tests of non-nested regression models. Econometrica, 50, 1287-1305. Rao, C. R. (1973) Linear Statistical Inference and Its Applications. Wiley, New York. Sawyer, K. R. (1983), Testing separate families of hypotheses: An information criterion. J. R. Statist. Soc. B, 45, 89-99. Searle, S. R. (1971), Linear Models. Wiley, New York. Searle, S. R. (1982), Matrix Algebra Useful for Statistics. Wiley, New York. Sertling, R. J. (1980), Approximation Theorems of Mathematical Statistics. Wiley, New York. White, H. (1982), Regularity conditions for Cox's test of non-nested hypotheses. J. Econometrics, 19, 301-318. I,.

9 Appendix A Sufficient condition for &~ - fto-~i - 0 in probability as n-oo (section 2, p 4): find Under HI' upon using the first order Taylor expansion and simplifying the quadratic form, we (A.I) (A.2) Since g'({\~~i))fi... independently distributed N(O, O'Ng'({\~~I))}2), for i = 1,...,n, we have (see Searle (1971) ) (A.3) Thus, by Chebychev's inequality, if Vl(&~) - 0 as n - 00, then &~ - fto-~l - 0 in probability as n-oo, where VI (&~) is given by the right-hand side of the above equation. So, the sufficient condition needed for the probability convergence is that the right-hand side of the above equation goes to zero as n -00. A completely analogous condition is obtained when interchanging the hypotheses.

10 Appendix B Interchanging Hypotheses In Cox's formulation the two hypotheses, HI and H 2, are considered asymmetrically, where H 2 serves as the type of alternative for which high power is desired. Hence, it is instructive to compute T 2' Cox's test statistic assuming the null and alternative hypotheses have been interchanged. Presenting T 2 will also help to illustrate why the hypotheses are considered asymmetrically. Using notation analogous to that useti in section 2 and corresponding expectations taken under H 2, we have the following: (B.l) (B.2a) (B.2h) (B.3) (BAa) (BAh) (BAc) and P~~~(n!;') = [ 20'4 2 g u,:,} 2-lz (B.5)

11 Again, by Cox's ~ethodology we have T2/[V2(T2)]1/2 is asymptotically normally distributed with mean zero and variance one under H 2 o