PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES

Similar documents
TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

MSE Performance of the Weighted Average. Estimators Consisting of Shrinkage Estimators

GARCH Models Estimation and Inference

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information

THE UNIVERSITY OF BRITISH COLUMBIA DEPARTMENT OF STATISTICS TECHNICAL REPORT #253 RIZVI-SOBEL SUBSET SELECTION WITH UNEQUAL SAMPLE SIZES

MSE Performance and Minimax Regret Significance Points for a HPT Estimator when each Individual Regression Coefficient is Estimated

Journal of Econometrics 3 (1975) North-Holland Publishing Company

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Shrinkage Estimation of the Slope Parameters of two Parallel Regression Lines Under Uncertain Prior Information

Mean squared error matrix comparison of least aquares and Stein-rule estimators for regression coefficients under non-normal disturbances

1 Motivation for Instrumental Variable (IV) Regression

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Stein-Rule Estimation under an Extended Balanced Loss Function

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Sociedad de Estadística e Investigación Operativa

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Bayesian Semiparametric GARCH Models

Interpreting Regression Results

Bayesian Semiparametric GARCH Models

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

An Introduction to Multivariate Statistical Analysis

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

PRE -TEST ESTIMATION IN A REGRESSION MODEL WITH A MIS-SPECIFIED ERROR COVARIANCE MATRIX

MA Advanced Econometrics: Applying Least Squares to Time Series

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

Problem Selected Scores

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

1 Appendix A: Matrix Algebra

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Preliminary Statistics. Lecture 5: Hypothesis Testing

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

Introductory Econometrics. Review of statistics (Part II: Inference)

Department of Econometrics and Business Statistics

Kumaun University Nainital

Econometrics. 4) Statistical inference

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Appendix from L. J. Revell, On the Analysis of Evolutionary Change along Single Branches in a Phylogeny

Minimax design criterion for fractional factorial designs

Inference in Normal Regression Model. Dr. Frank Wood

Bias Variance Trade-off

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Long-Run Covariability

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

Tube formula approach to testing multivariate normality and testing uniformity on the sphere

Elliptically Contoured Distributions

A nonparametric two-sample wald test of equality of variances

July 31, 2009 / Ben Kedem Symposium

Marginal Specifications and a Gaussian Copula Estimation

Cost Efficiency, Asymmetry and Dependence in US electricity industry.

:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics?

Non-parametric Inference and Resampling

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Testing for a unit root in an ar(1) model using three and four moment approximations: symmetric distributions

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

Using all observations when forecasting under structural breaks

SHRINKAGE ESTIMATOR AND TESTIMATORS FOR SHAPE PARAMETER OF CLASSICAL PARETO DISTRIBUTION

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

One-way ANOVA. Experimental Design. One-way ANOVA

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

GARCH Models Estimation and Inference

Introduction to the Mathematical and Statistical Foundations of Econometrics Herman J. Bierens Pennsylvania State University

Robust Backtesting Tests for Value-at-Risk Models

Regression and Statistical Inference

Introduction to Econometrics. Review of Probability & Statistics

Statistical Inference of Covariate-Adjusted Randomized Experiments

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

A Bayesian Treatment of Linear Gaussian Regression

The regression model with one stochastic regressor.

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Kruskal-Wallis and Friedman type tests for. nested effects in hierarchical designs 1

Hypothesis Testing One Sample Tests

BTRY 4830/6830: Quantitative Genomics and Genetics Fall 2014

CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution

The regression model with one fixed regressor cont d

Math, Stats, and Mathstats Review ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Estimation of parametric functions in Downton s bivariate exponential distribution

5 Introduction to the Theory of Order Statistics and Rank Statistics

Stat 231 Exam 2 Fall 2013

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Bayesian Estimation of Regression Coefficients Under Extended Balanced Loss Function

Hypothesis Testing hypothesis testing approach

Thomas J. Fisher. Research Statement. Preliminary Results

Volatility. Gerald P. Dwyer. February Clemson University

Test for Parameter Change in ARIMA Models

When is a copula constant? A test for changing relationships

Final Exam. Economics 835: Econometrics. Fall 2010

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

Multivariate Statistical Analysis

A characterization of consistency of model weights given partial information in normal linear models

Module 9: Stationary Processes

Transcription:

Sankhyā : The Indian Journal of Statistics 1994, Volume, Series B, Pt.3, pp. 334 343 PRE-TEST ESTIMATION OF THE REGRESSION SCALE PARAMETER WITH MULTIVARIATE STUDENT-t ERRORS AND INDEPENDENT SUB-SAMPLES By JUSTON ANDERSON NZ Ministry of Housing and JUDITH A. GILES University of Victoria SUMMARY. We consider the estimation of the error variance of a regression when additional information is available in the form of a second sample, which may be generated from a process with the same variance. This problem has received attention in the literature when the joint errors are members of the spherically symmetric family. We extend this assumption to one in which the errors in each sample are independent multivariate Student-t random vectors. We derive the exact risk under quadratic loss of the pre-test estimator which results after a test for homogeneity of the variances and we compare the risk of this estimator with that of its component estimators. 1. Introduction and model framework Suppose we have two linear regressions of the form : y i = X i β i + ɛ i, i =1, 2...(1) where y i is a (T i 1) vector of observations on the dependent variable; X i is a(t k i ) full column rank matrix of non-stochastic regressors with k i <T i ; β i is a (k i 1) vector of parameters and ɛ i is a (T i 1) vector of disturbance terms. We assume that ɛ 1 and ɛ 2 are independently generated from multivariate Student-t (Mt) distributions, with probability density functions (p.d.f.) f(ɛ i ν i,σ i )= ( )( ν ν i/2 i Γ((ν i + T i )/2) π T i/2 i ( ν i + ɛ iɛ i /σ 2 i ) (Ti+ν i)/2 ; i =1, 2. ) 1 Γ(ν i /2)σ Ti i Paper received. March 1993; revised December 1993. AMS (1980) subject classifications : Primary 62J05, 62F11; secondary 62P20. Key words and phrases. Preliminary testing, conditional inference, non-normal disturbances, testing for homogeneity.

pre-test estimation of regression scale parameters 335 ν i is the degrees of freedom parameter and σ i is the scale parameter of the distribution. If ν i > 2, then E(ɛ i )=0andvar(ɛ i )=σɛ 2 i = ν i σi 2/(ν i 2). When ν i =1, the p.d.f. is Cauchy for which no finite integral moments exist; when ν i = it is normal. We suppose that the researcher desires an estimate of σɛ 2 1, the variance of the first sample, when it is suspected that ɛ 2 may have the same variance. When the error variances are unequal we use the so-called never-pool estimator (NPE), s 2 N, to estimate σɛ 2 1 : s 2 N = s 2 1 = ɛ 1M 1 ɛ 1 /v 1...(2) where v i = T i k i ; M i = I Ti X i (X i X i) 1 X i ; i =1,2. s2 N uses only the information from the first sample. Alternatively, if σɛ 2 1 = σɛ 2 2 it is then more efficient to use the always-pool estimator (APE), s 2 A : s 2 A =(v 1 s 2 1 + v 2 s 2 2)/(v 1 + v 2 )...(3) where s 2 2 is defined analogously to s 2 1. Given the uncertainty about the equality of the error variances a typical strategy is to pre-test for homogeneity and then use s 2 N if we reject homogeneity or use s 2 A if we cannot reject homogeneity. The estimator actually reported after such a procedure is the so-called pre-test estimator (PTE) s 2 P : { s 2 s 2 P = A if J c s 2... (4) N if J>c where J = s 2 2/s 2 1 is the test statistic used to test the hypothesis 1 H 0 : σ 2 ɛ 1 = σ 2 ɛ 2 vs. H A : σ 2 ɛ 1 σ 2 ɛ 2 and c is the critical value of the test corresponding to an α% significance level. If ɛ =(ɛ 1ɛ 2) is distributed as a member of the elliptically symmetric family of distributions then it is well known that f(j) =φ 1 f ( F (v2,v 1)) where φ = σ 2 ɛ1 /σ 2 ɛ 2 is a measure of the hypothesis error and F (v2,v 1) is a central F random variable with v 2 and v 1 degrees of freedom (see King (1979) and Chmielewski (1981)). However, if ɛ 1 and ɛ 2 are independent Mt random vectors then J has a non-standard distribution. Ohtani (1993) derives the density of J in this case : 0 f(j) =[B(v 1 /2,v 2 /2)B(ν 1 /2,ν 2 /2)] 1 θ ν 1/2 J ν 1/2 1 t (v 2+ν 2 )/2 1 (1 t) (v 1+ν 1 )/2 1 (t + θj(1 t)) (ν 1+ν 2 )/2 dt where θ =(v 2 /v 1 )[ν 1 σ 2 1/(ν 2 σ 2 2)] and B(.,.) is the Beta function. Ohtani also tabulates a limited number of critical values for the test under this assumption, assuming a 5% significance level. The particular pre-test problem considered here has been well investigated in the literature but not under the above specified error term assumptions. 1 For simplicity we assume a one-sided alternative hypothesis. It is straightforward, though tedious, to extend our analysis to the two-sided case.

336 j. anderson and j. a. giles For example, Bancroft (1944), Toyoda and Wallace (1975), Ohtani and Toyoda (1978), and Bancroft and Han (1983) all consider this problem from various aspects under normal errors 2. Bancroft (1944) concludes from his numerical evaluations that the PTE which uses a critical value of unity strictly dominates the NPE. Toyoda and Wallace (1975) show that the risk of the APE always has two intersectios with the risk of the NPE and that one intersection always lies in the range φ (0, 1). This implies that the NPE cannot strictly dominate the APE and vice versa. Toyoda and Wallace also show that a PTE with a critical value in the range c (0, 2) strictly dominates the NPE, and that the PTE with c = 1 almost always dominates the APE, except in the neighbourhood of the null hypothesis. They consequently suggest c = 1 as a reasonable choice for the pre-test (p.399) and go on to show that this choice of critical value maximises relative average efficiency. Ohtani and Toyoda (1978) consider optimal critical values for this pre-test problem according to a minimax regret criterion. They show that the risk function for the PTE declines monotonically for c (0, 1], and obtains a local minimum at c = 1. Using a minimax regret criterion they solve numerically for an optimal critical value, c*. They find that c* depends on v 1 and v 2 and ranges from 1.7 to 2.8 (for the cases evaluated) which correspond to sizes from 6% to 22%. Bancroft and Han (1983) also consider the choice of an optimal critical value according to another criterion - so called relative efficiency. Their results suggest significance levels ranging from 24% to 48%, depending on v 1 and v 2. As the processes generating many time-series are non-normal, Giles (1990, 1992) extends the above cited work to one where the joint error term in the model is distributed according to the scale mixture of normal family of distributions, which are members of the elliptically symmetric family of distributions. It is then possible to consider error term assumptions which result in more or less kurtosis than under a normality assumption. One special member of this family is the Mt distribution which results in uncorrelated but dependent errors. She shows that the risk function for the PTE has a minimum when c = 1 and also that for small values of the Mt shape parameter, ν, (i.e. fat-tailed marginal distributions) this PTE strictly dominates both the NPE and the APE. Here we extend the error term assumptions further by assuming that the errors in each sample are Mt but are independent. This allows for the error distributions for the samples to have potentially different shape, as well as scale, parameters. The outline of this paper is as follows : Section 2 derives the risk functions of the NPE, APE and PTE under the independent Mt assumption and undertakes some comparisons of the risk properties of the three estimators. In Section 3 we consider some numerical evaluations of these risk functions. Here we use Ohtani s (1993) critical values where possible but we also consider 2 See Giles and Giles (1993) for a survey of this literature.

pre-test estimation of regression scale parameters 337 that a researcher may incorrectly assume normality of the errors and so use critical values from the central-f distribution. For the latter, of course, the true significance level will differ from the assigned nominal signmficance level. We conclude with some final remarks in Section 4. 2. The risk functions Let s 2 be any estimator of σɛ 2 1 and let its risk under quadratic loss be defined by R(s 2 )=E(s 2 σɛ 2 1 ) 2. Then, if ɛ 1 and ɛ 2 are generated from independent Mt distributions with ν 1 > 4 and ν 2 > 4wehave: Theorem 1. R(s 2 N )=2ν2 1σ 4 1(v 1 + ν 1 2) ( v 1 (ν 1 2) 2 (ν 1 4) )...(5) R(s 2 A ) = ν2 1σ1{(ν 4 2 4) [ v 1 (2v 1 + ν 1 )+(v2 2 + v 1 )(ν 1 4) ] +v 2 (v 2 + 2)(ν 2 2)(ν 1 4)/φ 2 2v2(ν 2 1 4)(ν 2 4)/φ}/ ( (v 1 + v 2 ) 2 (ν 1 2) 2 (ν 1 4)(ν 2 4) )...(6) R(s 2 P ) = ν1σ 2 1{(v 4 1 + v 2 ) 2 (v 1 + 2)(ν 1 2) +v 1 v 2 (v 2 +2)P 0440 (ν 1 2) 2 (ν 1 4) v 2 (2v 1 + v 2 )(v 1 +2)P 4004 (ν 1 2) 2 (ν 1 4) +2v1v 2 2 P 2222 (ν 1 2) 2 (ν 1 4) 2v 1 (v 1 + v 2 )((v 1 + v 2 )(ν 1 4) + v 2 P 0220 (ν 1 2)(ν 1 4) v ( 2 P 2002 (ν 1 2)(ν 1 4)) + v 1 (v 1 + v 2 ) 2 (ν 1 4)}/ v1 (v 1 + v 2 ) 2 (ν 1 2)(ν 1 4) )...(7) where ( P abij = Γ ν1+ν ) ( 2 a b 2 Γ( ν 1 2 )Γ( ν 2 2 ) ) ν2/2 ν 2 2 (ν φ v 2 b)/2 1 (cv 2 ) (ν1 a)/2 2 (a+b)/2 (ν 1 2) (ν1 a b)/2 1 0 t(ν2 b)/2 1 1 (1 t 1 ) (ν1 a)/2 1 (v 1 t 1 (ν 2 2)/φ +(ν 1 2)cv 2 (1 t 1 )) (a+b ν 1 ν 2 )/2 ( I 1 t1 2 (v 2 + i); 1 2 (v 1 + j) ) dt 1 Proof. Details are available in Anderson and Giles (1993) or from the authors on request. Remarks. (1) When c =0(α = 1) we always reject H 0 and the risk of the PTE collapses to that of the NPE. Conversely, if c (α 0) we never reject H 0 and R(s 2 P )=R(s2 A ).

338 j. anderson and j. a. giles (2) The risk functions of s 2 N and s2 A have two intersections with respect to φ. Let these be φ 1 and φ 2. Their values are φ 1 =(w+τ 1/2 )/ζ and φ 2 =(w τ 1/2 )/ζ, where w = 2v 1 v2(ν 2 1 4)(ν 2 4),τ =8v 1 v2(ν 2 2 4)(ν 1 4) ( [ 2v1(ν 2 2 2)(v 1 +2) v 1 v 2 2 (ν 1 ν 2 2) + v 2 (2 ν 4(ν 2 2) (ν 1 2)] +v 2 (ν 2 2)(ν 1 2)(ν 2 + 2)) and ζ = 2v 2 (ν 2 4) (4v1 2 v 1 (4(ν 1 2) v 2 (ν 1 6)) +2v 2 (ν 1 2)). As R(s 2 A ) is a quadratic in 1/φ, with an asymptote at φ = 0, one of these intersections will lie in the range (0, ) and the other in the range (0, + ). There are three possiblilities. First, one intersection lies in the range (0, 1). This implies that neither of the NPE or the APE strictly dominates the other, and accords with the results under spherically symmetric disturbances. The second alternative is that the positive intersection is greater than unity. Then, the NPE strictly dominates the APE over the range φ (0, 1]. The third possibility is that there are no real intersections. This occurs when τ is negative and in this case the NPE again strictly dominates the APE. We consider these cases further in the next section. (3) When ν 1 and ν 2, the risk functions collapse to their normal counterparts (see, for example, Toyada and Wallace (1975)) 3. (4) As φ 0,R(s 2 P ) R(s2 N ) while R(s2 A ). That is, pre-testing leads us to follow the correct strategy when the prior information is very false. (5) Typically R(s 2 A ) <R2 (s 2 N ) when φ = 1, although there are exceptions as noted in the above point 2. It is also possible for R(s 2 P ) <R(s2 A ) when φ =1. We illustrate such cases in the next section. Theorem 2. Extrema of R(s 2 P ) result when c =0,c, and c =1. Proof. Details are available in Anderson and Giles (1993) or from the authors on request. So, for any particular value of φ, the minimum risk estimator among those considered in this paper may be either the APE, the NPE, or the PTE with c =1. 3. Numerical evaluations of the risk functions To illustrate the results we have numerically evaluated the risk functions. Various values of the arguments were considered : v 1,v 2 = 10, 30, 40; ν 1,ν 2 = 5, 10, 50, 100, ; those critical values corresponding to a true size of 5% (Ohtani (1993)), those from the central-f distribution corresponding to nominal sizes of 1%, 5%, 30% and 75% and a critical value of unity. Full details of these results are available on request. The evaluations were undertaken using a FORTRAN program written by the authors, which utilises several subroutines from Press 3 Note that although the risks under normality are a special case of our results, those of Giles (1992) are not.

pre-test estimation of regression scale parameters 339 et al. (1986). We executed the program on a VAX 7610 and a VAXstation 4000. Figures 1 to 4 provide representative results. The horizontal axis, in each figure, measures the extent of the hypothesis error φ (0, 1]. The vertical axis measures risk and we have assumed, without loss of generality, that σ 2 1 =1.

340 j. anderson and j. a. giles

pre-test estimation of regression scale parameters 341 The following points can be noted : (1) The figures illustrate the possible cases referred to in the previous Section as point 2. Specifically, the risk functions of the NPE and APE can intersect at a value of φ (0, 1) (see for example Figure 1). Our results suggest that this will occur for all values of v 1 and v 2 when ν 2 ν 1. Figure 2 provides an example where the NPE strictly dominates the APE for all φ (0, 1]. The evaluations suggest that this case is likely when ν 2 is sufficiently smaller than ν 1. So, if the error term of the model for the second sample has marginal distributions which have fatter tails than that for the first sample then it is never optimal to pool the data, even if the variances are equal. This result contrasts to that found when the joint disturbance is spherically symmetric. Then it is always preferable to pool the samples when the variances are equal rather than to simply ignore the prior information. (2) An increase in v 1,v 2 shifts the risk functions downwards, as dose also an increase in ν 1,ν 2. (3) There is no strictly dominating estimator when the disturbances are normal or ν 1 and ν 2 are large (see for example Figure 3). Then, the APE has the smallest risk around the neighbourhood of H 0, while it is generally preferable to employ the PTE with c = 1 otherwise. (4) For small ν 1,ν 2 (e.g. 5, 10) the PTE can strictly dominate both the NPE and the APE (see for example Figure 4). Typically, the PTE which uses c = 1 strictly dominates all other PTE s. This result accords with those of Toyoda and Wallace (1975), Ohtani and Toyoda (1978), and Giles (1992). (5) Using the central-f critical values, as opposed to the values provided by Othani (1993), typically has a significant effect on the risk function; that is, there is a significant difference between the nominal and true sizes for the F -test. The distortion in size increases as ν 1,ν 2 decrease. (6) The PTE which uses c =1always strictly dominates the NPE. It is never optimal to ignore the prior information. 4. Concluding remarks In this paper we have examined the risks of estimators of the regression error variance after a preliminary test for homogeneity, when the disturbances in each sample are Mt but independent. In summary our investigation suggests that for large ν 1 and ν 2 the results under the independent Mt assumption are qualitatively similar to those under normal errors (e.g. Toyoda and Wallace (1975)) no strictly dominating estimator exists; the APE has the smallest

342 j. anderson and j. a. giles risk around the neighbourhood of the null hypothesis; and the PTE with c =1 strictly dominates the NPE. Secondly, our results suggest that for ν 2 ν 1 we have similar qualitative conclusions to Giles (1992) neither the NPE nor the APE can strictly dominate the other; and the PTE with c = 1 can strictly dominate both the APE and the NPE. Finally, if ν 1 >ν 2 the NPE strictly dominates the APE, but both are then strictly dominated by the PTE which uses c = 1. So, the optimal strategy when ν 1 >ν 2 is to pre-test with c =1. There remain a number of issues for future work. For example, it would be interesting to consider this problem with a different variance mixing distribution 4 ; to consider other pre-test problems under a similar disturbance assumption as used here; to investigate the choice of an optimal critical value according to some explicit optimality criterion; to assume that the disturbances are non-normal though identically independently distributed; and to consider the case where ν 1 and ν 2 are estimated rather than assumed to be known. References Anderson, J. and Giles, J. A. (1993). Pre-test estimation of the regression scale parameter with multivariate Student-t errors and independent sub-samples. Discussion Paper No. 9304, Department of Economics, University of Canterbury. Bancroft, T. A. (1944). On biases in estimation due to the use of preliminary tests of sgnificance. Ann. Math. Statist., 15, 190-204. Bancroft, T. A. and Han, C-P. (1983). A note on pooling variances. Jour. Amer. Statist. Assoc., 78, 981-983. Chmielewski, M. A. (1981). Invariant tests for the equality of K scale parameters under spherical symmetry. Jour. Statistical Planning and Inference, 5, 341-346. Clarke, J. A., Giles, D. E. A. and Wallace, T. D. (1987). Estimating the error variance in regression after a preliminary test of restrictions on the coefficients. Journal of Econometrics, 34, 293-304. Giles, J. A. (1990). Preliminary test estimation of a mis-specified linear model with spherically symmetric disturbances. Ph.D. thesis, University of Canterbury. (1992). Estimation of the error variance after a preliminary test of homogeneity in a regression model with spherically symmetric disturbances. Journal of Econometrics, 53, 345-361. Giles, J. A. and Giles, D. E. A. (1993). Pre-test estimation and testing in econometrics: Recent developments. Journal of Economic Surveys, 7, 145-197. King, M. L. (1979). Some aspects of statistical inference in the linear regression model. Ph.D. thesis, University of Canterbury. Ohtani, K. (1993). Testing for equality of error variables between two linear regression with independent multivariate t errors. Journal of Quantitative Economics, 9, 85-97. Ohtani, K. and Toyoda, T. (1978). Minimax regret critical values for a preliminary test in pooling variances. Journal of the Japan Statistical Society, 8, 15-20. 4 For example, Giles (1992) also considers the case where the mixing distribution is gamma.

pre-test estimation of regression scale parameters 343 Press, W. H., Flannery, B. P., Teukolsky, S. A. and Vetterling, W. T. (1986). Numerical Recipes : The Art of Scientific Computing, Cambridge University Press, New York. Toyoda, T. and Wallace, T. D. (1975). Estimation of variance after a preliminary test for homogeneity and optimal levels of significance for the pre-test, Journal of Econometrics, 3, 395-404. Department of Economics University of Victoria P. O. Box 3050, Victoria, BC Canada V8W 3P5.