Simultaneous Estimation in a Restricted Linear Model*

Similar documents
More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

Stochastic Design Criteria in Linear Models

ORDER RESTRICTED STATISTICAL INFERENCE ON LORENZ CURVES OF PARETO DISTRIBUTIONS. Myongsik Oh. 1. Introduction

On the Efficiencies of Several Generalized Least Squares Estimators in a Seemingly Unrelated Regression Model and a Heteroscedastic Model

Estimation of two ordered normal means under modified Pitman nearness criterion

A Test for Order Restriction of Several Multivariate Normal Mean Vectors against all Alternatives when the Covariance Matrices are Unknown but Common

Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA

arxiv: v1 [math.pr] 24 Sep 2009

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

subroutine is demonstrated and its accuracy examined, and the nal Section is for concluding discussions The rst of two distribution functions given in

YONGDO LIM. 1. Introduction

Simulation of truncated normal variables. Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris

On Sums of Conjugate Secondary Range k-hermitian Matrices

Abstract. In this article, several matrix norm inequalities are proved by making use of the Hiroshima 2003 result on majorization relations.

STRUCTURAL AND SPECTRAL PROPERTIES OF k-quasi- -PARANORMAL OPERATORS. Fei Zuo and Hongliang Zuo

Optimisation Problems for the Determinant of a Sum of 3 3 Matrices

Erich Lehmann's Contributions to Orderings of Probability Distributions

Journal of Multivariate Analysis. Sphericity test in a GMANOVA MANOVA model with normal error

A Proximal Method for Identifying Active Manifolds

Sociedad de Estadística e Investigación Operativa

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

OPTIMAL TRANSPORTATION PLANS AND CONVERGENCE IN DISTRIBUTION

McMaster University. Advanced Optimization Laboratory. Title: A Proximal Method for Identifying Active Manifolds. Authors: Warren L.

SEMIRINGS SATISFYING PROPERTIES OF DISTRIBUTIVE TYPE

A NOTE ON QUASI ISOMETRIES II. S.M. Patel Sardar Patel University, India

Estimation of parametric functions in Downton s bivariate exponential distribution

THE SHORTEST CONFIDENCE INTERVAL FOR PROPORTION IN FINITE POPULATIONS

FOR COMPACT CONVEX SETS

A note on the equality of the BLUPs for new observations under two linear models

SOME STABILITY RESULTS FOR THE SEMI-AFFINE VARIATIONAL INEQUALITY PROBLEM. 1. Introduction

On the simplest expression of the perturbed Moore Penrose metric generalized inverse

Fixed point of ϕ-contraction in metric spaces endowed with a graph

12x + 18y = 30? ax + by = m

Testing Statistical Hypotheses

CHARACTERIZATION OF (QUASI)CONVEX SET-VALUED MAPS

Rolle s Theorem and Negligibility of Points in Infinite Dimensional Banach Spaces*

SOME GENERALIZATION OF MINTY S LEMMA. Doo-Young Jung

On Some Estimates of the Remainder in Taylor s Formula

Lawrence D. Brown, T. Tony Cai and Anirban DasGupta

Mi-Hwa Ko. t=1 Z t is true. j=0

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

SHORTEST PERIODIC BILLIARD TRAJECTORIES IN CONVEX BODIES

A NOTE ON RATIONAL OPERATOR MONOTONE FUNCTIONS. Masaru Nagisa. Received May 19, 2014 ; revised April 10, (Ax, x) 0 for all x C n.

Mean squared error matrix comparison of least aquares and Stein-rule estimators for regression coefficients under non-normal disturbances

SOME VARIANTS OF LAGRANGE S FOUR SQUARES THEOREM

Inequalities Relating Addition and Replacement Type Finite Sample Breakdown Points

All Ramsey numbers for brooms in graphs

Studentization and Prediction in a Multivariate Normal Setting

COMMUTATIVITY OF OPERATORS

CLASSIFYING CAMINA GROUPS: ATHEOREMOFDARKANDSCOPPOLA

Large Sample Properties of Estimators in the Classical Linear Regression Model

An Introduction to Multivariate Statistical Analysis

ORTHOGONALLY INVARIANT ESTIMATION OF THE SKEW-SYMMETRIC NORMAL MEAN MATRIX SATOSHI KURIKI

SELECTING THE NORMAL POPULATION WITH THE SMALLEST COEFFICIENT OF VARIATION

Direction of Movement of the Element of Minimal Norm in a Moving Convex Set

THE HYPERBOLIC METRIC OF A RECTANGLE

Classifying Camina groups: A theorem of Dark and Scoppola

A CLASS OF SCHUR MULTIPLIERS ON SOME QUASI-BANACH SPACES OF INFINITE MATRICES

MINIMAL NUMBER OF GENERATORS AND MINIMUM ORDER OF A NON-ABELIAN GROUP WHOSE ELEMENTS COMMUTE WITH THEIR ENDOMORPHIC IMAGES

A Direct Proof of Caristi s Fixed Point Theorem

Journal of Inequalities in Pure and Applied Mathematics

du+ z f 2(u) , which generalized the result corresponding to the class of analytic functions given by Nash.

Stolz angle limit of a certain class of self-mappings of the unit disk

ORLICZ - PETTIS THEOREMS FOR MULTIPLIER CONVERGENT OPERATOR VALUED SERIES

TORIC WEAK FANO VARIETIES ASSOCIATED TO BUILDING SETS

DIRECTIONAL TESTS FOR ONE-SIDED ALTERNATIVES IN MULTIVARIATE MODELS 1. By Arthur Cohen and Harold B. Sackrowitz Rutgers University

Are Probabilities Used in Markets? 1

Singular Perturbation on a Subdomain*

A NOTE ON Q-ORDER OF CONVERGENCE

Research Article The Solution Set Characterization and Error Bound for the Extended Mixed Linear Complementarity Problem

Testing Statistical Hypotheses

CURVATURE VIA THE DE SITTER S SPACE-TIME

ON THE CONVERGENCE OF SERIES WITH RECURSIVELY DEFINED TERMS N. S. HOANG. 1. Introduction

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Statistics & Decisions 7, (1989) R. Oldenbourg Verlag, München /89 $

Characterization of quadratic mappings through a functional inequality

Extending Theorems of Serret and Pavone

Testing Some Covariance Structures under a Growth Curve Model in High Dimension

PROJECTIONS ONTO CONES IN BANACH SPACES

The Hausdorff measure of a Sierpinski-like fractal

A TALE OF TWO CONFORMALLY INVARIANT METRICS

Some Results on the Multivariate Truncated Normal Distribution

Identifying Active Constraints via Partial Smoothness and Prox-Regularity

HILBERT l-class FIELD TOWERS OF. Hwanyup Jung

Abstract. We derive a sharp bound for the modulus of the Schwarzian derivative of concave univalent functions with opening angle at infinity

Arithmetic Analogues of Derivations

Spherically Symmetric Logistic Distribution

1 x i. i=1 EVEN NUMBERS RAFAEL ARCE-NAZARIO, FRANCIS N. CASTRO, AND RAÚL FIGUEROA

ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES. 1. Introduction

ON POSITIVE SEMIDEFINITE PRESERVING STEIN TRANSFORMATION

On likelihood ratio tests

Bayesian Estimation of Regression Coefficients Under Extended Balanced Loss Function

SOME INEQUALITIES FOR THE EUCLIDEAN OPERATOR RADIUS OF TWO OPERATORS IN HILBERT SPACES

Topology Proceedings. COPYRIGHT c by Topology Proceedings. All rights reserved.

A Measure of Monotonicity of Two Random Variables

MULTIVARIATE ANALYSIS OF VARIANCE UNDER MULTIPLICITY José A. Díaz-García. Comunicación Técnica No I-07-13/ (PE/CIMAT)

Mohsen Pourahmadi. 1. A sampling theorem for multivariate stationary processes. J. of Multivariate Analysis, Vol. 13, No. 1 (1983),

ON D-OPTIMAL DESIGNS FOR ESTIMATING SLOPE

Transcription:

journal of multivariate analysis 61, 6166 (1997) article no. MV971657 Simultaneous Estimation in a Restricted Linear Model* C. Rueda, B. Salvador, and M. A. Ferna ndez Universidad de Valladolid, Spain We consider a linear normal model Y=X%+e with % verifying a linear restriction and the standard estimators % (unrestricted MLE) and %* (restricted MLE). We prove that %* is preferable to % using a new and strong criterion which implies the domination under other usual criteria; in particular it is proven that the standard simultaneous confidence intervals centered at %* have more confidence than those centered at %. 1997 Academic Press 1. INTRODUCTION We consider the estimation of the parameter vector in a linear model Y=X%+e, e ^ N k (0, _ 2 I), where X is a k_p complete range matrix and % is restricted to belong to a polyhedral cone ^/R p. The unrestricted maximum likelihood estimator (MLE) % for % is the standard estimator % =(X$X) &1 X$Y, % ^ N p (%, _ 2 (X$X) &1 ), and the restricted MLE, %*, is the projection of % on ^ for the metric given by 7=X$X: %*=P 7 (% ^). (See Robertson et al. [11] for this and other topics in restricted inference.) This general framework includes as a particular case the problem of comparing the means of independent normal populations, which is the standard model in the context of restricted inference, where % is the vector of the population means, % is the vector of sample means, and 7 is diagonal. We are interested in comparing the performance of % and %*. It is well known that when % is estimated globally, the restricted MLE is better than the unrestricted MLE under the quadratic error loss; however, using other loss functions we could obtain the opposite conclusion. Received October 18, 1995; revised July 31, 1996. AMS 1991 subject classifications: primary, 62F30; secondary, 62J05, 62J15. Key words and phrases: restricted estimator, simultaneous estimation, mean squared error, stochastic ordering, universal domination. * Research partially supported by the DGICYT (Direccio n General de Investigacio n Cient@ fica y Te cnica) under Grant PB 91-0197. 61 0047-259X97 25.00 Copyright 1997 by Academic Press All rights of reproduction in any form reserved.

62 RUEDA, SALVADOR, AND FERNA NDEZ For instance, when one is interested in estimating a single component % i or a linear function of the components, c$%, it depends upon ^ and c which one of the two estimators is better: in the case of independent normal populations and when ^ is the simple order cone, it has been proved that %* i dominates % i (Kelly [5] and Lee [7]); however, when ^ is the tree order cone, it has been proven by Lee [8] that % i * fails to dominate % i for estimating the root coordinate when p is large enough but also that for the other components %* i is better under suitable conditions. Other references, dealing with the estimation of c$% in normal restricted models, which may be also of interest are Cohen and Sackrowitz [1], Kushary and Cohen [6], Hwang and Peddada [4], and Rueda and Salvador [12]. On the other hand, in the context of linear models, the construction of confidence regions and simultaneous confidence intervals are of great interest and little is known regarding the problem of comparing the performance of %* and % in these cases. Some partial results for independent normal populations with a total order in their means are obtained by Marcus and Peritz [10] and Hayter [2], among others. The aim of this paper is to compare %* and % in a normal linear model, where ^ is a half-space, ^=[% # R p a$%0] under different criteria, including the performance, in simultaneous estimation. In Section 2 it is proven that %* performs better than % using a criterion which, as far as we know, is the strongest one used elsewhere. In fact, it implies that %* is preferable to % to estimate any linear combination of the components of the parameter vector individually or simultaneously. On the other hand, with the terminology introduced by Hwang [3], it implies that %* universally dominates %. For the model considered, the result obtained is very strong and definitely establishes the preference of the restricted MLE against the unrestricted MLE but we think that the number of constraints defining ^ is a factor of major importance and that a stronger result than what is proved here would be difficult to obtain in any other situation. 2. THE MAIN RESULT AND APPLICATIONS The main result in this section is Theorem 3, where it is proven that for any convex and symmetric set A/R p p(% &% # A)p(%*&% # A). This inequality, considered as a new domination criterion, establishes that %* is better than % to estimate %. It implies that % is dominated by %* for other usual criteria and is the basic result used to prove Corollary 4. It gives the necessary tool to show that the standard simultaneous estimation

SIMULTANEOUS ESTIMATION 63 procedures used in normal linear models can be improved by making use of the additional information about the parameter set. All these facts will be considered in the applications after the theorem. Before that we include two lemmas needed for the proof of Theorem 3. Lemma 1 is Theorem 4.2.2 in Tong [13]. It is quoted below for convenience. Lemma 2 proves an interesting property for the multivariate normal distribution that could have many other uses. Note that in the following we consider symmetric sets where the symmetry is always supposed to be about the origin. Lemma 1 (Tong, 1990). Let W ^ N p (%, 1), 1 nonsingular. Then for any finite m, any sets A 1,..., A m in R p, and any real numbers : 1,..., : m such that : i 0 and : i =1, we have p \W # : m i=1 : i A i+ `m i=1 [p(w # A i )] : i. Lemma 2. Let W ^ N p (%, 1), 1 nonsingular, a # R p and % verifying, a$%0. Then for any convex and symmetric set A/R p we have that g(z)=p(w&% # Aa$W=z) is nondecreasing for z0. Proof. Let V be the positive, symmetric square root of 1 &1 and make the change of variable U=V(W&%) to obtain p(w&% # Aa$W=z)=p(W&%# Aa$(W&%)=z&a$%) =p(u #Bb$U=z$), where B=VA is a convex and symmetric set, b=v &1 a, and z$= z&a$%0, from z0 and a$%0. Hence it suffices to prove the result for W ^ N p (0, I), in which case g(z) is a symmetric function. Now, consider an orthogonal transformation, Z=MW ^ N p (0, I), such that Z 1 =a$w and let be C=MA. Then g(z)=p(z #CZ 1 =z)=p((z 2,..., Z p )#C z ), where C z =[(z 2,..., z p )#R p&1 (z, z 2,..., z p )#C]. Let z<z$0, * # (0, 1), be such that z$=*z+(1&*)(&z) and B z$ = *C z +(1&*) C &z. From the convexity it is easy to prove that B z$ /C z$ ; therefore g(z$)=p((z 2,..., Z p )#C z$ )p((z 2,..., Z p )#B z$ ). (1)

64 RUEDA, SALVADOR, AND FERNA NDEZ Finally, from Lemma 1 and the symmetry of g(z) p((z 2,..., Z p )#B z$ )[p((z 2,..., Z p )#C z )] * [p((z 2,..., Z p )#C &z )] 1&* =p((z 2,..., Z p )#C z )=g(z) (2) and the result follows from (1) and (2). We now present the main result. K Theorem 3. For any convex and symmetric set A/R p we have p(% &% # A)p(%*&% # A). Proof. Let % a =P 7 (% a$x=0)=[i&7 &1 a(a$7 &1 a) &1 a$] % ; hence, %*= % a I [a$% <0] +% I [a$% 0]. It suffices to show that which follows from p(% &% # A, a$% <0)p(% a &% # A, a$% <0), p(% &% # A, a$% <0)= 0 p(% &% # Aa$% =z) df a$% (z) & 0 & p(% &% # Aa$% =0) df a$% (z) = 0 & p(% a &% # Aa$% =0) df a$% (z) = p(% a &% # A, a$% <0), where the inequality is obtained applying Lemma 2 and the last equality is a consequence of the independence between % a and a$%. K Corollary 4. Let R be the usual estimator of _ 2 and for each r>0, A(r) a convex and symmetric set, then p(% &% # A(R))p(%*&% # A(R)). Proof. The result follows directly from the fact that R is independent of % (and then also of %*) and from Theorem 3 conditioning in R. K The following applications resume the implications of Theorem 3 and Corollary 4 in order to establish the superiority of %* against % to estimate % or any linear function of % under different domination criteria. All these applications are valid for any vector a # R p, for example when it is known that % i % j for a pair of indexes i, j # [1, 2,..., p].

SIMULTANEOUS ESTIMATION 65 Application 1 (Coordinatewise estimation). Let A i =[x i # R x i t i ], t i >0, i # [1, 2,..., p]. Then from Theorem 3: p( % i&% i t i )p( % i *&% i t i ), i=1, 2,..., p. That is, % i *&% i is stochastically smaller than % i&% i for i=1, 2,..., p. From this it follows that for loss functions of the form l($; %)=\( $&% i ) ) with \(t) nonconstant and nondecreasing on t>0, l(% i *;% i ) is stochastically smaller than l(% i ; % i ) and, as a result, that %* is better than % under the quadratic error loss. In a similar way we also obtain that for any c # R p c$%*&c$% is stochastically smaller than c$% &c$%, which implies that %* is more ``concentrated'' about % than %, using the definition in Lehmann [9, p. 291]. (This result has been proved in Rueda and Salvador [12].) Application 2 (Simultaneous constant length confidence intervals). Let A= c # R p [ c$x t c ], where t c 0 for any c # R p. Then applying Theorem 3 to this set we have that p( c$% &c$% t c, \c#r p )p( c$%*&c$% t c, \c#r p ). Hence, the simultaneous constant length confidence intervals centered at the restricted estimator are superior to the intervals centered at the unrestricted estimator. Application 3 (Simultaneous confidence intervals of Scheffe type, Tukey type, or Bonferroni type). Let R be the standard estimator for _ 2 and for any r>0, define A(r)= c # R p [ c$x rt c ], where t c 0 for any c # R p. Now, applying Corollary 4 we have that p( c$% &c$% Rt c, \c # R p )p( c$%*&c$% Rt c, \c # R p ). Hence, the standard simultaneous confidence intervals are improved when a linear restriction holds for the parameter vector, using the restricted estimator, instead of the unrestricted estimator as the center point. Application 4 (Universal domination). Let B be a positive definite symmetric matrix and for any r>0, t>0 A(r)=[x # R p x$bxtr]. Again from Corollary 4 it follows that p((% &%)$ B(% &%)Rt)p((%*&%)$ B(%*&%)Rt). Using the terminology introduced by Hwang [3] of stochastic domination under generalized euclidean error and of universal domination, the result implies that %* universally dominates % and, therefore, that % is U-inadmissible.

66 RUEDA, SALVADOR, AND FERNA NDEZ ACKNOWLEDGMENTS The authors thank the referees for helpful comments. REFERENCES [1] Cohen, A., and Sackrowitz, H. B. (1970). Estimation of the last mean of a monotone sequence. Ann. Math. Statist. 41 20212034. [2] Hayter, A. J. (1990). A one-sided studientized range test for testing against a simple ordered alternative. J. Amer. Statist. Assoc. 85 778785. [3] Hwang, J. T. (1985). Universal domination and stochastic domination: Estimation simultaneously under a broad class of loss functions. Ann. Statist. 13 295314. [4] Hwang, J. T., and Peddada, S. D. (1994). Confidence interval estimation subject to order restrictions. Ann. Statist. 22 6793. [5] Kelly, R. E. (1989). Stochastic reduction of loss in estimating normal means by isotonic regression. Ann. Statist. 17 937940. [6] Kushary, D., and Cohen, A. (1989). Estimating ordered location and scale parameters. Statist. Decisions 7 201213. [7] Lee, C. C. (1981). The quadratic loss of isotonic regression under normality. Ann. Statist. 9 686688. [8] Lee, C. C. (1988). The quadratic loss of order restricted estimators for treatment means with a control. Ann. Statist. 16 751758. [9] Lehmann, E. L. (1983). Theory of Point Estimation. Wiley, New York. [10] Marcus, R., and Peritz, E. (1976). Some simultaneous confidence bounds in normal models with restricted alternatives. J. R. Statist. Soc. B 38 157165. [11] Robertson, T., Wright, F. T., and Dykstra, R. L. (1988). Order Restricted Statistical Inference. Wiley, New York. [12] Rueda, C., and Salvador, B. (1995). Reduction of risk using restricted estimators. Comm. Statistic. Theory Methods 24 10111022. [13] Tong, Y. L. (1990). The Multivariate Normal Distribution. Springer-Verlag, New York.