Multivariate Generalized Spatial Signed-Rank Methods
|
|
- Barry Curtis
- 6 years ago
- Views:
Transcription
1 Multivariate Generalized Spatial Signed-Rank Methods Jyrki Möttönen University of Tampere Hannu Oja University of Tampere and University of Jyvaskyla Robert Serfling 3 University of Texas at Dallas June 005 Final preprint version for Special Issue of Journal of Statistical Research 004, Volume 39, Number, pages 9-4 celebrating A. K. E. Saleh s tenure as Chief-Editor Department of Mathematics, Statistics and Philosophy, Statistics Unit, FIN-3304 University of Tampere, Finland. jyrki.mottonen@uta.fi. Tampere School of Public Health, FIN-3304 University of Tampere, Finland, and Department of Mathematics and Statistics, P.O. Box 35, MaD, FIN-4004 University of Jyvaskyla, Finland. Hannu.Oja@uta.fi. 3 Department of Mathematical Sciences, University of Texas at Dallas, Richardson, Texas , USA. serfling@utdallas.edu. Website: serfling. Support by National Science Foundation Grants DMS and CCF is gratefully acknowledged.
2 Abstract New multivariate generalized signed-rank tests for the one sample location model having favorable efficiency and robustness properties are introduced and studied. Limiting distributions of the tests and related estimates as well as formulae for asymptotic relative efficiencies are found. Relative efficiencies with respect to the classical Hotelling T test and the mean vector are evaluated for the multivariate normal, t, and Tukey models. While the tests estimates are only rotation invariant equivariant, versions that are affine invariant equivariant are discussed as well. AMS 000 Subject Classification: Primary 6G35 Secondary 6H05. Key words and phrases: Nonparametric; Multivariate; Spatial signed-ranks; One-sample location; Hodges-Lehmann estimation; Hypothesis tests; Asymptotic distributions; Asymptotic relative efficiency.
3 Introduction The purpose of this paper is to consider the robustness and efficiency properties of new multivariate generalized signed-rank tests that we introduce here and related multivariate generalized Hodges-Lehmann estimators that were proposed and studied by Chaudhuri 99. These tests and estimates provide more or less robust and efficient competitors of the classical Hotelling T test and the mean vector. Let Y {y,..., y n } be a random sample from a k-variate distribution with cdf F and pdf f symmetric around θ. We wish to estimate the unknown location centre θ and test the null hypothesis H 0 : θ 0. Möttönen and Oja 995 and later Möttönen, Oja, and Tienari 997 developed and studied multivariate spatial sign and signed-rank tests for this problem. The two test statistics they considered may be written in the form W mn n m n i n Sy i + + y im, i m for m giving the spatial sign test statistic or m giving the spatial signed-rank test statistic. Here the function { y Sy y if y 0 0 if y 0, where denotes the Euclidean norm in R k, denotes the spatial sign function of y R k. It is invariant under affine transformations except for those involving heterogeneous scale changes. In this paper, we consider the above statistics W mn for arbitrary m,,... and call them generalized signed-rank test statistics. Unlike the univariate case with m and, the test statistics W mn in general are only conditionally and asymptotically, but not strictly, distribution-free under the null hypothesis. The conditional sign-change tests are then based on the n under H 0 and symmetry equiprobable cases ±y,...,±y n. These test statistics are not affine invariant, which means that the obtained p-values depend on the chosen coordinate system. Related location estimators generalized Hodges-Lehmann estimators are obtained by a standard approach as follows. First, the observations are centered with respect to a candidate θ by setting z i y i θ, i,..., n. Next one calculates the value of the W mn statistic, say W mn θ, for the centered data. Finally, one obtains the estimator θ mn via the equation W mn θ mn 0. this yields the class of estimators studied by Chaudhuri 99. Two special cases are the multivariate spatial median m and multivariate spatial Hodges-Lehmann estimator m ; for these see also Möttönen and Oja 995. While the estimators ˆθ mn are found to have favorable efficiency and robustness properties, they are not fully affine equivariant, however.
4 In the paper, the efficiency and robustness properties of generalized spatial signed-rank tests and generalized Hodges-Lehmann estimates are considered in general, so as to include the cases m 3. It will be seen, for example, that efficiency at normal models increases with m, while robustness decreases. It is important, therefore, to establish benchmarks by which m may be selected so as to obtain a suitable trade-off between efficiency and robustness. The desired balance between these criteria depends, of course, upon the given context of application and the particular losses associated with lack of efficiency and lack of robustness. Numerical asymptotic relative efficiencies are provided for some specific models in Section 4. For m,,...,5, and for k,, 3, 4, 6 and 0, we compare the generalized spatial signed-rank test procedures with the classical Hotelling T test for the k-variate standard normal model and for k-variate t models with 3, 6, and 0 degrees of freedom see Tables -4. For example, for 6-variate standard normal, this ARE increases from 0.90 for m to for m 5. On the other hand, for the t distributions, the ARE s are all greater than, but do not follow a strict monotonicity pattern. We note, though, that for the 6-variate t distribution with 3 degrees of freedom, the ARE s decrease from.344 for m to.697 for m 5. Also, for the same range of m and k, we examine both the ARE of the tests and the bias of the estimators, for a family of Tukey models, i.e., contaminated multivariate normal models, with a range of contamination and spread parameters. Favorable robustness properties of our generalized signed-rank methods are indicated by their bounded influence functions and nonzero breakdown points see Section 3.. It is of interest that, independently of the dimension, the breakdown point decreases from 0.5 for m to 0.9 for m 5. The theoretical foundation underlying the above practical results is developed as follows. In Section we treat the asymptotic properties of our generalized signed-rank test procedures. These are obtained via an asymptotic equivalence Lemma with certain statistics of technically more convenient form expressed in terms of a multivariate generalized signed rank function, which we define and discuss. In this fashion we obtain chi-square limit distributions for Wald-type generalized signed-rank test statistics, both under the null hypothesis Theorem and under contiguous alternatives Theorem. The related estimators are discussed in Section 3: asymptotic normality of the estimators is provided by Theorem 3, and natural V-statistic estimators of the limiting covariance matrix are introduced. Proofs are relegated to the Appendix, which contains numerous technical lemmas taken from a more complete source, Möttönen 00. In Section 5 we discuss efficiency-robustness trade-offs, we describe the construction of fully affine invariant/equivariant versions, and we briefly make comparisons with selected other procedures. For the latter, an interesting finding is that the comparisons of competing procedures tend to depend upon the dimension.
5 Tests. Generalized spatial signed-ranks We first define what we mean by the multivariate generalized spatial signed-rank function: Definition Assume that y,...,y m, m,,..., is a random sample from a k-variate symmetrical distribution with symmetry centre 0 and cdf F. Then the population generalized spatial signed-rank function of order m is R m y E F {Sy y... y m }. If y,...,y n is a random sample from F, the corresponding empirical spatial signed-rank function of order m is R mn y ave{sy y i... y im }, where the average is taken over all possible m -sets of observations, that is, over all cases i <...<i m n. Note that, R y reduces to the spatial sign function and R y is the regular population spatial rank function F symmetric around the origin. Note also that R m y is the theoretical rank function for the distribution of y y m. See Möttönen and Oja 995. We could symmetrize R mn y, if desired, by taking the average over cases ±y,...,±y n, which would be useful when constructing sign-change tests. For spherical distributions, the generalized spatial signed-rank function at y r y and u y y is R m y q m ru, where q m r ϱ mr and ϱ m r E F y y... y m the expectation does not depend on u. As in Möttönen, Oja and Tienari 997, we can establish Proposition Assume that f is continuous and bounded. Then. Test statistics sup R mn y R m y P 0 y Definition The generalized signed-rank test statistic of order m for testing H 0 : θ 0 is the vector-valued V-statistic n n W mn n m... Sy i y im. i i m 3
6 An asymptotically equivalent version of this test statistic can be constructed using the generalized spatial ranks. Lemma Under the null hypothesis, W mn is asymptotically equivalent to T mn n n R m y i, i that is, nw mn T mn P 0. This in fact holds uniformly over all alternatives, since the model is a location model and the statistics are location equivariant. Under the null hypothesis, the expectation and variance of T mn are respectively, where E 0 T mn 0 and Var 0 T mn n B m, B m B m F E F [R m yr T my] is the generalized rank covariance matrix. For m and m, the regular spatial sign and rank covariance matrices are obtained. See Visuri et al The generalized rank covariance matrix B m may be consistently estimated by the corresponding sample counterpart sample rank covariance matrix, B mn ave{r mn y i R T mny i }. Note that related sign-change tests can be constructed, and that asymptotic tests may be constructed where the p-value is calculated using the limiting distribution. For brevity, we omit the details. Lemma Under the null hypothesis, B mn P B m, n. Theorem Under the null hypothesis, the limiting distribution of nt T mnb mnt mn is a chi-square distribution with k degrees of freedom. 4
7 .3 Formulae for Pitman efficiency For limiting efficiencies, we next consider a sequence of contiguous alternatives H n : θ n / δ. Write Ly logfy for the optimal location score function and assume that, under the null hypothesis, L n n [logfy i n / δ log fy i ] n / δ T U n δt I 0 δ + o P, i where U n n Ly i and I 0 E 0 LyL T y. i Note that L n is the log-likelihood test statistic H 0 vs. H, U n the optimal score statistic, and I 0 the expected Fisher information matrix for a single observation. Then, under the sequence of contiguous alternatives H n, the limiting distribution of n / T mn is N k A m δ,b m where A m E 0 R m yl T y and B m E 0 R m yr T m y. This can be derived as in Möttönen and Oja 995. Finally, the limiting distribution of the squared test statistic nt T mn B mn T mn follows. Theorem Under the sequence of alternatives H n, the limiting distribution of nt T mnb mnt mn is a noncentral chi-square distribution with k degrees of freedom and noncentrality parameter δ T A T mb m A m δ. Asymptotic Pitman efficiencies with respect to the optimal likelihood ratio test test are then simply δ T A T mbm A m δ. δ T I 0 δ This may be used in a standard way, as discussed, for example, in Hettmansperger and McKean
8 3 Estimators 3. Generalized Hodges-Lehmann estimators For completeness, we briefly treat the generalized Hodges-Lehmann estimators defined in Section. Lemma 3 Under general assumptions, nθmn na m W mn + o P Theorem 3 Under general assumptions, θ mn AN θ, n A m B m A m. The limiting variance can be estimated by estimating A m and B m separately. In Section. we gave a natural estimate of the generalized rank covariance matrix, namely the empirical rank covariance matrix B mn. To estimate A m, note that with ȳ m /my +...+y m we have [ ] A m E F ȳ m ȳm ȳ T m I k E. ȳ m 3 A natural estimate is then the V -statistic n n { [ȳiȳi A mn n m ȳi T I k ȳi 3 i i m where I i,...,i k and ȳi /my i y im. 3. Comparison of efficiency and robustness Comparison of asymptotic covariance matrices A m B ma m ]}, has been carried out in the multivariate normal case by Chaudhuri 99, in which case it is found that efficiency increases with m. The influence function of the functional θ m is found to be IFy; θ m,fa m R m y, which is bounded since R m y is bounded R m y. The breakdown point of ˆθ mn decreases, although slowly, as m increases, and is independent of the dimension. Specifically, BPˆθ mn / /m, which takes values 0.5, 0.93, 0.06, 0.59, 0.9 for m,, 3, 4, 5. 6
9 4 Asymptotic relative efficiency results for some specific models 4. Multivariate normal distributions The multivariate normal case has been considered earlier by Chaudhuri 99. We assume that y,...,y n is a random sample from the N0,I k distribution. If y r and u y y, then R m y q m ru where r q m r q m and qr r exp r / k+ Γ k + Γ k+ F ; k + ; r is the regular spatial signed-rank function Möttönen and Oja, 995. We then obtain see the Appendix for details A m Γ k+ k m Γ k I k, B m Γ k+ mk Γ k F, ; k + ; I m k, [ A T m B m A m F, ; k + ] ; I m k, [ ] ARE m, F, ; k + ; where ARE m is the Pitman efficiency of the generalized spatial signed-rank test with respect to Hotelling s T test. See Table for values of ARE m for selected values of k and m,,...,5. The notation F denotes the Gauss hypergeometric function. 4. Multivariate t distributions Let us now assume that y,...,y n is a random sample from a k-variate t-distribution with ν degrees of freedom, denoted by t k,ν. See the Appendix. The Pitman asymptotic relative efficiency of the generalized signed-rank test with respect to Hotelling s T is then ARE m µν + k kν [ { E q m r 7 r ν + r m }] [ E{q m r}],
10 Table : Asymptotic relative efficiencies of W mn with respect to T in the k-variate standard normal model, for selected values of k and m,,...,5. m k where r /k has a F k,ν distribution. See the Appendix for q m. Tables, 3 and 4 list the Pitman efficiencies of W mn with respect to T for t k,ν distributions with selected dimensions k and degrees of freedom ν, for m,,...,5. Table : Asymptotic relative efficiencies of W mn with respect to T in the case of the k- variate t distribution with ν 3 degrees of freedom, for selected values of k and m,,...,5. m k Multivariate Tukey distributions In this section we consider contaminated multivariate normal distributions, called here multivariate Tukey distributions. We say that the distribution of y is k-variate Tukeyɛ, µ, σ if its p.d.f. is fy ɛφy+ɛσ k φσ y µ, 8
11 Table 3: Asymptotic relative efficiencies of W mn with respect to T in the case of the k- variate t distribution with ν 6 degrees of freedom, for selected values of k and m,,...,5. m k Table 4: Asymptotic relative efficiencies of W mn with respect to T in the case of the k- variate t distribution with ν 0 degrees of freedom, for selected values of k and m,,...,5. m k where φy is the p.d.f. of N0,I k and ɛ [0, ]. This model proves useful in considering the asymptotic efficiency and asymptotic bias in the case that part of the data comes from a population of outliers. The problem is then to estimate the location vector of the population associated with the majority of the data. Now where q m r m h0 qr m h r exp r / ɛ h ɛ m h q k+ Γ k + Γ k+ F 9 r h µ m +σ h ; k + ; r,
12 is as in Section 4.. The optimal score function is then Ly ɛφy+ɛσ k φσ y ɛφy+ɛσ k φσ y y. See the Appendix for details on A m and B m. In Figure we exhibit the Pitman efficiencies of W mn with respect to T under k-variate Tukeyɛ, 0,σ models with various σ, ɛ, and k, for m,,...,5. It is seen that the ARE increases with dimension k and σ and exceeds for m. Figure exhibits the bias of the functional θ m, under k-variate Tukeyɛ, µ,σ models with various µ, σ, ɛ, and k, for m,,...,5. As expected, bias increases with µ. 5 Comparisons and further remarks 5. Trade-offs between efficiency and robustness We see that for low degrees of freedom, say ν 3, the ARE at t distributions becomes decreasing as m increases as soon as the dimension k is greater than. Thus the gain in ARE at the normal for higher values of m is paid for by a decrease in ARE at a heavy-tailed distribution such as the multivariate t with 3 degrees of freedom. For degrees of freedom ν 0, however, the dimension needs to be much higher for the case m to overpower the case m. This approach leads to a broad spectrum of trade-offs between efficiency at the normal and protection against heavier-tailed models. Depending on one s utility for efficiency versus protection, one might choose any one of the cases m,, 3, 4, or 5. Tables 4 provide some guidance for users to make such selections. For example, W mn for m 4 or 5 proves to be very attractive in comparison with the cases m or, in that a significant gain in efficiency at the normal model is achieved while also maintaining good efficiency at heaviertailed t distributions along with a reasonably high breakdown point. 5. Affine invariant/equivariant versions As mentioned earlier, the generalized signed-rank tests and related estimators are not fully affine invariant/equivariant, due to the use of the spatial sign function and spatial quantile function see Serfling, 004, for some discussion. Affine invariant/equivariant tests/estimates, however, may be constructed for the one-sample location problem using the well-known transformation-retransformation technique, as follows. Write C mn n m n i n Sy i y im S T y i y im. i m 0
13 a.3 k, epsilon0. b k, epsilon ARE ARE Sigma Sigma c.4 m m m3 m4 m5 k4, epsilon0. d.8 m m m3 m4 m5 k4, epsilon ARE.. ARE Sigma Sigma m m m3 m4 m5 m m m3 m4 m5 Figure : Asymptotic relative efficiencies of W mn with respect to T under k-variate Tukeyɛ, 0,σ models with various σ, ɛ, and k, for m,,...,5.
14 a 5 k, epsilon0., sigma b 5 k, epsilon0., sigma m5 m m3 bias m5 bias m4 m m3 m m m c 5 mu k4, epsilon0., sigma d 5 mu k4, epsilon0., sigma m5 m m3 bias m5 bias m4 m m3 m m m mu mu Figure : The bias of the functional θ m under k-variate Tukeyɛ, µ,σ models with various µ, σ, ɛ, and k, for m,,...,5.
15 For m, the spatial sign covariance matrix is obtained. See Visuri et al For invariant tests consider a symmetric positive definite k k-matrix V and standardize the observations by setting z i V / y i V / also symmetric. Then calculate C mn for the transformed data, and finally solve C mn V /ki k. The estimate ˆV mn is then unique up to a multiplicative constant and we can make the choice with Trace ˆV mn k. Finally, the invariant test statistic for the location problem is Ŵ mn, calculated from the ˆV / y i. For m, ˆV n is Tyler s scatter matrix estimate 987, and W n was introduced by Randles 000. Finally, for the affine equivariant location estimate, consider the standardized observations z i V / y i θ with location vector and scatter matrix candidates θ and V. Calculate the values of the W mn and C mn statistics for the standardized data and solve simultaneous equations W m,n θ,v0 and C mn θ,v k I k. The location estimate is then the generalized transformation-retransformation Hodges-Lehmann estimate. For m, the transformation-retransformation spatial median is obtained; Tyler s scatter matrix estimate is used in the transformation. See Chakraborty and Chaudhuri 998, Chakraborty, Chaudhuri, and Oja 998, and Hettmansperger and Randles Selected comparisons with other estimators Restricting to elliptically symmetric multivariate distributions, Hallin and Paindaveine 00 develop linear signed-rank type test statistics based on the interdirections of Randles 989. Model-based scores yield locally asymptotically maximin tests, which include the following: V: a new van der Wearden type statistic S: a sign statistic of Randles 989 W: a Wilcoxon statistic of Peters and Randles 990 t 3 : a statistic optimal for t with 3 degrees of freedom t 6 : a statistic optimal for t with 6 degrees of freedom t 5 : a statistic optimal for t with 5 degrees of freedom Let us staying within the elliptically symmetric framework compare V, S, W, t 3, t 6, and t 5 with our generalized spatial signed-rank tests GSSR,..., GSSR 5 corresponding to m,,...,5, respectively. Let us also include an affine invariant signed-rank test HMO of Hettmansperger, Möttönen, and Oja 997. As a quick-and-dirty criterion for comparison, we have compared ARE values for each test under the k-variate normal, t 3, and t 6 models and eliminate less competitive tests. We omit details and just indicate our findings. Interestingly, the comparisons depend upon the dimension k, as follows. 3
16 V, t 5, GSSR, GSSR 3, and GSSR 4 : good for all k HMO: good for k and k 6 GSSR 5 : good for k W: good for k GSSR : good for k 0 t 3 and t 6 : poor for all k Of course, many of these statistics including the GSSR statistics remain defined and attractive under broader distributional assumptions, such as central symmetry. It would be desirable to develop a more extensive comparison study to include other statistics, for example, the simpler, affine-invariant, multivariate, distribution-free sign test of Randles 000, to include other models such as the multivariate Tukey types, and to conclude robustness criteria such as breakdown points and influence functions. This is beyond the scope of the present paper and deferred to future work. 6 Acknowledgements Support of the third author by National Science Foundation Grants DMS and CCF is gratefully acknowledged. References [] Bose, A Bahadur representation of M m estimates. Ann. Statist [] Chakraborty, B., and Chaudhuri, P On an adaptive transformation retransformation estimate of multivariate location. J. Roy. Statist. Soc. B [3] Chakraborty, B., Chaudhuri, P., and Oja, H Operating transformation and retransformation on spatial median and angle test. Statist. Sinica [4] Chaudhuri, P. 99. Multivariate location estimation using extension of R-estimates through U-statistics type approach. Ann. Statist [5] Hallin, M. and Paindaveine, D. 00. Optimal tests for multivariate location based on interdirections and pseudo-mahalanobis ranks. Ann. Statist [6] Hettmansperger, T. P. and McKean, J. W Robust Nonparametric Statistical Methods. Arnold, London. 4
17 [7] Hettmansperger, T. P., M ott onen, J., and Oja, H Affine-invariant multivariate one-sample signed-rank tests J. Amer. Statist. Assoc [8] Hettmansperger, T. P., Nyblom, J. and Oja, H Affine invariant multivariate one-sample sign tests. J. R. Statist. Soc. B [9] Hettmansperger, T. P. and Randles, R. 00. A practical affine equivariant multivariate median. Biometrika [0] Möttönen, J. 00. Efficiency of the spatial rank test. Unpublished manuscript. [] Möttönen, J. and Oja, H Multivariate spatial sign and rank methods. J. Nonpar. Statist [] Möttönen, J., Oja, H. and Tienari, J On the efficiency of multivariate spatial sign and rank tests. Ann. Statist [3] Möttönen, J., Hettmansperger, T. P., Oja, H. and Tienari, J On the efficiency of affine invariant multivariate rank tests. J. Mult. Analy [4] Peters, D. and Randles, R. H A multivariate signed-rank test for the onesampled location problem. J. Amer. Statist. Assoc [5] Randles, R. H A distribution-free multivariate sign test based on interdirections. J. Amer. Statist. Assoc [6] Randles, R.000. A simpler, affine invariant, multivariate, distribution-free sign test. J. Amer. Statist. Assoc [7] Serfling, R Nonparametric multivariate descriptive measures based on spatial quantiles. J. Statist. Plann. and Inf [8] Visuri, S., Koivunen, V. and Oja, H Sign and rank covariance matrices. J. Statist. Plann. and Inf A Asymptotic relative efficiency evaluations for selected models Here we provide some of the basic lemmas underlying the ARE computations given in the paper. Due to space limitations, many details of proof and some further lemmas are omitted. An expanded version is available on the website serfling. Complete details are available in Möttönen 00. 5
18 A. Multivariate normal distribution Lemma 4 Let y,...,y m be independent N k 0,I k distributed random vectors and y 0,...,0,r T a fixed k-variate vector. Then r y s m χ k, m where s y y m. Proof. we have With s N k 0, m I k, [ s y s m m s k m + r s ] k m r m χ k. m Lemma 5 We have where ϱ m r r m ϱ, m ϱr / exp r k+ Γ k + Γ k F ; k ; r. Proof. ϱ m r E{ } r m χ k m m E{ } r χ k m r m ϱ m See Lemma of Möttönen 00. 6
19 Lemma 6 where and qr r exp r / R m ru q m ru r q m r q m k+ Γ k + Γ k+ F ; k + ; r Proof. q m r d dr ϱ mr m ϱ r m m r q m Lemma 7 Eq m rr k+ Γ /m Γ k 7
20 Proof. Eq m rr r E q r m E / r r exp m m / i0 Γ k+ + i Γ k+ + i i i! / Γ k+ + i Γ k+ + i i0 i i! k+ m Γ k k+ m m k+ Γ k k+ m m k+ /m m k+ Γ k+ Γ k i0 m E /+i Γ k+ + i Γ k+ + ii! r +i exp r m r Γ k ++i+i m /+i Γ k Γ k+ + i i i! m i0 k + k + Γ F 0 ; m Γ k+ Γ k m k+ m +k/++i m i Lemma 8 Eqmr Γ k+ mk Γ k F, ; k + ; m 8
21 Proof. Eqmr E q r m Γ k+ + iγ k+ [ + j r i+j+ ] Γ k+ + iγ k+ + j i0 j0 i+j+ i!j! E exp r m m Γ k+ + iγ k+ + j i+j+ Γ k + i + j + i+j+ Γ k+ + iγ k+ + j i0 j0 i+j+ i!j! m Γ k m +k/+i+j+ m Γ k m +k/+ Γ k+ + i + jγ k+ + iγ k+ + j i j Γ k+ + iγ k+ + ji!j! m + m + i0 j0 m Γ k F k + m +k/+ Γ k+, k + m Γ k F k + Γ k+ m kγ k, k + ; k + Γ k+ Γ k+, k + ; Γ k+ m +k/+ k Γ k, k + m+ m k+ m+ Γ k+ m k/ kγ k m k+ Γ k+ mk Γ k F ; k + ; m k/+ F m+ k + m+ m +,, k + m k+ k+ k+, ; k + ; m m + m + k+ ; k + ; m + m F, ; k + ; m k+ 9
22 Lemma 9 A m k Γ k+ m Γ k I k B m Γ k+ mk Γ k F A T mb m A m ARE m [ [ F, ; k + ; F, ; k + ;, ; k + ; ] I k m ] m I m k Proof. A m k Eq mrri k k m B m k Eq mri k Γ k+ mk Γ k F Γ k+ Γ k I k, ; k + ; I m k A T m B m A m k m Γ k+ [ Γ k k m F, ; k + ; kγ k [ Γ k+ ] I m k F, ; k + ; ] I m k ARE δt A T mbm A m δ δ T Σ δ δt A T mbm A m δ δ T Iδ [ ] δ T δ F, ; k+ ; m [ δ T δ F, ; k + ; ] m 0
23 A. Multivariate t distribution Definition 3 If x χ ν, z N k 0,I k, and x and z are independent, then y x/ν / z has k-variate t distribution with ν degrees of freedom y t ν,k. Lemma 0 Let y x /ν / z,...,y m x m /ν / z m be independent t ν,k distributed random vectors where x,...,x m,z,...,z m are independent with x i χ ν and z i N k 0,I k.ify 0,...,0,r T is a fixed k-variate vector and s y y m, then r y s x,...,x m σ χ k σ where σ σ x ν/x ν/x m. Proof. Suppose that x,...,x m are given. Then Es 0 and Vars σ I k where σ x /ν +...+x m /ν. Since the components of s are independent and normally distributed we get the result σ y s s σ s k σ + r s k r χ k σ σ Lemma ϱ m r / Γ k+ i0 + i r Γ k + ii! i i / ] E[ exp r σ σ Proof. E y s x,...,x m E σ i0 χ k r σ r exp r i Γ k+ + i σ σ σ i!γ k + i / i0 / Γ k+ + i r i Γ k + ii! exp r σ ϱ m r E[E y s x,...,x m ] Lemma R m ru q m ru where q m r d dr ϱ mr σ i /
24 A.3 Multivariate Tukey distribution Definition 4 We say that the distribution of y is k-variate T ukeyɛ, µ,σ distribution if the p.d.f. of y is gy ɛfy+ɛσ k fσ y µ, where fy is the p.d.f. of N k 0,I k -distribution and ɛ [0, ]. The c.d.f. of y is then Gy ɛf y+ɛf σ y µ. Lemma 3 Let v Bin,ɛ and x N k 0,I k be independent random variables. Then is T ukeyɛ, µ, σ distributed. y vx + vµ + σx vµ ++σ vx Lemma 4 Let y i v i µ++σ v i x i, i,...,m, be independent T ukeyɛ, µ,σ distributed random vectors. Then the conditional distribution of s m i y i given v,...,v m is N k hµ,a hi k where h m i v i and a h m +σ h. Proof. s m i m v i µ + +σ v i x i Es v,...,v m i m v i µ hµ i Vars v,...,v m m + σ v i I k i m + σ v i +σ vi I k i m + σ v i +σ v i I k i m + σ v i I k i m +σ hi k
25 Lemma 5 Let y 0,...,0,r T and µ 0,...,0,µ k T be fixed k-variate vectors. Then r y s v,...,v m a hχ hµk k ah Proof. a h y s s a h + + s k a h + r s k a h r sk E ah si E ah v,...,v m v,...,v m r hµ k ah 0 when i,...,k r a h y s v,...,v m χ hµk k ah Lemma 6 E y s v,...,v m / Γ k+ Γ k i0 + i exp r hµ k r hµk i ah + ii! a h a h Proof. Using Lemma 4 of Möttönen 00 we get exp r hµ k r hµ k i a E y s v,...,v m ah h a h Γ k+ + i i! Γ k / + i i0 / Γ k+ + i exp r hµ k r hµk i ah + ii! a h a h Lemma 7 i0 Γ k where ϱ m r E y s m m r ɛ h ɛ m h hµk ahϱ h ah h0 ϱr / exp r k+ Γ k + Γ k F ; k ; r 3
26 Proof. Since h Binm,ɛ we have ϱ m r E y s / Γ k+ + i Γ k + ii! E exp r hµ k r hµk i ah a h a h i0 / Γ k+ m + i Γ k + ii! exp r hµ k r hµk i ah a h a h i0 h0 m ɛ h ɛ m h h m m exp r hµ k r hµ k i ɛ h ɛ m h a ah h a h h i! h0 i0 m m r ɛ h ɛ m h hµk ahϱ h ah where h0 ϱr i0 exp r r i Γ k+ + i / i! + i / exp r See Lemma of Möttönen 00. Lemma 8 where Proof. h0 Γ k Γ k+ Γ k F k + ; k ; r m m r q m r ɛ h ɛ m h hµk q h ah qr r exp r / k+ Γ k + Γ k+ F ; k + ; r q m r d dr ϱ mr m m ɛ h ɛ m h ah d r h dr ϱ hµk ah h0 m m r ɛ h ɛ m h hµk q h ah h0 4 Γ k+ Γ k + i / + i
27 where qr / r exp r r See Lemma 3 of Möttönen 00. exp r / i0 Γ k+ + i + ii! r Γ k+ Γ k+ Γ k+ F k + Lemma 9 Let µ 0. The optimal score function is then Ly αy gy y i ; k + ; r where αy ɛfy+ɛσ k fσ y Proof. Ly d dy loggy d dy log ɛfy+ɛσ k fσ y ɛfy+ɛσ k fσ y [ ɛ d dy fy+ɛσ k d dy fσ y] ɛfy+ɛσ k fσ y [ ɛfyy ɛσ k fσ yy] ɛfy+ɛσ k fσ y ɛfy+ɛσ k fσ y y αy gy y Lemma 0 σr E q r ah σ σ + a h / Γ k+ Γ k 5
28 Proof. σr E q r ah [ σr E r exp σ r Γ k+ + i σ r i ] ah a h Γ k+ + ii! a h i0 Γ k+ + i i+ [ ] σ E r i+ exp σ r Γ k+ + ii! i0 ah a h Γ k+ + i i+ σ Γ k + i + i+ Γ k+ + ii! i0 ah Γ k σ a h +k/+i+ σ Γ k+ + i σ ah Γ k σ a h +k/+ i! σ + a h i0 σ Γ k+ k+ i σ i ah Γ k σ a h +k/+ i! σ + a h i0 σ Γ k+ k + σ ah Γ k σ a h +k/+ F 0 ; σ + a h σ ah Γ k σ Γ k+ ah Γ k σ σ + a h Γ k+ σ a h +k/+ / Γ k+ Γ k a h σ + a h σ k+ σ + a h k+ a k+ h σ + a h i See Lemma 8 of Möttönen 00. Lemma Let µ 0. Then A E[R m yl T y] ɛσ Eq m σrr+ ɛeq m rr k Γ k+ m { m ɛ h k Γ k ɛ m h h where r χ k 0. h0 I k ɛ σ + a h + ɛ / + a h / } I k 6
29 Proof. A E[R m yl T y] αy R gy R myy T gydy...dy k k αyr m yy T dy...dy k R k ɛ R m yy T fydy...dy k + ɛσ R m yy T σ k fσ ydy...dy k R k R k ɛ R m yy T fydy...dy k + ɛσ R k R m σyσy T fydy...dy k R k ɛer m yy T +ɛσ ER m σyy T ɛeq m ruru T +ɛσ Eq m σruru T ɛeq m rreuu T +ɛσ Eq m σrreuu T ɛ k Eq mrri k + ɛ k σ Eq m σrri k ɛeq mrr+ɛσ Eq m σrr I k k where r χ k 0. ɛσ Eq m σrr m m σr ɛ h+ ɛ m h E q r h ah σ h0 m m ɛ h+ ɛ m h σ / Γ k+ h σ + a h Γ k σ h0 m / m ɛ h+ ɛ m h Γ k+ h σ + a h Γ k h0 ɛeq m rr m m ɛ h ɛ m h E h h0 m m ɛ h ɛ m h h h0 r q ah +a h r / Γ k+ Γ k 7
30 A m m k+ ɛ h m h Γ ɛ { k h Γ k ɛ h0 Γ k+ m { m ɛ h k Γ k ɛ m h h h0 σ + a h Lemma The p.d.f of r is ft ɛgt+ɛ t σ g σ where gt is the p.d.f. of the χ k 0 distribution. Proof. where u Bin,ɛ and x T x χ k 0. r y T y +σ u x T x +σ ux T x / + ɛ +a h } I k ɛ σ + a h + ɛ / + a h / F t P r t P r t u P u +P r t u 0P u 0 ɛp σ χ k0 t+ ɛp χ k0 t t ɛg + ɛgt σ ft d dt F t ɛ t σ g + ɛgt σ Lemma 3 Let x be a random variable with the p.d.f. ft ɛgt+ɛ t σ g σ where gt is the p.d.f. of the χ k 0 distribution. Then where d> σ. Ex i Γ k exp dx ɛ + ii Γ k d + Γ k + ɛσi ii +k/+i Γ k dσ + k/+i 8 / } I k
31 Proof. Ex i exp dx 0 ɛ ɛ ɛ Lemma 4 When µ 0 Proof. B E[R m yr T my] x i exp dx ɛgx+ɛ σ xσ g dx x i exp dxgxdx + ɛ σ 0 x x i exp dxg dx σ x i exp dxgxdx + ɛ σ i y i exp dσ ygyσ dy σ 0 x i exp dxgxdx + ɛσ i y i exp dσ ygydy Γ k ɛ + ii Γ kd + Γ k + ɛσi ii +k/+i Γ k dσ + k/+i k E[q mr]i k m m m m ɛ h +h ɛ m h h k h h ah ah h 0 h 0 [ ɛ k+ x k+ y k+ + + a h a h k + F, k + ; k + ; x y + x y ɛσ k+ x k+ y k+ σ + σ + a h a h F k +, k + ; k + ; x y x y B E[R m yr T m y] E[q m ruq m ru T ] E[qmr]Euu T k E[q mr]i k 9 0 ] I k Γ k Γ k+ Γ k+
32 m qm r m h 0 h 0 m m h 0 h 0 m m h 0 h 0 m m h h m m h r ah ah exp i0 j0 m i0 h j0 ɛ h +h ɛ m h h q ɛ h +h ɛ m h h h r r ah exp r a h a h r i r Γ k+ + iγ k+ + j Γ k+ + iγ k+ m h + ji!j! a h ɛ h +h ɛ m h h a h ah ah Γ k+ + iγ k+ + j i Γ k+ + iγ k+ + ji!j! a h a h r i+j+ exp a h + r a h r q ah j j Eq mr m m h 0 h 0 m m i0 h j0 h ɛ h +h ɛ m h h Γ k+ + iγ k+ + j + iγ k+ + ji!j! Γ k+ [ ɛ ɛσ i+j+ ah ah i j a h a h Γ k + i + j + i+j+ k/+i+j+ + Γ k ah + ah + Γ k + i + j + ] i+j+ k/+i+j+ Γ k σ ah + σ ah + 30
33 where m Eqm r m h 0 h 0 m m h h [ ɛ k+ + + a h a h i0 j0 σ a h + i0 j0 ɛ h +h ɛ m h h Γ k+ + iγ k+ + jγ k+ ɛσ Γ k+ + iγ k+ + ji!j! k+ σ + a h + i + j x i y j + Γ k+ + iγ k+ + jγ k+ ] + i + j x i y j Γ k+ + iγ k+ + ji!j! ah ah Γ k x y x y a h a h + ah + a h a h + ah + σ a h a h + σ a h + σ a h a h + σ a h + 3
34 m Eqm r m h 0 h 0 m m [ h h ɛ k+ + + a h a h k + F, k + ɛσ σ + a h ɛ h +h ɛ m h h, k + σ +k+ a h k + F, k +, k + Γ k+ k+ Γ Γ k+ ; k +, k + ; x ; y + Γ k+ k+ Γ Γ k+ ; k +, k + ] ; x ; y ah ah Γ k m Eqm r m h 0 h 0 m m [ h h ɛ h +h ɛ m h h ah ah ɛ k+ x k+ y k+ + + a h a h k + F, k + ɛσ σ + a h F k + ; k + ; k+ σ + a h, k + ; k + ; x y + x y x k+ y k+ ] x y x y Γ k Γ k+ Γ k+ 3
Generalized Multivariate Rank Type Test Statistics via Spatial U-Quantiles
Generalized Multivariate Rank Type Test Statistics via Spatial U-Quantiles Weihua Zhou 1 University of North Carolina at Charlotte and Robert Serfling 2 University of Texas at Dallas Final revision for
More informationAFFINE INVARIANT MULTIVARIATE RANK TESTS FOR SEVERAL SAMPLES
Statistica Sinica 8(1998), 785-800 AFFINE INVARIANT MULTIVARIATE RANK TESTS FOR SEVERAL SAMPLES T. P. Hettmansperger, J. Möttönen and Hannu Oja Pennsylvania State University, Tampere University of Technology
More informationAsymptotic Relative Efficiency in Estimation
Asymptotic Relative Efficiency in Estimation Robert Serfling University of Texas at Dallas October 2009 Prepared for forthcoming INTERNATIONAL ENCYCLOPEDIA OF STATISTICAL SCIENCES, to be published by Springer
More informationPractical tests for randomized complete block designs
Journal of Multivariate Analysis 96 (2005) 73 92 www.elsevier.com/locate/jmva Practical tests for randomized complete block designs Ziyad R. Mahfoud a,, Ronald H. Randles b a American University of Beirut,
More informationTests Using Spatial Median
AUSTRIAN JOURNAL OF STATISTICS Volume 35 (2006), Number 2&3, 331 338 Tests Using Spatial Median Ján Somorčík Comenius University, Bratislava, Slovakia Abstract: The multivariate multi-sample location problem
More informationOn Invariant Within Equivalence Coordinate System (IWECS) Transformations
On Invariant Within Equivalence Coordinate System (IWECS) Transformations Robert Serfling Abstract In exploratory data analysis and data mining in the very common setting of a data set X of vectors from
More informationSymmetrised M-estimators of multivariate scatter
Journal of Multivariate Analysis 98 (007) 1611 169 www.elsevier.com/locate/jmva Symmetrised M-estimators of multivariate scatter Seija Sirkiä a,, Sara Taskinen a, Hannu Oja b a Department of Mathematics
More informationOn the limiting distributions of multivariate depth-based rank sum. statistics and related tests. By Yijun Zuo 2 and Xuming He 3
1 On the limiting distributions of multivariate depth-based rank sum statistics and related tests By Yijun Zuo 2 and Xuming He 3 Michigan State University and University of Illinois A depth-based rank
More informationNonparametric Methods for Multivariate Location Problems with Independent and Cluster Correlated Observations
JAAKKO NEVALAINEN Nonparametric Methods for Multivariate Location Problems with Independent and Cluster Correlated Observations ACADEMIC DISSERTATION To be presented, with the permission of the Faculty
More informationSigned-rank Tests for Location in the Symmetric Independent Component Model
Signed-rank Tests for Location in the Symmetric Independent Component Model Klaus Nordhausen a, Hannu Oja a Davy Paindaveine b a Tampere School of Public Health, University of Tampere, 33014 University
More informationInequalities Relating Addition and Replacement Type Finite Sample Breakdown Points
Inequalities Relating Addition and Replacement Type Finite Sample Breadown Points Robert Serfling Department of Mathematical Sciences University of Texas at Dallas Richardson, Texas 75083-0688, USA Email:
More informationMultivariate Distributions
IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate
More informationMultivariate-sign-based high-dimensional tests for sphericity
Biometrika (2013, xx, x, pp. 1 8 C 2012 Biometrika Trust Printed in Great Britain Multivariate-sign-based high-dimensional tests for sphericity BY CHANGLIANG ZOU, LIUHUA PENG, LONG FENG AND ZHAOJUN WANG
More informationOptimal Procedures Based on Interdirections and pseudo-mahalanobis Ranks for Testing multivariate elliptic White Noise against ARMA Dependence
Optimal Procedures Based on Interdirections and pseudo-mahalanobis Rans for Testing multivariate elliptic White Noise against ARMA Dependence Marc Hallin and Davy Paindaveine Université Libre de Bruxelles,
More informationIndependent Component (IC) Models: New Extensions of the Multinormal Model
Independent Component (IC) Models: New Extensions of the Multinormal Model Davy Paindaveine (joint with Klaus Nordhausen, Hannu Oja, and Sara Taskinen) School of Public Health, ULB, April 2008 My research
More informationInvariant coordinate selection for multivariate data analysis - the package ICS
Invariant coordinate selection for multivariate data analysis - the package ICS Klaus Nordhausen 1 Hannu Oja 1 David E. Tyler 2 1 Tampere School of Public Health University of Tampere 2 Department of Statistics
More informationElliptically Contoured Distributions
Elliptically Contoured Distributions Recall: if X N p µ, Σ), then { 1 f X x) = exp 1 } det πσ x µ) Σ 1 x µ) So f X x) depends on x only through x µ) Σ 1 x µ), and is therefore constant on the ellipsoidal
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More informationOn Multivariate Runs Tests. for Randomness
On Multivariate Runs Tests for Randomness Davy Paindaveine Université Libre de Bruxelles, Brussels, Belgium Abstract This paper proposes several extensions of the concept of runs to the multivariate setup,
More informationPackage SpatialNP. June 5, 2018
Type Package Package SpatialNP June 5, 2018 Title Multivariate Nonparametric Methods Based on Spatial Signs and Ranks Version 1.1-3 Date 2018-06-05 Author Seija Sirkia, Jari Miettinen, Klaus Nordhausen,
More informationCommentary on Basu (1956)
Commentary on Basu (1956) Robert Serfling University of Texas at Dallas March 2010 Prepared for forthcoming Selected Works of Debabrata Basu (Anirban DasGupta, Ed.), Springer Series on Selected Works in
More informationMiscellanea Multivariate sign-based high-dimensional tests for sphericity
Biometrika (2014), 101,1,pp. 229 236 doi: 10.1093/biomet/ast040 Printed in Great Britain Advance Access publication 13 November 2013 Miscellanea Multivariate sign-based high-dimensional tests for sphericity
More informationAsymptotic Statistics-VI. Changliang Zou
Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous
More informationGeneralized nonparametric tests for one-sample location problem based on sub-samples
ProbStat Forum, Volume 5, October 212, Pages 112 123 ISSN 974-3235 ProbStat Forum is an e-journal. For details please visit www.probstat.org.in Generalized nonparametric tests for one-sample location problem
More informationAsymmetric least squares estimation and testing
Asymmetric least squares estimation and testing Whitney Newey and James Powell Princeton University and University of Wisconsin-Madison January 27, 2012 Outline ALS estimators Large sample properties Asymptotic
More informationKRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS
Bull. Korean Math. Soc. 5 (24), No. 3, pp. 7 76 http://dx.doi.org/34/bkms.24.5.3.7 KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Yicheng Hong and Sungchul Lee Abstract. The limiting
More informationOn robust and efficient estimation of the center of. Symmetry.
On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract
More informationRobust Subspace DOA Estimation for Wireless Communications
Robust Subspace DOA Estimation for Wireless Communications Samuli Visuri Hannu Oja ¾ Visa Koivunen Laboratory of Signal Processing Computer Technology Helsinki Univ. of Technology P.O. Box 3, FIN-25 HUT
More informationOPERATING TRANSFORMATION RETRANSFORMATION ON SPATIAL MEDIAN AND ANGLE TEST
Statistica Sinica 8(1998), 767-784 OPERATING TRANSFORMATION RETRANSFORMATION ON SPATIAL MEDIAN AND ANGLE TEST Biman Chakraborty, Probal Chaudhuri and Hannu Oja Indian Statistical Institute and University
More informationApplication of Variance Homogeneity Tests Under Violation of Normality Assumption
Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com
More informationf(x µ, σ) = b 2σ a = cos t, b = sin t/t, π < t 0, a = cosh t, b = sinh t/t, t > 0,
R-ESTIMATOR OF LOCATION OF THE GENERALIZED SECANT HYPERBOLIC DIS- TRIBUTION O.Y.Kravchuk School of Physical Sciences and School of Land and Food Sciences University of Queensland Brisbane, Australia 3365-2171
More informationMULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES
MULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES S. Visuri 1 H. Oja V. Koivunen 1 1 Signal Processing Lab. Dept. of Statistics Tampere Univ. of Technology University of Jyväskylä P.O.
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions
More information5 Introduction to the Theory of Order Statistics and Rank Statistics
5 Introduction to the Theory of Order Statistics and Rank Statistics This section will contain a summary of important definitions and theorems that will be useful for understanding the theory of order
More informationMultivariate quantiles and conditional depth
M. Hallin a,b, Z. Lu c, D. Paindaveine a, and M. Šiman d a Université libre de Bruxelles, Belgium b Princenton University, USA c University of Adelaide, Australia d Institute of Information Theory and
More informationMultivariate Signed-Rank Tests in Vector Autoregressive Order Identification
Statistical Science 2004, Vol 9, No 4, 697 7 DOI 024/088342304000000602 Institute of Mathematical Statistics, 2004 Multivariate Signed-Ran Tests in Vector Autoregressive Order Identification Marc Hallin
More informationComputationally Easy Outlier Detection via Projection Pursuit with Finitely Many Directions
Computationally Easy Outlier Detection via Projection Pursuit with Finitely Many Directions Robert Serfling 1 and Satyaki Mazumder 2 University of Texas at Dallas and Indian Institute of Science, Education
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationApplication of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption
Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,
More informationScatter Matrices and Independent Component Analysis
AUSTRIAN JOURNAL OF STATISTICS Volume 35 (2006), Number 2&3, 175 189 Scatter Matrices and Independent Component Analysis Hannu Oja 1, Seija Sirkiä 2, and Jan Eriksson 3 1 University of Tampere, Finland
More informationA Very Brief Summary of Statistical Inference, and Examples
A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2008 Prof. Gesine Reinert 1 Data x = x 1, x 2,..., x n, realisations of random variables X 1, X 2,..., X n with distribution (model)
More informationCONTRIBUTIONS TO THE THEORY AND APPLICATIONS OF STATISTICAL DEPTH FUNCTIONS
CONTRIBUTIONS TO THE THEORY AND APPLICATIONS OF STATISTICAL DEPTH FUNCTIONS APPROVED BY SUPERVISORY COMMITTEE: Robert Serfling, Chair Larry Ammann John Van Ness Michael Baron Copyright 1998 Yijun Zuo All
More informationMinimum distance tests and estimates based on ranks
Minimum distance tests and estimates based on ranks Authors: Radim Navrátil Department of Mathematics and Statistics, Masaryk University Brno, Czech Republic (navratil@math.muni.cz) Abstract: It is well
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano, 02LEu1 ttd ~Lt~S Testing Statistical Hypotheses Third Edition With 6 Illustrations ~Springer 2 The Probability Background 28 2.1 Probability and Measure 28 2.2 Integration.........
More informationSupplementary Material for Wang and Serfling paper
Supplementary Material for Wang and Serfling paper March 6, 2017 1 Simulation study Here we provide a simulation study to compare empirically the masking and swamping robustness of our selected outlyingness
More informationA Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data
A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data Yujun Wu, Marc G. Genton, 1 and Leonard A. Stefanski 2 Department of Biostatistics, School of Public Health, University of Medicine
More informationChernoff-Savage and Hodges-Lehmann results for Wilks test of multivariate independence
IMS Collections Beyond Parametrics in Interdisciplinary Research: Festschrift in Honor of Professor Pranab K. Sen Vol. 1 (008) 184 196 c Institute of Mathematical Statistics, 008 DOI: 10.114/193940307000000130
More informationarxiv: v1 [stat.me] 14 Jan 2019
arxiv:1901.04443v1 [stat.me] 14 Jan 2019 An Approach to Statistical Process Control that is New, Nonparametric, Simple, and Powerful W.J. Conover, Texas Tech University, Lubbock, Texas V. G. Tercero-Gómez,Tecnológico
More informationsimple if it completely specifies the density of x
3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely
More informationEfficient and Robust Scale Estimation
Efficient and Robust Scale Estimation Garth Tarr, Samuel Müller and Neville Weber School of Mathematics and Statistics THE UNIVERSITY OF SYDNEY Outline Introduction and motivation The robust scale estimator
More informationRecall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n
Chapter 9 Hypothesis Testing 9.1 Wald, Rao, and Likelihood Ratio Tests Suppose we wish to test H 0 : θ = θ 0 against H 1 : θ θ 0. The likelihood-based results of Chapter 8 give rise to several possible
More informationrobustness, efficiency, breakdown point, outliers, rank-based procedures, least absolute regression
Robust Statistics robustness, efficiency, breakdown point, outliers, rank-based procedures, least absolute regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More informationPart IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015
Part IB Statistics Theorems with proof Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly)
More informationJournal of Statistical Software
JSS Journal of Statistical Software July 2011, Volume 43, Issue 5. http://www.jstatsoft.org/ Multivariate L 1 Methods: The Package MNM Klaus Nordhausen University of Tampere Hannu Oja University of Tampere
More informationChapter 7, continued: MANOVA
Chapter 7, continued: MANOVA The Multivariate Analysis of Variance (MANOVA) technique extends Hotelling T 2 test that compares two mean vectors to the setting in which there are m 2 groups. We wish to
More informationDistribution-Free Tests for Two-Sample Location Problems Based on Subsamples
3 Journal of Advanced Statistics Vol. No. March 6 https://dx.doi.org/.66/jas.6.4 Distribution-Free Tests for Two-Sample Location Problems Based on Subsamples Deepa R. Acharya and Parameshwar V. Pandit
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationON THE MAXIMUM BIAS FUNCTIONS OF MM-ESTIMATES AND CONSTRAINED M-ESTIMATES OF REGRESSION
ON THE MAXIMUM BIAS FUNCTIONS OF MM-ESTIMATES AND CONSTRAINED M-ESTIMATES OF REGRESSION BY JOSE R. BERRENDERO, BEATRIZ V.M. MENDES AND DAVID E. TYLER 1 Universidad Autonoma de Madrid, Federal University
More informationASSESSING A VECTOR PARAMETER
SUMMARY ASSESSING A VECTOR PARAMETER By D.A.S. Fraser and N. Reid Department of Statistics, University of Toronto St. George Street, Toronto, Canada M5S 3G3 dfraser@utstat.toronto.edu Some key words. Ancillary;
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More informationSTAT 512 sp 2018 Summary Sheet
STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}
More informationMultivariate Distribution Models
Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is
More informationRegression and Statistical Inference
Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More informationComputational rank-based statistics
Article type: Advanced Review Computational rank-based statistics Joseph W. McKean, joseph.mckean@wmich.edu Western Michigan University Jeff T. Terpstra, jeff.terpstra@ndsu.edu North Dakota State University
More informationJyh-Jen Horng Shiau 1 and Lin-An Chen 1
Aust. N. Z. J. Stat. 45(3), 2003, 343 352 A MULTIVARIATE PARALLELOGRAM AND ITS APPLICATION TO MULTIVARIATE TRIMMED MEANS Jyh-Jen Horng Shiau 1 and Lin-An Chen 1 National Chiao Tung University Summary This
More informationSingle Index Quantile Regression for Heteroscedastic Data
Single Index Quantile Regression for Heteroscedastic Data E. Christou M. G. Akritas Department of Statistics The Pennsylvania State University JSM, 2015 E. Christou, M. G. Akritas (PSU) SIQR JSM, 2015
More informationBest linear unbiased and invariant reconstructors for the past records
Best linear unbiased and invariant reconstructors for the past records B. Khatib and Jafar Ahmadi Department of Statistics, Ordered and Spatial Data Center of Excellence, Ferdowsi University of Mashhad,
More informationMULTIVARIATE SYMMETRY AND ASYMME- TRY
MULTIVARIATE SYMMETRY AND ASYMME- TRY (By Robert Serfling, January 2003, for Encyclopedia of Statistical Sciences, 2nd ed.) Abstract. Univariate symmetry has interesting and diverse forms of generalization
More informationA nonparametric two-sample wald test of equality of variances
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David
More informationOptimum designs for model. discrimination and estimation. in Binary Response Models
Optimum designs for model discrimination and estimation in Binary Response Models by Wei-Shan Hsieh Advisor Mong-Na Lo Huang Department of Applied Mathematics National Sun Yat-sen University Kaohsiung,
More informationCharacteristics of multivariate distributions and the invariant coordinate system
Characteristics of multivariate distributions the invariant coordinate system Pauliina Ilmonen, Jaakko Nevalainen, Hannu Oja To cite this version: Pauliina Ilmonen, Jaakko Nevalainen, Hannu Oja. Characteristics
More informationThe purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.
Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That
More informationComment. February 1, 2008
Persi Diaconis Stanford University Erich Lehmann UC Berkeley February 1, 2008 Sandy Zabell gives us a good feel for Student and his times. We supplement this with later developments showing that Student
More informationA Very Brief Summary of Statistical Inference, and Examples
A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random
More informationRobust estimation of scale and covariance with P n and its application to precision matrix estimation
Robust estimation of scale and covariance with P n and its application to precision matrix estimation Garth Tarr, Samuel Müller and Neville Weber USYD 2013 School of Mathematics and Statistics THE UNIVERSITY
More informationEmpirical Likelihood Tests for High-dimensional Data
Empirical Likelihood Tests for High-dimensional Data Department of Statistics and Actuarial Science University of Waterloo, Canada ICSA - Canada Chapter 2013 Symposium Toronto, August 2-3, 2013 Based on
More informationMultivariate Statistics
Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical
More informationRank-Based Estimation and Associated Inferences. for Linear Models with Cluster Correlated Errors
Rank-Based Estimation and Associated Inferences for Linear Models with Cluster Correlated Errors John D. Kloke Bucknell University Joseph W. McKean Western Michigan University M. Mushfiqur Rashid FDA Abstract
More informationHypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations
Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate
More informationThomas J. Fisher. Research Statement. Preliminary Results
Thomas J. Fisher Research Statement Preliminary Results Many applications of modern statistics involve a large number of measurements and can be considered in a linear algebra framework. In many of these
More informationRobust Optimal Tests for Causality in Multivariate Time Series
Robust Optimal Tests for Causality in Multivariate Time Series Abdessamad Saidi and Roch Roy Abstract Here, we derive optimal rank-based tests for noncausality in the sense of Granger between two multivariate
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationROBUST ESTIMATION OF A CORRELATION COEFFICIENT: AN ATTEMPT OF SURVEY
ROBUST ESTIMATION OF A CORRELATION COEFFICIENT: AN ATTEMPT OF SURVEY G.L. Shevlyakov, P.O. Smirnov St. Petersburg State Polytechnic University St.Petersburg, RUSSIA E-mail: Georgy.Shevlyakov@gmail.com
More informationCOMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS
Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department
More informationJoseph W. McKean 1. INTRODUCTION
Statistical Science 2004, Vol. 19, No. 4, 562 570 DOI 10.1214/088342304000000549 Institute of Mathematical Statistics, 2004 Robust Analysis of Linear Models Joseph W. McKean Abstract. This paper presents
More informationStatistical Inference of Covariate-Adjusted Randomized Experiments
1 Statistical Inference of Covariate-Adjusted Randomized Experiments Feifang Hu Department of Statistics George Washington University Joint research with Wei Ma, Yichen Qin and Yang Li Email: feifang@gwu.edu
More informationGLM Repeated Measures
GLM Repeated Measures Notation The GLM (general linear model) procedure provides analysis of variance when the same measurement or measurements are made several times on each subject or case (repeated
More informationA NONPARAMETRIC TEST FOR HOMOGENEITY: APPLICATIONS TO PARAMETER ESTIMATION
Change-point Problems IMS Lecture Notes - Monograph Series (Volume 23, 1994) A NONPARAMETRIC TEST FOR HOMOGENEITY: APPLICATIONS TO PARAMETER ESTIMATION BY K. GHOUDI AND D. MCDONALD Universite' Lava1 and
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth
More informationGeneralized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey
More informationAPPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV
DOI 10.1007/s11018-017-1213-4 Measurement Techniques, Vol. 60, No. 5, August, 2017 APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV B. Yu. Lemeshko and T.
More informationLecture 12 Robust Estimation
Lecture 12 Robust Estimation Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical Economics University of Karlsruhe Financial Econometrics, Summer Semester 2007 Copyright These lecture-notes
More informationMaster s Written Examination - Solution
Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2
More informationYIJUN ZUO. Education. PhD, Statistics, 05/98, University of Texas at Dallas, (GPA 4.0/4.0)
YIJUN ZUO Department of Statistics and Probability Michigan State University East Lansing, MI 48824 Tel: (517) 432-5413 Fax: (517) 432-5413 Email: zuo@msu.edu URL: www.stt.msu.edu/users/zuo Education PhD,
More information