T 2 Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem

Size: px
Start display at page:

Download "T 2 Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem"

Transcription

1 T Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem Toshiki aito a, Tamae Kawasaki b and Takashi Seo b a Department of Applied Mathematics, Graduate School of Science, Tokyo University of Science, Tokyo, Japan; b Department of Applied Mathematics, Faculty of Science, Tokyo University of Science, Tokyo, Japan Abstract In this paper, we consider tests for sub-mean vectors. In the one-sample and two-sample problems, we give the T type test statistic and the simultaneous confidence intervals by using the approximate upper percentiles of T type test statistic. In the k-sample problem, we give the simultaneous confidence intervals for pairwise multiple comparisons by using Bonferroni s approximation. Finally, we investigate the asymptotic behavior of the approximate upper percentiles of T type statistic by Monte Carlo simulation, and we give an example to illustrate the simultaneous confidence intervals. AMS 000 Mathematics Subject Classification. 6H0, 6H5. Key Words and Phrases: Bonferroni s approximation, Monte Carlo simulation, pairwise multiple comparisons, T max statistic, type I error Introduction Consider the tests of mean when the partial mean vector is given. That is, letting µ = (µ, µ be a mean vector, where µ = (µ, µ,..., µ r and µ = (µ r+, µ r+,..., µ p, the problem is the testing of µ when µ is given. For the problem of sub-mean vectors, Eaton and Kariya (983 derived tests for the independence of two normally distributed sub-mean vectors for the case that an additional random sample is available. Provost (990 obtained explicit expressions for the case that the maximum likelihood estimators (MLEs of all the parameters of the multi-normal random vector are given, and the likelihood ratio statistic for testing the independence between sub-mean vectors has been obtained. For the onesample problem, Rao (949 gave Rao s U-statistic and additional information. The null distribution of Rao s U-statistic has been introduced by Siotani et al. (985. A test for sub-mean vectors with two-step monotone missing data was discussed by Kawasaki and Seo (06. A test for sub-mean vectors in two-sample problem was introduced by Rencher (0. For the k-sample problem, Fujikoshi et al. (00 gave an asymptotic expansion of the distribution of the generalized U-statistic under normality. Gupta et al. (006 gave an asymptotic expansion of the distribution of the generalized U-statistic under

2 a general condition. However, the problem for sub-mean vectors in terms of simultaneous confidence intervals does not appear to have been discussed. In this paper, we give the T type test statistic and derive the simultaneous confidence intervals for sub-mean vectors in the one-sample and two-sample problems. For the k-sample problem, we give the simultaneous confidence intervals for pairwise multiple comparisons for sub-mean vectors. The remainder of this paper is organized. In Section, we propose the T type test statistic for the one-sample case (T test statistic, and its approximate upper percentiles and simultaneous confidence intervals. In Section 3, we propose the T type test statistic for the two-sample case (T test statistic, and its approximate upper percentiles and simultaneous confidence intervals. In Section 4, we present simultaneous confidence intervals for multiple comparisons in the k sample problem. In order to obtain the simultaneous confidence intervals, we derive the approximate upper percentiles of the T max statistic by Bonferroni s approximation. In Section 5, we investigate the asymptotic behavior of the approximate upper percentiles of the T test statistic and the T max statistic by Monte Carlo simulation. In Section 6, we give an example to illustrate simultaneous confidence intervals. One-sample problem. T type test statistic for a sub-mean vector Let x, x,..., x i.i.d. p (µ, Σ. In this section, we consider the following hypothesis H 0 : µ = µ 0 given µ = µ 0 vs. H : µ µ 0 given µ = µ 0, where µ 0 = (µ 0,, µ 0,,..., µ 0,p = (µ 0, µ 0 are given values, and µ = p ( µ µ, µ = r µ µ. µ r, µ = s µ r+ µ r+. µ p, Σ = Σ Σ. Σ Σ We partition x j into a r random vector, a s random vector as x j = (x j, x j, where x j : r, x j : s, p = r + s, j =,,...,. This hypothesis test is then the same as the following hypothesis: H 0 : µ = µ 0 vs. H : µ µ 0,

3 where µ = µ Σ Σ µ, µ 0 = µ 0 Σ Σ µ 0. We then derive the MLEs of µ and Σ. Employing the derivation of Siotani et al. (985, we use the transformed parameters as follows: ( η µ Σ η = = Σ µ 0 Ψ Ψ, Ψ = Σ Σ Σ = η µ 0 Ψ Ψ Σ Σ, Σ where Σ = Σ Σ Σ Σ. We note that (η, Ψ is in one-to-one correspondence with (µ, Σ. Using the transformed parameters (η, Ψ, the likelihood function is given by L(η, Ψ =(π p Ψ Ψ exp (x j Ψ x j η Ψ (x j Ψ x j η exp (x j µ 0 Ψ (x j µ 0. Therefore, the logarithm of L(η, Ψ can be expressed as log L(η, Ψ = p log(π log Ψ log Ψ (x j Ψ x j η Ψ (x j Ψ x j η (x j µ 0 Ψ (x j µ 0. Differentiating log L(η, Ψ with respect to η and Ψ, we have log L(η, Ψ η log L(η, Ψ Ψ log L(η, Ψ Ψ = log L(η, Ψ Ψ = Ψ (x j Ψ x j η, = Ψ + Ψ (x j Ψ x j η (x j Ψ x j η Ψ, Ψ { (xj x (x j x Ψ (x j x (x j x }, = Ψ + Ψ (x j µ 0 (x j µ 0 Ψ. Solving the partial derivative of log L(η, Ψ = 0, the MLEs of η and Ψ are given by η = x Ψ x, Ψ = V, Ψ = V V Ψ, Ψ = V, where x = x j = ( x x, V = (x j x(x j x V V =, V = V V V V V V. 3

4 Therefore, using the relation that (η, Ψ is in one-to-one correspondence with (µ, Σ, the MLEs of µ and Σ are given by ( ( µ x µ = = Σ Σ (x µ 0, µ µ 0 ( Σ Σ Σ = = V + Σ Σ Σ V V Σ Σ Σ V V V The T test statistic for sub-mean vectors is then given by Σ. T = ( µ µ 0 { Ĉov( µ µ 0 } ( µ µ 0, where µ, Ĉov( µ µ 0 are estimators of µ, Cov( µ µ 0. We note that under H 0, T is approximately distributed as a F distribution with r and p degrees of freedom. Using this result, the approximate upper 00α percentile of the T test statistic is given by t app(α = ( r p F r, p(α, where F r, p (α is the upper 00α percentile of the F distribution with r and p degrees of freedom.. Simultaneous confidence intervals for sub-mean vectors We consider the simultaneous confidence intervals for any and all linear compounds of the sub-mean. Using the approximate upper percentiles of T from Section., for any nonnull vector a = (a, a,, a p, the simultaneous approximate confidence intervals for a (µ µ 0 are given by a u where c R app a (µ µ 0 a u + c R app, a R r {0}, u = x µ 0 V V (x µ 0, R app = (t app(αa V a (, c = s. 3 Two-sample problem 3. T type test statistic for sub-mean vectors Let x, x,, x i.i.d. p (µ, Σ, i =,, = ( + (. In this section, we consider the following hypothesis H 0 : µ ( = µ ( given µ ( = µ ( vs. H : µ ( µ ( given µ ( = µ (, 4

5 where We partition x j µ = p µ µ, µ = r µ µ µ r., µ = s µ r+ µ r+. µ p Σ Σ, Σ =. Σ Σ into a r random vector and a s random vector, as x j = (x j, x j, where j =,,...,, and p = r + s. This hypothesis test is then the same as the following hypothesis H 0 : µ ( = µ( vs. H : µ ( µ(, where µ = µ Σ Σ µ, µ = µ ( = µ (. We derive the MLEs of µ and Σ as follows. As for the one-sample case, we use the following transformed parameters (η, Ψ η η = = η ( µ Σ Σ µ, Ψ = µ Ψ Ψ = Ψ Ψ Σ Σ Σ Σ Σ, Σ where Σ = Σ Σ Σ Σ, i =,. We note that (η, Ψ is in one-to-one correspondence with (µ, Σ. Using the transformed parameters (η, Ψ, the likelihood function is given by L(η (, η (, Ψ =(π p Ψ Ψ exp exp i= i= (x j Ψ x j η Ψ (x j Ψ x j η (x j η Ψ (x Therefore, the logarithm of L(η (, η (, Ψ can be expressed as j η. log L(η (, η (, Ψ = p log(π log Ψ log Ψ i= (x j Ψ x j η Ψ (x j Ψ x j η (x j η Ψ (x j η. i= 5

6 Differentiating log L(η (, η (, Ψ with respect to η, Ψ, we have log L(η (, η (, Ψ η log L(η (, η (, Ψ η log L(η (, η (, Ψ Ψ = Ψ + i= log L(η (, η (, Ψ Ψ = log L(η (, η (, Ψ Ψ = Ψ (x j Ψ x j η, = Ψ (x j η, i= Ψ (x j Ψ x j η (x j Ψ x j η Ψ, i= = Ψ + { } Ψ (x j η x j Ψ x x, i= Ψ (x j η (x j η Ψ. Solving the partial derivative of log L(η (, η (, Ψ = 0, the maximum likelihood estimates of η (, η (, and Ψ are given by η = x Ψ x, i =,, η = x, Ψ = V, Ψ = V V Ψ, { Ψ = V + (x (j x (x (j x }, where x = V = V = (x j x j = ( x x, x = x (x j x = V V V = V V i= i= x, ( V V V V, V = V V V V., Therefore, using the relation that (η (, η (, Ψ is in one-to-one correspondence with (µ (, µ (, Σ, the MLEs of µ (, µ (, and Σ are given by ( µ µ x = = Σ Σ (x x, i =,, µ x ( Σ Σ V + Σ Σ Σ V V Σ = = { Σ Σ Σ V V V + i= Σ (x x (x } x, 6

7 The T test statistic for the sub-mean vector is then given by T = ( µ ( µ( { Ĉov µ ( µ( ( } ( µ µ(, where µ and Ĉov µ ( µ( are the estimators of µ and Cov µ ( µ(, respectively. We note that under H 0, T is asymptotically distributed as a χ distribution with r degrees of freedom. However, when the sample is not large, the χ distribution is not a good approximation of the upper percentile of T. Let We can then rewrite T as u = x ( x ( V V (x( x (, c ( 3 = ( ( ( s 3. T = ( c u V u = ( z W z = ( z z z W z z, z where z = c Σ u, W = Σ V Σ. We note that u is distributed on r (µ ( µ(, c Σ when (, (. Therefore, the distribution of z z is a χ distribution with r degrees of freedom. We note that under H 0, ( p s T /( r is approximately distributed as a F distribution with r and p s degrees of freedom. Using this result, the approximate upper 00α percentile of the T statistic is given by t app(α = ( r p s F r, p s (α, where F r, p s (α is the upper 00α percentiles of the F distribution with r and p s degrees of freedom. 3. Simultaneous confidence intervals We consider the simultaneous confidence intervals for any and all linear compounds of the sub-mean. Using the upper percentiles of T from Section 3., for any nonnull vector a = (a, a,, a p, the simultaneous confidence intervals for a (µ ( µ( are given by a c u M a (µ ( µ( a u + c M, a Rr {0}, 7

8 where M = (t (αa V a, and t (α is the upper 00α percentiles of the T test statistic. However, it is not easy to obtain t (α. Therefore, using the approximate upper 00α percentiles of the T test statistic, t app(α, the approximate simultaneous confidence intervals for a (µ ( a c u M app a (µ ( µ( a u + where M app = (t app(αa V a. 4 k-sample problem 4. The T max type test statistic µ( c M app, a R r {0}, can obtained by In this section, we consider the T max type test statistic for testing any two sub-mean vectors and propose the approximate upper 00α percentiles of this statistic with Bonferroni s approximation. Let x, x,, x i.i.d. p (µ, Σ for i =,,, k, = k i=. Gupta et al. (006 has derived the likelihood ratio test (LRT statistic for testing the hypothesis H 0 : µ ( = µ ( = = µ (k given µ ( = µ ( = = µ (k vs. H : at least two µ s are an equal given µ( = µ ( = = µ (k. When H 0 is rejected, our interest is pairwise comparisons of sub-mean vectors. Under the assumption of the common population covariance matrix, for fixed a; b, we can use the T type statistic for the two-sample problem derived in Section3., that is, T ab = ( µ µ(j { Ĉov( µ µ(j } ( µ µ(j, where µ (= µ Σ Σ µ and Ĉov( µ µ(j are estimators of µ respectively. Similarly, as for the two-sample case, the MLEs are given by µ = ( ( µ x = Σ Σ µ x ( Σ Σ Σ = = Σ Σ (x x, V + Σ Σ Σ V V Σ V V { k V + i= Σ (x x (x and Cov( µ µ(j, x }, 8

9 where x = V = V = (x j x j = x, x x = x (x j x = k V V V = V V i= k i= x, ( V V V V, V = V V V V., Similarly, as Section 3., under the hypothesis that the two sub-mean vectors are equal, we have the following approximate upper 00α percentile of the Tab test statistic for fixed i = a, b: t ab app(α = ( kr r k(s + + F r, r k(s++(α, where F r, r k(s++ (α is the upper 00α percentile of the F distribution with r and r k(s++ degrees of freedom. Using the test statistic, the T type test statistic for H 0 : µ (a = µ (b for all a, b, a < b k given µ ( = µ ( = = µ (k vs. H : H 0 given µ ( = µ ( = = µ (k, is given by T max = max a<b k T ab. Generally, it is not easy to obtain the upper percentile of the T max statistic. Therefore, in this section, we adopt Bonferroni s approximation, which is one of the solutions to this problem. We first consider the T ab test statistic. We consider the upper 00α percentile of the T max statistic t max(α when ( = ( = = (k. An approximate upper percentile of T max, t BO(α, is a solution of the equation given by a<b k Pr(T ab > t BO(α = α. Using Bonferroni s approximation, t BO(α is given by t BO(α = t ab(α, 9

10 where α = α/k(k, t ab (α is the upper 00α percentiles of the Tab test statistic. However, it is not easy to obtain t ab (α. Using t ab app(α, the approximate upper 00α percentile of the T max statistic is given by t B app(α = ( kr k(s + r + F r, k(s+ r+(α, where the F r, k(s+ r+ (α is the upper approximate 00α percentile of F distribution with r and r k(s + + degrees of freedom. 4. Simultaneous confidence intervals for multiple comparisons among submean vectors In this section, we consider simultaneous confidence intervals for pairwise multiple comparisons among sub-mean vectors. Using the approximate upper percentile of T max, for any nonnull vector a = (a, a,, a p, the approximate simultaneous confidence intervals for a (µ a cab u ab k L app a (µ (a µ(b a u ab + µ(j, a < b k are given by cab k L app, a R r {0}, a < b k, where L app = ( t B app(αa V a, c ab = ( (a + (b ( k (a (b ( k s. 5 Simulation studies In this section, we perform a Monte Carlo simulation (with 0 6 runs in order to evaluate the asymptotic behavior of the F approximations and the accuracy of the approximate upper 00α percentiles of the T and T max test statistic. Tables a and b present the simulated upper 00α percentile of the T test statistic and the approximate upper 00α percentile of the T test statistic for the two-sample problem; (p, r, s = (4,, 3, (4,,, (4, 3,, (8,, 6, (8, 4, 4, (8, 6, ; α = 0.05, 0.0; and for the following two cases of ( (, ( : ( (, ( = { (l, l, l = 0, 40, 00, 00, 400 (l, l, l = 0, 40, 00, 00. Tables a and b present the type I errors for the upper 00α percentile of the χ distribution with r degrees of freedom and the approximate upper 00α percentile of the T test statistic given by α = Pr(T > χ r(α, α = Pr(T > t app(α, 0

11 respectively. Table gives the approximate upper 00α percentile of the Tmax statistic, t BO(α, and the approximate upper 00α percentile of the Tmax statistic, t B app(α, for the k-sample problem; k = 3, 4, 5, 6, 0; (p, r, s = (4,, ; α = 0.05, 0.0; (k = l, where l = 0, 40, 00, 00, 400. Table presents the type I error for the approximate upper 00α percentile of the Tmax statistic: α 3 = Pr(T max > t B app(α, where α = α/k(k. It may be noted from Tables a and b that the simulated values approach closer to the upper percentile of the χ distribution when both of the sample sizes ( and ( become large. In addition, it can be seen from both tables that the proposed approximation value is good for all cases even when the sample size is small. The results for the type I error of the proposed approximation value are closer than those of the χ value for all cases. In Table, the simulated values also approach the upper 00α percentile of the χ distribution when both of the sample sizes ( and ( become large. The results for the type I error of the proposed approximation value are always lower than α for all cases. That is, it is conservative in terms of the type I error when k is large. Therefore, in terms of the type I error, it can be concluded that the proposed approximation values are more accurate than the simulated values for all cases. In addition, we perform a Monte Carlo simulation (with 0 6 runs in order to evaluate the powers of the T test statistic. Table 3a presents the powers of the T test statistic for the two-sample problem; (p, r, s = (4,, ; α = 0.05, 0.0; δ = µ ( µ ( = 0, 0.,..., ; and for the following cases of ( (, ( : ( = ( = l = 0, 40, 50. Table 3b presents the powers of the T test statistic for the two-sample problem; r =, (p, r, s = (4,,, (5,, 3, (6,, 4, (0,, 8; α = 0.05, 0.0; δ = µ ( µ ( = 0, 0.,..., ; ( (, ( = (50, 50. Tables 3a and 3b present the powers of the T test statistic: β = Pr(T > χ r(α H, β = Pr(T > t app(α H, respectively. It may be noted from Table 3a that the power takes conservative values when both of the sample sizes ( and ( become small. In addition, it may be noted from Table 3b that the power takes values that are a little conservative when s become small. The reasons are that the power may depend on the noncentrality parameters only through θ = δ Σ δ /c, where δ = µ ( µ (.

12 6 umerical example In this section, we discuss an example to illustrate the approximation of t B app(α by comparing each of the simultaneous confidence intervals in Section 4.. In this example, we utilize the data in the iris plant taken from Fisher (936. Data are presented on three species of iris, setosa, versicolor, and virginica, comprising four different measurements: x : petal width, x : petal length, x 3 : sepal width, and x 4 : sepal length. The population mean vectors are µ = (µ, µ = (µ, µ, µ 3, µ 4, where µ : mean of petal width, µ : mean of petal length, µ 3 : mean of sepal width, µ 4 : mean of sepal length, µ = (µ, µ, and µ = (µ 3, µ 4. We assume that these data are distributed normality, and µ = µ ( = µ ( = µ (3. We consider the simultaneous approximate confidence intervals of pairwise comparisons for testing the sub-mean vectors hypothesis: H 0 : µ (a = µ (b for all a, b, a < b 3 given µ ( = µ ( = µ (3 vs. H : H 0 given µ ( = µ ( = µ (3. Table 4 presents the simultaneous confidence intervals obtained by using t max(α, t BO(α, and t B app(α. For example, let a = (0, and α = 0.05; then the simultaneous confidence intervals obtained by using t max(0.05, t BO(0.05, and t B app(0.05 for pairwise comparisons are constructed as a (µ ( µ( obtained by using t max(0.05 : (.74,.066, t BO(0.05 : (.75,.065, t B app(0.05 : (.77,.063, respectively. These simultaneous confidence intervals present the petal length between the score of the setosa population and the score of the versicolor population. The simultaneous confidence intervals obtained by using the approximate upper 00α percentile, t B app(0.05, are a little longer than those obtained by using t BO(0.05 and t max(0.05. The remaining results are the same. Therefore, it can be concluded that the simultaneous confidence intervals obtained by using t B app(α are useful.

13 Table a t (α and t app(α when p = 4 Sample Size α = 0.05 α = 0.0 ( ( t (α t app(α α α t (α t app(α α α (r, s = (, 3, χ (0.05 = 3.84, χ (0.0 = (r, s = (,, χ (0.05 = 5.99, χ (0.0 = (r, s = (3,, χ 3(0.05 = 7.8, χ 3(0.0 =

14 Table b t (α and t app(α when p = 8 Sample Size α = 0.05 α = 0.0 ( ( t (α t app(α α α t (α t app(α α α (r, s = (, 6, χ (0.05 = 5.99, χ (0.0 = (r, s = (4, 4, χ 4(0.05 = 9.49, χ 4(0.0 = (r, s = (6,, χ 6(0.05 =.59, χ 6(0.0 =

15 Table t BO(α and t B app(α when (p, r, s = (4,, Sample Size α = 0.05 α = 0.0 t BO(α t B app(α α 3 t BO(α t B app(α α 3 k = k = k = k = k = ote : χ = 8.9, χ =.4, χ = 9.57, χ =.79, χ = 0.60, χ = 3.8, χ =.4, χ = 4.63, χ = 3.60, χ =

16 Table 3a Power for T test statistic when (p, r, s = (4,, α = 0.05 α = 0.0 δ ( (, ( β β β β 0 (0, (40, (50, ote : δ = µ ( µ (, β = Pr(T > χ r(α H, β = Pr(T > t app(α H 6

17 Table 3b Power for T test statistic when ( (, ( = (50, 50, r = α = 0.05 α = 0.0 δ (p, r, s β β β β 0 (4,, (5,, (6,, (0,, ote : δ = µ ( µ (, β = Pr(T > χ r(α H, β = Pr(T > t app(α H 7

18 Table 4 Simultaneous confidence intervals obtained by using t max(0.05, t BO(0.05, and t B app(0.05 a (, 0 (0, (, a (µ ( µ( µ( µ ( µ ( µ ( (µ ( + µ ( (µ( + µ ( t max(0.05 (.77,.070 (.74,.066 ( 3.448, 3.40 t BO(0.05 (.77,.070 (.75,.065 ( 3.449, 3.39 t B app(0.05 (.8,.067 (.77,.063 ( 3.45, 3.37 a (µ ( µ(3 µ( µ (3 µ ( µ (3 (µ ( + µ ( (µ(3 + µ (3 t max(0.05 (.889,.68 ( 3.53,.945 ( 4.938, t BO(0.05 (.890,.679 ( 3.55,.944 ( 4.939, 4.79 t B app(0.05 (.89,.678 ( 3.56,.94 ( 4.940, 4.77 a (µ ( µ(3 µ( µ (3 µ ( µ (3 (µ ( + µ ( (µ(3 + µ (3 t max(0.05 ( 0.75, ( 0.983, (.594,.386 t BO(0.05 ( 0.76, ( 0.984, (.595,.385 t B app(0.05 ( 0.78, ( 0.986, 0.77 (.0, References [] Eaton, M. L. and Kariya, T. (983, Multivariate tests with incomplete data, The Anal. Stat., Vol., o., [] Fisher, R. A., (936, The use of multiple measurements in taxonomic problems, Ann. Eugen., 7, [3] Fujikoshi, Y., Ulyanov, V. V. and Shimizu, R. (00, Multivariate Statistics: High-Dimensional and Large-Sample Approximations, Hoboken, J: Wiley. [4] Gupta, A. K., Xu, J. and Fujikoshi, Y. (006, An asymptotic expansion of the distribution of Rao s U-statistic under a general condition, J. Multivariate Anal., 97, [5] Kawasaki, T. and Seo, T. (06, A test for subvector of mean vector with two-step monotone missing data, SUT J. Math., 5, -39. [6] Provost, S. B. (990, Estimators for the parameters of a multivariate normal random vector with incomplete data on two subvectors and test of independence, Comput. Statist. Data Anal., 9, [7] Rao, C. R. (949, On some problems arising out of discrimination with multiple characters, Sankhyā, 9, [8] Rencher, A. C. (0, Methods of Multivariate Analysis, Hoboken, J: Wiley. [9] Siotani, M., Hayakawa, T. and Fujikoshi, Y. (985, Modern Multivariate Statistical Analysis: A Graduate Course and Handbook, American Science Press Inc., Ohio. 8

SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA

SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA Kazuyuki Koizumi Department of Mathematics, Graduate School of Science Tokyo University of Science 1-3, Kagurazaka,

More information

Testing equality of two mean vectors with unequal sample sizes for populations with correlation

Testing equality of two mean vectors with unequal sample sizes for populations with correlation Testing equality of two mean vectors with unequal sample sizes for populations with correlation Aya Shinozaki Naoya Okamoto 2 and Takashi Seo Department of Mathematical Information Science Tokyo University

More information

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control Takahiro Nishiyama a a Department of Mathematical Information Science, Tokyo University of Science,

More information

Expected probabilities of misclassification in linear discriminant analysis based on 2-Step monotone missing samples

Expected probabilities of misclassification in linear discriminant analysis based on 2-Step monotone missing samples Expected probabilities of misclassification in linear discriminant analysis based on 2-Step monotone missing samples Nobumichi Shutoh, Masashi Hyodo and Takashi Seo 2 Department of Mathematics, Graduate

More information

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone

More information

On the conservative multivariate Tukey-Kramer type procedures for multiple comparisons among mean vectors

On the conservative multivariate Tukey-Kramer type procedures for multiple comparisons among mean vectors On the conservative multivariate Tukey-Kramer type procedures for multiple comparisons among mean vectors Takashi Seo a, Takahiro Nishiyama b a Department of Mathematical Information Science, Tokyo University

More information

EPMC Estimation in Discriminant Analysis when the Dimension and Sample Sizes are Large

EPMC Estimation in Discriminant Analysis when the Dimension and Sample Sizes are Large EPMC Estimation in Discriminant Analysis when the Dimension and Sample Sizes are Large Tetsuji Tonda 1 Tomoyuki Nakagawa and Hirofumi Wakaki Last modified: March 30 016 1 Faculty of Management and Information

More information

High-dimensional asymptotic expansions for the distributions of canonical correlations

High-dimensional asymptotic expansions for the distributions of canonical correlations Journal of Multivariate Analysis 100 2009) 231 242 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva High-dimensional asymptotic

More information

High-Dimensional AICs for Selection of Redundancy Models in Discriminant Analysis. Tetsuro Sakurai, Takeshi Nakada and Yasunori Fujikoshi

High-Dimensional AICs for Selection of Redundancy Models in Discriminant Analysis. Tetsuro Sakurai, Takeshi Nakada and Yasunori Fujikoshi High-Dimensional AICs for Selection of Redundancy Models in Discriminant Analysis Tetsuro Sakurai, Takeshi Nakada and Yasunori Fujikoshi Faculty of Science and Engineering, Chuo University, Kasuga, Bunkyo-ku,

More information

Applied Multivariate and Longitudinal Data Analysis

Applied Multivariate and Longitudinal Data Analysis Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference

More information

Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis

Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis Yasunori Fujikoshi and Tetsuro Sakurai Department of Mathematics, Graduate School of Science,

More information

Consistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis

Consistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis Consistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis Tetsuro Sakurai and Yasunori Fujikoshi Center of General Education, Tokyo University

More information

Approximate interval estimation for EPMC for improved linear discriminant rule under high dimensional frame work

Approximate interval estimation for EPMC for improved linear discriminant rule under high dimensional frame work Hiroshima Statistical Research Group: Technical Report Approximate interval estimation for PMC for improved linear discriminant rule under high dimensional frame work Masashi Hyodo, Tomohiro Mitani, Tetsuto

More information

Multivariate Statistics

Multivariate Statistics Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical

More information

Statistical Inference On the High-dimensional Gaussian Covarianc

Statistical Inference On the High-dimensional Gaussian Covarianc Statistical Inference On the High-dimensional Gaussian Covariance Matrix Department of Mathematical Sciences, Clemson University June 6, 2011 Outline Introduction Problem Setup Statistical Inference High-Dimensional

More information

arxiv: v1 [math.st] 2 Apr 2016

arxiv: v1 [math.st] 2 Apr 2016 NON-ASYMPTOTIC RESULTS FOR CORNISH-FISHER EXPANSIONS V.V. ULYANOV, M. AOSHIMA, AND Y. FUJIKOSHI arxiv:1604.00539v1 [math.st] 2 Apr 2016 Abstract. We get the computable error bounds for generalized Cornish-Fisher

More information

Chapter 7, continued: MANOVA

Chapter 7, continued: MANOVA Chapter 7, continued: MANOVA The Multivariate Analysis of Variance (MANOVA) technique extends Hotelling T 2 test that compares two mean vectors to the setting in which there are m 2 groups. We wish to

More information

Likelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern

Likelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern The Korean Communications in Statistics Vol. 12 No. 2, 2005 pp. 473-481 Likelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern Byungjin

More information

z = β βσβ Statistical Analysis of MV Data Example : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) test statistic for H 0β is

z = β βσβ Statistical Analysis of MV Data Example : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) test statistic for H 0β is Example X~N p (µ,σ); H 0 : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) H 0β : β µ = 0 test statistic for H 0β is y z = β βσβ /n And reject H 0β if z β > c [suitable critical value] 301 Reject H 0 if

More information

Likelihood Ratio Tests in Multivariate Linear Model

Likelihood Ratio Tests in Multivariate Linear Model Likelihood Ratio Tests in Multivariate Linear Model Yasunori Fujikoshi Department of Mathematics, Graduate School of Science, Hiroshima University, 1-3-1 Kagamiyama, Higashi Hiroshima, Hiroshima 739-8626,

More information

In most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up

In most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up THE UNIVERSITY OF MINNESOTA Statistics 5401 September 17, 2005 Chi-Squared Q-Q plots to Assess Multivariate Normality Suppose x 1, x 2,..., x n is a random sample from a p-dimensional multivariate distribution

More information

Hypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA

Hypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA Hypothesis Testing Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA An Example Mardia et al. (979, p. ) reprint data from Frets (9) giving the length and breadth (in

More information

ON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION. Gunabushanam Nedumaran Oracle Corporation 1133 Esters Road #602 Irving, TX 75061

ON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION. Gunabushanam Nedumaran Oracle Corporation 1133 Esters Road #602 Irving, TX 75061 ON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION Gunabushanam Nedumaran Oracle Corporation 33 Esters Road #60 Irving, TX 7506 Joseph J. Pignatiello, Jr. FAMU-FSU College of Engineering Florida

More information

An Introduction to Multivariate Methods

An Introduction to Multivariate Methods Chapter 12 An Introduction to Multivariate Methods Multivariate statistical methods are used to display, analyze, and describe data on two or more features or variables simultaneously. I will discuss multivariate

More information

Consistency of test based method for selection of variables in high dimensional two group discriminant analysis

Consistency of test based method for selection of variables in high dimensional two group discriminant analysis https://doi.org/10.1007/s42081-019-00032-4 ORIGINAL PAPER Consistency of test based method for selection of variables in high dimensional two group discriminant analysis Yasunori Fujikoshi 1 Tetsuro Sakurai

More information

Statistical Inference with Monotone Incomplete Multivariate Normal Data

Statistical Inference with Monotone Incomplete Multivariate Normal Data Statistical Inference with Monotone Incomplete Multivariate Normal Data p. 1/4 Statistical Inference with Monotone Incomplete Multivariate Normal Data This talk is based on joint work with my wonderful

More information

Comparing two independent samples

Comparing two independent samples In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point

More information

MATH5745 Multivariate Methods Lecture 07

MATH5745 Multivariate Methods Lecture 07 MATH5745 Multivariate Methods Lecture 07 Tests of hypothesis on covariance matrix March 16, 2018 MATH5745 Multivariate Methods Lecture 07 March 16, 2018 1 / 39 Test on covariance matrices: Introduction

More information

Lecture 32: Asymptotic confidence sets and likelihoods

Lecture 32: Asymptotic confidence sets and likelihoods Lecture 32: Asymptotic confidence sets and likelihoods Asymptotic criterion In some problems, especially in nonparametric problems, it is difficult to find a reasonable confidence set with a given confidence

More information

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department

More information

Exercises Chapter 4 Statistical Hypothesis Testing

Exercises Chapter 4 Statistical Hypothesis Testing Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

Solutions to Final STAT 421, Fall 2008

Solutions to Final STAT 421, Fall 2008 Solutions to Final STAT 421, Fall 2008 Fritz Scholz 1. (8) Two treatments A and B were randomly assigned to 8 subjects (4 subjects to each treatment) with the following responses: 0, 1, 3, 6 and 5, 7,

More information

STAT 461/561- Assignments, Year 2015

STAT 461/561- Assignments, Year 2015 STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and

More information

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. (a) If X and Y are independent, Corr(X, Y ) = 0. (b) (c) (d) (e) A consistent estimator must be asymptotically

More information

Testing a Normal Covariance Matrix for Small Samples with Monotone Missing Data

Testing a Normal Covariance Matrix for Small Samples with Monotone Missing Data Applied Mathematical Sciences, Vol 3, 009, no 54, 695-70 Testing a Normal Covariance Matrix for Small Samples with Monotone Missing Data Evelina Veleva Rousse University A Kanchev Department of Numerical

More information

A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data

A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data Yujun Wu, Marc G. Genton, 1 and Leonard A. Stefanski 2 Department of Biostatistics, School of Public Health, University of Medicine

More information

Semiparametric Discriminant Analysis of Mixture Populations Using Mahalanobis Distance. Probal Chaudhuri and Subhajit Dutta

Semiparametric Discriminant Analysis of Mixture Populations Using Mahalanobis Distance. Probal Chaudhuri and Subhajit Dutta Semiparametric Discriminant Analysis of Mixture Populations Using Mahalanobis Distance Probal Chaudhuri and Subhajit Dutta Indian Statistical Institute, Kolkata. Workshop on Classification and Regression

More information

An Introduction to Multivariate Statistical Analysis

An Introduction to Multivariate Statistical Analysis An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

4.5.1 The use of 2 log Λ when θ is scalar

4.5.1 The use of 2 log Λ when θ is scalar 4.5. ASYMPTOTIC FORM OF THE G.L.R.T. 97 4.5.1 The use of 2 log Λ when θ is scalar Suppose we wish to test the hypothesis NH : θ = θ where θ is a given value against the alternative AH : θ θ on the basis

More information

Likelihood-based inference with missing data under missing-at-random

Likelihood-based inference with missing data under missing-at-random Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric

More information

You can compute the maximum likelihood estimate for the correlation

You can compute the maximum likelihood estimate for the correlation Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute

More information

Lecture 17: Likelihood ratio and asymptotic tests

Lecture 17: Likelihood ratio and asymptotic tests Lecture 17: Likelihood ratio and asymptotic tests Likelihood ratio When both H 0 and H 1 are simple (i.e., Θ 0 = {θ 0 } and Θ 1 = {θ 1 }), Theorem 6.1 applies and a UMP test rejects H 0 when f θ1 (X) f

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth

More information

Robustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples

Robustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples DOI 10.1186/s40064-016-1718-3 RESEARCH Open Access Robustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples Atinuke Adebanji 1,2, Michael Asamoah Boaheng

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007 Hypothesis Testing Daniel Schmierer Econ 312 March 30, 2007 Basics Parameter of interest: θ Θ Structure of the test: H 0 : θ Θ 0 H 1 : θ Θ 1 for some sets Θ 0, Θ 1 Θ where Θ 0 Θ 1 = (often Θ 1 = Θ Θ 0

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives

Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives TR-No. 14-06, Hiroshima Statistical Research Group, 1 11 Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives Mariko Yamamura 1, Keisuke Fukui

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30 MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)

More information

Applied Multivariate Analysis

Applied Multivariate Analysis Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Discriminant Analysis Background 1 Discriminant analysis Background General Setup for the Discriminant Analysis Descriptive

More information

Testing linear hypotheses of mean vectors for high-dimension data with unequal covariance matrices

Testing linear hypotheses of mean vectors for high-dimension data with unequal covariance matrices Testing linear hypotheses of mean vectors for high-dimension data with unequal covariance matrices Takahiro Nishiyama a,, Masashi Hyodo b, Takashi Seo a, Tatjana Pavlenko c a Department of Mathematical

More information

Inference on reliability in two-parameter exponential stress strength model

Inference on reliability in two-parameter exponential stress strength model Metrika DOI 10.1007/s00184-006-0074-7 Inference on reliability in two-parameter exponential stress strength model K. Krishnamoorthy Shubhabrata Mukherjee Huizhen Guo Received: 19 January 2005 Springer-Verlag

More information

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh Journal of Statistical Research 007, Vol. 4, No., pp. 5 Bangladesh ISSN 056-4 X ESTIMATION OF AUTOREGRESSIVE COEFFICIENT IN AN ARMA(, ) MODEL WITH VAGUE INFORMATION ON THE MA COMPONENT M. Ould Haye School

More information

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments

Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007

More information

Hotelling s One- Sample T2

Hotelling s One- Sample T2 Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Loglikelihood and Confidence Intervals

Loglikelihood and Confidence Intervals Stat 504, Lecture 2 1 Loglikelihood and Confidence Intervals The loglikelihood function is defined to be the natural logarithm of the likelihood function, l(θ ; x) = log L(θ ; x). For a variety of reasons,

More information

University of Lisbon, Portugal

University of Lisbon, Portugal Development and comparative study of two near-exact approximations to the distribution of the product of an odd number of independent Beta random variables Luís M. Grilo a,, Carlos A. Coelho b, a Dep.

More information

Analysis of variance, multivariate (MANOVA)

Analysis of variance, multivariate (MANOVA) Analysis of variance, multivariate (MANOVA) Abstract: A designed experiment is set up in which the system studied is under the control of an investigator. The individuals, the treatments, the variables

More information

Sample size determination for logistic regression: A simulation study

Sample size determination for logistic regression: A simulation study Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This

More information

Likelihood Ratio tests

Likelihood Ratio tests Likelihood Ratio tests For general composite hypotheses optimality theory is not usually successful in producing an optimal test. instead we look for heuristics to guide our choices. The simplest approach

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth

More information

Lecture 5: Hypothesis tests for more than one sample

Lecture 5: Hypothesis tests for more than one sample 1/23 Lecture 5: Hypothesis tests for more than one sample Måns Thulin Department of Mathematics, Uppsala University thulin@math.uu.se Multivariate Methods 8/4 2011 2/23 Outline Paired comparisons Repeated

More information

Inferences about a Mean Vector

Inferences about a Mean Vector Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University

More information

STAT 501 EXAM I NAME Spring 1999

STAT 501 EXAM I NAME Spring 1999 STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your

More information

Statistical Inference with Monotone Incomplete Multivariate Normal Data

Statistical Inference with Monotone Incomplete Multivariate Normal Data Statistical Inference with Monotone Incomplete Multivariate Normal Data p. 1/4 Statistical Inference with Monotone Incomplete Multivariate Normal Data This talk is based on joint work with my wonderful

More information

Asymptotic Analysis of the Generalized Coherence Estimate

Asymptotic Analysis of the Generalized Coherence Estimate IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 1, JANUARY 2001 45 Asymptotic Analysis of the Generalized Coherence Estimate Axel Clausen, Member, IEEE, and Douglas Cochran, Senior Member, IEEE Abstract

More information

ST495: Survival Analysis: Hypothesis testing and confidence intervals

ST495: Survival Analysis: Hypothesis testing and confidence intervals ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took

More information

ABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY

ABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY ABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY José A. Díaz-García and Raúl Alberto Pérez-Agamez Comunicación Técnica No I-05-11/08-09-005 (PE/CIMAT) About principal components under singularity José A.

More information

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1. Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:. MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss

More information

Goodness-of-Fit Testing with. Discrete Right-Censored Data

Goodness-of-Fit Testing with. Discrete Right-Censored Data Goodness-of-Fit Testing with Discrete Right-Censored Data Edsel A. Pe~na Department of Statistics University of South Carolina June 2002 Research support from NIH and NSF grants. The Problem T1, T2,...,

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

ISyE 6416: Computational Statistics Spring Lecture 5: Discriminant analysis and classification

ISyE 6416: Computational Statistics Spring Lecture 5: Discriminant analysis and classification ISyE 6416: Computational Statistics Spring 2017 Lecture 5: Discriminant analysis and classification Prof. Yao Xie H. Milton Stewart School of Industrial and Systems Engineering Georgia Institute of Technology

More information

STAT 450: Final Examination Version 1. Richard Lockhart 16 December 2002

STAT 450: Final Examination Version 1. Richard Lockhart 16 December 2002 Name: Last Name 1, First Name 1 Stdnt # StudentNumber1 STAT 450: Final Examination Version 1 Richard Lockhart 16 December 2002 Instructions: This is an open book exam. You may use notes, books and a calculator.

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Lecture 28: Asymptotic confidence sets

Lecture 28: Asymptotic confidence sets Lecture 28: Asymptotic confidence sets 1 α asymptotic confidence sets Similar to testing hypotheses, in many situations it is difficult to find a confidence set with a given confidence coefficient or level

More information

THE STATISTICAL ANALYSIS OF MONOTONE INCOMPLETE MULTIVARIATE NORMAL DATA

THE STATISTICAL ANALYSIS OF MONOTONE INCOMPLETE MULTIVARIATE NORMAL DATA The Pennsylvania State University The Graduate School Department of Statistics THE STATISTICAL ANALYSIS OF MONOTONE INCOMPLETE MULTIVARIATE NORMAL DATA A Dissertation in Statistics by Megan M. Romer c

More information

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation Variations ECE 6540, Lecture 10 Last Time BLUE (Best Linear Unbiased Estimator) Formulation Advantages Disadvantages 2 The BLUE A simplification Assume the estimator is a linear system For a single parameter

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Bootstrap Procedures for Testing Homogeneity Hypotheses

Bootstrap Procedures for Testing Homogeneity Hypotheses Journal of Statistical Theory and Applications Volume 11, Number 2, 2012, pp. 183-195 ISSN 1538-7887 Bootstrap Procedures for Testing Homogeneity Hypotheses Bimal Sinha 1, Arvind Shah 2, Dihua Xu 1, Jianxin

More information

Ratio of Vector Lengths as an Indicator of Sample Representativeness

Ratio of Vector Lengths as an Indicator of Sample Representativeness Abstract Ratio of Vector Lengths as an Indicator of Sample Representativeness Hee-Choon Shin National Center for Health Statistics, 3311 Toledo Rd., Hyattsville, MD 20782 The main objective of sampling

More information

Analysis of Classification in Interval-Valued Information Systems

Analysis of Classification in Interval-Valued Information Systems Analysis of Classification in Interval-Valued Information Systems AMAURY CABALLERO, KANG YEN, EDUARDO CABALLERO Department of Electrical & Computer Engineering Florida International University 10555 W.

More information

Robustness of location estimators under t- distributions: a literature review

Robustness of location estimators under t- distributions: a literature review IOP Conference Series: Earth and Environmental Science PAPER OPEN ACCESS Robustness of location estimators under t- distributions: a literature review o cite this article: C Sumarni et al 07 IOP Conf.

More information

A Likelihood Ratio Test

A Likelihood Ratio Test A Likelihood Ratio Test David Allen University of Kentucky February 23, 2012 1 Introduction Earlier presentations gave a procedure for finding an estimate and its standard error of a single linear combination

More information

Lecture 5: LDA and Logistic Regression

Lecture 5: LDA and Logistic Regression Lecture 5: and Logistic Regression Hao Helen Zhang Hao Helen Zhang Lecture 5: and Logistic Regression 1 / 39 Outline Linear Classification Methods Two Popular Linear Models for Classification Linear Discriminant

More information

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists

More information

Lecture 3. Inference about multivariate normal distribution

Lecture 3. Inference about multivariate normal distribution Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates

More information

Data Mining and Analysis: Fundamental Concepts and Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA Department

More information

Multivariate analysis of variance and covariance

Multivariate analysis of variance and covariance Introduction Multivariate analysis of variance and covariance Univariate ANOVA: have observations from several groups, numerical dependent variable. Ask whether dependent variable has same mean for each

More information

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT

INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT Mirtagioğlu et al., The Journal of Animal & Plant Sciences, 4(): 04, Page: J. 344-349 Anim. Plant Sci. 4():04 ISSN: 08-708 INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF

More information

Classification Methods II: Linear and Quadratic Discrimminant Analysis

Classification Methods II: Linear and Quadratic Discrimminant Analysis Classification Methods II: Linear and Quadratic Discrimminant Analysis Rebecca C. Steorts, Duke University STA 325, Chapter 4 ISL Agenda Linear Discrimminant Analysis (LDA) Classification Recall that linear

More information

4 Statistics of Normally Distributed Data

4 Statistics of Normally Distributed Data 4 Statistics of Normally Distributed Data 4.1 One Sample a The Three Basic Questions of Inferential Statistics. Inferential statistics form the bridge between the probability models that structure our

More information

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

HYPOTHESIS TESTING: FREQUENTIST APPROACH. HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous

More information

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Hirokazu Yanagihara 1, Tetsuji Tonda 2 and Chieko Matsumoto 3 1 Department of Social Systems

More information

Lecture 15. Hypothesis testing in the linear model

Lecture 15. Hypothesis testing in the linear model 14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma

More information

Goodness of Fit Goodness of fit - 2 classes

Goodness of Fit Goodness of fit - 2 classes Goodness of Fit Goodness of fit - 2 classes A B 78 22 Do these data correspond reasonably to the proportions 3:1? We previously discussed options for testing p A = 0.75! Exact p-value Exact confidence

More information