T 2 Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem
|
|
- Godfrey Clark
- 6 years ago
- Views:
Transcription
1 T Type Test Statistic and Simultaneous Confidence Intervals for Sub-mean Vectors in k-sample Problem Toshiki aito a, Tamae Kawasaki b and Takashi Seo b a Department of Applied Mathematics, Graduate School of Science, Tokyo University of Science, Tokyo, Japan; b Department of Applied Mathematics, Faculty of Science, Tokyo University of Science, Tokyo, Japan Abstract In this paper, we consider tests for sub-mean vectors. In the one-sample and two-sample problems, we give the T type test statistic and the simultaneous confidence intervals by using the approximate upper percentiles of T type test statistic. In the k-sample problem, we give the simultaneous confidence intervals for pairwise multiple comparisons by using Bonferroni s approximation. Finally, we investigate the asymptotic behavior of the approximate upper percentiles of T type statistic by Monte Carlo simulation, and we give an example to illustrate the simultaneous confidence intervals. AMS 000 Mathematics Subject Classification. 6H0, 6H5. Key Words and Phrases: Bonferroni s approximation, Monte Carlo simulation, pairwise multiple comparisons, T max statistic, type I error Introduction Consider the tests of mean when the partial mean vector is given. That is, letting µ = (µ, µ be a mean vector, where µ = (µ, µ,..., µ r and µ = (µ r+, µ r+,..., µ p, the problem is the testing of µ when µ is given. For the problem of sub-mean vectors, Eaton and Kariya (983 derived tests for the independence of two normally distributed sub-mean vectors for the case that an additional random sample is available. Provost (990 obtained explicit expressions for the case that the maximum likelihood estimators (MLEs of all the parameters of the multi-normal random vector are given, and the likelihood ratio statistic for testing the independence between sub-mean vectors has been obtained. For the onesample problem, Rao (949 gave Rao s U-statistic and additional information. The null distribution of Rao s U-statistic has been introduced by Siotani et al. (985. A test for sub-mean vectors with two-step monotone missing data was discussed by Kawasaki and Seo (06. A test for sub-mean vectors in two-sample problem was introduced by Rencher (0. For the k-sample problem, Fujikoshi et al. (00 gave an asymptotic expansion of the distribution of the generalized U-statistic under normality. Gupta et al. (006 gave an asymptotic expansion of the distribution of the generalized U-statistic under
2 a general condition. However, the problem for sub-mean vectors in terms of simultaneous confidence intervals does not appear to have been discussed. In this paper, we give the T type test statistic and derive the simultaneous confidence intervals for sub-mean vectors in the one-sample and two-sample problems. For the k-sample problem, we give the simultaneous confidence intervals for pairwise multiple comparisons for sub-mean vectors. The remainder of this paper is organized. In Section, we propose the T type test statistic for the one-sample case (T test statistic, and its approximate upper percentiles and simultaneous confidence intervals. In Section 3, we propose the T type test statistic for the two-sample case (T test statistic, and its approximate upper percentiles and simultaneous confidence intervals. In Section 4, we present simultaneous confidence intervals for multiple comparisons in the k sample problem. In order to obtain the simultaneous confidence intervals, we derive the approximate upper percentiles of the T max statistic by Bonferroni s approximation. In Section 5, we investigate the asymptotic behavior of the approximate upper percentiles of the T test statistic and the T max statistic by Monte Carlo simulation. In Section 6, we give an example to illustrate simultaneous confidence intervals. One-sample problem. T type test statistic for a sub-mean vector Let x, x,..., x i.i.d. p (µ, Σ. In this section, we consider the following hypothesis H 0 : µ = µ 0 given µ = µ 0 vs. H : µ µ 0 given µ = µ 0, where µ 0 = (µ 0,, µ 0,,..., µ 0,p = (µ 0, µ 0 are given values, and µ = p ( µ µ, µ = r µ µ. µ r, µ = s µ r+ µ r+. µ p, Σ = Σ Σ. Σ Σ We partition x j into a r random vector, a s random vector as x j = (x j, x j, where x j : r, x j : s, p = r + s, j =,,...,. This hypothesis test is then the same as the following hypothesis: H 0 : µ = µ 0 vs. H : µ µ 0,
3 where µ = µ Σ Σ µ, µ 0 = µ 0 Σ Σ µ 0. We then derive the MLEs of µ and Σ. Employing the derivation of Siotani et al. (985, we use the transformed parameters as follows: ( η µ Σ η = = Σ µ 0 Ψ Ψ, Ψ = Σ Σ Σ = η µ 0 Ψ Ψ Σ Σ, Σ where Σ = Σ Σ Σ Σ. We note that (η, Ψ is in one-to-one correspondence with (µ, Σ. Using the transformed parameters (η, Ψ, the likelihood function is given by L(η, Ψ =(π p Ψ Ψ exp (x j Ψ x j η Ψ (x j Ψ x j η exp (x j µ 0 Ψ (x j µ 0. Therefore, the logarithm of L(η, Ψ can be expressed as log L(η, Ψ = p log(π log Ψ log Ψ (x j Ψ x j η Ψ (x j Ψ x j η (x j µ 0 Ψ (x j µ 0. Differentiating log L(η, Ψ with respect to η and Ψ, we have log L(η, Ψ η log L(η, Ψ Ψ log L(η, Ψ Ψ = log L(η, Ψ Ψ = Ψ (x j Ψ x j η, = Ψ + Ψ (x j Ψ x j η (x j Ψ x j η Ψ, Ψ { (xj x (x j x Ψ (x j x (x j x }, = Ψ + Ψ (x j µ 0 (x j µ 0 Ψ. Solving the partial derivative of log L(η, Ψ = 0, the MLEs of η and Ψ are given by η = x Ψ x, Ψ = V, Ψ = V V Ψ, Ψ = V, where x = x j = ( x x, V = (x j x(x j x V V =, V = V V V V V V. 3
4 Therefore, using the relation that (η, Ψ is in one-to-one correspondence with (µ, Σ, the MLEs of µ and Σ are given by ( ( µ x µ = = Σ Σ (x µ 0, µ µ 0 ( Σ Σ Σ = = V + Σ Σ Σ V V Σ Σ Σ V V V The T test statistic for sub-mean vectors is then given by Σ. T = ( µ µ 0 { Ĉov( µ µ 0 } ( µ µ 0, where µ, Ĉov( µ µ 0 are estimators of µ, Cov( µ µ 0. We note that under H 0, T is approximately distributed as a F distribution with r and p degrees of freedom. Using this result, the approximate upper 00α percentile of the T test statistic is given by t app(α = ( r p F r, p(α, where F r, p (α is the upper 00α percentile of the F distribution with r and p degrees of freedom.. Simultaneous confidence intervals for sub-mean vectors We consider the simultaneous confidence intervals for any and all linear compounds of the sub-mean. Using the approximate upper percentiles of T from Section., for any nonnull vector a = (a, a,, a p, the simultaneous approximate confidence intervals for a (µ µ 0 are given by a u where c R app a (µ µ 0 a u + c R app, a R r {0}, u = x µ 0 V V (x µ 0, R app = (t app(αa V a (, c = s. 3 Two-sample problem 3. T type test statistic for sub-mean vectors Let x, x,, x i.i.d. p (µ, Σ, i =,, = ( + (. In this section, we consider the following hypothesis H 0 : µ ( = µ ( given µ ( = µ ( vs. H : µ ( µ ( given µ ( = µ (, 4
5 where We partition x j µ = p µ µ, µ = r µ µ µ r., µ = s µ r+ µ r+. µ p Σ Σ, Σ =. Σ Σ into a r random vector and a s random vector, as x j = (x j, x j, where j =,,...,, and p = r + s. This hypothesis test is then the same as the following hypothesis H 0 : µ ( = µ( vs. H : µ ( µ(, where µ = µ Σ Σ µ, µ = µ ( = µ (. We derive the MLEs of µ and Σ as follows. As for the one-sample case, we use the following transformed parameters (η, Ψ η η = = η ( µ Σ Σ µ, Ψ = µ Ψ Ψ = Ψ Ψ Σ Σ Σ Σ Σ, Σ where Σ = Σ Σ Σ Σ, i =,. We note that (η, Ψ is in one-to-one correspondence with (µ, Σ. Using the transformed parameters (η, Ψ, the likelihood function is given by L(η (, η (, Ψ =(π p Ψ Ψ exp exp i= i= (x j Ψ x j η Ψ (x j Ψ x j η (x j η Ψ (x Therefore, the logarithm of L(η (, η (, Ψ can be expressed as j η. log L(η (, η (, Ψ = p log(π log Ψ log Ψ i= (x j Ψ x j η Ψ (x j Ψ x j η (x j η Ψ (x j η. i= 5
6 Differentiating log L(η (, η (, Ψ with respect to η, Ψ, we have log L(η (, η (, Ψ η log L(η (, η (, Ψ η log L(η (, η (, Ψ Ψ = Ψ + i= log L(η (, η (, Ψ Ψ = log L(η (, η (, Ψ Ψ = Ψ (x j Ψ x j η, = Ψ (x j η, i= Ψ (x j Ψ x j η (x j Ψ x j η Ψ, i= = Ψ + { } Ψ (x j η x j Ψ x x, i= Ψ (x j η (x j η Ψ. Solving the partial derivative of log L(η (, η (, Ψ = 0, the maximum likelihood estimates of η (, η (, and Ψ are given by η = x Ψ x, i =,, η = x, Ψ = V, Ψ = V V Ψ, { Ψ = V + (x (j x (x (j x }, where x = V = V = (x j x j = ( x x, x = x (x j x = V V V = V V i= i= x, ( V V V V, V = V V V V., Therefore, using the relation that (η (, η (, Ψ is in one-to-one correspondence with (µ (, µ (, Σ, the MLEs of µ (, µ (, and Σ are given by ( µ µ x = = Σ Σ (x x, i =,, µ x ( Σ Σ V + Σ Σ Σ V V Σ = = { Σ Σ Σ V V V + i= Σ (x x (x } x, 6
7 The T test statistic for the sub-mean vector is then given by T = ( µ ( µ( { Ĉov µ ( µ( ( } ( µ µ(, where µ and Ĉov µ ( µ( are the estimators of µ and Cov µ ( µ(, respectively. We note that under H 0, T is asymptotically distributed as a χ distribution with r degrees of freedom. However, when the sample is not large, the χ distribution is not a good approximation of the upper percentile of T. Let We can then rewrite T as u = x ( x ( V V (x( x (, c ( 3 = ( ( ( s 3. T = ( c u V u = ( z W z = ( z z z W z z, z where z = c Σ u, W = Σ V Σ. We note that u is distributed on r (µ ( µ(, c Σ when (, (. Therefore, the distribution of z z is a χ distribution with r degrees of freedom. We note that under H 0, ( p s T /( r is approximately distributed as a F distribution with r and p s degrees of freedom. Using this result, the approximate upper 00α percentile of the T statistic is given by t app(α = ( r p s F r, p s (α, where F r, p s (α is the upper 00α percentiles of the F distribution with r and p s degrees of freedom. 3. Simultaneous confidence intervals We consider the simultaneous confidence intervals for any and all linear compounds of the sub-mean. Using the upper percentiles of T from Section 3., for any nonnull vector a = (a, a,, a p, the simultaneous confidence intervals for a (µ ( µ( are given by a c u M a (µ ( µ( a u + c M, a Rr {0}, 7
8 where M = (t (αa V a, and t (α is the upper 00α percentiles of the T test statistic. However, it is not easy to obtain t (α. Therefore, using the approximate upper 00α percentiles of the T test statistic, t app(α, the approximate simultaneous confidence intervals for a (µ ( a c u M app a (µ ( µ( a u + where M app = (t app(αa V a. 4 k-sample problem 4. The T max type test statistic µ( c M app, a R r {0}, can obtained by In this section, we consider the T max type test statistic for testing any two sub-mean vectors and propose the approximate upper 00α percentiles of this statistic with Bonferroni s approximation. Let x, x,, x i.i.d. p (µ, Σ for i =,,, k, = k i=. Gupta et al. (006 has derived the likelihood ratio test (LRT statistic for testing the hypothesis H 0 : µ ( = µ ( = = µ (k given µ ( = µ ( = = µ (k vs. H : at least two µ s are an equal given µ( = µ ( = = µ (k. When H 0 is rejected, our interest is pairwise comparisons of sub-mean vectors. Under the assumption of the common population covariance matrix, for fixed a; b, we can use the T type statistic for the two-sample problem derived in Section3., that is, T ab = ( µ µ(j { Ĉov( µ µ(j } ( µ µ(j, where µ (= µ Σ Σ µ and Ĉov( µ µ(j are estimators of µ respectively. Similarly, as for the two-sample case, the MLEs are given by µ = ( ( µ x = Σ Σ µ x ( Σ Σ Σ = = Σ Σ (x x, V + Σ Σ Σ V V Σ V V { k V + i= Σ (x x (x and Cov( µ µ(j, x }, 8
9 where x = V = V = (x j x j = x, x x = x (x j x = k V V V = V V i= k i= x, ( V V V V, V = V V V V., Similarly, as Section 3., under the hypothesis that the two sub-mean vectors are equal, we have the following approximate upper 00α percentile of the Tab test statistic for fixed i = a, b: t ab app(α = ( kr r k(s + + F r, r k(s++(α, where F r, r k(s++ (α is the upper 00α percentile of the F distribution with r and r k(s++ degrees of freedom. Using the test statistic, the T type test statistic for H 0 : µ (a = µ (b for all a, b, a < b k given µ ( = µ ( = = µ (k vs. H : H 0 given µ ( = µ ( = = µ (k, is given by T max = max a<b k T ab. Generally, it is not easy to obtain the upper percentile of the T max statistic. Therefore, in this section, we adopt Bonferroni s approximation, which is one of the solutions to this problem. We first consider the T ab test statistic. We consider the upper 00α percentile of the T max statistic t max(α when ( = ( = = (k. An approximate upper percentile of T max, t BO(α, is a solution of the equation given by a<b k Pr(T ab > t BO(α = α. Using Bonferroni s approximation, t BO(α is given by t BO(α = t ab(α, 9
10 where α = α/k(k, t ab (α is the upper 00α percentiles of the Tab test statistic. However, it is not easy to obtain t ab (α. Using t ab app(α, the approximate upper 00α percentile of the T max statistic is given by t B app(α = ( kr k(s + r + F r, k(s+ r+(α, where the F r, k(s+ r+ (α is the upper approximate 00α percentile of F distribution with r and r k(s + + degrees of freedom. 4. Simultaneous confidence intervals for multiple comparisons among submean vectors In this section, we consider simultaneous confidence intervals for pairwise multiple comparisons among sub-mean vectors. Using the approximate upper percentile of T max, for any nonnull vector a = (a, a,, a p, the approximate simultaneous confidence intervals for a (µ a cab u ab k L app a (µ (a µ(b a u ab + µ(j, a < b k are given by cab k L app, a R r {0}, a < b k, where L app = ( t B app(αa V a, c ab = ( (a + (b ( k (a (b ( k s. 5 Simulation studies In this section, we perform a Monte Carlo simulation (with 0 6 runs in order to evaluate the asymptotic behavior of the F approximations and the accuracy of the approximate upper 00α percentiles of the T and T max test statistic. Tables a and b present the simulated upper 00α percentile of the T test statistic and the approximate upper 00α percentile of the T test statistic for the two-sample problem; (p, r, s = (4,, 3, (4,,, (4, 3,, (8,, 6, (8, 4, 4, (8, 6, ; α = 0.05, 0.0; and for the following two cases of ( (, ( : ( (, ( = { (l, l, l = 0, 40, 00, 00, 400 (l, l, l = 0, 40, 00, 00. Tables a and b present the type I errors for the upper 00α percentile of the χ distribution with r degrees of freedom and the approximate upper 00α percentile of the T test statistic given by α = Pr(T > χ r(α, α = Pr(T > t app(α, 0
11 respectively. Table gives the approximate upper 00α percentile of the Tmax statistic, t BO(α, and the approximate upper 00α percentile of the Tmax statistic, t B app(α, for the k-sample problem; k = 3, 4, 5, 6, 0; (p, r, s = (4,, ; α = 0.05, 0.0; (k = l, where l = 0, 40, 00, 00, 400. Table presents the type I error for the approximate upper 00α percentile of the Tmax statistic: α 3 = Pr(T max > t B app(α, where α = α/k(k. It may be noted from Tables a and b that the simulated values approach closer to the upper percentile of the χ distribution when both of the sample sizes ( and ( become large. In addition, it can be seen from both tables that the proposed approximation value is good for all cases even when the sample size is small. The results for the type I error of the proposed approximation value are closer than those of the χ value for all cases. In Table, the simulated values also approach the upper 00α percentile of the χ distribution when both of the sample sizes ( and ( become large. The results for the type I error of the proposed approximation value are always lower than α for all cases. That is, it is conservative in terms of the type I error when k is large. Therefore, in terms of the type I error, it can be concluded that the proposed approximation values are more accurate than the simulated values for all cases. In addition, we perform a Monte Carlo simulation (with 0 6 runs in order to evaluate the powers of the T test statistic. Table 3a presents the powers of the T test statistic for the two-sample problem; (p, r, s = (4,, ; α = 0.05, 0.0; δ = µ ( µ ( = 0, 0.,..., ; and for the following cases of ( (, ( : ( = ( = l = 0, 40, 50. Table 3b presents the powers of the T test statistic for the two-sample problem; r =, (p, r, s = (4,,, (5,, 3, (6,, 4, (0,, 8; α = 0.05, 0.0; δ = µ ( µ ( = 0, 0.,..., ; ( (, ( = (50, 50. Tables 3a and 3b present the powers of the T test statistic: β = Pr(T > χ r(α H, β = Pr(T > t app(α H, respectively. It may be noted from Table 3a that the power takes conservative values when both of the sample sizes ( and ( become small. In addition, it may be noted from Table 3b that the power takes values that are a little conservative when s become small. The reasons are that the power may depend on the noncentrality parameters only through θ = δ Σ δ /c, where δ = µ ( µ (.
12 6 umerical example In this section, we discuss an example to illustrate the approximation of t B app(α by comparing each of the simultaneous confidence intervals in Section 4.. In this example, we utilize the data in the iris plant taken from Fisher (936. Data are presented on three species of iris, setosa, versicolor, and virginica, comprising four different measurements: x : petal width, x : petal length, x 3 : sepal width, and x 4 : sepal length. The population mean vectors are µ = (µ, µ = (µ, µ, µ 3, µ 4, where µ : mean of petal width, µ : mean of petal length, µ 3 : mean of sepal width, µ 4 : mean of sepal length, µ = (µ, µ, and µ = (µ 3, µ 4. We assume that these data are distributed normality, and µ = µ ( = µ ( = µ (3. We consider the simultaneous approximate confidence intervals of pairwise comparisons for testing the sub-mean vectors hypothesis: H 0 : µ (a = µ (b for all a, b, a < b 3 given µ ( = µ ( = µ (3 vs. H : H 0 given µ ( = µ ( = µ (3. Table 4 presents the simultaneous confidence intervals obtained by using t max(α, t BO(α, and t B app(α. For example, let a = (0, and α = 0.05; then the simultaneous confidence intervals obtained by using t max(0.05, t BO(0.05, and t B app(0.05 for pairwise comparisons are constructed as a (µ ( µ( obtained by using t max(0.05 : (.74,.066, t BO(0.05 : (.75,.065, t B app(0.05 : (.77,.063, respectively. These simultaneous confidence intervals present the petal length between the score of the setosa population and the score of the versicolor population. The simultaneous confidence intervals obtained by using the approximate upper 00α percentile, t B app(0.05, are a little longer than those obtained by using t BO(0.05 and t max(0.05. The remaining results are the same. Therefore, it can be concluded that the simultaneous confidence intervals obtained by using t B app(α are useful.
13 Table a t (α and t app(α when p = 4 Sample Size α = 0.05 α = 0.0 ( ( t (α t app(α α α t (α t app(α α α (r, s = (, 3, χ (0.05 = 3.84, χ (0.0 = (r, s = (,, χ (0.05 = 5.99, χ (0.0 = (r, s = (3,, χ 3(0.05 = 7.8, χ 3(0.0 =
14 Table b t (α and t app(α when p = 8 Sample Size α = 0.05 α = 0.0 ( ( t (α t app(α α α t (α t app(α α α (r, s = (, 6, χ (0.05 = 5.99, χ (0.0 = (r, s = (4, 4, χ 4(0.05 = 9.49, χ 4(0.0 = (r, s = (6,, χ 6(0.05 =.59, χ 6(0.0 =
15 Table t BO(α and t B app(α when (p, r, s = (4,, Sample Size α = 0.05 α = 0.0 t BO(α t B app(α α 3 t BO(α t B app(α α 3 k = k = k = k = k = ote : χ = 8.9, χ =.4, χ = 9.57, χ =.79, χ = 0.60, χ = 3.8, χ =.4, χ = 4.63, χ = 3.60, χ =
16 Table 3a Power for T test statistic when (p, r, s = (4,, α = 0.05 α = 0.0 δ ( (, ( β β β β 0 (0, (40, (50, ote : δ = µ ( µ (, β = Pr(T > χ r(α H, β = Pr(T > t app(α H 6
17 Table 3b Power for T test statistic when ( (, ( = (50, 50, r = α = 0.05 α = 0.0 δ (p, r, s β β β β 0 (4,, (5,, (6,, (0,, ote : δ = µ ( µ (, β = Pr(T > χ r(α H, β = Pr(T > t app(α H 7
18 Table 4 Simultaneous confidence intervals obtained by using t max(0.05, t BO(0.05, and t B app(0.05 a (, 0 (0, (, a (µ ( µ( µ( µ ( µ ( µ ( (µ ( + µ ( (µ( + µ ( t max(0.05 (.77,.070 (.74,.066 ( 3.448, 3.40 t BO(0.05 (.77,.070 (.75,.065 ( 3.449, 3.39 t B app(0.05 (.8,.067 (.77,.063 ( 3.45, 3.37 a (µ ( µ(3 µ( µ (3 µ ( µ (3 (µ ( + µ ( (µ(3 + µ (3 t max(0.05 (.889,.68 ( 3.53,.945 ( 4.938, t BO(0.05 (.890,.679 ( 3.55,.944 ( 4.939, 4.79 t B app(0.05 (.89,.678 ( 3.56,.94 ( 4.940, 4.77 a (µ ( µ(3 µ( µ (3 µ ( µ (3 (µ ( + µ ( (µ(3 + µ (3 t max(0.05 ( 0.75, ( 0.983, (.594,.386 t BO(0.05 ( 0.76, ( 0.984, (.595,.385 t B app(0.05 ( 0.78, ( 0.986, 0.77 (.0, References [] Eaton, M. L. and Kariya, T. (983, Multivariate tests with incomplete data, The Anal. Stat., Vol., o., [] Fisher, R. A., (936, The use of multiple measurements in taxonomic problems, Ann. Eugen., 7, [3] Fujikoshi, Y., Ulyanov, V. V. and Shimizu, R. (00, Multivariate Statistics: High-Dimensional and Large-Sample Approximations, Hoboken, J: Wiley. [4] Gupta, A. K., Xu, J. and Fujikoshi, Y. (006, An asymptotic expansion of the distribution of Rao s U-statistic under a general condition, J. Multivariate Anal., 97, [5] Kawasaki, T. and Seo, T. (06, A test for subvector of mean vector with two-step monotone missing data, SUT J. Math., 5, -39. [6] Provost, S. B. (990, Estimators for the parameters of a multivariate normal random vector with incomplete data on two subvectors and test of independence, Comput. Statist. Data Anal., 9, [7] Rao, C. R. (949, On some problems arising out of discrimination with multiple characters, Sankhyā, 9, [8] Rencher, A. C. (0, Methods of Multivariate Analysis, Hoboken, J: Wiley. [9] Siotani, M., Hayakawa, T. and Fujikoshi, Y. (985, Modern Multivariate Statistical Analysis: A Graduate Course and Handbook, American Science Press Inc., Ohio. 8
SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA
SIMULTANEOUS CONFIDENCE INTERVALS AMONG k MEAN VECTORS IN REPEATED MEASURES WITH MISSING DATA Kazuyuki Koizumi Department of Mathematics, Graduate School of Science Tokyo University of Science 1-3, Kagurazaka,
More informationTesting equality of two mean vectors with unequal sample sizes for populations with correlation
Testing equality of two mean vectors with unequal sample sizes for populations with correlation Aya Shinozaki Naoya Okamoto 2 and Takashi Seo Department of Mathematical Information Science Tokyo University
More informationOn the conservative multivariate multiple comparison procedure of correlated mean vectors with a control
On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control Takahiro Nishiyama a a Department of Mathematical Information Science, Tokyo University of Science,
More informationExpected probabilities of misclassification in linear discriminant analysis based on 2-Step monotone missing samples
Expected probabilities of misclassification in linear discriminant analysis based on 2-Step monotone missing samples Nobumichi Shutoh, Masashi Hyodo and Takashi Seo 2 Department of Mathematics, Graduate
More informationInferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data
Journal of Multivariate Analysis 78, 6282 (2001) doi:10.1006jmva.2000.1939, available online at http:www.idealibrary.com on Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone
More informationOn the conservative multivariate Tukey-Kramer type procedures for multiple comparisons among mean vectors
On the conservative multivariate Tukey-Kramer type procedures for multiple comparisons among mean vectors Takashi Seo a, Takahiro Nishiyama b a Department of Mathematical Information Science, Tokyo University
More informationEPMC Estimation in Discriminant Analysis when the Dimension and Sample Sizes are Large
EPMC Estimation in Discriminant Analysis when the Dimension and Sample Sizes are Large Tetsuji Tonda 1 Tomoyuki Nakagawa and Hirofumi Wakaki Last modified: March 30 016 1 Faculty of Management and Information
More informationHigh-dimensional asymptotic expansions for the distributions of canonical correlations
Journal of Multivariate Analysis 100 2009) 231 242 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva High-dimensional asymptotic
More informationHigh-Dimensional AICs for Selection of Redundancy Models in Discriminant Analysis. Tetsuro Sakurai, Takeshi Nakada and Yasunori Fujikoshi
High-Dimensional AICs for Selection of Redundancy Models in Discriminant Analysis Tetsuro Sakurai, Takeshi Nakada and Yasunori Fujikoshi Faculty of Science and Engineering, Chuo University, Kasuga, Bunkyo-ku,
More informationApplied Multivariate and Longitudinal Data Analysis
Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference
More informationConsistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis
Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis Yasunori Fujikoshi and Tetsuro Sakurai Department of Mathematics, Graduate School of Science,
More informationConsistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis
Consistency of Distance-based Criterion for Selection of Variables in High-dimensional Two-Group Discriminant Analysis Tetsuro Sakurai and Yasunori Fujikoshi Center of General Education, Tokyo University
More informationApproximate interval estimation for EPMC for improved linear discriminant rule under high dimensional frame work
Hiroshima Statistical Research Group: Technical Report Approximate interval estimation for PMC for improved linear discriminant rule under high dimensional frame work Masashi Hyodo, Tomohiro Mitani, Tetsuto
More informationMultivariate Statistics
Multivariate Statistics Chapter 2: Multivariate distributions and inference Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2016/2017 Master in Mathematical
More informationStatistical Inference On the High-dimensional Gaussian Covarianc
Statistical Inference On the High-dimensional Gaussian Covariance Matrix Department of Mathematical Sciences, Clemson University June 6, 2011 Outline Introduction Problem Setup Statistical Inference High-Dimensional
More informationarxiv: v1 [math.st] 2 Apr 2016
NON-ASYMPTOTIC RESULTS FOR CORNISH-FISHER EXPANSIONS V.V. ULYANOV, M. AOSHIMA, AND Y. FUJIKOSHI arxiv:1604.00539v1 [math.st] 2 Apr 2016 Abstract. We get the computable error bounds for generalized Cornish-Fisher
More informationChapter 7, continued: MANOVA
Chapter 7, continued: MANOVA The Multivariate Analysis of Variance (MANOVA) technique extends Hotelling T 2 test that compares two mean vectors to the setting in which there are m 2 groups. We wish to
More informationLikelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern
The Korean Communications in Statistics Vol. 12 No. 2, 2005 pp. 473-481 Likelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern Byungjin
More informationz = β βσβ Statistical Analysis of MV Data Example : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) test statistic for H 0β is
Example X~N p (µ,σ); H 0 : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) H 0β : β µ = 0 test statistic for H 0β is y z = β βσβ /n And reject H 0β if z β > c [suitable critical value] 301 Reject H 0 if
More informationLikelihood Ratio Tests in Multivariate Linear Model
Likelihood Ratio Tests in Multivariate Linear Model Yasunori Fujikoshi Department of Mathematics, Graduate School of Science, Hiroshima University, 1-3-1 Kagamiyama, Higashi Hiroshima, Hiroshima 739-8626,
More informationIn most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up
THE UNIVERSITY OF MINNESOTA Statistics 5401 September 17, 2005 Chi-Squared Q-Q plots to Assess Multivariate Normality Suppose x 1, x 2,..., x n is a random sample from a p-dimensional multivariate distribution
More informationHypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA
Hypothesis Testing Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA An Example Mardia et al. (979, p. ) reprint data from Frets (9) giving the length and breadth (in
More informationON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION. Gunabushanam Nedumaran Oracle Corporation 1133 Esters Road #602 Irving, TX 75061
ON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION Gunabushanam Nedumaran Oracle Corporation 33 Esters Road #60 Irving, TX 7506 Joseph J. Pignatiello, Jr. FAMU-FSU College of Engineering Florida
More informationAn Introduction to Multivariate Methods
Chapter 12 An Introduction to Multivariate Methods Multivariate statistical methods are used to display, analyze, and describe data on two or more features or variables simultaneously. I will discuss multivariate
More informationConsistency of test based method for selection of variables in high dimensional two group discriminant analysis
https://doi.org/10.1007/s42081-019-00032-4 ORIGINAL PAPER Consistency of test based method for selection of variables in high dimensional two group discriminant analysis Yasunori Fujikoshi 1 Tetsuro Sakurai
More informationStatistical Inference with Monotone Incomplete Multivariate Normal Data
Statistical Inference with Monotone Incomplete Multivariate Normal Data p. 1/4 Statistical Inference with Monotone Incomplete Multivariate Normal Data This talk is based on joint work with my wonderful
More informationComparing two independent samples
In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point
More informationMATH5745 Multivariate Methods Lecture 07
MATH5745 Multivariate Methods Lecture 07 Tests of hypothesis on covariance matrix March 16, 2018 MATH5745 Multivariate Methods Lecture 07 March 16, 2018 1 / 39 Test on covariance matrices: Introduction
More informationLecture 32: Asymptotic confidence sets and likelihoods
Lecture 32: Asymptotic confidence sets and likelihoods Asymptotic criterion In some problems, especially in nonparametric problems, it is difficult to find a reasonable confidence set with a given confidence
More informationCOMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS
Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department
More informationExercises Chapter 4 Statistical Hypothesis Testing
Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics
More informationSolutions to Final STAT 421, Fall 2008
Solutions to Final STAT 421, Fall 2008 Fritz Scholz 1. (8) Two treatments A and B were randomly assigned to 8 subjects (4 subjects to each treatment) with the following responses: 0, 1, 3, 6 and 5, 7,
More informationSTAT 461/561- Assignments, Year 2015
STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and
More informationFinal Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.
1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given. (a) If X and Y are independent, Corr(X, Y ) = 0. (b) (c) (d) (e) A consistent estimator must be asymptotically
More informationTesting a Normal Covariance Matrix for Small Samples with Monotone Missing Data
Applied Mathematical Sciences, Vol 3, 009, no 54, 695-70 Testing a Normal Covariance Matrix for Small Samples with Monotone Missing Data Evelina Veleva Rousse University A Kanchev Department of Numerical
More informationA Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data
A Multivariate Two-Sample Mean Test for Small Sample Size and Missing Data Yujun Wu, Marc G. Genton, 1 and Leonard A. Stefanski 2 Department of Biostatistics, School of Public Health, University of Medicine
More informationSemiparametric Discriminant Analysis of Mixture Populations Using Mahalanobis Distance. Probal Chaudhuri and Subhajit Dutta
Semiparametric Discriminant Analysis of Mixture Populations Using Mahalanobis Distance Probal Chaudhuri and Subhajit Dutta Indian Statistical Institute, Kolkata. Workshop on Classification and Regression
More informationAn Introduction to Multivariate Statistical Analysis
An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More information4.5.1 The use of 2 log Λ when θ is scalar
4.5. ASYMPTOTIC FORM OF THE G.L.R.T. 97 4.5.1 The use of 2 log Λ when θ is scalar Suppose we wish to test the hypothesis NH : θ = θ where θ is a given value against the alternative AH : θ θ on the basis
More informationLikelihood-based inference with missing data under missing-at-random
Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric
More informationYou can compute the maximum likelihood estimate for the correlation
Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute
More informationLecture 17: Likelihood ratio and asymptotic tests
Lecture 17: Likelihood ratio and asymptotic tests Likelihood ratio When both H 0 and H 1 are simple (i.e., Θ 0 = {θ 0 } and Θ 1 = {θ 1 }), Theorem 6.1 applies and a UMP test rejects H 0 when f θ1 (X) f
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationRobustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples
DOI 10.1186/s40064-016-1718-3 RESEARCH Open Access Robustness of the Quadratic Discriminant Function to correlated and uncorrelated normal training samples Atinuke Adebanji 1,2, Michael Asamoah Boaheng
More informationA Very Brief Summary of Statistical Inference, and Examples
A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random
More informationIntroduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007
Hypothesis Testing Daniel Schmierer Econ 312 March 30, 2007 Basics Parameter of interest: θ Θ Structure of the test: H 0 : θ Θ 0 H 1 : θ Θ 1 for some sets Θ 0, Θ 1 Θ where Θ 0 Θ 1 = (often Θ 1 = Θ Θ 0
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationIllustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives
TR-No. 14-06, Hiroshima Statistical Research Group, 1 11 Illustration of the Varying Coefficient Model for Analyses the Tree Growth from the Age and Space Perspectives Mariko Yamamura 1, Keisuke Fukui
More informationAnalysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems
Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA
More informationMISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30
MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Discriminant Analysis Background 1 Discriminant analysis Background General Setup for the Discriminant Analysis Descriptive
More informationTesting linear hypotheses of mean vectors for high-dimension data with unequal covariance matrices
Testing linear hypotheses of mean vectors for high-dimension data with unequal covariance matrices Takahiro Nishiyama a,, Masashi Hyodo b, Takashi Seo a, Tatjana Pavlenko c a Department of Mathematical
More informationInference on reliability in two-parameter exponential stress strength model
Metrika DOI 10.1007/s00184-006-0074-7 Inference on reliability in two-parameter exponential stress strength model K. Krishnamoorthy Shubhabrata Mukherjee Huizhen Guo Received: 19 January 2005 Springer-Verlag
More informationJournal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh
Journal of Statistical Research 007, Vol. 4, No., pp. 5 Bangladesh ISSN 056-4 X ESTIMATION OF AUTOREGRESSIVE COEFFICIENT IN AN ARMA(, ) MODEL WITH VAGUE INFORMATION ON THE MA COMPONENT M. Ould Haye School
More informationApproximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments
CIRJE-F-466 Approximate Distributions of the Likelihood Ratio Statistic in a Structural Equation with Many Instruments Yukitoshi Matsushita CIRJE, Faculty of Economics, University of Tokyo February 2007
More informationHotelling s One- Sample T2
Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More informationLoglikelihood and Confidence Intervals
Stat 504, Lecture 2 1 Loglikelihood and Confidence Intervals The loglikelihood function is defined to be the natural logarithm of the likelihood function, l(θ ; x) = log L(θ ; x). For a variety of reasons,
More informationUniversity of Lisbon, Portugal
Development and comparative study of two near-exact approximations to the distribution of the product of an odd number of independent Beta random variables Luís M. Grilo a,, Carlos A. Coelho b, a Dep.
More informationAnalysis of variance, multivariate (MANOVA)
Analysis of variance, multivariate (MANOVA) Abstract: A designed experiment is set up in which the system studied is under the control of an investigator. The individuals, the treatments, the variables
More informationSample size determination for logistic regression: A simulation study
Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This
More informationLikelihood Ratio tests
Likelihood Ratio tests For general composite hypotheses optimality theory is not usually successful in producing an optimal test. instead we look for heuristics to guide our choices. The simplest approach
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth
More informationLecture 5: Hypothesis tests for more than one sample
1/23 Lecture 5: Hypothesis tests for more than one sample Måns Thulin Department of Mathematics, Uppsala University thulin@math.uu.se Multivariate Methods 8/4 2011 2/23 Outline Paired comparisons Repeated
More informationInferences about a Mean Vector
Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University
More informationSTAT 501 EXAM I NAME Spring 1999
STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your
More informationStatistical Inference with Monotone Incomplete Multivariate Normal Data
Statistical Inference with Monotone Incomplete Multivariate Normal Data p. 1/4 Statistical Inference with Monotone Incomplete Multivariate Normal Data This talk is based on joint work with my wonderful
More informationAsymptotic Analysis of the Generalized Coherence Estimate
IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 1, JANUARY 2001 45 Asymptotic Analysis of the Generalized Coherence Estimate Axel Clausen, Member, IEEE, and Douglas Cochran, Senior Member, IEEE Abstract
More informationST495: Survival Analysis: Hypothesis testing and confidence intervals
ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took
More informationABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY
ABOUT PRINCIPAL COMPONENTS UNDER SINGULARITY José A. Díaz-García and Raúl Alberto Pérez-Agamez Comunicación Técnica No I-05-11/08-09-005 (PE/CIMAT) About principal components under singularity José A.
More information(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.
Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error
More informationEMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky
EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are
More informationStatement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.
MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss
More informationGoodness-of-Fit Testing with. Discrete Right-Censored Data
Goodness-of-Fit Testing with Discrete Right-Censored Data Edsel A. Pe~na Department of Statistics University of South Carolina June 2002 Research support from NIH and NSF grants. The Problem T1, T2,...,
More informationHypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations
Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate
More informationISyE 6416: Computational Statistics Spring Lecture 5: Discriminant analysis and classification
ISyE 6416: Computational Statistics Spring 2017 Lecture 5: Discriminant analysis and classification Prof. Yao Xie H. Milton Stewart School of Industrial and Systems Engineering Georgia Institute of Technology
More informationSTAT 450: Final Examination Version 1. Richard Lockhart 16 December 2002
Name: Last Name 1, First Name 1 Stdnt # StudentNumber1 STAT 450: Final Examination Version 1 Richard Lockhart 16 December 2002 Instructions: This is an open book exam. You may use notes, books and a calculator.
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationLecture 28: Asymptotic confidence sets
Lecture 28: Asymptotic confidence sets 1 α asymptotic confidence sets Similar to testing hypotheses, in many situations it is difficult to find a confidence set with a given confidence coefficient or level
More informationTHE STATISTICAL ANALYSIS OF MONOTONE INCOMPLETE MULTIVARIATE NORMAL DATA
The Pennsylvania State University The Graduate School Department of Statistics THE STATISTICAL ANALYSIS OF MONOTONE INCOMPLETE MULTIVARIATE NORMAL DATA A Dissertation in Statistics by Megan M. Romer c
More informationVariations. ECE 6540, Lecture 10 Maximum Likelihood Estimation
Variations ECE 6540, Lecture 10 Last Time BLUE (Best Linear Unbiased Estimator) Formulation Advantages Disadvantages 2 The BLUE A simplification Assume the estimator is a linear system For a single parameter
More informationTesting Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata
Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function
More informationBootstrap Procedures for Testing Homogeneity Hypotheses
Journal of Statistical Theory and Applications Volume 11, Number 2, 2012, pp. 183-195 ISSN 1538-7887 Bootstrap Procedures for Testing Homogeneity Hypotheses Bimal Sinha 1, Arvind Shah 2, Dihua Xu 1, Jianxin
More informationRatio of Vector Lengths as an Indicator of Sample Representativeness
Abstract Ratio of Vector Lengths as an Indicator of Sample Representativeness Hee-Choon Shin National Center for Health Statistics, 3311 Toledo Rd., Hyattsville, MD 20782 The main objective of sampling
More informationAnalysis of Classification in Interval-Valued Information Systems
Analysis of Classification in Interval-Valued Information Systems AMAURY CABALLERO, KANG YEN, EDUARDO CABALLERO Department of Electrical & Computer Engineering Florida International University 10555 W.
More informationRobustness of location estimators under t- distributions: a literature review
IOP Conference Series: Earth and Environmental Science PAPER OPEN ACCESS Robustness of location estimators under t- distributions: a literature review o cite this article: C Sumarni et al 07 IOP Conf.
More informationA Likelihood Ratio Test
A Likelihood Ratio Test David Allen University of Kentucky February 23, 2012 1 Introduction Earlier presentations gave a procedure for finding an estimate and its standard error of a single linear combination
More informationLecture 5: LDA and Logistic Regression
Lecture 5: and Logistic Regression Hao Helen Zhang Hao Helen Zhang Lecture 5: and Logistic Regression 1 / 39 Outline Linear Classification Methods Two Popular Linear Models for Classification Linear Discriminant
More informationIntroduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test
Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists
More informationLecture 3. Inference about multivariate normal distribution
Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates
More informationData Mining and Analysis: Fundamental Concepts and Algorithms
Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA Department
More informationMultivariate analysis of variance and covariance
Introduction Multivariate analysis of variance and covariance Univariate ANOVA: have observations from several groups, numerical dependent variable. Ask whether dependent variable has same mean for each
More informationINFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF INDEPENDENT GROUPS ABSTRACT
Mirtagioğlu et al., The Journal of Animal & Plant Sciences, 4(): 04, Page: J. 344-349 Anim. Plant Sci. 4():04 ISSN: 08-708 INFLUENCE OF USING ALTERNATIVE MEANS ON TYPE-I ERROR RATE IN THE COMPARISON OF
More informationClassification Methods II: Linear and Quadratic Discrimminant Analysis
Classification Methods II: Linear and Quadratic Discrimminant Analysis Rebecca C. Steorts, Duke University STA 325, Chapter 4 ISL Agenda Linear Discrimminant Analysis (LDA) Classification Recall that linear
More information4 Statistics of Normally Distributed Data
4 Statistics of Normally Distributed Data 4.1 One Sample a The Three Basic Questions of Inferential Statistics. Inferential statistics form the bridge between the probability models that structure our
More informationHYPOTHESIS TESTING: FREQUENTIST APPROACH.
HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous
More informationBias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition
Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Hirokazu Yanagihara 1, Tetsuji Tonda 2 and Chieko Matsumoto 3 1 Department of Social Systems
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationGoodness of Fit Goodness of fit - 2 classes
Goodness of Fit Goodness of fit - 2 classes A B 78 22 Do these data correspond reasonably to the proportions 3:1? We previously discussed options for testing p A = 0.75! Exact p-value Exact confidence
More information