Testing Non-Linear Ordinal Responses in L2 K Tables

Size: px
Start display at page:

Download "Testing Non-Linear Ordinal Responses in L2 K Tables"

Transcription

1 RUHUNA JOURNA OF SCIENCE Vol. 2, September 2007, pp ISSN X 2007 Faculty of Science University of Ruhuna. Testing Non-inear Ordinal Responses in 2 Tables eslie Jayasekara Department of Mathematics, Univerrsity of Ruhuna, Matara 81000, Sri anka. leslie@maths.ruh.ac.lk Takashi Yanagawa Center for Biostatistics, urume University, Japan. yanagawa.takashi@kurume-u.ac.jp Abstract. In comparative studies responses of two populations are often summarized in stratified 2 tables with ordinal categories. A test, called Q Et test, is proposed for testing the homogenuity of the populations against non-linear alternatives in such tables. The asymptotic distributions of proposed test are obtained both under the null and alternative hypothesis. The powers of the Q Et test and extended Mantal test are compared by simulation. ey words: ordinal data, 2 k tables, homogeatiy, nonlinear response, asymptotic distribution, Chi-square tests, confounding variables 1. Introduction Data are often summarized in ordinal 2 tables in comparative medical studies, and statistical tests such as Pearson s chi-squared test (Pearson, 1900), Wilcoxon test(wilcoxon, 1945), Nair s test (Nair, 1986), cumulative chi-squared test (Takeuchi and Hirotsu,1982), and max χ 2 test (Hirotsu,1983) are applied to those data for detecting the difference of two distributions. It is well known that Pearson s chi-squared test has no good powers against ordered alternatives. The Wilcoxon test is specifically designed for testing location difference of two samples; also the tests are asymptotically uniformly most powerful unbiased tests for logistic linear alternatives. Whereas Nair s test is designed for detecting dispersion alternatives. The cumulative chi-squared test and max χ 2 test are ominibus tests developed for a wider class of alternatives including linear and non-linear responses. Here we call the response patterns like A, B and C in Table 1 the linear and the other patterns the non-linear; more specifically, the pattern D, E,, and I respectively called the pattern, pattern,, and pattern. We developed the Q t test (Jayasekara and Yanagawa, 1995; Jayasekara, Nishiyama and Yanagawa, 1999) for non-linear responses in 2 tables. The Q t test is shown to have higher powers than those tests just described when the control and treatment groups show the combination of the patterns of non-linear responses. Now confounding variables such as sex, age, blood pressure and others are involved in medical data and it is important to block their effects on testing. The above statistical tests lack this function and logistic models are conventionally employed. However, as is well known, the result of logistic models depend on the goodness of fit of the models to the data, 18

2 Ruhuna Journal of Science 2, pp , (2007) 19 Table 1 Response probabilities and patterns. Ordered categories Pattern A B C D E F G H I and yet it is not easy to establish the models, in particular, when responses are non-linear and the size of the data is not large. Here we may see the raison d etre of nonparametric tests. As far as we are aware the extended Mantel test (Mantel 1963, indis, Heyman, and och, 1978, Yanagawa 1986)(called EMT test in the sequel) is the only test that has been developed in the sprit. The EMT test adjusts for the effect of the confounding variables by stratification. In this paper we consider the same framework as the EMT test and develop a test for testing the homogenuity against non-linear alternatives. More specifically, considering 2 tables such as those given in Table 2 which have been constructed in the l-th stratum, l = 1, 2,,, to block the effect of confounding variables, we extended the Q t test. It is shown that the extended Q t test has higher power in most cases than EMT test when the alternatives are non-linear. 2. The Test Statistics We suppose in Table 2 that Y l1 = (Y l11,y l12,,y l1k ) and Y l2 = (Y l21,y l22,,y l2k ) are multinomial random vectors independently distributed with parameters n l1,(p l11, p l12,, p l1k ) and n l2,(p l21, p l22,, p l2k ) respectively (l = 1, 2,, ). Suppose that categories B 1, B 2,, B are ordinal (B 1 < B 2 < < B ), and define the odds-ratio of category B k relative to category B 1 by ψ lk = p l11 p l2k /p l21 p l1k (k = 1, 2,, ). The homogenuity of the distributions of the control and treatment groups in the table may be represented by ψ lk = 1 for all k = 1, 2,, and l = 1, 2,,, which we simply denote by ψ 1. Thus the problemma is testing H 0 : ψ 1 against H 1 : ψ lk 1 for some k = 2, and l = 1, 2,,. In particular, considered under the alternatives are the odds ratios derived from the combinations of those linear and non-linear response patterns presented in Table 1. We extend the Q t test(jayasekara and Yanagawa (1995), Jayasekara and Nishiyama(1996) ) for testing H 0 vs. H 1. et c lk be the Wilcoxon scorollarye in the l-th table defined by c l1 = (τ l1 N l )/2 and c lk = Σ k 1 j=1τ l j + (τ li N l )/2 for k = 2, 3,,, where τ lk is the marginal total in Table 2. Note that it is normalized to satisfy Σ k=1τ lk c lk = 0 for l = 1,,. Now for two dimensional vectors a l and b l in l-th stratum we define the inner product of a l and b l by (a l, b l ) = Σ k=1 τ lka lk b lk and the norm of a l by a l = (a l, a l ) 1/2.

3 20 Ruhuna Journal of Science 2, pp , (2007) Table 2 2 table in stratum l, l = 1,,. Ordered Categories Stratum l B 1 B 2 B Total Control Y l11 Y l12 Y l1 n l1 Treatment Y l21 Y l22 Y l2 n l2 Total τ l1 τ l2 τ l N l et c r lk be the r-th power of c lk, and put c lr = (c r l1, c r l2,, c r l), r = 0, 1,, 1. Furthermore let a l0 = c l0 / c lo and a lr = d lr / d lr, where d lr = c lr Σ r 1 j=0(c lr, a l j )a l j, r = 1, 2,, 1. Note that (a lr, a lr ) = { 1 if r = r, 0 if r r, for r, r = 0, 1,, 1. (1) Putting for given tε{1, 2,, 1} A = (a lr ),,2,,;r=1,,t ( t matrix), Y 2 = (Y 12 2 ) ( dimensional vector), and S 2 = n l1n l2 /N l (N l 1), U Et = A Y 2 /S we propose the following Q Et as a test statistic for testing H 0 vs. H 1 : Q Et = U EtU Et for each tε{1, 2,, 1}. et U r be the r-th elemmaent of U Et, then we have U r = and the Q Et may represented as follows: a lry l2 /S, (2) Q Et = U 2 1 +U U2 t. Remark: Q Et is identical to the test statistic of EMT test when t = 1, and to the Wilcoxon test statistic (Wilcoxon, 1945) when t = 1 and = 1. Now under H 0, the conditional distribution of Y l2 given C l = {n l1, n l2, τ l1,, τ l } is multiple hypergeometric with E[Y l2k C l ] = n l2 τ lk /N l Cov[Y l2k,y l2k C l ] = n l1n l2 Nl 2(N l 1) τ lk(δ jk N l τ lk ), for k, k = 1,,, where δ kk = 1 if k = k and 0 otherwise. THEOREM 1. Under H 0, the elemmaents of U Et, i.e., U r, r = 1, 2,,t, are uncorollaryrelated with zero mean and unit variance when conditioned on C = {C l, l = 1,, }. Proof. We first show E[U Et C] = 0. Putting τ l = (τ l1,, τ l ), we have fro a lr τ l = 0. (3)

4 Ruhuna Journal of Science 2, pp , (2007) 21 Thus E[U Et C] = A te[y 2 C]/S = A t(n 12 τ 1/N 1,, n 2 τ /N ) /S = = 0. ( n l2 a l1 τ l/n l,, n l2 a lt τ l/n l ) /S We next compute the conditional covariance matrix of U Et. Since Y l2, l = 1,,, are independent, the conditional covariance matrix V(U Et C) can be expressed as, Since V(U Et C) = A tv(y 2 C)A t /S 2 V(Y l2 C) = A t V(U Et C) = a lr V(Y l2 C)a lr = it follows from (3) that Thus fro Therefore from (4), we have V(Y 2 C) A t/s 2 ( a lr V(Y l2 C)a lr ) /S 2 for r, r = 1,,t. (4) τ l1 n l1 n l2 Nl 2 (N l 1) [N la. 0 lr 0. a lrv(y l2 C)a lr = τ lk n l1 n l2 N l (N l 1) (a lr, a lr ). a lr a lr τ lτ l a lr ], { 1 if r = a lr V(Y l2 C)a r, lr /S2 = 0 if r r, r, r = 0, 1,, 1. V(U Et C) = I t. 3. Asymptotic Distributions Theorem 1 shows that the elemmaents of Q Et are uncorollaryrelated and furthermore from (2) they are linear combinations of Y l2 = (Y l21,,y l2 ). However, their weight vectors, a lr s, depends on N l, which makes the asymptotic theory not straightforward. We assume that when N l the marginal totals n li and τ lk for l = 1,, satisfy: (A1) n li /N l γ li, 0 < γ li < 1, for i = 1, 2, and τ lk /N l ρ lk, 0 < ρ lk < 1, for k = 1, 2,,. To begin with we review the normal approximation of a multiple hypergeometric distribution.

5 22 Ruhuna Journal of Science 2, pp , (2007) 3.1. Normal Approximation of a Multiple Hypergeometric Distribution Plackett (1981) showed that when assumption (A1) is satisfied the asymptotic conditional distribution of X l = (Y l22,,y l2 ) given C l = {n l1, n l2, τ l1,, τ l }, is a 1 dimensional normal with mean m l2 and covariance matrix V l, where m l2 = (m l22,, m l2k ) and V 1 (σ l jk ) with σ lkk = m 1 l11 + m 1 l21 + (m 1 l1k + m 1 l = l2k )δ kk, for k, k = 2,, and l = 1,,. Here the sequence {m lik }, i = 1, 2; k = 1, 2,,, is determined uniquely by equations k=1 m lik = n li, 2 i=1 m lik = τ lk, and m l11 m l2k /m l21 m l1k = ψ lk, for i = 1, 2; k = 1, 2,, and l = 1,,. It is known (Sinkhorn, 1967) that the sequence may be obtained by the following iterative scaling procedure: l1k = n l1, k = 1, 2,, l21 = n l2 [1 + j=2 (ψ l j 1)/] l2k = m (2) lik m (3) lik m (2h) lik m (2h+1) lik n l2 ψ lk [1 + j=2 (ψ l j 1)/], k = 2,, = m(1) lik τ lk l.k = m(2) lik n li m (2) li....,, τ lk = m(2h 1) lik m (2h 1) l.k n li = m(2h) lik m (2h) li.,, h = 1, 2,, and l = 1,, Asymptotic Distributions Under H 0 We first evaluate the weight, a lrk. We write N 1/2 l a lrk = O(1) if and only if N 1/2 l a lrk tends to a constant as N. EMMA 1. If (A1) is satisfied, then (i) N 1 l c lrk = O(1), where c lrk = c r lk, is the r-th power of the k-th Wilcoxon scorollarye in the l-th table, for r = 1, 2,, 1, k = 1, 2,, and l = 1,,. (ii) et a l0k be the k-th elemmaent of a l0. Then N r l (c lr, a l0 )a l0k = O(1), for r = 1, 2,, 1, k = 1, 2,, and l = 1,,. (iii) et d lvk be the k-th component of d lv. If N v l d lvk = O(1), k = 1, 2,,, then for any v = 1, 2,, we have (a) N 2v 1 l d lv 2 = O(1), (b) N r l (c lr, d lv )d lvk / d lv 2 = O(1), l = 1,,. (iv) N r l d lrk = O(1) for r = 1, 2,, 1, k = 1, 2,, and l = 1,,. (v) N 1/2 l a lrk = O(1) for r = 1, 2,, 1, k = 1, 2,, and l = 1,,. Proof. (i) By the definition of c lk, and from (A1), we may get N 1 l c lk = O(1) for l = 1,,. Thus it is obvious that N r l c r lk = O(1). (ii) By the definition of a l0 we have a l0k =

6 Ruhuna Journal of Science 2, pp , (2007) 23 1/N 1/2 l for all k. So from (i) we obtain N (r+1/2) l (c lr, a l0 ) = O(1). Thus we have (ii). (iii) (a) The result may be obtained by the definition of d lv. (b) Expanding the inner product (c lr, d lv ) and applying (i) we may show N (r+l+1) l (c lr, d lv ) = O(1). Now using (a), the result follows. (iv) To prove this result we use induction on r. In case of r = 1, d l1k = c l1k (c l1, a l0 )a l0k, for k = 1, 2,,. Applying (i) and (ii), it follows that N 1 l d l1k = O(1) for k = 1, 2,,. Suppose that the result is true for r = 1, 2,, m 1. Since m 1 d lm = c lm j=0 (c lm, a l j )a l j, = c lm (c lm, a l0 )a l0 m 1 j=1 d l j (c lm, d l j ) d l j 2, it follows that N m l d lmk = O(1) from (i), (ii) and (iii). So the result is true for r = m. Thus by the induction the result follows. (v) From the definition of a lr and also by (iv) the result is straightforward. Next, we consider the asymptotic distribution of the test statistics under H 0. To apply the normal approximation in section 3.1 we represent the t dimensional vector U Et by: U Et = B W/S, (5) where B = (b lr ), b lr = (a lr2 a lr1,, a lr a lr1 ) N 1/2 l, l = 1,, ; r = 1, 2,,t, W = (W 1,W 2,,W ), W l = N 1/2 l (X l n l2 τ l /N l ). THEOREM 2. Under H 0, Q Et is asymptotically distributed as a chi-squared distribution with t degrees of freedom as N, l = 1, 2,,. Proof. From section 3.1 we have m lik = n li τ lk /N l, under H 0. Thus the conditional distribution of W i given C l = {n l1, n l2, τ l1,, τ l } converges in distribution to N 1 (0, l0 ) as N l, where 1 l0 = (σ l jk0), j, k = 2,,, with σ l jk0 = [ρ 1 l1 + δ jkρ 1 lk ]/(γ l1γ l2 ). Furthermore, since N 1/2 l a lrk = O(1) from emmama 1(v), we have b lr = O(1). Thus as N l, l = 1,,, it will be easy to show that U Et = B W/S converges in distribution to a t dimensional normal distribution with mean zero and the covariance matrix 10 V[U Et ] = B B/S2 (6) Now putting M l = 0 ρ l2 (N l ρ l2 ) ρ l2 ρ l3 ρ l2 ρ l ρ l3 ρ l2 ρ l3 (N l ρ l3 ) ρ l3 ρ l... ρ l ρ l2 ρ l ρ l3 ρ l (N l ρ l ) γ l1γ l2,

7 24 Ruhuna Journal of Science 2, pp , (2007) we may show Furthermore, fro B 1 M l l0 M = I 1. M B/S2 I t, where means that the ratio of the both hands side tends to one as N l, l = 1, 2,,. Thus from (6) V[U Et ] I t, and Q Et = U EtU Et follows asymptotically a chi-squared distribution with t degrees of freedom Asymptotic Distribution Under Contiguous Alternatives In this section we obtain the asymptotic distribution of Q Et under alternative hypothesis H 1 : ψ lk = 1 + A lk /N 1/2 l, for k = 2, 3,,, where A lk is a constant. EMMA 2. Under H 1, we may represent m lik = m 0 lik + N1/2 l η lik + O(N 1/2 l ) for i = 1, 2, k = 1, 2,, and l = 1,,, where m 0 lik = n liτ k /N l is the asymptotic mean under H 0, and k = 2, 3,,. η i1 = ( 1) i+1 N 1/2 l γ l1 γ l2 ρ l1 η ik = ( 1) i N 1/2 l j=2 γ l1 γ l2 ρ lk [ψ lk 1 (ψ l j 1)ψ l j j=2 (ψ l j 1)ψ l j ] Proof. Adopting the iterative scaling algoritheorem in section 3.1, we have the following expressions for l1k, l21 l2k, m (2) li1 and m (2) lik, under H 1. l1k = n l1 l21 = n l2 [1 j=2 l2k = n l2 m (2) li1 m (2) lik [ψ lk j=2 (ψ l j 1) (ψ l j 1) = m 0 li1 +( 1) i+1 N l γ l1 γ l2 ρ l1 + o(n 1/2 l )] + o(n 1/2 l )], k = 2, 3,,, j=2 = m 0 lik +( 1) i N l γ l1 γ l2 ρ lk [ψ lk 1 (ψ l j 1) j=2 + o(n 1/2 l ), (ψ l j 1) ]+o(n 1/2 l ).

8 Ruhuna Journal of Science 2, pp , (2007) 25 Using mathematical induction on v, we may show m (v) lik = m0 lik + N1/2 l η lik + o(n 1/2 l ), k = 1, 2,,, and v = 3, 4,. Thus we have the desired results. THEOREM 3. Under H 1, Q Et is asymptotically distributed as a non-central chi-squared distribution with t degrees of freedom. The noncentrality parameter is given by λ = t r=1 δ2 r, where δ r = N lγ l1 γ l2 k=2 a lrkρ lk (ψ lk 1)/S. Proof. From Section 3.1 and emmama 2 it follows that under H 1, the conditional distribution of W l given C l = {n l1, n l2, τ l1,, τ l } converges in distribution to N 1 (η l2, l0 ), where η l2 = (η l22,, η l2 ), and l0 is that given in the proof of Theorem 2. Thus under H 1, U Et = B W/S converges in distribution to t dimentional normal distribution with mean δ Et = B (η 12,, η 2) /S and covariance matrix V[U Et ], which is shown to be I t in the proof of Theorem 2. The r-th elemmaent of δ Et, say δ r, is obtained as: From (7) and ψ l1 = 1, we have δ r = δ r = k=2 N l γ l1 γ l2 k=2 N 1/2 l (a lrk a lr1 )η l2k /S. a lrk ρ lk (ψ lk 1)/S. The theorem is immediately obtained from these results. COROARY 1. The power of U 2 r is approximately maximized when ln ψ lk = β l a lrk, k = 1, 2,,, for some constant β l, l = 1, 2,,. Proof. From the proof of Theorem 3 it follows that Ur 2 follows asymptotically a noncentral chi-squared distribution with one degree of freedom with noncentral parameter δ 2 r. Thus the asymptotic power of Ur 2 for testing H 0 vs. H 1 may be approximated by P(U 2 r χ 2 1(α) H 0 ) Φ(δ r χ(α)), where Φ is the cdf of a standard normal distribution. Since δ r may be represented by δ r = γ l1 γ l2 (a lr, ψ l 1)/S, this power is maximized when ψ 1 = β l a lr, that is when ln ψ lk β l a lrk for some constant β l. From the corollaryollary the statistic Q Et = U1 2 + U U t 2 is viewed as a sum of the statistics that are asymptotically optimum against the alternatives which are expressed as log linearities of the odds ratios with scorollarye a lrk, the standardized r-th power of the Wilcoxon scorollarye.

9 26 Ruhuna Journal of Science 2, pp , (2007) 4. Simulation Studies Simulation was conducted to compare the Q Et, t = 1, 2, 3, 4, test with the EMT test (Mantel 1963, andis, Heyman and och, 1978, Yanagawa 1986). Because the EMT test with Wilcoxon scorollarye is equivalent to the Q E1 test, we herein considered the EMT test with scorollaryes 0, 1, 2,, and 1 assigned to categories B 1, B 2,, and B, respectively. First we assessed Type I error of the Q Et, t = 1, 2, 3, 4 and EMT tests at the significance level α = The response probabilities employed are those listed in Table 1. We considered four strata and combinations of response patterns shown in the first column of Table 3. For example, (,,, ) in the table means that the response probabilities in the 1st stratum are p 111 = p 121 = 0.1, p 112 = p 122 = 0.15, p 113 = p 123 = 0.2, p 114 = p 124 = 0.25, p 115 = p 125 = 0.3; 2nd stratum are p 211 = p 221 = 0.1, p 212 = p 222 = 0.15, p 213 = p 223 = 0.2, p 214 = p 224 = 0.25, p 215 = p 225 = 0.3; and so on. We generated 10,000, four 2 5 tables for each combination of patterns and computed empirical significance levels when n l1 = n l2 = 60, 80, and 100. The results are listed in Table 3. The table shows that Type I error of the Q Et and EMT tests are close to the nominal level for all combinations of patterns. Second we assessed the powers of the Q Et, t = 1, 2, 3, 4, and EMT tests. We conducted similar simulation as above by using again the response probabilities listed in Table 1. Considering the combinations of pattern of distribution of Y 1 from {(,,, ), (,,, ), (,,, ),,(,,, ),(,,, ),(,,, ),(,,, ), (,,, )} we computed the powers of the tests for all combinations of patterns of each distribution, 48 all together, when n l1 = n l2 = 100, l = 1, 2, 3 and 4. The tests which give the largest and second largest powers are listed in Table 4a, 4b, and 4c. For example, the entry of the 2nd row and 3rd column in Table 4a means that when the pattern of Y 1 is and that of Y 2 is the test with the largest power is Q E4 followed by Q E3 ; and the entry of the 2nd row and 4th column in Table 4c means that when the pattern of Y 1 is and that of Y 2 then the test with the largest power is Q E4 followed by Q E3. The tests in the tables show that those tests have equal powers. The tables show that in most combinations, 45 among 48, the powers of the class of the Q Et test are larger or equal to than those of the EMT test. Table 5 lists the maximum, mean and minimum values of the powers of each test for 48 combinations of response patterns considered in Table 4. Inspection of the table shows that the mean and minimum powers of the Q Et test dominates the corollaryresponding values of the other tests, and that the maximum powers of the tests are almost equal. 5. Discussion The Q Et test is proposed for testing the homogeneity against non-linear responses in 2 tables. We took into account the combinations of patterns of linear and non-linear responses summarized in Table 1, and shown that the class of Q Et test is superior to the extended Mantel test (Mantel 1963, andis, Heyman and och 1978, Yanagawa 1986). Those non-linear patterns we considered often appear, for example, in Phase III randomized clinical trials for proving the efficacy of a new drug against the active control,in which the efficacy is sometimes categorized as excellent, effective slightly effective, not effective and aggravation. We emphasize that in such example, the response probabilities like 0.15, 0.25, 0.1, 0.3 and

10 Ruhuna Journal of Science 2, pp , (2007) 27 Table 3 Estimated Type I errors of the Q Et, t = 1, 2, 3, 4, and extended Mantel test (EMT). Pattern Sample size Estimated Type I error levels n l1 = n l2, l = 1, 2, 3, 4 Q E1 Q E2 Q E3 Q E4 EMT (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) (,,, )}

11 28 Ruhuna Journal of Science 2, pp , (2007) Table 4 Tests which give the largest and second largest powers: Y 1 (,,, ) (,,, ) (,,, ) (,,, ) (,,, ) Q E4, Q E3 Q E4, Q E3 Q E3, Q E4 Q E4, Q E2 (,,, ) Q E4, Q E1 (Q E1, Q E2, Q E3, (Q E1, Q E2, Q E3, EMT, Q E2 Q E4, EMT) Q E4, EMT) (,,, ) (Q E1, Q E2, Q E3, (Q E1, Q E2, Q E3, (Q E1, Q E2, Q E3, (Q E1, Q E2, Q E3, Q E4, EMT) Q E4, EMT) Q E4, EMT) Q E4, EMT) (,,, ) Q E2, Q E4 (Q E2, Q E3, Q E2, Q E3 (Q E2, Q E3, Q E4, EMT) Q E2, Q E1 ) (,,, ) (Q E2, Q E3, Q E2, Q E3 (Q E2, Q E3, Q E3, Q E2 Q E4, EMT) Q E4, Q E1 ) (,,, ) Q E3, Q E4 (Q E3, Q E4, Q E2 ) (Q E3, Q E4, Q E2 ) Q E4, Q E3 (,,, (Q E3, Q E4, Q E2 ) Q E4, Q E3 (Q E3, Q E4, EMT) Q E3, Q E4 (,,, ) Q E4, Q E3 Q E4, Q E3 Q E4, Q E3 Q E4, Q E3 (,,, ) Q E4, Q E3 Q E4, Q E3 Q E4, Q E3 Q E4, Q E2 (,,, ) - Q E4, Q E3 Q E4, Q E1 Q E4, Q E3 (,,, ) Q E4, Q E3 - Q E4, Q E3 EMT, Q E3 (,,, ) Q E4, Q E1 Q E1, Q E2 - Q E4, Q E3 (,,, ) Q E4, Q E3 EMT, Q E1 Q E4, Q E3 - Y 2 Table 5 The maximum, mean and the minimum powers of the tests for 48 combinations of the patterns in Table 4. Q E1 Q E2 Q E3 Q E4 EMT Max Mean Min ,i.e. pattern is not unreasonable. It is suggested in the simulation that when all combinations of those response patterns are taken into account the Q E4 test is good choice. The Q Et is shown to be the sum of U 2 r, r = 1, 2,,t, that are asymptotically optimum against the alternatives which are expressed as log linearities of the odds ratios with scorollarye a lrk, the standardized r-th power of the Wilcoxon scorollarye. References Pearson,. (1900), On criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling, Philos. Mag., series 5, 50, (Reprinted 1948 in arl Pearsons Early statistical papers, ed. By E. S. Pearson, Cambridge University Press.) Nair, V. N. (1986), On Testing in Industrial Experiments with Ordered Categorical Data, Journal of American Statistical Association, 58,

12 Ruhuna Journal of Science 2, pp , (2007) 29 Takeuchi,. and Hirotsu, C. (1982), The Cumulative Chi Square Method Against Ordered Alternative in Two-way Contingency Tables, Reports of Statistical Application Research, Japanese Union of Scientists and Engineers, 29, Hirotsu, C. (1983), Defining the pattern of association in two-way contingency tables, Biometrika, 70, Jayasekara,. and Yanagawa, T. (1995) Testing ordered categorical data in 2 k Tables by the statistics with orthonormal scorollarye vectors, Bulletin of Informatics and Cybernetics, 27, Jayasekara,., Nishiyama, H. and Yanagawa, T. (1999) Versatile Test for Ordinal Data in 2 k Tables, indis, J.R., Heyman, E.R. and och, G.G. (1978), Average partial association in three way contingency tables: a review and discussion of alternative tests, International Statistical Review, 46, Mantel, N. (1963), Chi-square tests with one degree of freedom: extensions of the Mantel- Haenszel procedure, Journal of American Statistical Association, 58, Plackett, R.. (1981). The Analysis of Categorical Data, ondon: Griffin. Sinkhorn, R. (1967), Diagonal equivalence to matrices with prescribed row and column sums, Amer. Math. Month., 74, Wilcoxon, Frank (1945), Individual Comparisons by Ranking Methods, Biometrics 1, Ynagawa, T. (1986), Analysis of Discrete Multivariate Data, Tokyo: yoritu Shuppan.

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Rank-sum Test Based on Order Restricted Randomized Design

Rank-sum Test Based on Order Restricted Randomized Design Rank-sum Test Based on Order Restricted Randomized Design Omer Ozturk and Yiping Sun Abstract One of the main principles in a design of experiment is to use blocking factors whenever it is possible. On

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Small n, σ known or unknown, underlying nongaussian

Small n, σ known or unknown, underlying nongaussian READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Bivariate Relationships Between Variables

Bivariate Relationships Between Variables Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods

More information

Two-way contingency tables for complex sampling schemes

Two-way contingency tables for complex sampling schemes Biomctrika (1976), 63, 2, p. 271-6 271 Printed in Oreat Britain Two-way contingency tables for complex sampling schemes BT J. J. SHUSTER Department of Statistics, University of Florida, Gainesville AND

More information

Nonparametric analysis of blocked ordered categories data: some examples revisited

Nonparametric analysis of blocked ordered categories data: some examples revisited University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2006 Nonparametric analysis of blocked ordered categories data: some examples

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

2.3 Analysis of Categorical Data

2.3 Analysis of Categorical Data 90 CHAPTER 2. ESTIMATION AND HYPOTHESIS TESTING 2.3 Analysis of Categorical Data 2.3.1 The Multinomial Probability Distribution A mulinomial random variable is a generalization of the binomial rv. It results

More information

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables American Journal of Biostatistics (): 7-, 00 ISSN 948-9889 00 Science Publications Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables Kouji Yamamoto, Kyoji Hori and Sadao Tomizawa

More information

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS Ivy Liu and Dong Q. Wang School of Mathematics, Statistics and Computer Science Victoria University of Wellington New Zealand Corresponding

More information

INFORMATION THEORY AND STATISTICS

INFORMATION THEORY AND STATISTICS INFORMATION THEORY AND STATISTICS Solomon Kullback DOVER PUBLICATIONS, INC. Mineola, New York Contents 1 DEFINITION OF INFORMATION 1 Introduction 1 2 Definition 3 3 Divergence 6 4 Examples 7 5 Problems...''.

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

Chapter 2: Describing Contingency Tables - II

Chapter 2: Describing Contingency Tables - II : Describing Contingency Tables - II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]

More information

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21 Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship

More information

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables International Journal of Statistics and Probability; Vol. 7 No. 3; May 208 ISSN 927-7032 E-ISSN 927-7040 Published by Canadian Center of Science and Education Decomposition of Parsimonious Independence

More information

Negative Multinomial Model and Cancer. Incidence

Negative Multinomial Model and Cancer. Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,

More information

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables. Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:

More information

APPENDIX B Sample-Size Calculation Methods: Classical Design

APPENDIX B Sample-Size Calculation Methods: Classical Design APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

Regression models for multivariate ordered responses via the Plackett distribution

Regression models for multivariate ordered responses via the Plackett distribution Journal of Multivariate Analysis 99 (2008) 2472 2478 www.elsevier.com/locate/jmva Regression models for multivariate ordered responses via the Plackett distribution A. Forcina a,, V. Dardanoni b a Dipartimento

More information

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures

More information

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Asymptotic equivalence of paired Hotelling test and conditional logistic regression Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS

More information

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j. Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS074) p.5763 Assessing GEE Models wh Longudinal Ordinal Data by Global Odds Ratio LIN, KUO-CHIN Graduate Instute of

More information

Multivariate Extensions of McNemar s Test

Multivariate Extensions of McNemar s Test Multivariate Extensions of McNemar s Test Bernhard Klingenberg Department of Mathematics and Statistics, Williams College Williamstown, MA 01267, U.S.A. e-mail: bklingen@williams.edu and Alan Agresti Department

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,

More information

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline

More information

Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency Tables with Symmetry Structure: An Overview

Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency Tables with Symmetry Structure: An Overview Symmetry 010,, 1108-110; doi:10.3390/sym01108 OPEN ACCESS symmetry ISSN 073-8994 www.mdpi.com/journal/symmetry Review Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency

More information

3 Joint Distributions 71

3 Joint Distributions 71 2.2.3 The Normal Distribution 54 2.2.4 The Beta Density 58 2.3 Functions of a Random Variable 58 2.4 Concluding Remarks 64 2.5 Problems 64 3 Joint Distributions 71 3.1 Introduction 71 3.2 Discrete Random

More information

Multi-Level Test of Independence for 2 X 2 Contingency Table using Cochran and Mantel Haenszel Statistics

Multi-Level Test of Independence for 2 X 2 Contingency Table using Cochran and Mantel Haenszel Statistics IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. Issue 8, August 015. ISSN 348 7968 Multi-Level Test of Independence for X Contingency Table using Cochran and Mantel

More information

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes Ritesh Ramchandani Harvard School of Public Health August 5, 2014 Ritesh Ramchandani (HSPH) Global Rank Test for Multiple Outcomes

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Three-Way Contingency Tables

Three-Way Contingency Tables Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

More information

Suppose that we are concerned about the effects of smoking. How could we deal with this?

Suppose that we are concerned about the effects of smoking. How could we deal with this? Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)

More information

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Communications of the Korean Statistical Society 2009, Vol 16, No 4, 697 705 Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Kwang Mo Jeong a, Hyun Yung Lee 1, a a Department

More information

Marginal Screening and Post-Selection Inference

Marginal Screening and Post-Selection Inference Marginal Screening and Post-Selection Inference Ian McKeague August 13, 2017 Ian McKeague (Columbia University) Marginal Screening August 13, 2017 1 / 29 Outline 1 Background on Marginal Screening 2 2

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Describing Stratified Multiple Responses for Sparse Data

Describing Stratified Multiple Responses for Sparse Data Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control Takahiro Nishiyama a a Department of Mathematical Information Science, Tokyo University of Science,

More information

11-2 Multinomial Experiment

11-2 Multinomial Experiment Chapter 11 Multinomial Experiments and Contingency Tables 1 Chapter 11 Multinomial Experiments and Contingency Tables 11-11 Overview 11-2 Multinomial Experiments: Goodness-of-fitfit 11-3 Contingency Tables:

More information

Statistical Methods in Clinical Trials Categorical Data

Statistical Methods in Clinical Trials Categorical Data Statistical Methods in Clinical Trials Categorical Data Types of Data quantitative Continuous Blood pressure Time to event Categorical sex qualitative Discrete No of relapses Ordered Categorical Pain level

More information

Optimal rejection regions for testing multiple binary endpoints in small samples

Optimal rejection regions for testing multiple binary endpoints in small samples Optimal rejection regions for testing multiple binary endpoints in small samples Robin Ristl and Martin Posch Section for Medical Statistics, Center of Medical Statistics, Informatics and Intelligent Systems,

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

Binary choice 3.3 Maximum likelihood estimation

Binary choice 3.3 Maximum likelihood estimation Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

MS-E2112 Multivariate Statistical Analysis (5cr) Lecture 6: Bivariate Correspondence Analysis - part II

MS-E2112 Multivariate Statistical Analysis (5cr) Lecture 6: Bivariate Correspondence Analysis - part II MS-E2112 Multivariate Statistical Analysis (5cr) Lecture 6: Bivariate Correspondence Analysis - part II the Contents the the the Independence The independence between variables x and y can be tested using.

More information

Log-linear multidimensional Rasch model for capture-recapture

Log-linear multidimensional Rasch model for capture-recapture Log-linear multidimensional Rasch model for capture-recapture Elvira Pelle, University of Milano-Bicocca, e.pelle@campus.unimib.it David J. Hessen, Utrecht University, D.J.Hessen@uu.nl Peter G.M. Van der

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Sample size determination for logistic regression: A simulation study

Sample size determination for logistic regression: A simulation study Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

High-dimensional asymptotic expansions for the distributions of canonical correlations

High-dimensional asymptotic expansions for the distributions of canonical correlations Journal of Multivariate Analysis 100 2009) 231 242 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva High-dimensional asymptotic

More information

Analysis of Categorical Data Three-Way Contingency Table

Analysis of Categorical Data Three-Way Contingency Table Yu Lecture 4 p. 1/17 Analysis of Categorical Data Three-Way Contingency Table Yu Lecture 4 p. 2/17 Outline Three way contingency tables Simpson s paradox Marginal vs. conditional independence Homogeneous

More information

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS Communications in Statistics - Simulation and Computation 33 (2004) 431-446 COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS K. Krishnamoorthy and Yong Lu Department

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Test Volume 11, Number 1. June 2002

Test Volume 11, Number 1. June 2002 Sociedad Española de Estadística e Investigación Operativa Test Volume 11, Number 1. June 2002 Optimal confidence sets for testing average bioequivalence Yu-Ling Tseng Department of Applied Math Dong Hwa

More information

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

STATISTICS ( CODE NO. 08 ) PAPER I PART - I STATISTICS ( CODE NO. 08 ) PAPER I PART - I 1. Descriptive Statistics Types of data - Concepts of a Statistical population and sample from a population ; qualitative and quantitative data ; nominal and

More information

Nemours Biomedical Research Biostatistics Core Statistics Course Session 4. Li Xie March 4, 2015

Nemours Biomedical Research Biostatistics Core Statistics Course Session 4. Li Xie March 4, 2015 Nemours Biomedical Research Biostatistics Core Statistics Course Session 4 Li Xie March 4, 2015 Outline Recap: Pairwise analysis with example of twosample unpaired t-test Today: More on t-tests; Introduction

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Analysis of data in square contingency tables

Analysis of data in square contingency tables Analysis of data in square contingency tables Iva Pecáková Let s suppose two dependent samples: the response of the nth subject in the second sample relates to the response of the nth subject in the first

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Testing equality of two mean vectors with unequal sample sizes for populations with correlation

Testing equality of two mean vectors with unequal sample sizes for populations with correlation Testing equality of two mean vectors with unequal sample sizes for populations with correlation Aya Shinozaki Naoya Okamoto 2 and Takashi Seo Department of Mathematical Information Science Tokyo University

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Session 3 The proportional odds model and the Mann-Whitney test

Session 3 The proportional odds model and the Mann-Whitney test Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session

More information

Formulas and Tables by Mario F. Triola

Formulas and Tables by Mario F. Triola Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or

More information

Minimal basis for connected Markov chain over 3 3 K contingency tables with fixed two-dimensional marginals. Satoshi AOKI and Akimichi TAKEMURA

Minimal basis for connected Markov chain over 3 3 K contingency tables with fixed two-dimensional marginals. Satoshi AOKI and Akimichi TAKEMURA Minimal basis for connected Markov chain over 3 3 K contingency tables with fixed two-dimensional marginals Satoshi AOKI and Akimichi TAKEMURA Graduate School of Information Science and Technology University

More information

Modeling and Measuring Association for Ordinal Data

Modeling and Measuring Association for Ordinal Data Modeling and Measuring Association for Ordinal Data A Thesis Submitted to the Faculty of Graduate Studies and Research In Partial Fulfillment of the Requirements for the Degree of Master of Science in

More information

Statistical. Psychology

Statistical. Psychology SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES Hisashi Tanizaki Graduate School of Economics, Kobe University, Kobe 657-8501, Japan e-mail: tanizaki@kobe-u.ac.jp Abstract:

More information

Correspondence Analysis

Correspondence Analysis Correspondence Analysis Q: when independence of a 2-way contingency table is rejected, how to know where the dependence is coming from? The interaction terms in a GLM contain dependence information; however,

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples

On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples Gad Nathan Journal of the American Statistical Association, Vol. 67, No. 340. (Dec., 1972), pp. 917-920.

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

Testing Homogeneity Of A Large Data Set By Bootstrapping

Testing Homogeneity Of A Large Data Set By Bootstrapping Testing Homogeneity Of A Large Data Set By Bootstrapping 1 Morimune, K and 2 Hoshino, Y 1 Graduate School of Economics, Kyoto University Yoshida Honcho Sakyo Kyoto 606-8501, Japan. E-Mail: morimune@econ.kyoto-u.ac.jp

More information

Marginal, crude and conditional odds ratios

Marginal, crude and conditional odds ratios Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010 Parameter Denitions When measuring the eect of a binary

More information

Describing Contingency tables

Describing Contingency tables Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds

More information

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh Journal of Statistical Research 007, Vol. 4, No., pp. 5 Bangladesh ISSN 056-4 X ESTIMATION OF AUTOREGRESSIVE COEFFICIENT IN AN ARMA(, ) MODEL WITH VAGUE INFORMATION ON THE MA COMPONENT M. Ould Haye School

More information

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47 Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................

More information