Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency Tables with Symmetry Structure: An Overview

Size: px
Start display at page:

Download "Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency Tables with Symmetry Structure: An Overview"

Transcription

1 Symmetry 010,, ; doi: /sym01108 OPEN ACCESS symmetry ISSN Review Minimum Phi-Divergence Estimators and Phi-Divergence Test Statistics in Contingency Tables with Symmetry Structure: An Overview Leandro Pardo 1, and Nirian Martín 1 Department of Statistics and O.R., Complutense University of Madrid, 8040 Madrid, Spain Department of Statistics, Carlos III University of Madrid, 8903 Getafe Madrid, Spain Author to whom correspondence should be addressed; lpardo@mat.ucm.es. Received: 1 March 010; in revised form: 7 May 010 / Accepted: 10 June 010 / Published: 11 June 010 Abstract: In the last years minimum phi-divergence estimators MϕE and phi-divergence test statistics ϕts have been introduced as a very good alternative to classical likelihood ratio test and maximum likelihood estimator for different statistical problems. The main purpose of this paper is to present an overview of the main results presented until now in contingency tables with symmetry structure on the basis of MϕE and ϕts. Keywords: minimum phi-divergence estimator; phi-divergence statistics; symmetry model Classification: MSC 6B10, 6H15 1. Introduction An interesting problem in a two-way contingency table is to investigate whether there are symmetric patterns in the data: Cell probabilities on one side of the main diagonal are a mirror image of those on the other side. This problem was first discussed by Bowker [1] who gave the maximum likelihood estimator as well as a large sample chi-square type test for the null hypothesis of symmetry. The minimum discrimination information estimator was proposed in [] and the minimum chi-squared estimator in [3]. In [4 7] new families of test statistics, based on ϕ-divergence measures, were introduced. These families contain as a particular case the test statistic given by [1] as well as the likelihood ratio test.

2 Symmetry 010, 1109 Let X and Y denote two ordinal response variables, X and Y having I levels. When we classify subjects on both variables, there are I possible combinations of classifications. The responses X, Y of a subject randomly chosen from some population have a probability distribution. Let p ij = PrX = i, Y = j, with p ij > 0, i, j = 1,..., I. We display this distribution in a rectangular table having I rows for the categories of X and I columns for the categories of Y. Consider a random sample of size n on X, Y and we denote by n ij the observed frequency in the i, jth cell for i, j I I with I I n ij = n. The classical problem of testing for symmetry is given by H 0 : p ij = p ji, i, j I I 1 versus H 1 : p ij p ji, for at least one i, j pair This problem was considered for the first time by Bowker [1] using the Pearson test statistic X = i<j n ij n ji n ij + n ji 3 for which established that X χ k for large n, where k = 1 II 1. In some real problems i.e., medicine, psychology, sociology, etc. the categorical response variables X, Y represent the measure after or before a treatment. In such situations our interest is to determine the treatment effect, i.e., if X Y we assume that X represents the measure after the treatment and Y before the treatment. In the following we understand that X is preferred or indifferent to Y, according to joint likelihood ratio ordering, if and only if iff p ij p ji i j. In this situation the alternative hypothesis is H 1 : p ij p ji, for all i j 4 This problem was first considered by El Barmi and Kochar [8], who presented the likelihood ratio test for the problem of testing H 0 : p ij = p ji against H 1 : p ij p ji, i j and considered the application of it to a real life problem: They tested if the vision of both the eyes, for 7477 women, is the same against the alternative that the right eye has better vision than the left eye. In [5] these results were extended using ϕ-divergence measures. In this paper we present an overview on contingency tables with symmetry structure on the basis of divergence measures. We pay especial attention to the family of ϕ-divergence test statistics for testing H 0 versus H 1, H 0 against H 1 and also for testing H 1 against the alternative H of no restrictions over p ij s, i.e., H 1 : p ij p ji, i j, against H : p ij p ji, i j 5 It is interesting to observe that not only we consider ϕ-divergence test statistics but also we consider minimum ϕ-divergence estimators in order to estimate of the parameters of the model.

3 Symmetry 010, Phi-divergences Measures We consider the set Θ = {θ : θ = p ij ; 1 i, j I, i, j I, I with p ij > 0 and I I,i,j I,I p ij < 1} 6 and we denote pθ = p = p 11,..., p II T, p II = 1 I I,i,j I,I p ij or equivalently by the I I matrix pθ = [p ij ]. The ϕ-divergence between two probability distributions p = [p ij ], q = [q ij ] was introduced independently by [9] and [10]. It is defined as follows: D ϕ p, q = q ij ϕ pij q ij, ϕ Φ 7 where Φ is the class of all convex functions ϕ : [0, R { }, such that ϕ 1 = 0, ϕ 1 > 0; and we define 0ϕ 0/0 = 0 and 0ϕ p/0 = lim u ϕ u /u. For every ϕ Φ that is differentiable at x = 1, the function ψ given by ψ x = ϕ x ϕ 1 x 1 also belongs to Φ. Then we have D ψ p, q = D ϕ p, q, and ψ has the additional property that ψ 1 = 0. Because the two divergence measures are equivalent, we can consider the set Φ to be equivalent to the set Φ Φ {ϕ : ϕ 1 = 0} An important family of ϕ-divergences in statistical problems, is the power divergence family ϕ λ x = x x + λ 1 x ; λ 0, λ 1 8 λ λ + 1 ϕ 0 x = lim λ 0 ϕ λ x = x ln x x + 1 ϕ 1 x = lim λ 1 ϕ λ x = ln x + x 1 which was introduced and studied by [11]. Notice that ϕ λ Φ. In the following we shall denote the power-divergence measures by D ϕλ p, q, λ R. For more details about ϕ-divergence measures see [1]. 3. Hypothesis Testing: H 0 versus H 1 We define B = {a 11,..., a 1I, a,..., a I,..., a I 1I 1, a I 1I T R II : i j a ij < 1, i, j = 1,.., I} the hypothesis 1 can be written as H 0 : θ = gβ, β= p 11,..., p 1I, p,..., p I,..., p I 1I 1, p I 1I T B 9

4 Symmetry 010, 1111 where the function β is defined by β = g ij ; i, j = 1,..., I, i, j I, I with { p ij, i j g ij β =, i, j = 1,..., I 1 p ji, i > j and g Ij β = p ji, j = 1,..., I 1 g ii β = p ii, i = 1,..., I 1 Note that pgβ = g ij β; i, j = 1,..., I T, where g II β = 1 I i,,i,j I,I g ijβ The maximum likelihood estimator MLE of β can be defined as β = arg min β B D KL p, pgβ a.s. where D KL p, pgβ is the Kullback Leibler divergence measure see [13,14] defined by D KL p, pgβ = p ij log p ij g ij β We denote by θ = g θ and by p θ = p 11 θ,..., p II θ T. It is well known that p ij θ = p ij+ p ji, i = 1,..., I, j = 1,..., I. Using the ideas developed in [15], we can consider the minimum ϕ -divergence estimator Mϕ E replacing the Kullback Leibler divergence by a ϕ -divergence measure in the following way θ ϕ = arg min D ϕ p, pgβ; ϕ Φ 10 β B where D ϕ p, pgβ = g ij βϕ pij g ij β We denote θ S,ϕ = g β ϕ and we have see [7,16] ng βϕ β L N 0, I S F β 1 n where I S F θ = Σ θ Σ θ Bθ T BθΣ T θ Bθ 1 BθΣ θ being Σ θ = diagθ θθ T hij θ and Bθ = θ ij. The functions h II 1 ij are given by I 1 h ij θ = p ij p ji, i < j, i = 1,..., I 1, j = 1,..., I It is not difficult to establish that the matrix I S F θ can be written as I S F θ = M T βi F β 1 M β

5 Symmetry 010, 111 where I F β is the Fisher information matrix corresponding to β B. If we consider the family of power divergences we get the minimum power-divergence estimator, θ S,λ of θ, under the hypothesis of symmetry, whose expression is given by For λ = 0 we get θ S,λ ij = p ij + p ji θ S,0 ij 1 p ij + p ji 1, i, j = 1,..., I 11 = p ij + p ji, i, j = 1,..., I hence, we obtain the maximum likelihood estimator for symmetry introduced by [1]. For λ = 1, we obtain as a limit case θ S, 1 p ij p ji 1 ij =, i, j = 1,..., I p ij p ji 1 i.e., the minimum discrimination estimator for symmetry introduced and studied in []. For λ = 1 we get the minimum chi-squared estimator for symmetry introduced in [3], p ij + p 1/ ji θ S,1 ij = p ij + p ji 1/ We denote θ ϕ = g β ϕ and by p θ ϕ = p 11 θ ϕ,..., p II θ ϕ T 1 the Mϕ E of the probability vector that characterizes the symmetry model. Based on p θ ϕ it is possible to define a new family of statistics for testing 1 that contains as a particular case Pearson test statistic as well as likelihood ratio test. This family of statistics is given by T ϕ 1 n θ ϕ n ϕ 1 1 D ϕ 1 p, p θ ϕ = n ϕ 1 1 p ij θ ϕ ϕ 1 p ij p ij θ ϕ We can observe that the family 13 involves two functions ϕ 1 and ϕ, both belonging to Φ. We use the function ϕ to obtain the Mϕ E and ϕ 1 to obtain the family of statistics. If we consider ϕ 1 x = 1 x 1 and ϕ x = x log x x + 1 we get Pearson test statistic whose expression was given in 3 and for ϕ 1 x = ϕ x = x log x x + 1 we get the likelihood ratio test given by G = n ij log In the following theorem the asymptotic distribution of T ϕ 1 n θ ϕ is obtained. i<j 13 n ij n ji + n ij 14

6 Symmetry 010, 1113 Theorem 1 The asymptotic distribution of T ϕ 1 n θ ϕ is chi-squared with m = II 1/ degrees of freedom. Proof. See Chapter 8 in [1]. Thus, for a given significance level α 0, 1, the critical value of T ϕ 1 n θ ϕ may be approximated by χ m,α, the upper 100α% of the chi-square distribution with m degrees of freedom, i.e., reject the hypothesis of symmetry iff T ϕ 1 n θ ϕ χ m,α 15 Now we are going to analyze the power of the test. Let q = q 11,..., q II T be a point at the alternative hypothesis, i.e., there exist at least two indexes i and j for which q ij q ji. We denote by θ ϕ a the point on Θ verifying where Θ 0 is given by and with It is clear that θ ϕ a = arg mind ϕ q, pθ θ Θ 0 Θ 0 = {θ Θ : θ = gβ for some β B} θ ϕ a = f ij q; i, j = 1,..., I, i, j I, I T p θ ϕ a = f ij q; i, j = 1,..., I, T fq f II q = 1 i,j I,I f ij q The notation f ij q indicates that the elements of the vector θ ϕ a power-divergence family ϕ λ x we have depend on q. For instance, for the We also denote and then f ij q = q ij + q 1 ji, i, j = 1,..., I q ij + q 1 ji θ S,ϕ = p S,ϕ ij ; i, j = 1,..., I, i, j I, I T p θ S,ϕ = p S,ϕ ij ; i, j = 1,..., I T f p where f = f ij ; i, j = 1,..., I T. If the alternative q is true we have that p tends to q and p θ S,ϕ to pθ ϕ a in probability. If we define the function Ψ ϕ1 q = D ϕ1 q,fq we have Ψ ϕ1 p = Ψ ϕ1 q + J D ϕ1 q, fq p ij q ij + o p q q ij

7 Symmetry 010, 1114 Then the random variables n Dϕ1 p, f p D ϕ1 q, fq and n J have the same asymptotic distribution. If we define D ϕ1 q,gq q ij p ij q ij l ij = D ϕ 1 q, fq q ij 16 and l = l ij ; i, j = 1,..., I T, we have ndϕ1 p,f p L D ϕ1 q, fq N n 0,lT Σ q l 17 where Σ q = diagq qq T. If we consider the maximum likelihood estimator instead of minimum ϕ-divergence estimator, we get l T Σ q l = q ij m ϕ 1 ij q ij m ϕ 1 ij where m ϕ 1 ij = 1 ϕ q ij + q ji ϕ 1 qij qji q ij + q ji + q ij ϕ qij 1 ϕ qji 1 q ij + q ji q ij + q ji q ij + q ji It is also interesting to observe, if we consider the power divergence measure, that m λ 1 pij pji ij = + λλ + 1 p ij + p ji p ij + p ji + 1 λ λ p ji pij pji λ p ij + p ji p ij + p ji p ij + p ji For λ 0 and λ = 1 we get m 0 ij = log p ij and m 1 ij = p ij 3p ji + p ij p ji p ij + p ji p ij + p ji respectively. Therefore, the corresponding asymptotic variances are given by σ0 p ij = p ij log p ij p ij + p ji p ij log p ij + p ji and σ 1 = p ij 3p ji + p ij p ji p ij p ij + p ji Based on the previous result we can formulate the following theorem. p ij p ij 3p ji + p ij p ji p ij + p ji

8 Symmetry 010, 1115 Theorem The asymptotic power for the test given in 15, at the alternative q, is given by 1 ϕ 1 1 β n,ϕ1,ϕ q = 1 Φ n l T Σ q l n χ m,α nd ϕ1 q, fθ ϕ a where Φ n x is a sequence of distributions functions tending uniformly to the standard normal distribution function Φ x. We consider a contiguous sequence of alternative hypotheses that approaches the null hypothesis H 0 : θ = pgβ, for some unknown β B, at the rate O n 1/. Consider the multinomial probability vector p n,ij = p ij gβ + d ij n 1/, i = 1,..., I, j = 1,..., I where d = d 11,..., d II T is a fixed I 1 vector such that I I d ij = 0, recall that n is the total count parameter of the multinomial distribution and β B. As n, the sequence of multinomial probabilities {p n } n N with p n = p n,ij, i = 1,..., I, j = 1,..., I T, converges to a multinomial probability in H 0 at the rate of O n 1/. Let H 1,n : p n = pgβ + dn 1/, β B 18 In the next theorem we present the asymptotic distribution of the family of test statistics T ϕ 1 n θ ϕ defined in 13, under the contiguous alternative hypotheses given in 18. Theorem 3 Under H 1,n, given in 18, the family of test statistics T ϕ 1 n θ ϕ is asymptotically noncentrality chi-squared distributed with I I 1 / degrees of freedom and noncentrality parameter δ = 1 d ij p ij i<j d ij d ji p ij Proof. See Chapter 8 in [1]. An interesting a simulation study can be seen in [7]. In that study some interesting alternative test statistics appear to the classical Pearson test statistics and likelihood ratio test. 4. Hypothesis Testing: H 0 versus H 1 and H 1 versus H In this section we consider the three hypotheses, H 0, H 1, H given in 1, 4, 5 respectively and some test statistics based on ϕ-divergence D ϕ p, p 0 and D ϕ p, p 1 19 for testing H 0 against H 1 and H 1 against H. In the expression 19, p is the maximum likelihood estimator MLE of p given by p = [ p ij ], where p ij = n ij /n; and p 0 and p 1 denote the MLEs of p under H 0 and H 1 respectively. These MLEs were obtained by [8]. Let θ ij = p ij p ij + p ji, for i > j

9 Symmetry 010, 1116 then H 0 : θ ij = 1/ for i > j and H 1 : θ ij 1/ for i > j, and θ ij = n ij n ij + n ji, θ 0 ij = 1/ and θ 1 ij = max nij, 1 n ij + n ji for i > j It follows that p 0 and p 1 are given by Then we have and p 0 ij D ϕ p, p 1 = = n ij + n ji n D ϕ p, p 0 = i>j and p 1 ij i>j n ij + n ji n = n ij + n ji n n ij + n ji n nij max, 1 n ij + n ji ϕ θ ij + ϕ1 θ ij θij θ 1 ij ϕ θ θ 1 ij ϕ 1 θ ij 1 ij 1 θ ij To solve the problem of testing H 1 against H, [8] consider the likelihood ratio test statistic T 1 = i>j n ij ln θ ij + n ji ln1 θ ij n ij ln θ 1 ij n 1 ji ln1 θ ij This statistic is such that T 1 = nd KL p, p 1 0 where D KL p, p 1 is the Kullback Leibler divergence given by 0 with ϕx = ϕ 0 x defined above. Then the likelihood ratio test statistic is based on the closeness, in terms of the Kullback Leibler divergence measure, between the probability distributions p and p 1. Thus, one could measure the closeness between the two probability distributions using a more general divergence measure if we are able to obtain its asymptotic distribution. One appropriate family of divergence measures for that purpose is the ϕ-divergence measure. As a generalization of the test statistic given in 0 for testing H 1 against H we introduce the family of test statistics T ϕ 1 = n ϕ 1 D ϕ p, p 1 1 To test H 0 against H 1, El Barmi and Kochar [8] consider the likelihood ratio test statistic T 01 = i>j 1 1 n ij ln θ ij + n ji ln1 θ ij n ij ln 1 n ji ln 1 It is clear that T 01 = n D KL p, p 0 D KL p, p 1 As a generalization of this test statistic we consider in this paper the family of test statistics T ϕ 01 = n D ϕ ϕ p, p 0 D ϕ p, p 1 1

10 Symmetry 010, 1117 If ϕ = ϕ 0 then T ϕ 1 = T 1 and T ϕ 01 = T 01, and hence the families of test statistics T ϕ 1 and T ϕ 01 can be considered as generalizations of the test statistics T 1 and T 01 respectively. In order to get the asymptotic distribution of the test statistics given in 1 and, we first define the so-called chi-bar squared distribution with n degrees of freedom, denoted by χ n. Definition 4 Let U = max0, Z where Z N 0, 1, so that the c.d.f. of U is given by { Φu, u > 0 F U u = 0, u < 0 where Φ denotes the standard normal cumulative distribution function. Let V = n k=1 U k, where U 1,..., U n are independent and distributed like U, then V χ n. It is readily shown that EV = 1 n and VarV = 5 4 n This distribution is related to χ distribution. It can be readily shown that PrV > v = n l=0 n 1 l n Prχ l > v by conditioning on L, the number of non-zero U i s. Furthermore, like the χ n distribution, the χ n distribution is stochastically increasing with n. If V χ n and V χ n, where n < n, then V is stochastically smaller than V : PrV > t PrV > t. This follows since V V + W where W χ n n with V and W independent. For more details about the chi-bar squared distribution see [17]. The following theorem present the asymptotic distribution of T01. ϕ Theorem 5 Under H 0, T ϕ 01 L χ K as n where K = II 1/. Proof. See [7]. If we consider the family of power divergences given in 8, we have the power divergence family of test statistics defined as T λ 01 = T ϕ λ 01 which can be used for testing H 0 against H 1. Therefore some important statistics can now be expressed as members of the power divergence family of test statistics T01, λ that is, T01 1 is the Pearson test statistic X01, T 1/ 01 is the Freeman Tukey test statistic F01, T01 is the Neyman-modified test statistic NM01, T01 1 is the modified loglikelihood ratio test statistic NG 01, T01 0 is the loglikelihood ratio test statistic G 01 introduced by [8] and T /3 01 is Cressie Read test statistic see [11]. Theorem 6 Under H 1, T ϕ L 1 χ M as n, where M is the number of elements in the set {i, j : i > j, p ij = p ji } K = 1 II 1 and lim n PrT ϕ 1 t K l=0 K 1 l K Prχ l t

11 Symmetry 010, 1118 If we consider the family of power divergences given in 8, we have the power divergence family of test statistics defined as T λ 1 = T ϕ λ 1 which we can use for testing H 1 against H. Remark 7 In the same way as previously we can obtain the test statistics T1 0 = G 1, T1 1 = NG 1, T1 1 = X1, T 1/ 1 = F1, T1 = NM1 and T /3 1. We will refer here to the example of [18, Section 9.5], where the test proposed by Bowker [1] is applied. The proposed tests in this paper may be used in the situation such that it is hoped that a new formulation of a drug will reduce some side-effects. Example We consider 158 patients who have been treated with the old formulation and records are available of any side-effects. We might now treat each patient with the new formulation and note incidence of side-effects. Table 1 shows a possible outcome for such an experiment. Do the data in Table 1 provide any evidence regarding a less severity of side-effects with the new formulation of the drug? BA The two test statistics given in 1 and are appropriate for this problem. For Table 1. Side-effect levels for old and new formulation. Side-effect levels-new formulation None Slight Severe Total Side-effect None levels-old Slight formulation Severe Total the test statistic T ϕ 01 given in the null hypothesis is that for all off-diagonal counts in the table the associated probabilities are such that all p ij = p ji. The alternative is that p ij p ji for all i j. We have computed the members of the family {T λ 01} given in Remark 7 and the corresponding asymptotic p-values P λ 01 = Pr χ 3 > T λ 01 which are given in the following table: Table. Asymptotic p-values for T λ 01. λ 1 1/ 0 /3 1 T01 λ P01 λ On the other hand, if we consider the usual Pearson test statistic X, we have that the value of this statistic is In this case using the chi-squared distribution with 3 degrees of freedom, the corresponding asymptotic distribution found by Bowker [1], Prχ 3 > X = Then for all

12 Symmetry 010, 1119 the considered statistics there is evidence of a differing incidence rate for side-effects under the two formulations, moreover this difference is towards less severe side effects under the new formulation. Therefore, the two considered tests lead to the same conclusion: There is a strong evidence of a bigger incidence rate for side-effects under the old formulation. The conclusion obtained in [18] is in accordance with our conclusion. Acknowledgements This work was supported by Grants MTM and BSCH-UCM References 1. Bowker, A. A test for symmetry in contingency tables. J. Am. Statist. Assoc. 1948, 43, Ireland, C.T.; Ku, H.H.; Koch, G.G. Symmetry and marginal homogeneity of an r r contingency table. J. Am. Statist. Assoc. 1969, 64, Quade, D.; Salama, I.A. A note on minimum chi-square statistics in contingency tables. Biometrics 1975, 31, Menéndez, M.L.; Pardo, J.A.; Pardo, L. Tests based on ϕ-divergences for bivariate symmetry. Metrika 001, 53, Menéndez, M.L.; Pardo, J.A.; Pardo, L. Tests for bivariate symmetry against ordered alternatives in square contingency tables. Aust. N. Z. J. Stat. 003, 45, Menéndez, M.L.; Pardo, J.A.; Pardo, L. Tests of symmetry in three-dimensional contingency tables based on phi-divergence statistics. J. Appl. Stat. 004, 31, Menéndez, M.L.; Pardo, J.A.; Pardo, L.; Zografos, K. On tests of symmetry, marginal homogeneity and quasi-symmetry in two contingency tables based on minimum ϕ-divergence estimator with constraints. J. Stat. Comput. Sim. 005, 75, El Barmi, H.; Kochar, S.C. Likelihood ratio tests for symmetry against ordered alternatives in a square contingency table. Stat. Probab. Lett. 1995,, Csiszàr, I. Eine Informationstheorestiche Ungleichung und ihre Anwendung auf den Beweis der Ergodizität von Markoffschen Ketten. The Mathematical Institute of Hungarian Academy of Sciences: Budapest, Hungary, 1963; Volume 8, pp Ali, S.M.; Silvey, S.D. A general class of coefficients of divergence of one distribution from another. J. Roy. Stat. Soc. Ser. B Stat. Met. 1966, 8, Cressie, N.; Read, T.R.C. Multinomial goodness-of-fit tests. J. Roy. Stat. Soc. Ser. B Stat. Met. 1984, 46, Pardo, L. Statistical Inference Based on Divergence Measures; Chapman & Hall/CRC: New York, NY, USA, Kullback, S. Information Theory and Statistics; John Wiley: New York, NY, USA, Kullback, S. Marginal homogeneity of multidimensional contingency tables. Ann. Math. Stat. 1971, 4, Morales, D.; Pardo, L.; Vajda, I. Asymptotic divergences of estimates of discrete distributions. J. Stat. Plan. Infer. 1995, 48,

13 Symmetry 010, Pardo, J.A.; Pardo, L.; Zografos, K. Minimum ϕ-divergence estimators with constraints in multinomial populations. J. Stat. Plan. Infer. 00, 104, Robertson, T.; Wrigt, F.T.; Dykstra, R.L. Order Restricted Statistical Inference; Wiley: New York, NY, USA, Sprent, P. Applied Nonparametric Statistical Methods; Chapman & Hall: London, UK, c 010 by the authors; licensee MDPI, Basel, Switzerland. This article is an Open Access article distributed under the terms and conditions of the Creative Commons Attribution license

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Comparison of Estimators in GLM with Binary Data

Comparison of Estimators in GLM with Binary Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 10 11-2014 Comparison of Estimators in GLM with Binary Data D. M. Sakate Shivaji University, Kolhapur, India, dms.stats@gmail.com

More information

φ-divergence in Contingency Table Analysis Institute of Statistics, RWTH Aachen University, Aachen, Germany;

φ-divergence in Contingency Table Analysis Institute of Statistics, RWTH Aachen University, Aachen, Germany; entropy Review φ-divergence in Contingency Table Analysis Maria Kateri Institute of Statistics, RWTH Aachen University, 52062 Aachen, Germany; maria.kateri@rwth-aachen.de Received: 6 April 2018; Accepted:

More information

Goodness of Fit Test and Test of Independence by Entropy

Goodness of Fit Test and Test of Independence by Entropy Journal of Mathematical Extension Vol. 3, No. 2 (2009), 43-59 Goodness of Fit Test and Test of Independence by Entropy M. Sharifdoost Islamic Azad University Science & Research Branch, Tehran N. Nematollahi

More information

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables

Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables American Journal of Biostatistics (): 7-, 00 ISSN 948-9889 00 Science Publications Measure for No Three-Factor Interaction Model in Three-Way Contingency Tables Kouji Yamamoto, Kyoji Hori and Sadao Tomizawa

More information

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables

Decomposition of Parsimonious Independence Model Using Pearson, Kendall and Spearman s Correlations for Two-Way Contingency Tables International Journal of Statistics and Probability; Vol. 7 No. 3; May 208 ISSN 927-7032 E-ISSN 927-7040 Published by Canadian Center of Science and Education Decomposition of Parsimonious Independence

More information

New families of estimators and test statistics in log-linear models

New families of estimators and test statistics in log-linear models Journal of Multivariate Analysis 99 008 1590 1609 www.elsevier.com/locate/jmva ew families of estimators and test statistics in log-linear models irian Martín a,, Leandro Pardo b a Department of Statistics

More information

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

On The Asymptotics of Minimum Disparity Estimation

On The Asymptotics of Minimum Disparity Estimation Noname manuscript No. (will be inserted by the editor) On The Asymptotics of Minimum Disparity Estimation Arun Kumar Kuchibhotla Ayanendranath Basu Received: date / Accepted: date Abstract Inference procedures

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables. Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:

More information

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction Sankhyā : The Indian Journal of Statistics 2007, Volume 69, Part 4, pp. 700-716 c 2007, Indian Statistical Institute More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order

More information

CONVEX FUNCTIONS AND MATRICES. Silvestru Sever Dragomir

CONVEX FUNCTIONS AND MATRICES. Silvestru Sever Dragomir Korean J. Math. 6 (018), No. 3, pp. 349 371 https://doi.org/10.11568/kjm.018.6.3.349 INEQUALITIES FOR QUANTUM f-divergence OF CONVEX FUNCTIONS AND MATRICES Silvestru Sever Dragomir Abstract. Some inequalities

More information

2.6.3 Generalized likelihood ratio tests

2.6.3 Generalized likelihood ratio tests 26 HYPOTHESIS TESTING 113 263 Generalized likelihood ratio tests When a UMP test does not exist, we usually use a generalized likelihood ratio test to verify H 0 : θ Θ against H 1 : θ Θ\Θ It can be used

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

Literature on Bregman divergences

Literature on Bregman divergences Literature on Bregman divergences Lecture series at Univ. Hawai i at Mānoa Peter Harremoës February 26, 2016 Information divergence was introduced by Kullback and Leibler [25] and later Kullback started

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

PROPERTIES. Ferdinand Österreicher Institute of Mathematics, University of Salzburg, Austria

PROPERTIES. Ferdinand Österreicher Institute of Mathematics, University of Salzburg, Austria CSISZÁR S f-divergences - BASIC PROPERTIES Ferdinand Österreicher Institute of Mathematics, University of Salzburg, Austria Abstract In this talk basic general properties of f-divergences, including their

More information

INFORMATION THEORY AND STATISTICS

INFORMATION THEORY AND STATISTICS INFORMATION THEORY AND STATISTICS Solomon Kullback DOVER PUBLICATIONS, INC. Mineola, New York Contents 1 DEFINITION OF INFORMATION 1 Introduction 1 2 Definition 3 3 Divergence 6 4 Examples 7 5 Problems...''.

More information

A New Quantum f-divergence for Trace Class Operators in Hilbert Spaces

A New Quantum f-divergence for Trace Class Operators in Hilbert Spaces Entropy 04, 6, 5853-5875; doi:0.3390/e65853 OPEN ACCESS entropy ISSN 099-4300 www.mdpi.com/journal/entropy Article A New Quantum f-divergence for Trace Class Operators in Hilbert Spaces Silvestru Sever

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS

KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Bull. Korean Math. Soc. 5 (24), No. 3, pp. 7 76 http://dx.doi.org/34/bkms.24.5.3.7 KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Yicheng Hong and Sungchul Lee Abstract. The limiting

More information

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Chi-Square. Heibatollah Baghi, and Mastee Badii

Chi-Square. Heibatollah Baghi, and Mastee Badii 1 Chi-Square Heibatollah Baghi, and Mastee Badii Different Scales, Different Measures of Association Scale of Both Variables Nominal Scale Measures of Association Pearson Chi-Square: χ 2 Ordinal Scale

More information

TUTORIAL 8 SOLUTIONS #

TUTORIAL 8 SOLUTIONS # TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level

More information

Ridit Score Type Quasi-Symmetry and Decomposition of Symmetry for Square Contingency Tables with Ordered Categories

Ridit Score Type Quasi-Symmetry and Decomposition of Symmetry for Square Contingency Tables with Ordered Categories AUSTRIAN JOURNAL OF STATISTICS Volume 38 (009), Number 3, 183 19 Ridit Score Type Quasi-Symmetry and Decomposition of Symmetry for Square Contingency Tables with Ordered Categories Kiyotaka Iki, Kouji

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

ON A CLASS OF PERIMETER-TYPE DISTANCES OF PROBABILITY DISTRIBUTIONS

ON A CLASS OF PERIMETER-TYPE DISTANCES OF PROBABILITY DISTRIBUTIONS KYBERNETIKA VOLUME 32 (1996), NUMBER 4, PAGES 389-393 ON A CLASS OF PERIMETER-TYPE DISTANCES OF PROBABILITY DISTRIBUTIONS FERDINAND OSTERREICHER The class If, p G (l,oo], of /-divergences investigated

More information

11-2 Multinomial Experiment

11-2 Multinomial Experiment Chapter 11 Multinomial Experiments and Contingency Tables 1 Chapter 11 Multinomial Experiments and Contingency Tables 11-11 Overview 11-2 Multinomial Experiments: Goodness-of-fitfit 11-3 Contingency Tables:

More information

2.3 Analysis of Categorical Data

2.3 Analysis of Categorical Data 90 CHAPTER 2. ESTIMATION AND HYPOTHESIS TESTING 2.3 Analysis of Categorical Data 2.3.1 The Multinomial Probability Distribution A mulinomial random variable is a generalization of the binomial rv. It results

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

Inverse Sampling for McNemar s Test

Inverse Sampling for McNemar s Test International Journal of Statistics and Probability; Vol. 6, No. 1; January 27 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Inverse Sampling for McNemar s Test

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Chi-Squared Tests. Semester 1. Chi-Squared Tests

Chi-Squared Tests. Semester 1. Chi-Squared Tests Semester 1 Goodness of Fit Up to now, we have tested hypotheses concerning the values of population parameters such as the population mean or proportion. We have not considered testing hypotheses about

More information

Received: 20 December 2011; in revised form: 4 February 2012 / Accepted: 7 February 2012 / Published: 2 March 2012

Received: 20 December 2011; in revised form: 4 February 2012 / Accepted: 7 February 2012 / Published: 2 March 2012 Entropy 2012, 14, 480-490; doi:10.3390/e14030480 Article OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Interval Entropy and Informative Distance Fakhroddin Misagh 1, * and Gholamhossein

More information

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as page1 Loglinear Models Loglinear models are a way to describe association and interaction patterns among categorical variables. They are commonly used to model cell counts in contingency tables. These

More information

Sample size determination for logistic regression: A simulation study

Sample size determination for logistic regression: A simulation study Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This

More information

On the GLR and UMP tests in the family with support dependent on the parameter

On the GLR and UMP tests in the family with support dependent on the parameter STATISTICS, OPTIMIZATION AND INFORMATION COMPUTING Stat., Optim. Inf. Comput., Vol. 3, September 2015, pp 221 228. Published online in International Academic Press (www.iapress.org On the GLR and UMP tests

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Chapter 11. Hypothesis Testing (II)

Chapter 11. Hypothesis Testing (II) Chapter 11. Hypothesis Testing (II) 11.1 Likelihood Ratio Tests one of the most popular ways of constructing tests when both null and alternative hypotheses are composite (i.e. not a single point). Let

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

A homogeneity test for spatial point patterns

A homogeneity test for spatial point patterns A homogeneity test for spatial point patterns M.V. Alba-Fernández University of Jaén Paraje las lagunillas, s/n B3-053, 23071, Jaén, Spain mvalba@ujaen.es F. J. Ariza-López University of Jaén Paraje las

More information

Negative Multinomial Model and Cancer. Incidence

Negative Multinomial Model and Cancer. Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence S. Lahiri & Sunil K. Dhar Department of Mathematical Sciences, CAMS New Jersey Institute of Technology, Newar,

More information

Chi-square (χ 2 ) Tests

Chi-square (χ 2 ) Tests Math 442 - Mathematical Statistics II April 30, 2018 Chi-square (χ 2 ) Tests Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4.

More information

Analysis of data in square contingency tables

Analysis of data in square contingency tables Analysis of data in square contingency tables Iva Pecáková Let s suppose two dependent samples: the response of the nth subject in the second sample relates to the response of the nth subject in the first

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Exercises Chapter 4 Statistical Hypothesis Testing

Exercises Chapter 4 Statistical Hypothesis Testing Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

ROBUST TESTS BASED ON MINIMUM DENSITY POWER DIVERGENCE ESTIMATORS AND SADDLEPOINT APPROXIMATIONS

ROBUST TESTS BASED ON MINIMUM DENSITY POWER DIVERGENCE ESTIMATORS AND SADDLEPOINT APPROXIMATIONS ROBUST TESTS BASED ON MINIMUM DENSITY POWER DIVERGENCE ESTIMATORS AND SADDLEPOINT APPROXIMATIONS AIDA TOMA The nonrobustness of classical tests for parametric models is a well known problem and various

More information

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization Edgar Erdfelder University of Mannheim Germany The Problem Typically, model applications start with a global

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

arxiv: v4 [math.st] 16 Nov 2015

arxiv: v4 [math.st] 16 Nov 2015 Influence Analysis of Robust Wald-type Tests Abhik Ghosh 1, Abhijit Mandal 2, Nirian Martín 3, Leandro Pardo 3 1 Indian Statistical Institute, Kolkata, India 2 Univesity of Minnesota, Minneapolis, USA

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1, Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem

More information

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare

More information

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

We know from STAT.1030 that the relevant test statistic for equality of proportions is: 2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

Testing Non-Linear Ordinal Responses in L2 K Tables

Testing Non-Linear Ordinal Responses in L2 K Tables RUHUNA JOURNA OF SCIENCE Vol. 2, September 2007, pp. 18 29 http://www.ruh.ac.lk/rjs/ ISSN 1800-279X 2007 Faculty of Science University of Ruhuna. Testing Non-inear Ordinal Responses in 2 Tables eslie Jayasekara

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Hirokazu Yanagihara 1, Tetsuji Tonda 2 and Chieko Matsumoto 3 1 Department of Social Systems

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Categorical Data Analysis Chapter 3

Categorical Data Analysis Chapter 3 Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,

More information

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

Contingency Tables Part One 1

Contingency Tables Part One 1 Contingency Tables Part One 1 STA 312: Fall 2012 1 See last slide for copyright information. 1 / 32 Suggested Reading: Chapter 2 Read Sections 2.1-2.4 You are not responsible for Section 2.5 2 / 32 Overview

More information

Example. χ 2 = Continued on the next page. All cells

Example. χ 2 = Continued on the next page. All cells Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E

More information

Ling 289 Contingency Table Statistics

Ling 289 Contingency Table Statistics Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,

More information

Module 10: Analysis of Categorical Data Statistics (OA3102)

Module 10: Analysis of Categorical Data Statistics (OA3102) Module 10: Analysis of Categorical Data Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 14.1-14.7 Revision: 3-12 1 Goals for this

More information

Multiple Sample Categorical Data

Multiple Sample Categorical Data Multiple Sample Categorical Data paired and unpaired data, goodness-of-fit testing, testing for independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Hypothesis Testing - Frequentist

Hypothesis Testing - Frequentist Frequentist Hypothesis Testing - Frequentist Compare two hypotheses to see which one better explains the data. Or, alternatively, what is the best way to separate events into two classes, those originating

More information

Chi-square (χ 2 ) Tests

Chi-square (χ 2 ) Tests Math 145 - Elementary Statistics April 17, 2007 Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. Chi-square (χ 2 ) Tests 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4. Testing

More information

2.1.3 The Testing Problem and Neave s Step Method

2.1.3 The Testing Problem and Neave s Step Method we can guarantee (1) that the (unknown) true parameter vector θ t Θ is an interior point of Θ, and (2) that ρ θt (R) > 0 for any R 2 Q. These are two of Birch s regularity conditions that were critical

More information

Modeling and inference for an ordinal effect size measure

Modeling and inference for an ordinal effect size measure STATISTICS IN MEDICINE Statist Med 2007; 00:1 15 Modeling and inference for an ordinal effect size measure Euijung Ryu, and Alan Agresti Department of Statistics, University of Florida, Gainesville, FL

More information

Inference for Categorical Data. Chi-Square Tests for Goodness of Fit and Independence

Inference for Categorical Data. Chi-Square Tests for Goodness of Fit and Independence Chi-Square Tests for Goodness of Fit and Independence Chi-Square Tests In this course, we use chi-square tests in two different ways The chi-square test for goodness-of-fit is used to determine whether

More information

Adejumo, Heumann, Toutenburg: Modelling Negative Binomial as a substitute model to Poisson for raters agreement on ordinal scales with sparse data

Adejumo, Heumann, Toutenburg: Modelling Negative Binomial as a substitute model to Poisson for raters agreement on ordinal scales with sparse data Adejumo, Heumann, Toutenburg: Modelling Negative Binomial as a substitute model to Poisson for raters agreement on ordinal scales with sparse data Sonderforschungsbereich 386, Paper 387 (2004) Online unter:

More information

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j. Chapter 9 Pearson s chi-square test 9. Null hypothesis asymptotics Let X, X 2, be independent from a multinomial(, p) distribution, where p is a k-vector with nonnegative entries that sum to one. That

More information

Mutual information of Contingency Tables and Related Inequalities

Mutual information of Contingency Tables and Related Inequalities Mutual information of Contingency Tables and Related Inequalities Peter Harremoës Copenhagen Business College Copenhagen, Denmark Email: harremoes@ieeeorg arxiv:4020092v [mathst] Feb 204 Abstract For testing

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

13.1 Categorical Data and the Multinomial Experiment

13.1 Categorical Data and the Multinomial Experiment Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)

More information

Sample size calculations for logistic and Poisson regression models

Sample size calculations for logistic and Poisson regression models Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National

More information

Two-way contingency tables for complex sampling schemes

Two-way contingency tables for complex sampling schemes Biomctrika (1976), 63, 2, p. 271-6 271 Printed in Oreat Britain Two-way contingency tables for complex sampling schemes BT J. J. SHUSTER Department of Statistics, University of Florida, Gainesville AND

More information

A SYMMETRIC INFORMATION DIVERGENCE MEASURE OF CSISZAR'S F DIVERGENCE CLASS

A SYMMETRIC INFORMATION DIVERGENCE MEASURE OF CSISZAR'S F DIVERGENCE CLASS Journal of the Applied Mathematics, Statistics Informatics (JAMSI), 7 (2011), No. 1 A SYMMETRIC INFORMATION DIVERGENCE MEASURE OF CSISZAR'S F DIVERGENCE CLASS K.C. JAIN AND R. MATHUR Abstract Information

More information

The Multinomial Model

The Multinomial Model The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient

More information

The Number of Symmetric Colorings of the Quaternion Group

The Number of Symmetric Colorings of the Quaternion Group Symmetry 010,, 69-75; doi:10.3390/sym010069 Article OPEN ACCESS symmetry ISSN 073-8994 www.mdpi.com/journal/symmetry The Number of Symmetric Colorings of the Quaternion Group Yuliya Zelenyuk School of

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Statistics for Managers Using Microsoft Excel

Statistics for Managers Using Microsoft Excel Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap

More information

On the Comparison of Fisher Information of the Weibull and GE Distributions

On the Comparison of Fisher Information of the Weibull and GE Distributions On the Comparison of Fisher Information of the Weibull and GE Distributions Rameshwar D. Gupta Debasis Kundu Abstract In this paper we consider the Fisher information matrices of the generalized exponential

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS Ivy Liu and Dong Q. Wang School of Mathematics, Statistics and Computer Science Victoria University of Wellington New Zealand Corresponding

More information

Pseudo-score confidence intervals for parameters in discrete statistical models

Pseudo-score confidence intervals for parameters in discrete statistical models Biometrika Advance Access published January 14, 2010 Biometrika (2009), pp. 1 8 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asp074 Pseudo-score confidence intervals for parameters

More information

Review of One-way Tables and SAS

Review of One-way Tables and SAS Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409

More information

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests Univariate categorical data Univariate categorical data are best summarized in a one way frequency table.

More information

ROI ANALYSIS OF PHARMAFMRI DATA:

ROI ANALYSIS OF PHARMAFMRI DATA: ROI ANALYSIS OF PHARMAFMRI DATA: AN ADAPTIVE APPROACH FOR GLOBAL TESTING Giorgos Minas, John A.D. Aston, Thomas E. Nichols and Nigel Stallard Department of Statistics and Warwick Centre of Analytical Sciences,

More information