Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Size: px
Start display at page:

Download "Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at"

Transcription

1 Biometrika Trust Analysis of Variability with Large Numbers of Small Samples Author(s): D. R. Cox and P. J. Solomon Source: Biometrika, Vol. 73, No. 3 (Dec., 1986), pp Published by: Oxford University Press on behalf of Biometrika Trust Stable URL: Accessed: :35 UTC REFERENCES Linked references are available on JSTOR for this article: You may need to log in to JSTOR to access the linked references. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust, Oxford University Press are collaborating with JSTOR to digitize, preserve and extend access to Biometrika

2 Biometrika (1986), 73, 3, pp Printed in Great Britain Analysis of variability with large numbers of small samples By D. R. COX Department of Mathematics, Imperial College, London SW7 2BZ, U.K. AND P. J. SOLOMON Department of Statistics, University of Adelaide, Adelaide, S.A., Australia 5001 SUMMARY Procedures are discussed for the detailed analysis of distributional form, based on many samples of size r, where especially r = 2, 3, 4. The possibility of discriminating between different kinds of departure from the standard normal assumptions is discussed. Both graphical and more formal procedures are developed and iltustrated by some data on pulse rates. Some key words: Graphical methods; Kurtosis; Nonnormality; Order statistics; Overdispersion; Pulse rate; Skewness. 1. INTRODUCTION It is common both in some kinds of balanced experimental design and in schemes of routine testing to have quite large numbers of small groups of observations, each group obtained under the same conditions. For example, in a large study, blood pressure measurements might be taken in duplicate on each patient visit, or in certain routine chemical analyses triplicate samples from the same source might be dealt with independently. There can be major problems in ensuring the independence of such replicate observations necessary to ensure that the relevant source of variability is not underestimated. Assuming that this independence is achieved, preferably by appropriate 'blinding', we use the replicate observations primarily to improve the precision of the mean, or other location estimate, but also to estimate a component of variance within samples, usually measuring sampling or measurement error. A further use is to check for gross errors affecting a single observation. In the present paper, however, we examine what further information can be extracted from such data. Throughout we suppose that, possibly after transformation, the 'standard' assumption of a normal distribution with constant variance is at least a reasonable starting point for an analysis. Of course more detailed analysis is likely to be worthwhile only if the variation within samples is of intrinsic interest. We assume throughout that rounding errors, digit preferences and the like are relatively unimportant and that the variability studied is not an artefact. 2. SOME GENERAL IDEAS 2 1. Formulation of models Suppose that we have m independent samples each of size r, the observations in the ith sample being ydl,...., Yir (i = 1..., inm). As a basis for the analysis we consider a

3 544 D. R. COX AND P. J. SOLOMON number of possible models for the corresponding simplest is the standard, normal theory one. Normal theory model, MN. The Yip are independently normally distributed with const variance o.2 and with E ( Yip) = ui. As explained in? 1 the focus of interest in the present paper is not the {Ui}, which may indeed have some additional structure specified, for instance, by a regression or factorial model. We shall typically suppose that there are substantial differences in mean present and that no useful information about the internal variability can be recovered from the between-sample variation. There are numerous ways in which interesting departures from MN may occur and we shall consider just three. Systematic changes in variance, MSY. Here the assumptions of MN hold except that var (YYi,) = CJ2, which is not constant but is a function either of an explanatory variable zi characterizing the ith population or of g,i. Such variation can, if necessary, be represent in various ways (Cook & Weisberg, 1983), e.g. OJ2= e/zi 0-2 2= etgi O02 o2 = = epg9i.2 P - H 20 where o-2o is a 'baseline' variance and / captures the systematic dependency present. An important extension allows for more complex multidimensional dependence: for example, if the samples are arranged in a row x column array, there may be systematic differences in variance between rows, between columns, or both. Complementary to MSY is the possibility that, while each population has a different variance, the changes in variance are random, unrelated to any observed feature. This is a model of overdispersion relative to MN. Overdispersion model, MOD. Again the normal theory assumptions of MN are modifie only by allowing each population to have a different variance, var ( Y1) = (J2, but now Ti = C-2 (i = 1,..., m) are independent unobserved values of a random variable T having a probability density function h(t). In the type of application we have in mind, it will often not be feasible to estimate the form of h(t) with any precision, and it may then be adequate to assume that T has an inverse gamma distribution, with density (Tfo)o tt-'fo exp (-'fot'/t){f('fo)f1, (1) where fo plays the role of an 'effective degrees of freedom' and E(T)= Tfo (fo-2)-1= O, say. For our final model, we suppose that, except for location, all populations ha same distribution, which is, however, nonnormal. Nonnormal model, MNN. All { Y1} are independent and the density of Yi, is where g(x) is a nonnormal density of zero mean. We write Kr for the rth cumulant of Yis and pr=kr/(j (r > 3). Of course, there are other possibilities and, in particular, we could combine M MOD and MNN in various ways, and also introduce models involving serial correlation.

4 Variability with large numbers of small samples 545 We are interested in methods for detecting departures from MN, in the estimation of relevant parameters in MSY, MOD and MNN, ana in studying the feasibility of discriminat ing between these three kinds of departure, in particular between MOD and MNN. While for very small values of r, separation of MOD and MNN may often not be possible, note that if there is substantial underdispersion in the sample estimates of variance, MOD can at once be eliminated from consideration. No special model for gross errors has been included. Both MOD and MNN can simulate such errors, in the first case via an occasional very large 'true' variance and in the second via a very long-tailed error distribution. Note that with r > 2 if gross errors were detected to be predominantly in one direction, MNN rather than MOD would be required. To some extent further analysis and interpretation may be quite strongly influenced by what kind of departure from MN is most appropriate. Thus, as between MOD and MNN, the former prompts the question 'why are some groups more variable than others?', whereas under the latter the errors may more reasonably be presumed to have a totally homogeneous structure. For the ith sample, we write 2 2. Some simple properties of MOD and MNN -i =, Yip/r Si = I ( Yip _ yi )2 for the sample mean and sum of squares. These are sufficient under both MN and MOD. It is convenient also to write ={Si/ (r -1 )}2 and 'i= 5i for the usual estimates of standard deviation and variance. For the overdispersion model MOD with the inverse gamma compounding density (1), the likelihood of the ith sample is (fo T?)!oF(1fO + 2r) F(2fo)irir{foro+ r(-zi)2+ Si}fo+lrf (2) which is of the Student's t form. If the {,i} are regarded as nuisance parameters, we examine the marginal likelihood based on Si which is Sir-3/2{1 + Sr/(f0to)}-foAr+i (foto r bb2f 2 r2) 3 For the general compounding density h(t), (3) is replaced by 00 J r1qr-1(sit'v)h(t) dt, (4) where qr1l(x) is the probability density of th of freedom. Estimation under the model MOD can be based on (3). Alternatively, and also for discriminating between MOD and MNN, it is useful to record the cumulants {Kj(i zi in terms of the cumulants {Aj} of the compounding density h(t). These are, wit 2= A1 = E(T), E()= var ()=+-1- (5)

5 546 D. R. COX AND P. J. SOLOMON say, and - 6S 8 _12r+1 A2 +(r+l1)(r +3) A31 K3(Ti)=)2+ (r-1)2 4 (r-1)(r+)aj Because A2/ U4, A3! O6,... are the cumulants of a nonnegative random variable of unit mean, A3/oJ6 > (A2/! o4)(a 2/4-1), with equality attained by a two-point distribution with one atom at zero. Therefore (T-) >.6{- + 8 (r+l)(9-r) A2 (r+1)(r+3) (A2)2} {(r- 1)2 (r- l) +2 r+1) +2 Note that negative skewness in the distribution of -, is possible, although for s of r, very large values of if2 would be necessary to bring this about. By comparison we have under the nonnormal model MNN (Kendall & Stuart, 1969, Ch. 12) E(T-)=cr2, var ( (~~\6 8+ 4(r -2) 1? 2 _(7 03( E) =C{(r - I1)2 +r(r - l1)2 p23 +r( r - l ) r From (5) and (7) we can compute fairly directly via the first two moments of the {T} estimates of either A2/ C in MOD or p4 in MNN. Both are dimensionless measures of the departure from the standard conditions MN. For r> 2 we can compute for each sample scale and location invariant measures of 'shape', e.g. the standardized third cumulant. Under MN and MOD these are distributed independently of yi. and Si, but this independence is in general lost under MNN. We shall in the subsequent discussion suppose that the population means {fig} are such that no useful information about error properties can be gleaned from the sample means 9'. Note, however, that under MNN cov (i., Si) = (r-1)k3/r. If the {ui} are distributed with variance -2,, this covering both systematic and rando variation, we have so that var (Yi.) = (J2/r + &.2, var(si) = 2(r - 1)(J4+ (r -)2K41r, corr (Yi., Si) = P I (8) Under MNN there is thus some possibility tha which could be misinterpreted as evidence The correlation (8) is, however, likely to be assume.

6 Variability with large numbers of small samples Procedures based on the distribution of Si In? 2*2 some of the properties of the sample sums of squares ha especially under MOD and MNN. We now consider some corresponding statistical procedures. For preliminary analysis a probability plot of the ordered {Si} against the expected order statistics of the chi-squared distribution with r- 1 degrees of freedom (Pearson & Hartley, 1972, Table 20) is natural. When r is not too small, say r > 5, and a number of Si are available, a powerful genera method of analysis (Bartlett & Kendall, 1946) under MN is to employ linear methods for log Si using the fact that under sampling a normal distribution var (log Si) = qf(2r-2), (9) where +fr(z) is the digamma function and the values for r = 2, 3, 4 are 4 93, 1-64, We shall not make extensive use of this in the present work, partly because of the severe loss of efficiency when r = 2, 3 and partly because of the undue sensitivity to sma values of Si and the failure of the method without ad hoc modification if Si = 0; su values could quite easily arise from rounding errors. As one example of its use, however, one could test for departure from MN in the direction of MOD or MNN by comparing with (9). (m - 1)-1{ (log Si)2 - (X log Si)2/ m} For MOD expansion of (3) or (4) for small 1/fo or small dispersion of h(x) shows that the locally most powerful test is based on the distribution of l S2 given L Si, or equivalently the marginal distribution of the dispersion index _{ S2 _ (X Si)2/ m}m-.1 (1 Si/ m)2{2/(r-1)} The divisor 2/ (r -1) ensures that, under MN, I -> 1 in probability as m -> cx. Note that detailed specification of h(x) is unnecessary for the local optimality property. For large m, I is asymptotically normal: because of the skewness of I, a rather better approximation is obtained by taking log I to be normal with mean - (r + 1)/{m(r - 1)} and variance 2(r+ 1)/{m(r- 1)} Detection of systematic changes We now consider the examination of possible systematic relations between variance and an explanatory variable or between variance and mean. Again graphical analysis will usually be a natural first step supplemented where appropriate by a test statistic which we take in the form T =I aisilz; Si. 11 where we can without loss of generality scale the explanatory variable so that l ai = 0, I ai = m. For instance if the sample means yi. are taken as the explan-atory variable ai=(yi.-y){ (yj.)2/m} 2 where g = Y./ m. There are two ways of obtaining a null hypothesis distribution of T The first is to note that under MN all permutations of {S1,...., Sm} are equally likely; hence, with S. = l Si/ m, E(T)=0O, var (T) = {z(si _S)2/ m}/3.2. (12)

7 548 D. R. COX AND P. J. SOLOMON Under weak assumptions, asymptotic normality will hold as n -> cc: for higher permutation moments, see Cox & Hinkley (1974, p. 185). An 'exact' permutation test is in principle possible. Under MN a more sensitive analysis is possible, by using the normal theory distribution of the ratio of quadratic forms rather than the permutation distribution. A reasonably simple approximate test is derived by writing pr (Ttx) = pr { (ai - x/m) Si }- [{2 - )( x2/)} Thus for a two-sided equi-tailed test at level 2a we need critical limits x* defined by where ((-k*) a. That is, {2(r- 1)(1 +x*2/m)}12 a, ~~ 2 (13) x* ={2(r - 1)1'k*f 1-2(r - 1) k*2/m}2 (13) For a more refined calculation it is possible to introduce a correction based on the standardized skewness and kurtosis of the random variable I (ai - x/lvm)si, nam l) (r- 1)/m I r+/ (m)} = (rii )rn{p4avrnmp3a+o(n)} where P3a = a3/m, P4a = a4/m-3. Numerical work in the special case r = 2 suggests that for the purpose of the present paper, where m is likely to be quite large, use of (13) is entirely adequate. Note that the simpler approximation (13) with the 1/rm term omitted is equivalent to the permutation test (12) with a normal theory value for var (T), as is clear on general grounds. 3. SAMPLES OF SIZE TWO We now consider in more detail what is probably the most common case in applications, namely r = 2, when a large number of duplicate observations are available. Provided that the numbering of observations within a pair is uninformative, we may replace the ith pair of observations (Yil, Yi2) by Yi. = 2( Yi1 + YiA), Si = 2( Yi-Yi2)2. An initial analysis for detecting systematic dependencies is to plot Si against Yi. or some other suitable explanatory variable zi. For a formal test we use? 2-4. If there is no systematic relation, attention is focused on the marginal distribution of the {Si}. Under the standard model MN, Sil/r2 has the chi-squared distribution with one degree of freedom. Probably the simplest graphical analysis is a seminormal plot of the ordered V/Sj; see Pearson & Hartley (1972, Table 21). Departure in the direction of underdispersion would indicate MNN whereas overdispersion could indicate either MOD or MNN.

8 Variability with large numbers of small samples 549 Under MNN, it is clear that even with a very large value of m only even-order cumulants of the distribution of the y's can be determined. Thus there is no way of studying the skewness of y, at least so long as there is no relevant information in the variation between the {yi.} To supplement the plot of Si versus yi. the index of dispersion can be calculated, and by (5) and (7) regarded as estimating 1 +2(A2/J-4) under MOD, and 1 +4p4 under MNN, 4. SAMPLES OF SIZE THREE 4 1. General discussion When we have samples of size three rather than samples of size two, there is appreciable extra sensitivity in having estimates of variance with two degrees of freedom rather than one, but in some ways the more interesting difference lies in the richer possibilities for examining distributional shape and in particular of detecting skewness under the nonnormal model, MNN. Thus an initial analysis will often consist of: (i) a plot of Si versus Yi. or a similar explanatory variable zi, supplemented where necessary by the test statistic (11); (ii) a plot of the ordered Si versus the expected order statistics from an exponential distribution, supplemented where necessary by the test statistic (10). Note that epm, the expected value of the pth smallest observation in samples of size m from the unit exponential distribution, is 1 1 log{(m+2)/(m-p+2)} (ptm), pmrmn * * * m-p+1 log (m+)+y (p= m), where -y is Euler's constant. We now concentrate on the more detailed analysis of the variation and this is most naturally done in terms of the order statistics Order statistics in samples of three It is convenient to drop temporarily the suffix indicating the particular sample and to write Y(1) < Y(2) Y(3) for the order statistics. The central results were given by Fisher (1930) in connection with a study of the exact distribution of sample estimates of skewne in which induction on sample size was used, samples of three being the 'starting point'. If a2 = ]S is the usual estimate of variance, the one 'degree of freedom' that describes the skewness of the sample can be taken in dimensionless form as W = (Y(3)-2Y(2) + Y(1))/J Under the standard normal model, MN, and also under MOD, (w, S) are independent, so that clear dependence between (w, S) is an indication that MNN should be considered. To obtain the joint distribution of (5, w) under MN and MNN it is convenient, following Fisher (1930), to write Y(3) = Y(2) + 0 ( Cos COS v t ext/3rem +1 si v Y=Y sin v, - Y(1) Cos = v Y(2) +13 c - p sin to ss v, wt +2 =3 sin or

9 550 D. R. Cox AND P. J. SOLOMON Y(1)* Note that w = 213 sin v and that t - sin1 takes values in (-1, 1). Now the joint density of {Y(1), Y(2), y(3)} is 6g(y(l))g(y(2))g(y(3)) and, on transforming to new variables (ai, Y2, v), the density of (5, v) is _ U f0 g(x - -Cos v + N3a5 sin v)g(x)g(x +J-Cos v +ln35-sin v) dx. (14) It follows immediately that for MN the statistics 5- and v are indeed independent and that the marginal density of v is uniform; i.e. that of t is uniform on (-1, 1). Incidentally the usual unbiased estimate of the third cumulant 2 =(y3- y)3=35 sin (3 v), showing explicitly that consideration of w or v is essentially equivalent to that of the standardized third cumulant ratio. Note that the individually standardized third cumulant lies in (-13, /3). There are the following open questions connected with (14). (i) Does knowledge of the joint distribution of (5, v) determine g(x) uniquely, except for a translation? (ii) Does independence of (r and v imply that g(x) is normal? (iii) Is there a simple necessary and sufficient condition for a given distribution of (O, v) to be representable in the form (14)? (iv) What statistic or statistics are theoretically most sensitive for detecting departures from MN? Note that under MOD, ( and v are independent, v is uniform and 5 or S has the overdispersed distribution discussed in? 2-2. Also if it is required to estimate skewness under the model MNN averaging of individually standardized estimates is inappropriate, in particular in the light of the constraint mentioned above: a consistent estimate of K3/1 ( is A reasonable analysis in practice is: 2m -Yi)3/(m )3/2. (a) the examination of the marginal distribution of the Si via the probability plot and test statistic mentioned in? 4 1; (b) inspection of a scatter plot of ti versus oi, clear dependence showing evidence against MN and MOD; (c) inspection of the marginal distribution of ti supplemented by calculation of the mean of ti as a statistic nonzero values of which would indicate skewness in the model MNN- 5. SAMPLES OF SIZE MORE THAN THREE We discuss only briefly corresponding procedures for samples of size four and more. The natural extension of the statitic w of? 4 is provided via linear functions of order statistics with a simple interpretation and zero expectation under normality. Thus for samples of size four we start with measures of skewness and kurtosis ( y(4 - y(3)) - ( Y(2) -Y(1)), (y j"4 - (3) + ( Y(2) -Y(1)) - k( y(3) -Y(2)),

10 Variability with large numbers of small samples 551 choosing k so that the second of these has zero expectation under MN: in fact k = This leads us to define W = {Y(4)-Y(3 -Y(2)+Y(1)}/, W"= {Y(4) y(3)+3f465Y(2)-Y(1)1'7. It is easily shown that, under MN and MOD, W' and wi are independent of 5J, var(w')=0-771, var (w")= They are also uncorrelated but far from independent and far from normally distributed. It is likely, therefore, that the most effective procedure is to compute w' and w" from each set of data and to examine the sample means and variances for consistency with the normal theory values. Departures would have a fairly clear diagnostic value. The sample estimate of cov (w', w") could also be calculated, although it is unclear what interpretation is to be put on a nonzero value. 6. SOME EXAMPLES Samples of size three. As an illustration, we analyse some pulse rates from the International Prospective Primary Prevention Study in Hypertension, a large scale clinical trial. Before entry and randomization to treatments, patients typically attended three qualifying visits at least one day apart. Pulse rate, the number of beats per minute, was one of several variables measured at each pre-entry visit. The data analysed here consist of three pulse rates, y, for a sample of one hundred men and one hundred women. The analysis goes in three broad steps. First, a plot, not given here, of the within-patient sum of squares Si versus the mean pulse rate yi. shows clear evidence that Si increases with Yi., thus indicating either MSY or, just conceivably, sampling correlation between Si and Yi. under MNN. Clearly MN is inappropriate. However, use of the reciprocal pulse rate removes the systematic relation between sum of squares and mean. The reciprocal pulse rate has a direct interpretation as the mean time between beats. The changes induced by reciprocal transformation are supported by the statistic T; for women, T = 0897 on the original scale, but this changes to for reciprocals; correspondingly for men, T changes from to 0094 under the reciprocal transformation. The second step is a probability plot of the ordered Si versus the expected exponential order statistics. Both the pulse rates and reciprocal pulse rates show overdispersion, but less for reciprocals; see Fig. 1. The very large values of Si correspond to women with large variation in pulse rate which variation, however, may be clinically meaningful. Therefore it is on the whole sensible to regard such values as part of the population under study rather than as aberrant values to be rejected. Supplementing the plot by the index of overdispersion I of (10) verifies the improved distributional properties of the reciprocals. Under MNN, from (7), I leads to an estimate of the dimensionless fourth cumulant, namely and for pulse rates for women and men, and on the reciprocal scale, and 4-602, respectively. The third step in the analysis plots Si versus the angular measure of skewness ti. We found it useful to examine for comparison simulated data from other distributions such as the normal, exponential and Student's 1 distributions. Normal data produced a random scatter as expected whereas exponential data ga-ve a plot which was clearly positively skew.

11 552 D. R. COX AND P. J. SOLOMON Si ~~~~~~~~ e 5 Fig. 1. Entry pulse rate for 100 women. Triples. Crosses, Si versus exponential score, ei; circles, losi versus exponential score, ei, after reciprocal transformation. (a) 1000 Si O ~ ~~~~ - Si 600- o 0 (23> 800- (b) (3 0 ~~~~~~~~~~~~ ~~~0 0 O ~~~~200 e O~~~~~~~~~~~~~~ t ~~ O ~ ~ CD? _ 0 _ 0-1*0-0O5 0 O 5 PG ti 1.0 Fig. 2. Entry pulse rate for 100 women. Triples. (a) Si versus angular statistic, ti. (b) losi versus angular statistic, ti, after reciprocal transformation _ ]-st ~~~xxxx < XA 0 I I Seminormal expected order statistic Fig. 3. Entry pulse rate for 100 women. Pairs. Crosses, seminormal plot of ordered V/Si; circles, seminormal plot of ordered V/S, after reciprocal transformation.

12 Variability with large numbers of small samples 553 Figure 2(a) shows that Si and ti for pulse rates are not independent. Reciprocals improve the symmetry; see Fig. 2(b). The skewness measure ti is characterized by concentrations of points at -1, 1 and 0, the first two values a consequence of grouping; its distribution is certainly nonuniform, both on the original and reciprocal scales, so that MOD is not appropriate. The women exhibit slightly negatively skew pulse rates, giving mean ti and standard deviation On the reciprocal scale, ti has mean and standard deviation 0 705: the mean is exaggerated by one very large Si. By contrast, the men exhibit slight positive skewness in pulse rate with mean ti and standard deviation 0 739; for reciprocals, mean ti is with standard deviation The assumption of symmetry in the distribution of the reciprocals is further supported by the usual standardized measure of skewness which takes values of and for women and for men. Exactly unit values of ti are accounted for by rounding. Simulations suggest, however, that rounding cannot entirely explain the anomalous distribution of ti. The grouping could have been aggravated by some temporal correlation. In particular, when two out of three rates agree, do they occur together as consecutive values? Pooling men and women, of 81 individuals with two rates the same, 19 had a different rate at the second visit, whereas 34 had a different rate at the first, and 28 at the third visit. This tends to suggest that some doctors may recall the previous pulse rate at the subsequent visit. In summary, the reciprocal pulse rates appear to have a symmetric, nonnormal, long-tailed distribution, suggesting that Student's t distribution with a small number of degrees of freedom may be appropriate. This provides a suitably flexible family of symmetrical distributions if parametric representation is required. Samples of size two. Partly in order to compare results, we have analysed a series of pairs of observations formed by taking just the first and last pulse rate from the data discussed above. Reciprocals certainly reduce the size of the kurtosis, being typically positive. However, reciprocals for women exhibit slight negative kurtosis as shown in Fig. 3, although testing with I, which under MNN leads to an estimate of p4 as for reciprocals, this is by no means statistically significant. Also, the other values of I vary little. In particular, for women, pulse rates give p as 3 335, and for men 6 054, which changes to for reciprocals. Samples of size four. Data on strengths of yarn (Cox & Snell, 1981, p. 131) in the form of 12 samples each of size four show excellent agreement with the normal theory values for w' and w". Namely, the observed mean w' for bobbins is -0,0178 with observed standard deviation 0-710, and for w", the observed mean is -0A466 with standard dev 1 561; the theoretical standard deviations are respectively and and the standard errors of the means thus and 0;461. ACKNOWLEDGEMENTS We are grateful to the Science and Engineering Research Council and to Ciba-Geigy Ltd (Basle) for support of this work. REFERENCES BARTLETT, M. S. & KENDALL, D. G. (1946). The statistical analysis of variance-heterogeneity and the logarithmic transformation. SuppL J. R. Statist. Soc. 8,

13 554 D. R. COX AND P. J. SOLOMON COOK, R. D. & WEISBERG, S. (1983). Diagnostics for heteroscedasticity in regression. Biometrika 70, Cox, D. R. & HINKLEY, D. V. (1974). Theoretical Statistics. London: Chapman & Hall. Cox, D. R. & SNELL, E. J. (1981). Applied Stajistics. London: Chapman & Hall. FISHER, R. A. (1930). The moments of the distribution for normal samples of measures of departure from normality. Proc. R. Soc. A 130, KENDALL, M. G. & STUART A. (1969). Advanced Theory of Statistics, 1, 3rd ed. High Wycombe: Griffin. PEARSON, E. S. & HARTLEY, H. 0. (1972). Biometrika Tables for Statisticians, 2. High Wycombe: Griffin. [Received December Revised April 1986]

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at A Note on the Efficiency of Least-Squares Estimates Author(s): D. R. Cox and D. V. Hinkley Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 30, No. 2 (1968), pp. 284-289

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at American Society for Quality A Note on the Graphical Analysis of Multidimensional Contingency Tables Author(s): D. R. Cox and Elizabeth Lauh Source: Technometrics, Vol. 9, No. 3 (Aug., 1967), pp. 481-488

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust Robust Regression via Discriminant Analysis Author(s): A. C. Atkinson and D. R. Cox Source: Biometrika, Vol. 64, No. 1 (Apr., 1977), pp. 15-19 Published by: Oxford University Press on

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust Some Remarks on Overdispersion Author(s): D. R. Cox Source: Biometrika, Vol. 70, No. 1 (Apr., 1983), pp. 269-274 Published by: Oxford University Press on behalf of Biometrika Trust Stable

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Some Applications of Exponential Ordered Scores Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 26, No. 1 (1964), pp. 103-110 Published by: Wiley

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust Some Simple Approximate Tests for Poisson Variates Author(s): D. R. Cox Source: Biometrika, Vol. 40, No. 3/4 (Dec., 1953), pp. 354-360 Published by: Oxford University Press on behalf of

More information

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. Biometrika Trust An Improved Bonferroni Procedure for Multiple Tests of Significance Author(s): R. J. Simes Source: Biometrika, Vol. 73, No. 3 (Dec., 1986), pp. 751-754 Published by: Biometrika Trust Stable

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at On the Estimation of the Intensity Function of a Stationary Point Process Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 27, No. 2 (1965), pp. 332-337

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Regression Analysis when there is Prior Information about Supplementary Variables Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 22, No. 1 (1960),

More information

International Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics.

International Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics. 400: A Method for Combining Non-Independent, One-Sided Tests of Significance Author(s): Morton B. Brown Reviewed work(s): Source: Biometrics, Vol. 31, No. 4 (Dec., 1975), pp. 987-992 Published by: International

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at The Analysis of Multivariate Binary Data Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 21, No. 2 (1972), pp. 113-120 Published by: Wiley for

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust Nonlinear Component of Variance Models Author(s): P. J. Solomon and D. R. Cox Source: Biometrika, Vol. 79, No. 1 (Mar., 1992), pp. 1-11 Published by: Oxford University Press on behalf

More information

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,

More information

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. Biometrika Trust Discrete Sequential Boundaries for Clinical Trials Author(s): K. K. Gordon Lan and David L. DeMets Reviewed work(s): Source: Biometrika, Vol. 70, No. 3 (Dec., 1983), pp. 659-663 Published

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at A Look at Some Data on the Old Faithful Geyser Author(s): A. Azzalini and A. W. Bowman Source: Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 39, No. 3 (1990), pp. 357-365

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

Hypothesis Testing for Var-Cov Components

Hypothesis Testing for Var-Cov Components Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output

More information

The Periodogram and its Optical Analogy.

The Periodogram and its Optical Analogy. The Periodogram and Its Optical Analogy Author(s): Arthur Schuster Reviewed work(s): Source: Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character,

More information

Math 423/533: The Main Theoretical Topics

Math 423/533: The Main Theoretical Topics Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)

More information

FAILURE-TIME WITH DELAYED ONSET

FAILURE-TIME WITH DELAYED ONSET REVSTAT Statistical Journal Volume 13 Number 3 November 2015 227 231 FAILURE-TIME WITH DELAYED ONSET Authors: Man Yu Wong Department of Mathematics Hong Kong University of Science and Technology Hong Kong

More information

Testing the homogeneity of variances in a two-way classification

Testing the homogeneity of variances in a two-way classification Biomelrika (1982), 69, 2, pp. 411-6 411 Printed in Ortal Britain Testing the homogeneity of variances in a two-way classification BY G. K. SHUKLA Department of Mathematics, Indian Institute of Technology,

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Comparison of Two Samples

Comparison of Two Samples 2 Comparison of Two Samples 2.1 Introduction Problems of comparing two samples arise frequently in medicine, sociology, agriculture, engineering, and marketing. The data may have been generated by observation

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Calculus first semester exam information and practice problems

Calculus first semester exam information and practice problems Calculus first semester exam information and practice problems As I ve been promising for the past year, the first semester exam in this course encompasses all three semesters of Math SL thus far. It is

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust Testing Multivariate Normality Author(s): D. R. Cox and N. J. H. Small Source: Biometrika, Vol. 65, No. 2 (Aug., 1978), pp. 263-272 Published by: Oxford University Press on behalf of Biometrika

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at The Interpretation of Interaction in Contingency Tables Author(s): E. H. Simpson Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 13, No. 2 (1951), pp. 238-241 Published

More information

A correlation coefficient for circular data

A correlation coefficient for circular data BiomelriL-a (1983). 70. 2, pp. 327-32 327 Prinltd in Great Britain A correlation coefficient for circular data BY N. I. FISHER CSIRO Division of Mathematics and Statistics, Lindfield, N.S.W., Australia

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. The Variance of the Product of Random Variables Author(s): Leo A. Goodman Source: Journal of the American Statistical Association, Vol. 57, No. 297 (Mar., 1962), pp. 54-60 Published by: American Statistical

More information

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

A Monte-Carlo study of asymptotically robust tests for correlation coefficients Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,

More information

SEQUENTIAL TESTS FOR COMPOSITE HYPOTHESES

SEQUENTIAL TESTS FOR COMPOSITE HYPOTHESES [ 290 ] SEQUENTIAL TESTS FOR COMPOSITE HYPOTHESES BYD. R. COX Communicated by F. J. ANSCOMBE Beceived 14 August 1951 ABSTRACT. A method is given for obtaining sequential tests in the presence of nuisance

More information

Diagnostics can identify two possible areas of failure of assumptions when fitting linear models.

Diagnostics can identify two possible areas of failure of assumptions when fitting linear models. 1 Transformations 1.1 Introduction Diagnostics can identify two possible areas of failure of assumptions when fitting linear models. (i) lack of Normality (ii) heterogeneity of variances It is important

More information

On prediction and density estimation Peter McCullagh University of Chicago December 2004

On prediction and density estimation Peter McCullagh University of Chicago December 2004 On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating

More information

Topic 23: Diagnostics and Remedies

Topic 23: Diagnostics and Remedies Topic 23: Diagnostics and Remedies Outline Diagnostics residual checks ANOVA remedial measures Diagnostics Overview We will take the diagnostics and remedial measures that we learned for regression and

More information

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2. Chapter 1 LINEAR EQUATIONS 11 Introduction to linear equations A linear equation in n unknowns x 1, x,, x n is an equation of the form a 1 x 1 + a x + + a n x n = b, where a 1, a,, a n, b are given real

More information

STATISTICS SYLLABUS UNIT I

STATISTICS SYLLABUS UNIT I STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Chapter 8 Heteroskedasticity

Chapter 8 Heteroskedasticity Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized

More information

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. Biometrika Trust A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test Author(s): G. Hommel Source: Biometrika, Vol. 75, No. 2 (Jun., 1988), pp. 383-386 Published by: Biometrika

More information

Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data

Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data Journal of Modern Applied Statistical Methods Volume 4 Issue Article 8 --5 Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data Sudhir R. Paul University of

More information

Kumaun University Nainital

Kumaun University Nainital Kumaun University Nainital Department of Statistics B. Sc. Semester system course structure: 1. The course work shall be divided into six semesters with three papers in each semester. 2. Each paper in

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust The Use of a Concomitant Variable in Selecting an Experimental Design Author(s): D. R. Cox Source: Biometrika, Vol. 44, No. 1/2 (Jun., 1957), pp. 150-158 Published by: Oxford University

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. On the Bound for a Pair of Consecutive Quartic Residues of a Prime Author(s): R. G. Bierstedt and W. H. Mills Source: Proceedings of the American Mathematical Society, Vol. 14, No. 4 (Aug., 1963), pp.

More information

ELEMENTARY LINEAR ALGEBRA

ELEMENTARY LINEAR ALGEBRA ELEMENTARY LINEAR ALGEBRA K R MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND First Printing, 99 Chapter LINEAR EQUATIONS Introduction to linear equations A linear equation in n unknowns x,

More information

Conditional Inference by Estimation of a Marginal Distribution

Conditional Inference by Estimation of a Marginal Distribution Conditional Inference by Estimation of a Marginal Distribution Thomas J. DiCiccio and G. Alastair Young 1 Introduction Conditional inference has been, since the seminal work of Fisher (1934), a fundamental

More information

Likelihood and p-value functions in the composite likelihood context

Likelihood and p-value functions in the composite likelihood context Likelihood and p-value functions in the composite likelihood context D.A.S. Fraser and N. Reid Department of Statistical Sciences University of Toronto November 19, 2016 Abstract The need for combining

More information

You can compute the maximum likelihood estimate for the correlation

You can compute the maximum likelihood estimate for the correlation Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at A Quincuncial Projection of the Sphere Author(s): C. S. Peirce Source: American Journal of Mathematics, Vol. 2, No. 4 (Dec., 1879), pp. 394-396 Published by: The Johns Hopkins University Press Stable URL:

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Curriculum Map for Mathematics SL (DP1)

Curriculum Map for Mathematics SL (DP1) Unit Title (Time frame) Topic 1 Algebra (8 teaching hours or 2 weeks) Curriculum Map for Mathematics SL (DP1) Standards IB Objectives Knowledge/Content Skills Assessments Key resources Aero_Std_1: Make

More information

Group Sequential Designs: Theory, Computation and Optimisation

Group Sequential Designs: Theory, Computation and Optimisation Group Sequential Designs: Theory, Computation and Optimisation Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 8th International Conference

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. On Runs of Residues Author(s): D. H. Lehmer and Emma Lehmer Source: Proceedings of the American Mathematical Society, Vol. 13, No. 1 (Feb., 1962), pp. 102-106 Published by: American Mathematical Society

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

The Model Building Process Part I: Checking Model Assumptions Best Practice

The Model Building Process Part I: Checking Model Assumptions Best Practice The Model Building Process Part I: Checking Model Assumptions Best Practice Authored by: Sarah Burke, PhD 31 July 2017 The goal of the STAT T&E COE is to assist in developing rigorous, defensible test

More information

UNIFORMLY MOST POWERFUL CYCLIC PERMUTATION INVARIANT DETECTION FOR DISCRETE-TIME SIGNALS

UNIFORMLY MOST POWERFUL CYCLIC PERMUTATION INVARIANT DETECTION FOR DISCRETE-TIME SIGNALS UNIFORMLY MOST POWERFUL CYCLIC PERMUTATION INVARIANT DETECTION FOR DISCRETE-TIME SIGNALS F. C. Nicolls and G. de Jager Department of Electrical Engineering, University of Cape Town Rondebosch 77, South

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

STAT Section 5.8: Block Designs

STAT Section 5.8: Block Designs STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons

Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons Journal of Educational Statistics Fall 1993, Vol 18, No. 3, pp. 271-279 Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons Robert D. Gibbons Donald R. Hedeker John M. Davis

More information

The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1)

The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) Authored by: Sarah Burke, PhD Version 1: 31 July 2017 Version 1.1: 24 October 2017 The goal of the STAT T&E COE

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Sample size calculations for logistic and Poisson regression models

Sample size calculations for logistic and Poisson regression models Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA STATISTICS IN MEDICINE, VOL. 17, 59 68 (1998) CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA J. K. LINDSEY AND B. JONES* Department of Medical Statistics, School of Computing Sciences,

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust A note on the sensitivity to assumptions of a generalized linear mixed model Author(s): D. R. COX and M. Y. WONG Source: Biometrika, Vol. 97, No. 1 (MARCH 2010), pp. 209-214 Published

More information

Mind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind.

Mind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind. Mind Association Response to Colyvan Author(s): Joseph Melia Source: Mind, New Series, Vol. 111, No. 441 (Jan., 2002), pp. 75-79 Published by: Oxford University Press on behalf of the Mind Association

More information

Conditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions

Conditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions Biometrika (92), 9, 2, p. Printed in Great Britain Conditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions BY J. F. LAWLESS* University

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Queues with Time-Dependent Arrival Rates: II. The Maximum Queue and the Return to Equilibrium Author(s): G. F. Newell Source: Journal of Applied Probability, Vol. 5, No. 3 (Dec., 1968), pp. 579-590 Published

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

ELEMENTARY LINEAR ALGEBRA

ELEMENTARY LINEAR ALGEBRA ELEMENTARY LINEAR ALGEBRA K. R. MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND Corrected Version, 7th April 013 Comments to the author at keithmatt@gmail.com Chapter 1 LINEAR EQUATIONS 1.1

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Exercises Chapter 4 Statistical Hypothesis Testing

Exercises Chapter 4 Statistical Hypothesis Testing Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

1 One-way Analysis of Variance

1 One-way Analysis of Variance 1 One-way Analysis of Variance Suppose that a random sample of q individuals receives treatment T i, i = 1,,... p. Let Y ij be the response from the jth individual to be treated with the ith treatment

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments /4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population

More information

A test for improved forecasting performance at higher lead times

A test for improved forecasting performance at higher lead times A test for improved forecasting performance at higher lead times John Haywood and Granville Tunnicliffe Wilson September 3 Abstract Tiao and Xu (1993) proposed a test of whether a time series model, estimated

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Modeling Overdispersion

Modeling Overdispersion James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 Introduction 2 Introduction In this lecture we discuss the problem of overdispersion in

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:

Today. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News: Today HW 1: due February 4, 11.59 pm. Aspects of Design CD Chapter 2 Continue with Chapter 2 of ELM In the News: STA 2201: Applied Statistics II January 14, 2015 1/35 Recap: data on proportions data: y

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. Measuring the Speed and Altitude of an Aircraft Using Similar Triangles Author(s): Hassan Sedaghat Source: SIAM Review, Vol. 33, No. 4 (Dec., 1991), pp. 650-654 Published by: Society for Industrial and

More information

Chapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics

Chapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics Chapter 6 Order Statistics and Quantiles 61 Extreme Order Statistics Suppose we have a finite sample X 1,, X n Conditional on this sample, we define the values X 1),, X n) to be a permutation of X 1,,

More information

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica. A Set of Independent Necessary and Sufficient Conditions for Simple Majority Decision Author(s): Kenneth O. May Source: Econometrica, Vol. 20, No. 4 (Oct., 1952), pp. 680-684 Published by: The Econometric

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

6 Single Sample Methods for a Location Parameter

6 Single Sample Methods for a Location Parameter 6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually

More information

B.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba

B.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba B.N.Bandodkar College of Science, Thane Random-Number Generation Mrs M.J.Gholba Properties of Random Numbers A sequence of random numbers, R, R,., must have two important statistical properties, uniformity

More information

Adjusting the Tests for Skewness and Kurtosis for Distributional Misspecifications. Abstract

Adjusting the Tests for Skewness and Kurtosis for Distributional Misspecifications. Abstract Adjusting the Tests for Skewness and Kurtosis for Distributional Misspecifications Anil K. Bera University of Illinois at Urbana Champaign Gamini Premaratne National University of Singapore Abstract The

More information

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij = K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing

More information

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1, Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem

More information

GARCH Models Estimation and Inference

GARCH Models Estimation and Inference GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in

More information

Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to The American Mathematical Monthly.

Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to The American Mathematical Monthly. Recounting the Rationals Author(s): Neil Calkin and Herbert S. Wilf Source: The American Mathematical Monthly, Vol. 107, No. 4 (Apr., 2000), pp. 360-363 Published by: Mathematical Association of America

More information

Analysis of variance, multivariate (MANOVA)

Analysis of variance, multivariate (MANOVA) Analysis of variance, multivariate (MANOVA) Abstract: A designed experiment is set up in which the system studied is under the control of an investigator. The individuals, the treatments, the variables

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information