Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
|
|
- Sharlene Montgomery
- 5 years ago
- Views:
Transcription
1 Biometrika Trust Analysis of Variability with Large Numbers of Small Samples Author(s): D. R. Cox and P. J. Solomon Source: Biometrika, Vol. 73, No. 3 (Dec., 1986), pp Published by: Oxford University Press on behalf of Biometrika Trust Stable URL: Accessed: :35 UTC REFERENCES Linked references are available on JSTOR for this article: You may need to log in to JSTOR to access the linked references. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Biometrika Trust, Oxford University Press are collaborating with JSTOR to digitize, preserve and extend access to Biometrika
2 Biometrika (1986), 73, 3, pp Printed in Great Britain Analysis of variability with large numbers of small samples By D. R. COX Department of Mathematics, Imperial College, London SW7 2BZ, U.K. AND P. J. SOLOMON Department of Statistics, University of Adelaide, Adelaide, S.A., Australia 5001 SUMMARY Procedures are discussed for the detailed analysis of distributional form, based on many samples of size r, where especially r = 2, 3, 4. The possibility of discriminating between different kinds of departure from the standard normal assumptions is discussed. Both graphical and more formal procedures are developed and iltustrated by some data on pulse rates. Some key words: Graphical methods; Kurtosis; Nonnormality; Order statistics; Overdispersion; Pulse rate; Skewness. 1. INTRODUCTION It is common both in some kinds of balanced experimental design and in schemes of routine testing to have quite large numbers of small groups of observations, each group obtained under the same conditions. For example, in a large study, blood pressure measurements might be taken in duplicate on each patient visit, or in certain routine chemical analyses triplicate samples from the same source might be dealt with independently. There can be major problems in ensuring the independence of such replicate observations necessary to ensure that the relevant source of variability is not underestimated. Assuming that this independence is achieved, preferably by appropriate 'blinding', we use the replicate observations primarily to improve the precision of the mean, or other location estimate, but also to estimate a component of variance within samples, usually measuring sampling or measurement error. A further use is to check for gross errors affecting a single observation. In the present paper, however, we examine what further information can be extracted from such data. Throughout we suppose that, possibly after transformation, the 'standard' assumption of a normal distribution with constant variance is at least a reasonable starting point for an analysis. Of course more detailed analysis is likely to be worthwhile only if the variation within samples is of intrinsic interest. We assume throughout that rounding errors, digit preferences and the like are relatively unimportant and that the variability studied is not an artefact. 2. SOME GENERAL IDEAS 2 1. Formulation of models Suppose that we have m independent samples each of size r, the observations in the ith sample being ydl,...., Yir (i = 1..., inm). As a basis for the analysis we consider a
3 544 D. R. COX AND P. J. SOLOMON number of possible models for the corresponding simplest is the standard, normal theory one. Normal theory model, MN. The Yip are independently normally distributed with const variance o.2 and with E ( Yip) = ui. As explained in? 1 the focus of interest in the present paper is not the {Ui}, which may indeed have some additional structure specified, for instance, by a regression or factorial model. We shall typically suppose that there are substantial differences in mean present and that no useful information about the internal variability can be recovered from the between-sample variation. There are numerous ways in which interesting departures from MN may occur and we shall consider just three. Systematic changes in variance, MSY. Here the assumptions of MN hold except that var (YYi,) = CJ2, which is not constant but is a function either of an explanatory variable zi characterizing the ith population or of g,i. Such variation can, if necessary, be represent in various ways (Cook & Weisberg, 1983), e.g. OJ2= e/zi 0-2 2= etgi O02 o2 = = epg9i.2 P - H 20 where o-2o is a 'baseline' variance and / captures the systematic dependency present. An important extension allows for more complex multidimensional dependence: for example, if the samples are arranged in a row x column array, there may be systematic differences in variance between rows, between columns, or both. Complementary to MSY is the possibility that, while each population has a different variance, the changes in variance are random, unrelated to any observed feature. This is a model of overdispersion relative to MN. Overdispersion model, MOD. Again the normal theory assumptions of MN are modifie only by allowing each population to have a different variance, var ( Y1) = (J2, but now Ti = C-2 (i = 1,..., m) are independent unobserved values of a random variable T having a probability density function h(t). In the type of application we have in mind, it will often not be feasible to estimate the form of h(t) with any precision, and it may then be adequate to assume that T has an inverse gamma distribution, with density (Tfo)o tt-'fo exp (-'fot'/t){f('fo)f1, (1) where fo plays the role of an 'effective degrees of freedom' and E(T)= Tfo (fo-2)-1= O, say. For our final model, we suppose that, except for location, all populations ha same distribution, which is, however, nonnormal. Nonnormal model, MNN. All { Y1} are independent and the density of Yi, is where g(x) is a nonnormal density of zero mean. We write Kr for the rth cumulant of Yis and pr=kr/(j (r > 3). Of course, there are other possibilities and, in particular, we could combine M MOD and MNN in various ways, and also introduce models involving serial correlation.
4 Variability with large numbers of small samples 545 We are interested in methods for detecting departures from MN, in the estimation of relevant parameters in MSY, MOD and MNN, ana in studying the feasibility of discriminat ing between these three kinds of departure, in particular between MOD and MNN. While for very small values of r, separation of MOD and MNN may often not be possible, note that if there is substantial underdispersion in the sample estimates of variance, MOD can at once be eliminated from consideration. No special model for gross errors has been included. Both MOD and MNN can simulate such errors, in the first case via an occasional very large 'true' variance and in the second via a very long-tailed error distribution. Note that with r > 2 if gross errors were detected to be predominantly in one direction, MNN rather than MOD would be required. To some extent further analysis and interpretation may be quite strongly influenced by what kind of departure from MN is most appropriate. Thus, as between MOD and MNN, the former prompts the question 'why are some groups more variable than others?', whereas under the latter the errors may more reasonably be presumed to have a totally homogeneous structure. For the ith sample, we write 2 2. Some simple properties of MOD and MNN -i =, Yip/r Si = I ( Yip _ yi )2 for the sample mean and sum of squares. These are sufficient under both MN and MOD. It is convenient also to write ={Si/ (r -1 )}2 and 'i= 5i for the usual estimates of standard deviation and variance. For the overdispersion model MOD with the inverse gamma compounding density (1), the likelihood of the ith sample is (fo T?)!oF(1fO + 2r) F(2fo)irir{foro+ r(-zi)2+ Si}fo+lrf (2) which is of the Student's t form. If the {,i} are regarded as nuisance parameters, we examine the marginal likelihood based on Si which is Sir-3/2{1 + Sr/(f0to)}-foAr+i (foto r bb2f 2 r2) 3 For the general compounding density h(t), (3) is replaced by 00 J r1qr-1(sit'v)h(t) dt, (4) where qr1l(x) is the probability density of th of freedom. Estimation under the model MOD can be based on (3). Alternatively, and also for discriminating between MOD and MNN, it is useful to record the cumulants {Kj(i zi in terms of the cumulants {Aj} of the compounding density h(t). These are, wit 2= A1 = E(T), E()= var ()=+-1- (5)
5 546 D. R. COX AND P. J. SOLOMON say, and - 6S 8 _12r+1 A2 +(r+l1)(r +3) A31 K3(Ti)=)2+ (r-1)2 4 (r-1)(r+)aj Because A2/ U4, A3! O6,... are the cumulants of a nonnegative random variable of unit mean, A3/oJ6 > (A2/! o4)(a 2/4-1), with equality attained by a two-point distribution with one atom at zero. Therefore (T-) >.6{- + 8 (r+l)(9-r) A2 (r+1)(r+3) (A2)2} {(r- 1)2 (r- l) +2 r+1) +2 Note that negative skewness in the distribution of -, is possible, although for s of r, very large values of if2 would be necessary to bring this about. By comparison we have under the nonnormal model MNN (Kendall & Stuart, 1969, Ch. 12) E(T-)=cr2, var ( (~~\6 8+ 4(r -2) 1? 2 _(7 03( E) =C{(r - I1)2 +r(r - l1)2 p23 +r( r - l ) r From (5) and (7) we can compute fairly directly via the first two moments of the {T} estimates of either A2/ C in MOD or p4 in MNN. Both are dimensionless measures of the departure from the standard conditions MN. For r> 2 we can compute for each sample scale and location invariant measures of 'shape', e.g. the standardized third cumulant. Under MN and MOD these are distributed independently of yi. and Si, but this independence is in general lost under MNN. We shall in the subsequent discussion suppose that the population means {fig} are such that no useful information about error properties can be gleaned from the sample means 9'. Note, however, that under MNN cov (i., Si) = (r-1)k3/r. If the {ui} are distributed with variance -2,, this covering both systematic and rando variation, we have so that var (Yi.) = (J2/r + &.2, var(si) = 2(r - 1)(J4+ (r -)2K41r, corr (Yi., Si) = P I (8) Under MNN there is thus some possibility tha which could be misinterpreted as evidence The correlation (8) is, however, likely to be assume.
6 Variability with large numbers of small samples Procedures based on the distribution of Si In? 2*2 some of the properties of the sample sums of squares ha especially under MOD and MNN. We now consider some corresponding statistical procedures. For preliminary analysis a probability plot of the ordered {Si} against the expected order statistics of the chi-squared distribution with r- 1 degrees of freedom (Pearson & Hartley, 1972, Table 20) is natural. When r is not too small, say r > 5, and a number of Si are available, a powerful genera method of analysis (Bartlett & Kendall, 1946) under MN is to employ linear methods for log Si using the fact that under sampling a normal distribution var (log Si) = qf(2r-2), (9) where +fr(z) is the digamma function and the values for r = 2, 3, 4 are 4 93, 1-64, We shall not make extensive use of this in the present work, partly because of the severe loss of efficiency when r = 2, 3 and partly because of the undue sensitivity to sma values of Si and the failure of the method without ad hoc modification if Si = 0; su values could quite easily arise from rounding errors. As one example of its use, however, one could test for departure from MN in the direction of MOD or MNN by comparing with (9). (m - 1)-1{ (log Si)2 - (X log Si)2/ m} For MOD expansion of (3) or (4) for small 1/fo or small dispersion of h(x) shows that the locally most powerful test is based on the distribution of l S2 given L Si, or equivalently the marginal distribution of the dispersion index _{ S2 _ (X Si)2/ m}m-.1 (1 Si/ m)2{2/(r-1)} The divisor 2/ (r -1) ensures that, under MN, I -> 1 in probability as m -> cx. Note that detailed specification of h(x) is unnecessary for the local optimality property. For large m, I is asymptotically normal: because of the skewness of I, a rather better approximation is obtained by taking log I to be normal with mean - (r + 1)/{m(r - 1)} and variance 2(r+ 1)/{m(r- 1)} Detection of systematic changes We now consider the examination of possible systematic relations between variance and an explanatory variable or between variance and mean. Again graphical analysis will usually be a natural first step supplemented where appropriate by a test statistic which we take in the form T =I aisilz; Si. 11 where we can without loss of generality scale the explanatory variable so that l ai = 0, I ai = m. For instance if the sample means yi. are taken as the explan-atory variable ai=(yi.-y){ (yj.)2/m} 2 where g = Y./ m. There are two ways of obtaining a null hypothesis distribution of T The first is to note that under MN all permutations of {S1,...., Sm} are equally likely; hence, with S. = l Si/ m, E(T)=0O, var (T) = {z(si _S)2/ m}/3.2. (12)
7 548 D. R. COX AND P. J. SOLOMON Under weak assumptions, asymptotic normality will hold as n -> cc: for higher permutation moments, see Cox & Hinkley (1974, p. 185). An 'exact' permutation test is in principle possible. Under MN a more sensitive analysis is possible, by using the normal theory distribution of the ratio of quadratic forms rather than the permutation distribution. A reasonably simple approximate test is derived by writing pr (Ttx) = pr { (ai - x/m) Si }- [{2 - )( x2/)} Thus for a two-sided equi-tailed test at level 2a we need critical limits x* defined by where ((-k*) a. That is, {2(r- 1)(1 +x*2/m)}12 a, ~~ 2 (13) x* ={2(r - 1)1'k*f 1-2(r - 1) k*2/m}2 (13) For a more refined calculation it is possible to introduce a correction based on the standardized skewness and kurtosis of the random variable I (ai - x/lvm)si, nam l) (r- 1)/m I r+/ (m)} = (rii )rn{p4avrnmp3a+o(n)} where P3a = a3/m, P4a = a4/m-3. Numerical work in the special case r = 2 suggests that for the purpose of the present paper, where m is likely to be quite large, use of (13) is entirely adequate. Note that the simpler approximation (13) with the 1/rm term omitted is equivalent to the permutation test (12) with a normal theory value for var (T), as is clear on general grounds. 3. SAMPLES OF SIZE TWO We now consider in more detail what is probably the most common case in applications, namely r = 2, when a large number of duplicate observations are available. Provided that the numbering of observations within a pair is uninformative, we may replace the ith pair of observations (Yil, Yi2) by Yi. = 2( Yi1 + YiA), Si = 2( Yi-Yi2)2. An initial analysis for detecting systematic dependencies is to plot Si against Yi. or some other suitable explanatory variable zi. For a formal test we use? 2-4. If there is no systematic relation, attention is focused on the marginal distribution of the {Si}. Under the standard model MN, Sil/r2 has the chi-squared distribution with one degree of freedom. Probably the simplest graphical analysis is a seminormal plot of the ordered V/Sj; see Pearson & Hartley (1972, Table 21). Departure in the direction of underdispersion would indicate MNN whereas overdispersion could indicate either MOD or MNN.
8 Variability with large numbers of small samples 549 Under MNN, it is clear that even with a very large value of m only even-order cumulants of the distribution of the y's can be determined. Thus there is no way of studying the skewness of y, at least so long as there is no relevant information in the variation between the {yi.} To supplement the plot of Si versus yi. the index of dispersion can be calculated, and by (5) and (7) regarded as estimating 1 +2(A2/J-4) under MOD, and 1 +4p4 under MNN, 4. SAMPLES OF SIZE THREE 4 1. General discussion When we have samples of size three rather than samples of size two, there is appreciable extra sensitivity in having estimates of variance with two degrees of freedom rather than one, but in some ways the more interesting difference lies in the richer possibilities for examining distributional shape and in particular of detecting skewness under the nonnormal model, MNN. Thus an initial analysis will often consist of: (i) a plot of Si versus Yi. or a similar explanatory variable zi, supplemented where necessary by the test statistic (11); (ii) a plot of the ordered Si versus the expected order statistics from an exponential distribution, supplemented where necessary by the test statistic (10). Note that epm, the expected value of the pth smallest observation in samples of size m from the unit exponential distribution, is 1 1 log{(m+2)/(m-p+2)} (ptm), pmrmn * * * m-p+1 log (m+)+y (p= m), where -y is Euler's constant. We now concentrate on the more detailed analysis of the variation and this is most naturally done in terms of the order statistics Order statistics in samples of three It is convenient to drop temporarily the suffix indicating the particular sample and to write Y(1) < Y(2) Y(3) for the order statistics. The central results were given by Fisher (1930) in connection with a study of the exact distribution of sample estimates of skewne in which induction on sample size was used, samples of three being the 'starting point'. If a2 = ]S is the usual estimate of variance, the one 'degree of freedom' that describes the skewness of the sample can be taken in dimensionless form as W = (Y(3)-2Y(2) + Y(1))/J Under the standard normal model, MN, and also under MOD, (w, S) are independent, so that clear dependence between (w, S) is an indication that MNN should be considered. To obtain the joint distribution of (5, w) under MN and MNN it is convenient, following Fisher (1930), to write Y(3) = Y(2) + 0 ( Cos COS v t ext/3rem +1 si v Y=Y sin v, - Y(1) Cos = v Y(2) +13 c - p sin to ss v, wt +2 =3 sin or
9 550 D. R. Cox AND P. J. SOLOMON Y(1)* Note that w = 213 sin v and that t - sin1 takes values in (-1, 1). Now the joint density of {Y(1), Y(2), y(3)} is 6g(y(l))g(y(2))g(y(3)) and, on transforming to new variables (ai, Y2, v), the density of (5, v) is _ U f0 g(x - -Cos v + N3a5 sin v)g(x)g(x +J-Cos v +ln35-sin v) dx. (14) It follows immediately that for MN the statistics 5- and v are indeed independent and that the marginal density of v is uniform; i.e. that of t is uniform on (-1, 1). Incidentally the usual unbiased estimate of the third cumulant 2 =(y3- y)3=35 sin (3 v), showing explicitly that consideration of w or v is essentially equivalent to that of the standardized third cumulant ratio. Note that the individually standardized third cumulant lies in (-13, /3). There are the following open questions connected with (14). (i) Does knowledge of the joint distribution of (5, v) determine g(x) uniquely, except for a translation? (ii) Does independence of (r and v imply that g(x) is normal? (iii) Is there a simple necessary and sufficient condition for a given distribution of (O, v) to be representable in the form (14)? (iv) What statistic or statistics are theoretically most sensitive for detecting departures from MN? Note that under MOD, ( and v are independent, v is uniform and 5 or S has the overdispersed distribution discussed in? 2-2. Also if it is required to estimate skewness under the model MNN averaging of individually standardized estimates is inappropriate, in particular in the light of the constraint mentioned above: a consistent estimate of K3/1 ( is A reasonable analysis in practice is: 2m -Yi)3/(m )3/2. (a) the examination of the marginal distribution of the Si via the probability plot and test statistic mentioned in? 4 1; (b) inspection of a scatter plot of ti versus oi, clear dependence showing evidence against MN and MOD; (c) inspection of the marginal distribution of ti supplemented by calculation of the mean of ti as a statistic nonzero values of which would indicate skewness in the model MNN- 5. SAMPLES OF SIZE MORE THAN THREE We discuss only briefly corresponding procedures for samples of size four and more. The natural extension of the statitic w of? 4 is provided via linear functions of order statistics with a simple interpretation and zero expectation under normality. Thus for samples of size four we start with measures of skewness and kurtosis ( y(4 - y(3)) - ( Y(2) -Y(1)), (y j"4 - (3) + ( Y(2) -Y(1)) - k( y(3) -Y(2)),
10 Variability with large numbers of small samples 551 choosing k so that the second of these has zero expectation under MN: in fact k = This leads us to define W = {Y(4)-Y(3 -Y(2)+Y(1)}/, W"= {Y(4) y(3)+3f465Y(2)-Y(1)1'7. It is easily shown that, under MN and MOD, W' and wi are independent of 5J, var(w')=0-771, var (w")= They are also uncorrelated but far from independent and far from normally distributed. It is likely, therefore, that the most effective procedure is to compute w' and w" from each set of data and to examine the sample means and variances for consistency with the normal theory values. Departures would have a fairly clear diagnostic value. The sample estimate of cov (w', w") could also be calculated, although it is unclear what interpretation is to be put on a nonzero value. 6. SOME EXAMPLES Samples of size three. As an illustration, we analyse some pulse rates from the International Prospective Primary Prevention Study in Hypertension, a large scale clinical trial. Before entry and randomization to treatments, patients typically attended three qualifying visits at least one day apart. Pulse rate, the number of beats per minute, was one of several variables measured at each pre-entry visit. The data analysed here consist of three pulse rates, y, for a sample of one hundred men and one hundred women. The analysis goes in three broad steps. First, a plot, not given here, of the within-patient sum of squares Si versus the mean pulse rate yi. shows clear evidence that Si increases with Yi., thus indicating either MSY or, just conceivably, sampling correlation between Si and Yi. under MNN. Clearly MN is inappropriate. However, use of the reciprocal pulse rate removes the systematic relation between sum of squares and mean. The reciprocal pulse rate has a direct interpretation as the mean time between beats. The changes induced by reciprocal transformation are supported by the statistic T; for women, T = 0897 on the original scale, but this changes to for reciprocals; correspondingly for men, T changes from to 0094 under the reciprocal transformation. The second step is a probability plot of the ordered Si versus the expected exponential order statistics. Both the pulse rates and reciprocal pulse rates show overdispersion, but less for reciprocals; see Fig. 1. The very large values of Si correspond to women with large variation in pulse rate which variation, however, may be clinically meaningful. Therefore it is on the whole sensible to regard such values as part of the population under study rather than as aberrant values to be rejected. Supplementing the plot by the index of overdispersion I of (10) verifies the improved distributional properties of the reciprocals. Under MNN, from (7), I leads to an estimate of the dimensionless fourth cumulant, namely and for pulse rates for women and men, and on the reciprocal scale, and 4-602, respectively. The third step in the analysis plots Si versus the angular measure of skewness ti. We found it useful to examine for comparison simulated data from other distributions such as the normal, exponential and Student's 1 distributions. Normal data produced a random scatter as expected whereas exponential data ga-ve a plot which was clearly positively skew.
11 552 D. R. COX AND P. J. SOLOMON Si ~~~~~~~~ e 5 Fig. 1. Entry pulse rate for 100 women. Triples. Crosses, Si versus exponential score, ei; circles, losi versus exponential score, ei, after reciprocal transformation. (a) 1000 Si O ~ ~~~~ - Si 600- o 0 (23> 800- (b) (3 0 ~~~~~~~~~~~~ ~~~0 0 O ~~~~200 e O~~~~~~~~~~~~~~ t ~~ O ~ ~ CD? _ 0 _ 0-1*0-0O5 0 O 5 PG ti 1.0 Fig. 2. Entry pulse rate for 100 women. Triples. (a) Si versus angular statistic, ti. (b) losi versus angular statistic, ti, after reciprocal transformation _ ]-st ~~~xxxx < XA 0 I I Seminormal expected order statistic Fig. 3. Entry pulse rate for 100 women. Pairs. Crosses, seminormal plot of ordered V/Si; circles, seminormal plot of ordered V/S, after reciprocal transformation.
12 Variability with large numbers of small samples 553 Figure 2(a) shows that Si and ti for pulse rates are not independent. Reciprocals improve the symmetry; see Fig. 2(b). The skewness measure ti is characterized by concentrations of points at -1, 1 and 0, the first two values a consequence of grouping; its distribution is certainly nonuniform, both on the original and reciprocal scales, so that MOD is not appropriate. The women exhibit slightly negatively skew pulse rates, giving mean ti and standard deviation On the reciprocal scale, ti has mean and standard deviation 0 705: the mean is exaggerated by one very large Si. By contrast, the men exhibit slight positive skewness in pulse rate with mean ti and standard deviation 0 739; for reciprocals, mean ti is with standard deviation The assumption of symmetry in the distribution of the reciprocals is further supported by the usual standardized measure of skewness which takes values of and for women and for men. Exactly unit values of ti are accounted for by rounding. Simulations suggest, however, that rounding cannot entirely explain the anomalous distribution of ti. The grouping could have been aggravated by some temporal correlation. In particular, when two out of three rates agree, do they occur together as consecutive values? Pooling men and women, of 81 individuals with two rates the same, 19 had a different rate at the second visit, whereas 34 had a different rate at the first, and 28 at the third visit. This tends to suggest that some doctors may recall the previous pulse rate at the subsequent visit. In summary, the reciprocal pulse rates appear to have a symmetric, nonnormal, long-tailed distribution, suggesting that Student's t distribution with a small number of degrees of freedom may be appropriate. This provides a suitably flexible family of symmetrical distributions if parametric representation is required. Samples of size two. Partly in order to compare results, we have analysed a series of pairs of observations formed by taking just the first and last pulse rate from the data discussed above. Reciprocals certainly reduce the size of the kurtosis, being typically positive. However, reciprocals for women exhibit slight negative kurtosis as shown in Fig. 3, although testing with I, which under MNN leads to an estimate of p4 as for reciprocals, this is by no means statistically significant. Also, the other values of I vary little. In particular, for women, pulse rates give p as 3 335, and for men 6 054, which changes to for reciprocals. Samples of size four. Data on strengths of yarn (Cox & Snell, 1981, p. 131) in the form of 12 samples each of size four show excellent agreement with the normal theory values for w' and w". Namely, the observed mean w' for bobbins is -0,0178 with observed standard deviation 0-710, and for w", the observed mean is -0A466 with standard dev 1 561; the theoretical standard deviations are respectively and and the standard errors of the means thus and 0;461. ACKNOWLEDGEMENTS We are grateful to the Science and Engineering Research Council and to Ciba-Geigy Ltd (Basle) for support of this work. REFERENCES BARTLETT, M. S. & KENDALL, D. G. (1946). The statistical analysis of variance-heterogeneity and the logarithmic transformation. SuppL J. R. Statist. Soc. 8,
13 554 D. R. COX AND P. J. SOLOMON COOK, R. D. & WEISBERG, S. (1983). Diagnostics for heteroscedasticity in regression. Biometrika 70, Cox, D. R. & HINKLEY, D. V. (1974). Theoretical Statistics. London: Chapman & Hall. Cox, D. R. & SNELL, E. J. (1981). Applied Stajistics. London: Chapman & Hall. FISHER, R. A. (1930). The moments of the distribution for normal samples of measures of departure from normality. Proc. R. Soc. A 130, KENDALL, M. G. & STUART A. (1969). Advanced Theory of Statistics, 1, 3rd ed. High Wycombe: Griffin. PEARSON, E. S. & HARTLEY, H. 0. (1972). Biometrika Tables for Statisticians, 2. High Wycombe: Griffin. [Received December Revised April 1986]
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
A Note on the Efficiency of Least-Squares Estimates Author(s): D. R. Cox and D. V. Hinkley Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 30, No. 2 (1968), pp. 284-289
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
American Society for Quality A Note on the Graphical Analysis of Multidimensional Contingency Tables Author(s): D. R. Cox and Elizabeth Lauh Source: Technometrics, Vol. 9, No. 3 (Aug., 1967), pp. 481-488
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust Robust Regression via Discriminant Analysis Author(s): A. C. Atkinson and D. R. Cox Source: Biometrika, Vol. 64, No. 1 (Apr., 1977), pp. 15-19 Published by: Oxford University Press on
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust Some Remarks on Overdispersion Author(s): D. R. Cox Source: Biometrika, Vol. 70, No. 1 (Apr., 1983), pp. 269-274 Published by: Oxford University Press on behalf of Biometrika Trust Stable
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Some Applications of Exponential Ordered Scores Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 26, No. 1 (1964), pp. 103-110 Published by: Wiley
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust Some Simple Approximate Tests for Poisson Variates Author(s): D. R. Cox Source: Biometrika, Vol. 40, No. 3/4 (Dec., 1953), pp. 354-360 Published by: Oxford University Press on behalf of
More informationBiometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.
Biometrika Trust An Improved Bonferroni Procedure for Multiple Tests of Significance Author(s): R. J. Simes Source: Biometrika, Vol. 73, No. 3 (Dec., 1986), pp. 751-754 Published by: Biometrika Trust Stable
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
On the Estimation of the Intensity Function of a Stationary Point Process Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 27, No. 2 (1965), pp. 332-337
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Regression Analysis when there is Prior Information about Supplementary Variables Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 22, No. 1 (1960),
More informationInternational Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics.
400: A Method for Combining Non-Independent, One-Sided Tests of Significance Author(s): Morton B. Brown Reviewed work(s): Source: Biometrics, Vol. 31, No. 4 (Dec., 1975), pp. 987-992 Published by: International
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
The Analysis of Multivariate Binary Data Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 21, No. 2 (1972), pp. 113-120 Published by: Wiley for
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust Nonlinear Component of Variance Models Author(s): P. J. Solomon and D. R. Cox Source: Biometrika, Vol. 79, No. 1 (Mar., 1992), pp. 1-11 Published by: Oxford University Press on behalf
More informationApplication of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption
Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,
More informationBiometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.
Biometrika Trust Discrete Sequential Boundaries for Clinical Trials Author(s): K. K. Gordon Lan and David L. DeMets Reviewed work(s): Source: Biometrika, Vol. 70, No. 3 (Dec., 1983), pp. 659-663 Published
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
A Look at Some Data on the Old Faithful Geyser Author(s): A. Azzalini and A. W. Bowman Source: Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 39, No. 3 (1990), pp. 357-365
More informationApplication of Variance Homogeneity Tests Under Violation of Normality Assumption
Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com
More informationHypothesis Testing for Var-Cov Components
Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output
More informationThe Periodogram and its Optical Analogy.
The Periodogram and Its Optical Analogy Author(s): Arthur Schuster Reviewed work(s): Source: Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character,
More informationMath 423/533: The Main Theoretical Topics
Math 423/533: The Main Theoretical Topics Notation sample size n, data index i number of predictors, p (p = 2 for simple linear regression) y i : response for individual i x i = (x i1,..., x ip ) (1 p)
More informationFAILURE-TIME WITH DELAYED ONSET
REVSTAT Statistical Journal Volume 13 Number 3 November 2015 227 231 FAILURE-TIME WITH DELAYED ONSET Authors: Man Yu Wong Department of Mathematics Hong Kong University of Science and Technology Hong Kong
More informationTesting the homogeneity of variances in a two-way classification
Biomelrika (1982), 69, 2, pp. 411-6 411 Printed in Ortal Britain Testing the homogeneity of variances in a two-way classification BY G. K. SHUKLA Department of Mathematics, Indian Institute of Technology,
More informationA nonparametric two-sample wald test of equality of variances
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David
More informationComparison of Two Samples
2 Comparison of Two Samples 2.1 Introduction Problems of comparing two samples arise frequently in medicine, sociology, agriculture, engineering, and marketing. The data may have been generated by observation
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More informationCalculus first semester exam information and practice problems
Calculus first semester exam information and practice problems As I ve been promising for the past year, the first semester exam in this course encompasses all three semesters of Math SL thus far. It is
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust Testing Multivariate Normality Author(s): D. R. Cox and N. J. H. Small Source: Biometrika, Vol. 65, No. 2 (Aug., 1978), pp. 263-272 Published by: Oxford University Press on behalf of Biometrika
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
The Interpretation of Interaction in Contingency Tables Author(s): E. H. Simpson Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 13, No. 2 (1951), pp. 238-241 Published
More informationA correlation coefficient for circular data
BiomelriL-a (1983). 70. 2, pp. 327-32 327 Prinltd in Great Britain A correlation coefficient for circular data BY N. I. FISHER CSIRO Division of Mathematics and Statistics, Lindfield, N.S.W., Australia
More informationEach copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.
The Variance of the Product of Random Variables Author(s): Leo A. Goodman Source: Journal of the American Statistical Association, Vol. 57, No. 297 (Mar., 1962), pp. 54-60 Published by: American Statistical
More informationA Monte-Carlo study of asymptotically robust tests for correlation coefficients
Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,
More informationSEQUENTIAL TESTS FOR COMPOSITE HYPOTHESES
[ 290 ] SEQUENTIAL TESTS FOR COMPOSITE HYPOTHESES BYD. R. COX Communicated by F. J. ANSCOMBE Beceived 14 August 1951 ABSTRACT. A method is given for obtaining sequential tests in the presence of nuisance
More informationDiagnostics can identify two possible areas of failure of assumptions when fitting linear models.
1 Transformations 1.1 Introduction Diagnostics can identify two possible areas of failure of assumptions when fitting linear models. (i) lack of Normality (ii) heterogeneity of variances It is important
More informationOn prediction and density estimation Peter McCullagh University of Chicago December 2004
On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating
More informationTopic 23: Diagnostics and Remedies
Topic 23: Diagnostics and Remedies Outline Diagnostics residual checks ANOVA remedial measures Diagnostics Overview We will take the diagnostics and remedial measures that we learned for regression and
More informationa 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.
Chapter 1 LINEAR EQUATIONS 11 Introduction to linear equations A linear equation in n unknowns x 1, x,, x n is an equation of the form a 1 x 1 + a x + + a n x n = b, where a 1, a,, a n, b are given real
More informationSTATISTICS SYLLABUS UNIT I
STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationChapter 8 Heteroskedasticity
Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized
More informationBiometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.
Biometrika Trust A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test Author(s): G. Hommel Source: Biometrika, Vol. 75, No. 2 (Jun., 1988), pp. 383-386 Published by: Biometrika
More informationTesting Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data
Journal of Modern Applied Statistical Methods Volume 4 Issue Article 8 --5 Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data Sudhir R. Paul University of
More informationKumaun University Nainital
Kumaun University Nainital Department of Statistics B. Sc. Semester system course structure: 1. The course work shall be divided into six semesters with three papers in each semester. 2. Each paper in
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust The Use of a Concomitant Variable in Selecting an Experimental Design Author(s): D. R. Cox Source: Biometrika, Vol. 44, No. 1/2 (Jun., 1957), pp. 150-158 Published by: Oxford University
More informationEach copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.
On the Bound for a Pair of Consecutive Quartic Residues of a Prime Author(s): R. G. Bierstedt and W. H. Mills Source: Proceedings of the American Mathematical Society, Vol. 14, No. 4 (Aug., 1963), pp.
More informationELEMENTARY LINEAR ALGEBRA
ELEMENTARY LINEAR ALGEBRA K R MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND First Printing, 99 Chapter LINEAR EQUATIONS Introduction to linear equations A linear equation in n unknowns x,
More informationConditional Inference by Estimation of a Marginal Distribution
Conditional Inference by Estimation of a Marginal Distribution Thomas J. DiCiccio and G. Alastair Young 1 Introduction Conditional inference has been, since the seminal work of Fisher (1934), a fundamental
More informationLikelihood and p-value functions in the composite likelihood context
Likelihood and p-value functions in the composite likelihood context D.A.S. Fraser and N. Reid Department of Statistical Sciences University of Toronto November 19, 2016 Abstract The need for combining
More informationYou can compute the maximum likelihood estimate for the correlation
Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
A Quincuncial Projection of the Sphere Author(s): C. S. Peirce Source: American Journal of Mathematics, Vol. 2, No. 4 (Dec., 1879), pp. 394-396 Published by: The Johns Hopkins University Press Stable URL:
More informationTESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST
Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department
More informationCurriculum Map for Mathematics SL (DP1)
Unit Title (Time frame) Topic 1 Algebra (8 teaching hours or 2 weeks) Curriculum Map for Mathematics SL (DP1) Standards IB Objectives Knowledge/Content Skills Assessments Key resources Aero_Std_1: Make
More informationGroup Sequential Designs: Theory, Computation and Optimisation
Group Sequential Designs: Theory, Computation and Optimisation Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 8th International Conference
More informationEach copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.
On Runs of Residues Author(s): D. H. Lehmer and Emma Lehmer Source: Proceedings of the American Mathematical Society, Vol. 13, No. 1 (Feb., 1962), pp. 102-106 Published by: American Mathematical Society
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationThe Model Building Process Part I: Checking Model Assumptions Best Practice
The Model Building Process Part I: Checking Model Assumptions Best Practice Authored by: Sarah Burke, PhD 31 July 2017 The goal of the STAT T&E COE is to assist in developing rigorous, defensible test
More informationUNIFORMLY MOST POWERFUL CYCLIC PERMUTATION INVARIANT DETECTION FOR DISCRETE-TIME SIGNALS
UNIFORMLY MOST POWERFUL CYCLIC PERMUTATION INVARIANT DETECTION FOR DISCRETE-TIME SIGNALS F. C. Nicolls and G. de Jager Department of Electrical Engineering, University of Cape Town Rondebosch 77, South
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,
More informationSTAT Section 5.8: Block Designs
STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationEstimation of Effect Size From a Series of Experiments Involving Paired Comparisons
Journal of Educational Statistics Fall 1993, Vol 18, No. 3, pp. 271-279 Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons Robert D. Gibbons Donald R. Hedeker John M. Davis
More informationThe Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1)
The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1) Authored by: Sarah Burke, PhD Version 1: 31 July 2017 Version 1.1: 24 October 2017 The goal of the STAT T&E COE
More informationRepeated ordinal measurements: a generalised estimating equation approach
Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related
More informationSample size calculations for logistic and Poisson regression models
Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationCHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA
STATISTICS IN MEDICINE, VOL. 17, 59 68 (1998) CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA J. K. LINDSEY AND B. JONES* Department of Medical Statistics, School of Computing Sciences,
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Biometrika Trust A note on the sensitivity to assumptions of a generalized linear mixed model Author(s): D. R. COX and M. Y. WONG Source: Biometrika, Vol. 97, No. 1 (MARCH 2010), pp. 209-214 Published
More informationMind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind.
Mind Association Response to Colyvan Author(s): Joseph Melia Source: Mind, New Series, Vol. 111, No. 441 (Jan., 2002), pp. 75-79 Published by: Oxford University Press on behalf of the Mind Association
More informationConditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions
Biometrika (92), 9, 2, p. Printed in Great Britain Conditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions BY J. F. LAWLESS* University
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
Queues with Time-Dependent Arrival Rates: II. The Maximum Queue and the Return to Equilibrium Author(s): G. F. Newell Source: Journal of Applied Probability, Vol. 5, No. 3 (Dec., 1968), pp. 579-590 Published
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationELEMENTARY LINEAR ALGEBRA
ELEMENTARY LINEAR ALGEBRA K. R. MATTHEWS DEPARTMENT OF MATHEMATICS UNIVERSITY OF QUEENSLAND Corrected Version, 7th April 013 Comments to the author at keithmatt@gmail.com Chapter 1 LINEAR EQUATIONS 1.1
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationExercises Chapter 4 Statistical Hypothesis Testing
Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics
More information1 One-way Analysis of Variance
1 One-way Analysis of Variance Suppose that a random sample of q individuals receives treatment T i, i = 1,,... p. Let Y ij be the response from the jth individual to be treated with the ith treatment
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More information1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments
/4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population
More informationA test for improved forecasting performance at higher lead times
A test for improved forecasting performance at higher lead times John Haywood and Granville Tunnicliffe Wilson September 3 Abstract Tiao and Xu (1993) proposed a test of whether a time series model, estimated
More informationAnalysing data: regression and correlation S6 and S7
Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models Generalized Linear Models - part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.
More informationModeling Overdispersion
James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 Introduction 2 Introduction In this lecture we discuss the problem of overdispersion in
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the
More informationPractice Problems Section Problems
Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,
More informationToday. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:
Today HW 1: due February 4, 11.59 pm. Aspects of Design CD Chapter 2 Continue with Chapter 2 of ELM In the News: STA 2201: Applied Statistics II January 14, 2015 1/35 Recap: data on proportions data: y
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationEach copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.
Measuring the Speed and Altitude of an Aircraft Using Similar Triangles Author(s): Hassan Sedaghat Source: SIAM Review, Vol. 33, No. 4 (Dec., 1991), pp. 650-654 Published by: Society for Industrial and
More informationChapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics
Chapter 6 Order Statistics and Quantiles 61 Extreme Order Statistics Suppose we have a finite sample X 1,, X n Conditional on this sample, we define the values X 1),, X n) to be a permutation of X 1,,
More informationThe Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.
A Set of Independent Necessary and Sufficient Conditions for Simple Majority Decision Author(s): Kenneth O. May Source: Econometrica, Vol. 20, No. 4 (Oct., 1952), pp. 680-684 Published by: The Econometric
More informationAn Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01
An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationB.N.Bandodkar College of Science, Thane. Random-Number Generation. Mrs M.J.Gholba
B.N.Bandodkar College of Science, Thane Random-Number Generation Mrs M.J.Gholba Properties of Random Numbers A sequence of random numbers, R, R,., must have two important statistical properties, uniformity
More informationAdjusting the Tests for Skewness and Kurtosis for Distributional Misspecifications. Abstract
Adjusting the Tests for Skewness and Kurtosis for Distributional Misspecifications Anil K. Bera University of Illinois at Urbana Champaign Gamini Premaratne National University of Singapore Abstract The
More informationGroup Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology
Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,
More informationAnalysis of 2x2 Cross-Over Designs using T-Tests
Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More informationEconomics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,
Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem
More informationGARCH Models Estimation and Inference
GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in
More informationMathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to The American Mathematical Monthly.
Recounting the Rationals Author(s): Neil Calkin and Herbert S. Wilf Source: The American Mathematical Monthly, Vol. 107, No. 4 (Apr., 2000), pp. 360-363 Published by: Mathematical Association of America
More informationAnalysis of variance, multivariate (MANOVA)
Analysis of variance, multivariate (MANOVA) Abstract: A designed experiment is set up in which the system studied is under the control of an investigator. The individuals, the treatments, the variables
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More information