Supplement to: Guidelines for constructing a confidence interval for the intra-class correlation coefficient (ICC)

Similar documents
Bayesian Inference. Chapter 1. Introduction and basic concepts

1. Define the following terms (1 point each): alternative hypothesis

Scale Parameter Estimation of the Laplace Model Using Different Asymmetric Loss Functions

Each element of this set is assigned a probability. There are three basic rules for probabilities:

Simple Examples. Let s look at a few simple examples of OI analysis.

Stat 5101 Lecture Notes

Bayesian inference with reliability methods without knowing the maximum of the likelihood function

Bayesian Inference for the Multivariate Normal

Theory and Methods of Statistical Inference. PART I Frequentist theory and methods

ANOVA 3/12/2012. Two reasons for using ANOVA. Type I Error and Multiple Tests. Review Independent Samples t test

FIDUCIAL INFERENCE: AN APPROACH BASED ON BOOTSTRAP TECHNIQUES

Theory and Methods of Statistical Inference. PART I Frequentist likelihood methods

Comment about AR spectral estimation Usually an estimate is produced by computing the AR theoretical spectrum at (ˆφ, ˆσ 2 ). With our Monte Carlo

The Bayesian Approach to Multi-equation Econometric Model Estimation

A Solution to Separation for Clustered Binary Data

Essential Maths 1. Macquarie University MAFC_Essential_Maths Page 1 of These notes were prepared by Anne Cooper and Catriona March.

Two-Stage Improved Group Plans for Burr Type XII Distributions

Theory and Methods of Statistical Inference

eqr094: Hierarchical MCMC for Bayesian System Reliability

Using prior knowledge in frequentist tests

Luis Manuel Santana Gallego 100 Investigation and simulation of the clock skew in modern integrated circuits. Clock Skew Model

Evidence, eminence and extrapolation

A732: Exercise #7 Maximum Likelihood

Quantile POD for Hit-Miss Data

Estimating a Finite Population Mean under Random Non-Response in Two Stage Cluster Sampling with Replacement

Bureau International des Poids et Mesures

Part III. A Decision-Theoretic Approach and Bayesian testing

DIFFUSION NETWORKS, PRODUCT OF EXPERTS, AND FACTOR ANALYSIS. Tim K. Marks & Javier R. Movellan

GS Analysis of Microarray Data

GS Analysis of Microarray Data

HarperCollinsPublishers Ltd

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

Concentration of magnetic transitions in dilute magnetic materials

Parametric Empirical Bayes Methods for Microarrays

Solving Systems of Linear Equations Symbolically

The Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition.

GS Analysis of Microarray Data

Minimizing a convex separable exponential function subject to linear equality constraint and bounded variables

FinQuiz Notes

Multilevel Statistical Models: 3 rd edition, 2003 Contents

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

Inference on reliability in two-parameter exponential stress strength model

Probabilistic Modelling of Duration of Load Effects in Timber Structures

Non-Linear Regression Samuel L. Baker

Application of Pareto Distribution in Wage Models

DEPARTMENT OF ECONOMICS

A note on multiple imputation for general purpose estimation

Probabilistic Modelling and Reasoning: Assignment Modelling the skills of Go players. Instructor: Dr. Amos Storkey Teaching Assistant: Matt Graham

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

Bayesian Analysis of Latent Variable Models using Mplus

ON THE CONSEQUENCES OF MISSPECIFING ASSUMPTIONS CONCERNING RESIDUALS DISTRIBUTION IN A REPEATED MEASURES AND NONLINEAR MIXED MODELLING CONTEXT

MCS 115 Exam 2 Solutions Apr 26, 2018

Bayesian Inference: Concept and Practice

Nonparametric Estimation of Smooth Conditional Distributions

A Very Brief Summary of Statistical Inference, and Examples

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Statistical techniques for data analysis in Cosmology

A noninformative Bayesian approach to domain estimation

Distributed control of "ball on beam" system

Heriot-Watt University

PARAMETER ESTIMATION: BAYESIAN APPROACH. These notes summarize the lectures on Bayesian parameter estimation.

Bayesian Inference: Principles and Practice 3. Sparse Bayesian Models and the Relevance Vector Machine

arxiv: v1 [hep-ph] 20 Dec 2012

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Markov Chain Monte Carlo

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK

STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

A Brief Review of Probability, Bayesian Statistics, and Information Theory

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

#A50 INTEGERS 14 (2014) ON RATS SEQUENCES IN GENERAL BASES

arxiv: v1 [astro-ph] 2 Apr 2008

3 Forces and pressure Answer all questions and show your working out for maximum credit Time allowed : 30 mins Total points available : 32

The Mean Version One way to write the One True Regression Line is: Equation 1 - The One True Line

Bayesian Methods for Machine Learning

A Bayesian approach for estimating length-weight relationships in fishes

Uncertainty Quantification for Inverse Problems. November 7, 2011

Sampling Methods (11/30/04)

Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA

Noninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions

On Bayesian Inference with Conjugate Priors for Scale Mixtures of Normal Distributions

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

Fixed-b Inference for Testing Structural Change in a Time Series Regression

Intraclass Correlations in One-Factor Studies

Improving variational data assimilation through background and observation error adjustments

1 Caveats of Parallel Algorithms

A BAYESIAN MATHEMATICAL STATISTICS PRIMER. José M. Bernardo Universitat de València, Spain

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Online tests of Kalman filter consistency

Subject CS1 Actuarial Statistics 1 Core Principles

MATH 225: Foundations of Higher Matheamatics. Dr. Morton. 3.4: Proof by Cases

Spectrum Opportunity Detection with Weak and Correlated Signals

Notes to accompany Continuatio argumenti de mensura sortis ad fortuitam successionem rerum naturaliter contingentium applicata

Bayesian data analysis in practice: Three simple examples

Fiducial Inference and Generalizations

The Use of COMSOL for Integrated Hydrological Modeling

Transcription:

Supplement to: Guidelines for constructing a confidence interval for the intra-class correlation coefficient (ICC) Authors: Alexei C. Ionan, Mei-Yin C. Polley, Lisa M. McShane, Kevin K. Doin Section Page S1: A Review of Frequentist Methods 1 S2: A Review of Bayesian Methods 3 S3: Investigation of the Bayesian Interval Methods 4 S4: Samaniego Criteria 11 S5: Description of Excel File 12 S6: Marginal distriutions for simulation settings 13 S7: Tale of simulation settings 13 S8: Supplementary Figures 14 S9: Data analysis details 15 Section S1: A REVIEW OF FREQUENTIST METHODS There is no general formula for the exact distriution of the ICC which could e used for interval construction. The asymptotic distriution of the ICC (or a variance stailized transformation of the ICC) can e derived from pulished results (e.g., Searle et al., 2006). But simplistic intervals constructed from these asymptotics can e sustantially anticonservative (e.g., Zou and McDermott, 1999) ; this means, for example, that the proaility that a nominal 95% confidence interval for the ICC contains the true ICC can e well elow 95%. The rate of convergence of the ICC to these asymptotics is too slow for most practical situations. Two frequentist CI methods to address this prolem are discussed in the next two paragraphs. Uniform minimum variance uniased (UMVU) estimators of each variance component in the model have een derived (e.g., Lehmann and Casella, 1998). For linear cominations of variance components, Grayill and Wang (1980) showed that exact confidence intervals can e uilt from linear cominations of UMVU estimates in special cases where some of the mean squares are exactly zero; specifically, when some mean squares are zero, then coefficients on the other mean squares that provide the exact coverage can e derived mathematically. Cycling through and setting different variance components to zero results in coefficient estimates for all mean squares. Putting these together results in a modification of the usual interval limits ased on standard asymptotics. This approach is called the modified large sample (MLS) method. An MLS procedure for the ICC in the random two-way layout without interaction was developed in Cappelleri and Ting (2003). Supplement: Ionan et al. Page 1

Pivotal quantities are often used to construct confidence intervals. A pivotal quantity is a function of the parameter and the data with a known distriution (e.g., Lehmann, 1999) free of nuisance parameters. No pivot exists for the ICC in the two-way layout, ut a Generalized Pivotal Quantity (GPQ) does exist. A GPQ is a function of the parameter, the data, and oservale values of statistics; like the pivot, the GPQ has a distriution that can e descried in terms of known distriutions. The GPQ can e used to construct a Generalized Confidence Interval (GCI). The inclusion of oservale values means that there is a sutle difference etween the definition of the CI and the GCI (Weerahandi, 1993). More discussion appears in the Methods elow and in the 1993 reference. Frequentist methods have een compared in past papers. Cappelleri and Ting (2003) compared the MLS approach in a two-way layout without interaction to a Satterthwaite two moment approach, standard approach, and three moment approach. The MLS was found the est. Gilder et al. (2007) compared MLS and GCI in two-way layouts with an interaction term, and recommended the MLS for overall performance. GCI APPROACH EQUATIONS MLS APPROACH EQUATIONS Supplement: Ionan et al. Page 2

Section S2: A REVIEW OF BAYESIAN METHODS Bayesian intervals are defined within the Bayesian paradigm. In that paradigm, there is a prior distriution on variance components efore the experiment, and a posterior distriution after the experiment that reflects how the oserved data modified the prior. A 95% Bayesian (credile) interval contains 95% of the posterior proaility, that is, the proaility under the posterior density curve. The posterior distriution is proportional to the prior times the likelihood, so that a change in the prior also changes the posterior distriution. When scientists are reporting results, this dependence of a reported interval on the specified prior can distract from the findings; readers may wonder how much the interval depends on the prior used. This leads to non-informative priors which are appropriate when no prior data exists or when inference ased solely on the data is desired (Carlin and Louis, 2009, p. 36). Non-informative priors have een used Supplement: Ionan et al. Page 3

extensively in random effects models and shown to have good frequentist performance in some of these settings (Browne and Draper, 2006). Due to the wide availaility of Bayesian computing software, Bayesian credile intervals for the ICC are increasingly eing reported in the iomedical literature (Barzman et al., 2012). Bayesian approaches to linear models are discussed in Box and Tiao (1992) and Carlin and Louis (2009). Bayesian methods for fitting variance components are reviewed in Browne and Draper (2006). They report frequentist properties of the Bayesian methods for nested designs. In contrast to that work, we examine a crossed, rather than nested, model. Bayesian credile interval methods for parameters in one-way random effect linear models with priors on variance components are reviewed extensively in Gelman (2006). That paper considers a variety of non-informative prior distriutions. Gelman showed that the commonly used non-informative inverse gamma priors result in credile intervals for variance components which can perform poorly ecause they are very sensitive to the choice of the prior parameters. After examining a wide range of alternative distriutions, Gelman (2006) recommended improper uniform priors on standard deviations for this one-way ANOVA setting. In a susequent application, Gelman used a mildly informative prior. Gelman did not in this paper study the crossed, random effects model. Section S3: INVESTIGATION OF THE BAYESIAN INTERVAL METHODS (a) () (c) Figure S1: Marginal distriution of the responses for the non-normal simulations. Uniform (a) and mixture normal () densities ased on mathematical formulas presented in text. For the (shifted, scaled) gamma (rightmost plot) ICC =0.70 and ICC w=0.90. For the gamma (c), density estimates from 10 million Monte Carlo simulations. Priors for the ICC and ICC w that are induced y the inverse gamma, IG(0.001,0.001), distriution on each variance component are derived in the supplement and plotted in Figure 1(a). The priors are oth nearly Supplement: Ionan et al. Page 4

degenerate, with point masses at 0 and 1. As discussed in Carlin and Louis (2009), the noninformative inverse gamma distriution has a very heavy tail. This means that one variance parameter is likely to e much larger than one or oth of the others. The ICC prior is zero or one when one or more of the three 2 2 2 parameters, is infinite relative to the other(s). For example, on a typical modern personal, l e computer (PC), the largest machine numer is aout 2 10 308. But, for X~IG(0.001,0.001), P(X>2 10 308 ) 0.49. So, aout 49% of the random variales will e larger than the largest numer on a 2 2 typical PC, i.e., effectively infinite. Suppose. Then ICC / 1. On the other hand, if 2, ut 2 e 2 2 2 2, then ICC / 0 extreme finding in Figure 1(a). See Supplement Section S7 for more discussion. l l e. This explains the (a) () Figure S2: Prior distriutions on the intraclass correlation coefficients, etween-la (ICC ) and within-la (ICC w). If the priors on each standard deviation component are mutually independent and uniform over the positive real numers, then the prior distriution of the etween-la intra-class correlation (ICC =ρ ) and the withinla intra-class correlation (ICC w=ρ w) are derived in Supplement Section S1. The priors are Supplement: Ionan et al. Page 5

f ( ) wmin(1/,2 /(1 )) w0 wmin(1/,2 /(1 )) w0 w 1/ 2 w 1/ 2 k( w(1 )) 1 8 k( w(1 )) 1 8 2w(1 ) 1 dw 2w(1 ) 1 dw f 1 w) 4 1/ 2 3/ 2 2 2 3/ 1/ w (1 w) 1 1/ 2 w (1 w) 1 1/ 2 ( w w 2 2 where k( x) u 1/ (1 u) 1/ du. x x 1 These prior densities are plotted in Figure 1. As can e seen, the ICC w prior places most of its weight near 0 and 1. But the density is spread out enough to e non-degenerate across the range of possile values. For example, under this prior P(0.001 < ICC w <0.999)>0.99. The ICC density places more prior weight near zero compared to the ICC w prior, ut is also non-degenerate. For this prior, P(0.001< ICC <0.999)>0.99. 2.3 Choosing Bayesian priors We investigated the performance of inference using the classical noninformative inverted gamma distriution. A range of inverse gamma were considered, with focus on the popular IG(0.001,0.001) prior 0.001 1.001 defined y the proaility density function f ( x) Exp[ 1/(0.001x)]/ (0.001)0.001 x same as an inverted Wishart distriution. Our research found a numer of prolems with this prior:. This is the 1. When data were generated from the model of Equation (1), the coverage proailities of these intervals varied depending on true values of the population parameters. But, in real applications, these population parameters are unknown. For example, a 95% credile interval achieved 0.945 coverage when ICC w=0.80, ut this dropped to 0.880 when ICC w=0.99, where ICC =0.70 in oth cases. The coverage depends in complicated ways on the underlying truth. 2. The confidence intervals were not invariant under scale transformations (Lehmann, 1986). In other words, when the true data are transformed y a multiplicative factor c>0, resulting in 2 2 2 2 2 2 (,, ) ( c, c, c ), then the confidence interval changed as well. The ICC l e l e 2 2 2 2 2 2 2 2 itself is scale invariant, since ICC c c c c /. / l e l e For example, the mean width of the confidence interval for ICC changed from 0.42 to 0.54 when we changed from c=1 to c=0.01 (with 2 e =1, ICC w=0.9, ICC =0.70, ased on 10,000 simulations). In practical terms, this means that converting units from parts-per-million to partsper-ten-thousand, say, would result in much different IG-prior Bayesian intervals. Supplement: Ionan et al. Page 6

These two prolems remained under a wide variety of noninformative inverse gamma prior parameter settings examined. Therefore, the inverse gamma results are not presented, and only Bayesian results for the uniform distriution on standard deviations are studied. The uniform distriution displayed neither of these prolems, has een proposed as a gold standard in similar settings (Gelman, 2006), and can e implemented with existing widely-availale software. DETAILS OF THE INVERSE GAMMA PRIOR SOME CHECKING OF THE BAYESIAN PRIOR DERIVATIONS Supplement: Ionan et al. Page 7

Supplement: Ionan et al. Page 8

Supplement: Ionan et al. Page 9

Supplement: Ionan et al. Page 10

Section S4: SAMANIEGO CRITERIA Supplement: Ionan et al. Page 11

Section S5: DESCRIPTION OF THE EXCEL FILES FILENAME: Summary 201212-update.xls FILENAME: supp_summary 20140712-194908.xls presents results ased on 10,000 simulations per tale row. Design: Two numers with first eing 100 times the within-la ICC, and the second eing the numer of laoratories in the design (denoted l 0 ). The etween-la ICC is fixed at 0.70. Numer of iological replicates is fixed at 24 (denoted 0 ). Method: The confidence interval construction method used, either GCI, MLS or Bayes with uniform prior on standard deviations. Center_mean: The average of the midpoints of the confidence intervals. Center_sd: The standard deviation of the midpoints of the confidence intervals. Width_mean: The average width of the confidence intervals. Width_sd: The standard deviation of the widths of the confidence intervals. Coverage: The empirical coverage oserved. Supplement: Ionan et al. Page 12

Coverage>0.945: Indicator of whether column to the left is greater than 0.945. As ias in coverage: Bias of empirical coverage relative to nominal 95%. Mean ias in coverage Bayes: The average ias in the Bayesian coverage across all simulations. FILENAME: supp_coverage for Las 4 to 7.xls presents results ased on 10,000 simulations per tale row. Design: Two numers with first eing 100 times the within-la ICC, and the second eing the numer of laoratories in the design (denoted l 0 ). The etween-la ICC is fixed at 0.70. The numer of iological replicates is fixed at 24 (denoted 0 ). Method: The confidence interval construction method used, either GCI, MLS or Bayes with uniform prior on standard deviations. Center_mean: The average of the midpoints of the confidence intervals. Center_sd: The standard deviation of the midpoints of the confidence intervals. Width_mean: The average width of the confidence intervals. Width_sd: The standard deviation of the widths of the confidence intervals. Coverage: The empirical coverage oserved. FILENAME: supp_icc Estimates.xls presents results ased on 10,000 simulations per tale row. Design: Two numers with first eing 100 times the within-la ICC, and the second eing the numer of laoratories in the design (denoted l 0 ). The etween-la ICC is fixed at 0.70. The numer of iological replicates is fixed at 24 (denoted 0 ). ICChatMean: The mean value of the estimated etween-la ICC, using the formula in Saito et al. (2006). Supplement: Ionan et al. Page 13

ICChatSD: The standard deviation of the estimated etween-la ICC. Model: The model used to generate the data; for the gamma, given is the alpha parameter. Section S6: MARGINAL DISTRIBUTIONS FOR SIMULATION SETTINGS Supplement: Ionan et al. Page 14

Section S7: TABLE OF SIMULATION SETTINGS Section S8: SUPPLEMENTARY FIGURES Supplement: Ionan et al. Page 15

Figure aove: Simulation study with 48 iological replicates and 3 las for a total of 144 oservations. Comparison of MLS, GPQ and Bayes method performance. Nominal 95% confidence intervals. Coverages and average widths calculated from 10,000 simulations. (a) Mixture normal model data. 48 iological replicates and 3 las for a total of 144 oservations. Section S9: DATA ANALYSIS DETAILS Raw data CEL files and covariate sample data were downloaded from the National Cancer Institute s Center for Bioinformatics (NCICB) caarray 2.0 wesite (https://array.nci.nih.gov). The Experiment Identifier is doi00100. Data on the tumor samples were normalized using the Bioconductor affy suite (Gautier et al., 2004). The package s mas5 function was applied under R version 2.15.2. Further normalization consisted of identifying the suset of features with mas5 present calls for over half the arrays, then multiplying each array value y a constant to make the median of this feature set 500. Tumors with signal values elow 5.0 were truncated to 5.0, and the ase 2 logarithm transformation applied. REFERENCES Supplement: Ionan et al. Page 16

Barzman, D., Mossman, D., Sonnier, L., and Sorter, M. (2012). Brief rating of aggression y children and adolescents (BRACHA): A reliaility study. Journal of the American Academy of Psychiatry and the Law 40, 374-382. Box GEP and Tiao GC (1973) Bayesian Inference in Statistical Analysis. Wiley, New York. Browne WJ and Draper D (2006) A comparison of Bayesian and likelihood-ased methods for fitting multilevel models. Bayesian Analysis, 3, 473-514. Cappelleri JC and Ting N (2003) A modified large-sample approach to approximate interval estimation for a particular intraclass correlation coefficient. Statistics in Medicine, 22, 1861-1877. Carlin BP and Louis TA (2008) Bayesian Methods for Data Analysis, Third Edition. Chapman and Hall, Boca Raton. Gelman, A. (2006). Prior distriutions for variance parameters in hierarchical models. Bayesian Analysis 1(3), 515-533. Gautier L, Cope L, Bolstad BM, Irizarry RA (2004) Affy analysis of Affymetrix GeneChip data at the proe level. Bioinformatics, 20, 307-315. Gilder K, Ting N, Tian L., Cappelleri JC, Hanumara RC (2007) Confidence intervals on intraclass correlation coefficients in a alanced two-factor random design. Journal of Statistical Planning and Inference, 137, 1199-1212. Grayill FA and Wang C (1980) Confidence intervals on nonnegative linear cominations of variances. Journal of the American Statistical Association. 75: 869-873. Lehmann EL (1986) Testing Statistical Hypotheses, Second Edition. Springer-Verlag, New York. Lehmann EL and Casella G (1998) Theory of Point Estimation. Springer-Verlag, New York. Lehmann EL (1999) Elements of Large-Sample Theory. Springer-Verlag, New York. Searle SR, Casella G, McCulloch CE (2006) Variance Components. Wiley, New York. Weerahandi S (1993) Generalized confidence intervals. Journal of the American Statistical Association, 88, 899-905. Supplement: Ionan et al. Page 17

Zou KH and McDermott MP (1999) Higher-moment approaches to approximate interval estimation for a certain intraclass correlation coefficient. Statistics in Medicine, 18, 2051-2061. Supplement: Ionan et al. Page 18