Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering

Size: px
Start display at page:

Download "Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering"

Transcription

1 Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering John J. Dziak The Pennsylvania State University Inbal Nahum-Shani The University of Michigan Copyright 016, Penn State. All rights reserved. Technical Report Number Please send questions and comments to John Dziak, The suggested citation for this technical report is Dziak, J. J., & Nahum-Shani, I. (016). Three-level modeling for factorial experiments with experimentally induced clustering (Technical Report No ). University Park, PA: The Methodology Center, Penn State. This work was supported by Awards P50 DA010075, P50 DA039838, P01 CA180945, R01 DK097364, R01 AA0931, R01 DA039901, and K05 DA01806 from the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

2 Methodology Center Tech Report Three-Level Modeling With EIC Introduction In some experiments in the social sciences, participants enter the study as independent, but are then assigned into clusters (such as teams or therapy groups) by the investigators during the course of the study. This can be described as experimentally induced clustering (EIC). A recent manuscript by Nahum-Shani, Dziak, and Collins (016) provides specially constructed multilevel models for analyzing data from factorial experiments with EIC, and also provides power and sample size resources for planning them. They consider two kinds of EIC: full EIC, in which each participants are assigned to a cluster, and partial EIC, in which only some participants are so assigned, and one of the factors in the design tells whether a participant is assigned to belong to a cluster or not. They also provide analysis and power planning resources for EIC studies with or without a pretest measurement. Nahum-Shani, Dziak, and Collins (016) focused on two-level models, with Level 1 representing the participant and Level representing the cluster. These models assume that if the participant s pretest response (e.g., the participant s weight or body mass index at baseline) is available, it will be used by the investigator as a covariate in an ANCOVA-like model for predicting the posttest (e.g., the participant s weight or body mass index at a 6-month follow-up). However, another possible approach is to include the pretest as a repeated measure in a threelevel model in which time (pretest versus posttest) is used as an additional level. The covariateadjusted approach, as in Nahum-Shani, Dziak, and Collins (016), and the repeated measures approach, as in this technical report, require different assumptions. There are advantages and disadvantages to each approach, and neither is necessarily superior (see, e.g., Allison, 1990; Murray, 1998; Janega et al., 004).

3 Methodology Center Tech Report Three-Level Modeling With EIC 3 In Nahum-Shani, Dziak, and Collins (016) we focused on covariate-adjusted models for several reasons. First, they require fewer levels of nesting and hence are computationally much simpler and less costly than the repeated measures models. Second, the covariate-adjusted approach also offers more statistical power than the repeated measures approach under some scenarios (Vickers, 001; Vickers & Altman, 001; Dimitrov & Rumrill 003) although not all (Oakes & Feldman, 001). Third, the covariate-adjusted approach does not require that the pretest and posttest be measured on the same scale. In fact, the covariate-adjusted approach requires practically no assumptions about the distribution of the pretest, except those required of any regression predictor (i.e., that it have some variability in the sample, and that its relationship with posttest, if any, be linear). The three-level approach, on the other hand, requires that the variance structure of the pretest be jointly specified together with that of the posttest-- a much stronger requirement. However, it is possible that the three-level approach may still be of substantive or theoretical interest in specific situations. Therefore, as a supplement to Nahum-Shani, Dziak, and Collins (016), this technical report extends the models and power formulas provided in the manuscript to include another level for observation time, so as to model the response between pretest and posttest. Thus, this report provides models and power planning resources for the three-level approach. As in the manuscript, we begin for simplicity with a model for a single factor experiment with full EIC, and a similar model for partial EIC. We then generalize both to a model which allows multiple factors and either full or partial EIC. Last, we provide power formulas for factorial experiments with full and partial EIC using the repeated measures approach.

4 Methodology Center Tech Report Three-Level Modeling With EIC 4 Models for Repeated Measures Experiments with EIC Full EIC with a Single Factor Let us begin by considering the case of a randomized controlled trial (RCT), involving a single dichotomous treatment factor X, in which there is full EIC (i.e., clusters are generated for all participants). Nahum-Shani, Dziak, and Collins (016) point out that the pretest in an experiment with EIC must be included in the analysis model in a special way. Specifically, such a model must specify that the cluster-level random effect exists only at posttest and not at pretest, because clusters are generated after treatment assignment. Let T denote the observation time, coded as T = 0 for pretest and T = 1 for posttest. Let j be the index representing cluster membership, and i be the index of individuals within a cluster. For example, the dataset might appear as follows: Cluster (j) Individual (i) Time (t) Within Cluster (0=Pre, 1=Post) X Y

5 Methodology Center Tech Report Three-Level Modeling With EIC 5 With Levels 1, and 3 representing observations time, participant and cluster respectively, the response Y tij at observation time t, for participant i, in cluster j can be modeled as follows: Level 1: Level : Y tij = π 0ij + π 1ij T tij + e tij π 0ij = β 00j + r ij π 1ij = β 10j Level 3: β 00j = γ 000 β 10j = γ γ 101 X j + u j Combined: Y tij = (γ r ij ) + (γ u j )T tij + γ 101 X j T tij + e tij, (1) where e tij ~N(0, σ e ), r ij ~N(0, τ r ), and u j ~N(0, τ u ). The r ij and u j represent the effects of participant and cluster, respectively, and the e tij represent a combination of measurement error and any random subject-by-time interaction (random slope) that may exist. The fixed effects are represented by the γ coefficients. More specifically, the pretest mean is represented by γ 000 ; the posttest mean is represented by γ γ 100 for the control condition and by γ γ γ 101 for the experimental condition. Thus γ 101 is the fixed treatment effect of interest. In this model the cluster-level random effect u j is multiplied by the dummy-coded time indicator (T tij ) in order to allow cluster-level variation in the posttest response but not in the pretest response (i.e., only when T tij = 1). The pretest response is not subject to cluster-level variation, so the random effect for cluster is multiplied by zero when entering the model for the pretest. The posttest response will have cluster-level variation for all subjects, regardless of their experimental condition (i.e., regardless of X j ), because the design is assumed to involve full EIC.

6 Methodology Center Tech Report Three-Level Modeling With EIC 6 Partial EIC with a Single Factor Where there is partial EIC, it is necessary to take cluster-level variation into account for participants in the experimental condition, but not for those in the control condition, because the latter are independent of other participants. However, standard multilevel model notation involves using the same number of subscripts (representing levels within the nesting structure) for each participant. To resolve this dilemma, Nahum-Shani, Dziak, and Collins (016) use a notational approach strategy based on that of Roberts and Roberts (005) and Bauer et al. (008). Specifically, if an individual is not assigned to a cluster (e.g., therapy group) for treatment administration purposes, then the individual is considered to consist of a trivial cluster of size one for analysis purposes. The response can still be written as Y tij, with t representing time, i representing participant, and j representing cluster. However, trivial clusters will contain only i=1. For example, the dataset might appear as follows: Cluster (j) Individual (i) Time (t) Within Cluster (0=Pre, 1=Post) X Y

7 Methodology Center Tech Report Three-Level Modeling With EIC 7 With Levels 1,, and 3 representing observation time, participant and cluster respectively, let Level 1: Level : Y tij = π 0ij + π 1ij T tij + e tij π 0ij = β 00j + r ij π 1ij = β 10j Level 3: β 00j = γ 000 β 10j = γ (γ u j )X j Combined: Y tij = (γ r ij ) + γ 100 T tij + (γ u j )X j T tij + e tij () where e tij ~N(0, σ e ), r ij ~N(0, τ r ), and u j ~N(0, τ u ). Here, the pretest mean is represented by γ 000 ; the posttest mean is represented by γ γ 100 for the control condition, and by γ γ γ 101 for the experimental condition. Thus γ 101 is the overall treatment effect. Model differs from Model 1 in that the cluster-level random effect u j is now multiplied by the condition and time dummy codes (X j T tij ), rather than by the time dummy code only (T tij ). Thus, Model allows a cluster-level variation only in the posttest responses of the clustered condition (i.e., only if both X j = 1 and T tij = 1) since these are the only responses for which meaningful clusters exist in the design under consideration. Model 1, on the other hand, allows cluster-level variation in the posttest response of both conditions. Optionally, the error variance σ e can be allowed to differ between pretest and posttest, and/or between the clustered and the unclustered conditions at posttest. The error variance parameter can be written as σ e,t,s where t is 0 for pretest and 1 for posttest, and s is 0 for clustered and 1 for unclustered. This notation may be unnecessarily complex. In particular, the

8 Methodology Center Tech Report Three-Level Modeling With EIC 8 clustered and unclustered conditions should not differ at pretest, although they might differ at posttest. Thus, it is reasonable to assume that the pretest error variances for the unclustered participants (denoted σ e,0,0 ) and the pretest error variance for the clustered participants (denoted σ e,0,1 ) are equal; hence we refer to both as σ e,0,0 for convenience. The posttest error variance for unclustered participants (σ e,1,0 ), and the posttest error variance for clustered participants (σ e,1,1 ) can each still differ from σ e,0,0 and from each other. Note that Models 1 and above assume that there is zero intraclass correlation (ICC) at pretest, as well as no systematic difference between control and experimental group responses at pretest. Thus, these models are appropriate only when participants are not only randomly assigned to conditions but also randomly assigned to clusters. If this is not the case, then the experiment might be better conceptualized as a between-clusters design with pre-existing clustering, as discussed in Dziak, Nahum-Shani, and Collins (01). Extension to Multiple Factors In the manuscript, we employ an approach that combines the advantages of both dummy coding and effect coding (see the discussion of Models 5 and 6 in Nahum-Shani, Dziak, and Collins, 016). This approach enables scientists to both conveniently model the random cluster effect for only clustered participants (by using dummy coding so that random effects could be multiplied by zero and hence cancel out where appropriate), but yet interpret the effects of multiple factors in a straightforward manner (by using effect coding for the fixed effects model). We continue to use this approach here. As in the manuscript, assuming three dichotomous factors for simplicity, let X 1, X and X 3 be the effect-coded representations (-1 for Off and +1 for On) of the first, second, and third factors. Also, let C be an indicator of whether the participant has been assigned to a nontrivial

9 Methodology Center Tech Report Three-Level Modeling With EIC 9 cluster (0 for no and 1 for yes). In the case of partial EIC, C is a dummy-coded version of the assumed cluster-generating factor X 1 ; that is, C = 1 if X 1 = +1, and C = 0 if X 1 = 1. In the case of full EIC, C = 1 for everyone since all participants are assigned to nontrivial clusters. As before, let Y tij be the response at time t for individual i in cluster j. With Levels 1,, and 3 representing observation time, participant and cluster respectively, we can model Y tij as follows: Level 1: Level : Y tij = π 0ij + π 1ij T tij + e tij π 0ij = β 00j + r ij π 1ij = β 10j Level 3: β 00j = γ 000 β 10j = γ u j C j + γ 101j X 1j + γ 10j X j + γ 103j X 3j Combined: Y tij = γ r ij + γ 100 T tij + u j C j T tij + γ 101 X 1j T tij (3) + γ 10 X j T tij + γ 103 X 3j T tij + e tij where e tij ~ N(0, σ e ), r ij ~N(0, τ r ), and u j ~N(0, τ u ); C and T are dummy coded (i.e., 0 or 1); and X 1, X, and X 3 are effect-coded (i.e., -1 or +1). As before, the error variance can differ between pretest (denote this as σ e,0,0 ), unclustered participants posttest (σ e,1,0 ), and clustered participants posttest (σ e,1,1 ). τ r represents the person-level variance in pretest to posttest change, and τ u represents the cluster-level variance in pretest to posttest change for the clustered conditions. Notice that if C j is a dummy-coded version of X 1j (i.e., the partial EIC setting), then Model 3 instead becomes a generalization of Model to multiple factors; if C j is 1 for all participants (i.e., the full EIC setting), then Model 3 becomes a multiple-factor generalization of Model 1. In other words, both Models 1 and are actually special cases of Model 3. Model 3 can

10 Methodology Center Tech Report Three-Level Modeling With EIC 10 be extended to include interactions, as in Model 6 in the Nahum-Shani, Dziak, and Collins (016) manuscript. Power Planning for Repeated Measures Experiments with EIC As discussed in the manuscript, one can estimate the power for a test of a main effect or an interaction in a linear mixed model using the noncentral F distribution. That is, we assume that the power can be approximated by the probability that a noncentral F 1,ν variate, having noncentrality parameter λ, exceeds the critical value κ of the test to be performed. Here, κ is the value such that a central F 1,ν variate has only a probability α of exceeding κ under H 0, and γ λ = Var(γ ) (4) where γ is the regression parameter in question. As in the manuscript, we recommend that in the full EIC design the degrees of freedom ν should be calculated as the number of clusters minus the number of regression coefficients to be estimated. In planning experiments with partial EIC, the number of nontrivial clusters minus the number of regression coefficients would be appropriate as a conservative initial estimate of ν. However, for analysis purposes in partial EIC designs it is better that ν be empirically estimated using Satterthwaite s approximation (see Roberts & Roberts, 005; Nahum-Shani, Dziak, & Collins, 016). The formulas for Var(γ ) in Model 3, and informal derivations thereof, are provided below for full and partial factorial designs. Sampling Variance for Full EIC with Repeated Measures (Model 3; C j 1) The three-level repeated measures or "time condition" (Murray, 1998) approach of Model 3 is equivalent, assuming balance, to an analysis of gain scores (see Dimitrov & Rumrill, 003). That is, the treatment effect of interest is the difference between conditions in the mean

11 Methodology Center Tech Report Three-Level Modeling With EIC 11 change score (posttest minus pretest; see Murray, 1998, p. 181). Therefore, let G ij = Y 1ij Y 0ij denote the difference in response between pretest and posttest for an individual i in cluster j. Then, define μ (X k =L) to equal the sample estimate of E(G X k = L). Given this new notation, and following the logic discussed in Appendix B of the manuscript, we use the identities γ k = 1 (μ (X k =+1) of μ (X k =+1) μ (X k = 1) ) and Var(γ k) = 1 Var (μ (X k =+1) ). Asymptotically, the sampling variance should be the same as that of the sampling variance of G ij:x k =+1, that is, the average of the G ij over the participants with X k = +1. The random-effects part of G ij is u j + e 1ij e 0ij, each term of which is independent of the others. Assuming balance, in G ij:xk =+1, there are J/ independent subject-level u j terms being averaged together, nj/ independent observation-level e 0ij terms being averaged together, and nj/ independent observation-level e 1ij terms being averaged together, where J represents the total number of clusters summed across all conditions, and n is the number of members per cluster. Thus, assuming that pretest and posttest have the same error variance σ e, it then follows that Var(γ k) = Var ( μ (X k =+1) μ (Xk = 1) ) (+1) ( 1) = 1 Var 4 (μ (X k =+1) = 1 Var (μ (X k =+1) ) ) + 1 Var 4 (μ (X k = 1) ) = 1 ( τ u J = τ u + σ e J + σ e nj nj + σ e nj ) (5) where τ u is the cluster-level variance in posttest. Comparing expression 5 superficially to the corresponding formula without pretest (see Table 4 in the manuscript), it seems as though including the pretest has increased sampling error.

12 Methodology Center Tech Report Three-Level Modeling With EIC 1 Specifically, the formula for Var(γ k) in the two-level model without pretest was τ u + σ e, but J nj with pretest as a repeated measure the formula is now τ u + σ e. However, this would be a J misleading conclusion. In fact, σ e as defined in the three-level model is likely to be much smaller than σ e as defined in the posttest-only model. This is because the subject-level variance in the three-level model is partitioned into τ r, which is the person-level variance in pretest to posttest change (random but stable subject-level differences) and σ e (random subject-by-time interaction plus random measurement error). That is, the posttest variance after adjusting for any cluster or treatment effects but not adjusting for pretest is σ Y = σ e + τ r. The second term, τ r, is removed in the three-level model by effectively subtracting away the pretest. In contrast, in the posttest-only two-level model, all of the subject-level random variability was assigned to σ e and none was removed. Thus, σ e = σ Y in the posttest-only model, but σ e = (1 ρ pre,post )σ Y in the repeated measures model, where nj ρ pre,post = = Corr(Y 0, Y 1 X, u) Cov(Y 0, Y 1 X, u) Var(Y 0 X, u) Var(Y 1 X, u) = τ r σ Y and where σ Y in both cases represents within-cluster posttest variance, that is, posttest variance after adjusting for any cluster or treatment effects but not adjusting for pretest. Thus, the pretest is worthwhile if ρ pre,post > 1 (just as in classical unclustered experiments; see Frison & Pocock, 199; Vickers, 001). Following the logic presented in Appendix B of the manuscript, Var(γ k) can be re-expressed in terms of the pretest-posttest correlation correlation ρ pre,post, the posttest intraclass correlation ρ Y1 and the within-cluster marginal variance σ Y. Specifically, we already saw that σ e = (1 ρ pre,post )σ Y. Also, the

13 Methodology Center Tech Report Three-Level Modeling With EIC 13 posttest covariance between two members i and i within the same cluster is τ r, and the total marginal variance of the random effects at pretest, including random cluster effects, is τ r + τ u + σ e = τ u + σ Y. Therefore ρ Y1 = τ u τ u +σ and we can write τ ρ Y 1 u = σ Y Y (1 ρ Y 1 ). Then expression 5 is equivalent to Var(γ k) = σ Y ( ρ Y1 + (1 ρ pre,post) (1 ρ Y 1 )J Jn ). (6) In (4), if γ k is specified as a multiple of σ e, then σ Y cancels out from the numerator and denominator, and only the correlations still need to be specified. Past literature may help researchers find realistic candidate values for ρ Y1 and ρ pre,post. Sampling Variance for Partial EIC with Repeated Measures (Model 3; C j is 0 or 1 depending on X 1 ). We now consider the partial EIC case. We assume that participants are assigned to clusters if they receive a treatment condition with X 1 = +1, but left unclustered if they receive a treatment condition with X 1 = 1. Analogously to the full EIC case, we replace the analysis with an analysis of change scores G ij to argue that Var(γ k) = 1 4 Var (μ (X 1 =+1) ) Var (μ (X 1 = 1) ). The random effects part of G ij is C j u j + e 1ij e 0ij. Thus, Var (μ (X 1 =+1) ) = τ u + σ e,0,1 J 1 nj 1 + σ e,1,1 nj 1 and

14 Methodology Center Tech Report Three-Level Modeling With EIC 14 Var (μ (X 1 = 1) ) = σ e,0,0 J 0 + σ e,1,0 J 0 where σ e,0,0, σ e,0,1, σ e,1,0 and σ e,1,1 are the error variances for unclustered pretests, clustered pretests, unclustered posttests and clustered posttests respectively, and τ u is the cluster-level variance in pretest to posttest change for the clustered conditions. Therefore, Var(γ k) = τ u + σ e,0,0 4J 1 4J 0 + σ e,1,0 4J 0 + σ e,0,1 + σ e,1,1. 4nJ 1 4nJ 1 J 1, J 0 and n above represent the number of clusters, number of unclustered individuals, and number of members per nontrivial cluster respectively, so that J 1 n is the number of clustered individuals and J 0 + J 1 n is the total number of participants. σ e,0,0, σ e,0,1, σ e,1,0 and σ e,1,1 denote the error variances for unclustered pretests, clustered pretests, unclustered posttests and clustered posttests respectively; and τ u is the cluster-level variance in pretest to posttest change for the clustered conditions. Assuming that the pretest error variances σ e,0,0 (for unclustered individuals) and σ e,0,1 (for individuals to be clustered) are equal, we can refer to both as σ e,0,0, and combine their terms. Further simplification is possible if we assume that pretest and posttest error variances are equal, such that σ e,0,0 = σ e,0,1 = σ e,1,0 = σ e,1,1 = σ e. In this case, following the same reasoning as in the previous section, Var(γ k) can be re-expressed in terms of correlations and marginal variances ρ Y Var(γ k) = σ Y ( + 1 ρ pre,post 4(1 ρ Y )J 1 J 1 n + 1 ρ pre,post J 0 ), where ρ Y is the posttest ICC for clustered individuals, adjusting for treatment; and ρ pre,post represents the pretest-posttest correlation after adjusting for treatment for unclustered

15 Methodology Center Tech Report Three-Level Modeling With EIC 15 individuals, and the pretest-posttest correlation after adjusting for treatment and cluster for clustered individuals.

16 Methodology Center Tech Report Three-Level Modeling With EIC 16 References Allison, P. D. (1990). Change scores as dependent variables in regression analysis. Sociological Methodology, 0, Bauer, D. J., Sterba, S. K., & Hallfors, D. D. (008). Evaluating group-based interventions when control participants are ungrouped. Multivariate Behavioral Research, 43, Dimitrov, D. M., & Rumrill, P. D. (003). Pretest-posttest designs and measurement of change. Work, 0, Dziak, J. J., Nahum-Shani, I., & Collins, L. M. (01). Multilevel factorial experiments for developing behavioral interventions: power, sample size, and resource considerations. Psychological Methods, 17: Frison, L., & Pocock, S. J. (199). Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design. Statistics in Medicine, 11, Janega, J. B., Murray, D. M., Varnell, S. P., Blitstein, J. L., Birnbaum, A. S., & Lytle, L. A. (004). Assessing intervention effects in a school-based nutrition intervention trial: Which analytic model is most powerful? Health Education & Behavior, 31, Murray, D. M. (1998). Design and analysis of group-randomized trials (Vol. 9). Oxford University Press. Nahum-Shani, I., Dziak, J. J., and Collins, L. M. (016). Multilevel Factorial Designs With Experiment-Induced Clustering. Submitted to Psychological Methods. Oakes, J. M., & Feldman, H. A. (001). Statistical power for nonequivalent pretest-posttest designs. The impact of change-score versus ANCOVA models. Evaluation Review, 5, 3-8.

17 Methodology Center Tech Report Three-Level Modeling With EIC 17 Roberts, C., & Roberts, S. A. (005). Design and analysis of clinical trials with clustering effects due to treatment. Clinical Trials,, Vickers, A. J. (001). The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study. BMC Medical Research Methodology, 1:6. Accessed at Vickers, A. J., & Altman, D. G. (001). Analysing controlled trials with baseline and follow up measurements. BMJ, 33,

Formula for the t-test

Formula for the t-test Formula for the t-test: How the t-test Relates to the Distribution of the Data for the Groups Formula for the t-test: Formula for the Standard Error of the Difference Between the Means Formula for the

More information

ANCOVA. Lecture 9 Andrew Ainsworth

ANCOVA. Lecture 9 Andrew Ainsworth ANCOVA Lecture 9 Andrew Ainsworth What is ANCOVA? Analysis of covariance an extension of ANOVA in which main effects and interactions are assessed on DV scores after the DV has been adjusted for by the

More information

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs DISCUSSION PAPER SERIES IZA DP No. 75 Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs Spyros Konstantopoulos October 008 Forschungsinstitut zur Zukunft der Arbeit Institute

More information

Accepted Manuscript. Comparing different ways of calculating sample size for two independent means: A worked example

Accepted Manuscript. Comparing different ways of calculating sample size for two independent means: A worked example Accepted Manuscript Comparing different ways of calculating sample size for two independent means: A worked example Lei Clifton, Jacqueline Birks, David A. Clifton PII: S2451-8654(18)30128-5 DOI: https://doi.org/10.1016/j.conctc.2018.100309

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department

More information

BIOS 6649: Handout Exercise Solution

BIOS 6649: Handout Exercise Solution BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss

More information

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) psyc3010 lecture 7 analysis of covariance (ANCOVA) last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) 1 announcements quiz 2 correlation and

More information

ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS

ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS Background Independent observations: Short review of well-known facts Comparison of two groups continuous response Control group:

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Estimation and Centering

Estimation and Centering Estimation and Centering PSYED 3486 Feifei Ye University of Pittsburgh Main Topics Estimating the level-1 coefficients for a particular unit Reading: R&B, Chapter 3 (p85-94) Centering-Location of X Reading

More information

Multilevel Modeling: A Second Course

Multilevel Modeling: A Second Course Multilevel Modeling: A Second Course Kristopher Preacher, Ph.D. Upcoming Seminar: February 2-3, 2017, Ft. Myers, Florida What this workshop will accomplish I will review the basics of multilevel modeling

More information

Workshop on Statistical Applications in Meta-Analysis

Workshop on Statistical Applications in Meta-Analysis Workshop on Statistical Applications in Meta-Analysis Robert M. Bernard & Phil C. Abrami Centre for the Study of Learning and Performance and CanKnow Concordia University May 16, 2007 Two Main Purposes

More information

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D. Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models

More information

Power in Randomized Group Comparisons: The Value of Adding a Single Intermediate Time Point to a Traditional Pretest Posttest Design

Power in Randomized Group Comparisons: The Value of Adding a Single Intermediate Time Point to a Traditional Pretest Posttest Design Psychological Methods Copyright 00 by the American Psychological Association, Inc 00, Vol 7, No, 194 09 108-989X/0/$500 DOI: 101037//108-989X7194 Power in Randomized Group Comparisons: The Value of Adding

More information

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised ) Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

International Journal of Statistics: Advances in Theory and Applications

International Journal of Statistics: Advances in Theory and Applications International Journal of Statistics: Advances in Theory and Applications Vol. 1, Issue 1, 2017, Pages 1-19 Published Online on April 7, 2017 2017 Jyoti Academic Press http://jyotiacademicpress.org COMPARING

More information

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements: Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported

More information

Least Squares Analyses of Variance and Covariance

Least Squares Analyses of Variance and Covariance Least Squares Analyses of Variance and Covariance One-Way ANOVA Read Sections 1 and 2 in Chapter 16 of Howell. Run the program ANOVA1- LS.sas, which can be found on my SAS programs page. The data here

More information

Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure.

Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure. Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure. Author(s): Chris Rhoads, University of Connecticut.

More information

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research Research Methods Festival Oxford 9 th July 014 George Leckie

More information

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective Second Edition Scott E. Maxwell Uniuersity of Notre Dame Harold D. Delaney Uniuersity of New Mexico J,t{,.?; LAWRENCE ERLBAUM ASSOCIATES,

More information

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA Topics: Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA What are MI and DIF? Testing measurement invariance in CFA Testing differential item functioning in IRT/IFA

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

r(equivalent): A Simple Effect Size Indicator

r(equivalent): A Simple Effect Size Indicator r(equivalent): A Simple Effect Size Indicator The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Rosenthal, Robert, and

More information

Step 2: Select Analyze, Mixed Models, and Linear.

Step 2: Select Analyze, Mixed Models, and Linear. Example 1a. 20 employees were given a mood questionnaire on Monday, Wednesday and again on Friday. The data will be first be analyzed using a Covariance Pattern model. Step 1: Copy Example1.sav data file

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

Pearson s Product Moment Correlation: Sample Analysis. Jennifer Chee. University of Hawaii at Mānoa School of Nursing

Pearson s Product Moment Correlation: Sample Analysis. Jennifer Chee. University of Hawaii at Mānoa School of Nursing Pearson s 1 Running head: PEARSON S PRODUCT MOMENT CORRELATION Pearson s Product Moment Correlation: Sample Analysis Jennifer Chee University of Hawaii at Mānoa School of Nursing Pearson s 2 Introduction

More information

Analysis of repeated measurements (KLMED8008)

Analysis of repeated measurements (KLMED8008) Analysis of repeated measurements (KLMED8008) Eirik Skogvoll, MD PhD Professor and Consultant Institute of Circulation and Medical Imaging Dept. of Anaesthesiology and Emergency Medicine 1 Day 2 Practical

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

Abstract Title Page. Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size

Abstract Title Page. Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size Abstract Title Page Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size Authors and Affiliations: Ben Kelcey University of Cincinnati SREE Spring

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

WU Weiterbildung. Linear Mixed Models

WU Weiterbildung. Linear Mixed Models Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Multilevel Modeling of Single-Case Data: An Introduction and Tutorial for the Applied Researcher

Multilevel Modeling of Single-Case Data: An Introduction and Tutorial for the Applied Researcher University of Connecticut OpenCommons@UConn NERA Conference Proceedings 2017 Northeastern Educational Research Association (NERA) Annual Conference 12-2017 Multilevel Modeling of Single-Case Data: An Introduction

More information

Moving Beyond the Average Treatment Effect: A Look at Power Analyses for Moderator Effects in Cluster Randomized Trials

Moving Beyond the Average Treatment Effect: A Look at Power Analyses for Moderator Effects in Cluster Randomized Trials Moving Beyond the Average Treatment Effect: A Look at Power Analyses for Moderator Effects in Cluster Randomized Trials Jessaca Spybrook Western Michigan University December 3, 015 (Joint work with Ben

More information

Power and Sample Size for the Most Common Hypotheses in Mixed Models

Power and Sample Size for the Most Common Hypotheses in Mixed Models Power and Sample Size for the Most Common Hypotheses in Mixed Models Anna E. Barón Sarah M. Kreidler Deborah H. Glueck, University of Colorado and Keith E. Muller, University of Florida UNIVERSITY OF COLORADO

More information

Difference scores or statistical control? What should I use to predict change over two time points? Jason T. Newsom

Difference scores or statistical control? What should I use to predict change over two time points? Jason T. Newsom Difference scores or statistical control? What should I use to predict change over two time points? Jason T. Newsom Overview Purpose is to introduce a few basic concepts that may help guide researchers

More information

LCA_Distal_LTB Stata function users guide (Version 1.1)

LCA_Distal_LTB Stata function users guide (Version 1.1) LCA_Distal_LTB Stata function users guide (Version 1.1) Liying Huang John J. Dziak Bethany C. Bray Aaron T. Wagner Stephanie T. Lanza Penn State Copyright 2017, Penn State. All rights reserved. NOTE: the

More information

Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects. Howard S.

Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects. Howard S. Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects Howard S. Bloom MDRC Jessaca Spybrook Western Michigan University May 10, 016 Manuscript

More information

This is a revised version of the Online Supplement. that accompanies Lai and Kelley (2012) (Revised February, 2016)

This is a revised version of the Online Supplement. that accompanies Lai and Kelley (2012) (Revised February, 2016) This is a revised version of the Online Supplement that accompanies Lai and Kelley (2012) (Revised February, 2016) Online Supplement to "Accuracy in parameter estimation for ANCOVA and ANOVA contrasts:

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

RCB - Example. STA305 week 10 1

RCB - Example. STA305 week 10 1 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations

More information

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University Calculating Effect-Sizes David B. Wilson, PhD George Mason University The Heart and Soul of Meta-analysis: The Effect Size Meta-analysis shifts focus from statistical significance to the direction and

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

LCA Distal Stata function users guide (Version 1.0)

LCA Distal Stata function users guide (Version 1.0) LCA Distal Stata function users guide (Version 1.0) Liying Huang John J. Dziak Bethany C. Bray Aaron T. Wagner Stephanie T. Lanza Penn State Copyright 2016, Penn State. All rights reserved. Please send

More information

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements: Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:

More information

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each participant, with the repeated measures entered as separate

More information

Data Analysis as a Decision Making Process

Data Analysis as a Decision Making Process Data Analysis as a Decision Making Process I. Levels of Measurement A. NOIR - Nominal Categories with names - Ordinal Categories with names and a logical order - Intervals Numerical Scale with logically

More information

One-Way ANOVA Cohen Chapter 12 EDUC/PSY 6600

One-Way ANOVA Cohen Chapter 12 EDUC/PSY 6600 One-Way ANOVA Cohen Chapter 1 EDUC/PSY 6600 1 It is easy to lie with statistics. It is hard to tell the truth without statistics. -Andrejs Dunkels Motivating examples Dr. Vito randomly assigns 30 individuals

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

1 Introduction. 2 Example

1 Introduction. 2 Example Statistics: Multilevel modelling Richard Buxton. 2008. Introduction Multilevel modelling is an approach that can be used to handle clustered or grouped data. Suppose we are trying to discover some of the

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Sample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations

Sample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations Sample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations Ben Brewer Duke University March 10, 2017 Introduction Group randomized trials (GRTs) are

More information

Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models

Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models Journal of Of cial Statistics, Vol. 14, No. 3, 1998, pp. 267±275 Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models Michael P. ohen 1 Behavioral and social data commonly

More information

From Gain Score t to ANCOVA F (and vice versa)

From Gain Score t to ANCOVA F (and vice versa) A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests last lecture: introduction to factorial designs next lecture: factorial between-ps ANOVA II: (effect sizes and follow-up tests) 1 general

More information

Eco517 Fall 2014 C. Sims FINAL EXAM

Eco517 Fall 2014 C. Sims FINAL EXAM Eco517 Fall 2014 C. Sims FINAL EXAM This is a three hour exam. You may refer to books, notes, or computer equipment during the exam. You may not communicate, either electronically or in any other way,

More information

Downloaded from:

Downloaded from: Hossain, A; DiazOrdaz, K; Bartlett, JW (2017) Missing binary outcomes under covariate-dependent missingness in cluster randomised trials. Statistics in medicine. ISSN 0277-6715 DOI: https://doi.org/10.1002/sim.7334

More information

THE EFFECTS OF MULTICOLLINEARITY IN MULTILEVEL MODELS

THE EFFECTS OF MULTICOLLINEARITY IN MULTILEVEL MODELS THE EFFECTS OF MULTICOLLINEARITY IN MULTILEVEL MODELS A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy By PATRICK C. CLARK B.A., Indiana University

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Intraclass Correlations in One-Factor Studies

Intraclass Correlations in One-Factor Studies CHAPTER Intraclass Correlations in One-Factor Studies OBJECTIVE The objective of this chapter is to present methods and techniques for calculating the intraclass correlation coefficient and associated

More information

Development. ECON 8830 Anant Nyshadham

Development. ECON 8830 Anant Nyshadham Development ECON 8830 Anant Nyshadham Projections & Regressions Linear Projections If we have many potentially related (jointly distributed) variables Outcome of interest Y Explanatory variable of interest

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Estimating Optimal Dynamic Treatment Regimes from Clustered Data

Estimating Optimal Dynamic Treatment Regimes from Clustered Data Estimating Optimal Dynamic Treatment Regimes from Clustered Data Bibhas Chakraborty Department of Biostatistics, Columbia University bc2425@columbia.edu Society for Clinical Trials Annual Meetings Boston,

More information

Sampling. February 24, Multilevel analysis and multistage samples

Sampling. February 24, Multilevel analysis and multistage samples Sampling Tom A.B. Snijders ICS Department of Statistics and Measurement Theory University of Groningen Grote Kruisstraat 2/1 9712 TS Groningen, The Netherlands email t.a.b.snijders@ppsw.rug.nl February

More information

I used college textbooks because they were the only resource available to evaluate measurement uncertainty calculations.

I used college textbooks because they were the only resource available to evaluate measurement uncertainty calculations. Introduction to Statistics By Rick Hogan Estimating uncertainty in measurement requires a good understanding of Statistics and statistical analysis. While there are many free statistics resources online,

More information

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression: Biost 518 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture utline Choice of Model Alternative Models Effect of data driven selection of

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

CORELATION - Pearson-r - Spearman-rho

CORELATION - Pearson-r - Spearman-rho CORELATION - Pearson-r - Spearman-rho Scatter Diagram A scatter diagram is a graph that shows that the relationship between two variables measured on the same individual. Each individual in the set is

More information

Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression

Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Conditional SEMs from OLS, 1 Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Mark R. Raymond and Irina Grabovsky National Board of Medical Examiners

More information

Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design

Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design Chapter 236 Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design Introduction This module provides power analysis and sample size calculation for non-inferiority tests

More information

Impact of serial correlation structures on random effect misspecification with the linear mixed model.

Impact of serial correlation structures on random effect misspecification with the linear mixed model. Impact of serial correlation structures on random effect misspecification with the linear mixed model. Brandon LeBeau University of Iowa file:///c:/users/bleb/onedrive%20 %20University%20of%20Iowa%201/JournalArticlesInProgress/Diss/Study2/Pres/pres.html#(2)

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings

Multiple Linear Regression II. Lecture 8. Overview. Readings Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution

More information

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I

Multiple Linear Regression II. Lecture 8. Overview. Readings. Summary of MLR I. Summary of MLR I. Summary of MLR I Multiple Linear Regression II Lecture 8 Image source:https://commons.wikimedia.org/wiki/file:autobunnskr%c3%a4iz-ro-a201.jpg Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Regression in R. Seth Margolis GradQuant May 31,

Regression in R. Seth Margolis GradQuant May 31, Regression in R Seth Margolis GradQuant May 31, 2018 1 GPA What is Regression Good For? Assessing relationships between variables This probably covers most of what you do 4 3.8 3.6 3.4 Person Intelligence

More information

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis. Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description

More information

Sampling and Sample Size. Shawn Cole Harvard Business School

Sampling and Sample Size. Shawn Cole Harvard Business School Sampling and Sample Size Shawn Cole Harvard Business School Calculating Sample Size Effect Size Power Significance Level Variance ICC EffectSize 2 ( ) 1 σ = t( 1 κ ) + tα * * 1+ ρ( m 1) P N ( 1 P) Proportion

More information

Chapter 3 ANALYSIS OF RESPONSE PROFILES

Chapter 3 ANALYSIS OF RESPONSE PROFILES Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information