Bios 6649: Clinical Trials - Statistical Design and Monitoring
|
|
- Rodney Harvey
- 5 years ago
- Views:
Transcription
1 Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver c 2015 John M. Kittelson, PhD Bios pg 1
2 6. Other topics Other topics in design of biomedical studies than 2 treatment groups (a) > 2 unrelated treatment (one-way ANOVA) designs (d) Designs for regression endpoints endpoints 6.2 Design of non-inferiority trials 6.3 Prevention of bias (randomization and blinding) 6.4 Prevention and treatment of missing data Bios pg 2
3 6.1(a) Trials with more than two unrelated Evaluating more than two unrelated (1-way ANOVA) Recall inferential objective with 2-group RCT: Parameterization of treatment effect θ (true mean treatment effect) Magnitude of ˆθ used to decide whether new treatment is suited for use in practice Parameter space divided into regions of clinically important effects (in both directions) What is the question with more than 2 treatment groups? Which treatment (of many) is best? Is there a dose-response relationship? Is this several studies with a shared control group? Is a combination treatment better than each of its components? Bios pg 3
4 6.1(a) Trials with more than two unrelated Evaluating more than two unrelated (1-way ANOVA) Inferential objective Is there a difference between K treatment groups or are they all the same? (My observation: investigators are more often interested in identifying the particular that differ from the rest, so the ANOVA F-test does not usually satisfy the scientific/clinical objectives.) Statistical setting Hypotheses: Null hypothesis: the mean outcome is the same in all groups (θ k = θ for k = 1,..., K ) Alternative hypothesis: At least one mean differs from the others. Bios pg 4
5 6.1(a) Trials with more than two unrelated Evaluating more than two unrelated (1-way ANOVA) Statistical inference focuses on the F-test in a 1-way ANOVA: F = SS b (K 1) SS w K (N 1) where N is the number of subjects in treatment group, and SS b and SS w denote between-group and within-group sums of squares: SS b = SS w = N i=1 N i=1 K (Y k Y ) 2 k=1 K (Y ik Y k ) 2 k=1 The statistic follows an F-distribution with numerator and denominator degrees of freedom of K 1 and K (N 1), respectively. (Recall that the F -distribution comes from the ratio of two chi-square random variables.) Bios pg 5
6 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) The ANOVA setting does not fit into the usual sample size formulation (The F-statistic is not asymptotically normal) For the purposes of sample size evaluation: Assume SSw is fixed and known, and let = K (N 1) σ2 w. This is analogous to assuming σ 2 in the usual sample size evaluation. SS w With known σ 2 w known: F = SS b (K 1) σ 2 w, and F χ 2 K 1 Bios pg 6
7 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) Under the null hypothesis, the critical value is cv = qchisq(0.95,k-1) Finding power = Pr(F > cv): F follows a non-central chi-square distribution. Non-centrality parameter is KNδ 2 where: δ 2 = σ2 b σ 2 w. Bios pg 7
8 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) F χ 2 K 1, so power is given by: 1 Pr(F f ) = 1 pchisq(f,k-1,ncp = KNδ 2 ) To find the sample size that gives power β for σ 2 b/σ 2 w, we find N such that 1 Pr(F f ) = β. For sample size evaluation, we specify the alternative hypothesis in terms of σ 2 b. Suppose we want adequate power for a particular vector of θ k s: (θ 1,..., θ K ). This alternative determines σ 2 b as follows: σ 2 b = K (θ k θ) 2 k=1. K * NOTICE: That this is not the standard variance estimator. The standard estimator would have K 1 in the denominator. Bios pg 8
9 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) Example: Suppose that a trial is planned with 3 treatment groups. Suppose that the primary outcome is LDL-C reduction (measured as percent). Assume σw = 10%. Want power to detect a difference between treatment groups of 8%. Finding σb 2: (µ 1, µ 2, µ 3 ) = (0, 0, 8) σb 2 = (µ 1, µ 2, µ 3 ) = (0, 2, 8) σb 2 = (µ 1, µ 2, µ 3 ) = (0, 4, 8) σb 2 = (µ 1, µ 2, µ 3 ) = (0, 6, 8) σb 2 = (µ 1, µ 2, µ 3 ) = (0, 8, 8) σb 2 = The critical value is: cv <- qchisq(0.95,2) = Set δ 2 = σ 2 b/σ 2 w = 10.67/100 = For a particular choice of N, the power is given by: 1 - pchisq(cv,2,3*n*0.1067). There is just over 90% power when N = 40. Bios pg 9
10 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) Example in seqdesign. > dsgn <- seqdesign(prob.model="mean",arms=3,variance=sgma2w, + alt.hypo=c(0,4,8), alpha=0.05,power = ) > dsgn Call: seqdesign(prob.model = "mean", arms = 3, alt.hypothesis = c(0, 4, 8), variance = sgma2w, power = , alpha = 0.05) PROBABILITY MODEL and HYPOTHESES: Theta is between group variance of means One-sided hypothesis test of a greater alternative: Null hypothesis : Theta <= 0.00 (size = ) Alternative hypothesis : Theta >= (power = ) (Fixed sample test) STOPPING BOUNDARIES: Sample Mean scale Futility Efficacy Time 1 (N= ) > dsgn$std.bounds STOPPING BOUNDARIES: Standardized Cumulative Sum scale Futility Efficacy Time 1 (N= 1) Bios pg 10
11 Evaluating more than two unrelated Sample size evaluation (one-way ANOVA) Comments on ANOVA designs seqdesign: There are some quirks: * Not possible to specify sample size and then calculate power * By default power = 1- alpha which can be confusing ANOVA designs are uncommon in RCTs. Interim decision-making is not implemented in RCTdesign * It is difficult to specify reasons for early termination with more. * See above comments regarding inferential objective. Bios pg 11
12 Evaluating more than two unrelated Definition: The term factorial design" refers to an experiment that evaluates several factors simultaneously. The most common application in the health sciences setting is a 2 2 factorial design of two different (A vs B) and 4 treatment groups: * no A; no B * no A; B * A; no B * A; B I have also seen 2-factor designs with more than 2 levels (e.g., a 3 4 factorial design) or with more than 2 (e.g., a factorial design). Inferential objective: There are 3 common reasons for using a factorial design (Classical) An efficient way to study 2 separately. Study the interaction between two drugs. Selecting the best treatment from 4 different options. Bios pg 12
13 Evaluating more than two unrelated (statistical setting in a 2 2 factorial design) Data: Y ijk = outcome for: ith subject (i = 1,..., N) jth level of treatment A (j = 0, 1) kthe level of treatment B (k = 0, 1) Distribution: Y jk N (θ jk, σ 2 /N). Tests: Potential tests include (see next page): Effect of treatment A (marginal) Effect of treatment B (marginal) Interaction between treatment A and B. Bios pg 13
14 Evaluating more than two unrelated (statistical setting in a 2 2 factorial design) Effect of treatment A: Y 10 + Y 11 2 Y 00 + Y 01 2 N (θ 1 θ 0, σ2 N ) where θ j = (θ j0 + θ j1 )/2. Effect of treatment B: Y 01 + Y 11 2 Y 00 + Y 10 2 N (θ 1 θ 0, σ2 N ) where θ k = (θ 0k + θ 1k )/2. Test interaction: Is the effect of A if you receive B the same as its effect if you do not receive B? (Y 11 Y 01 ) (Y 10 Y 00 ) N ((θ 11 θ 01 ) (θ 10 θ 00 ), 4σ2 N Selecting best of the 4 : test for any evidence of group differences in a 1-way ANOVA. ) Bios pg 14
15 Evaluating more than two unrelated (statistical setting in a 2 2 factorial design) Sample size evaluation: Let θ + denote the smallest important difference. Test effects of treatment A and (separately) the effects of treatment B: ( ) 2 zα + z β N main = σ 2 Test for interaction: θ + N Xact = ( zα + z β θ + ) 2 4σ 2 Notes on designing for interaction: * N Xact = 4N main * Interactions are usually smaller than main effects. When designing to test interactions design alternative (µ + ) could be smaller in which case N Xact > 4N main. Bios pg 15
16 Evaluating more than two unrelated (statistical setting in a 2 2 factorial design) One-way ANOVA approach: δ = σ2 b σ 2 where σ 2 b is calculated from θ jk. If you want to select the sample size to detect when at least one treatment has effect θ +, then: * choosing (µ 00, µ 01, µ 10, µ 11 ) = (0, 0, µ +, µ + ) gives smallest samples size. * choosing (µ 00, µ 01, µ 10, µ 11 ) = (0, 1 3 µ +, 2 3 µ +, µ + ) gives largest samples size. Bios pg 16
17 Evaluating more than two unrelated (inferential objective) Inferential objective: Treatment groups represent different doses of a drug. The FDA often requires evidence of a dose-response relationship. Some approaches to inference: Identify two dose levels as being of primary interest. Ask if any group differs from the others (ANOVA design). Compare all dose levels to 0-dose (placebo) to determine the minimally effective dose. Ask if there is evidence that response increases with dose. I will discuss the last setting. In my experience this is a common FDA motivation for dose-response studies. Bios pg 17
18 Evaluating more than two unrelated (Statistical setting) Treatment groups are ordered by dose. Let µ d, d = 1,..., D denote the mean outcome in the ordered dose groups. The dose-response hypothesis is that response increases with dose. Usually monotonicity is also hypothesized; that is, µ 0 < µ D with µ 0 µ 1... µ D 1 µ D Notice that this definition divides the parameter space into null and alternative regions: Null region: any non-monotonic or decreasing response. Alternative region: monotonic increasing response. As with ANOVA, these regions are multi-dimensional, and so it is not possible to express the null and alternative hypotheses by one-dimensional inequalities. In contrast to ANOVA, the null region is bigger than µ 0 = µ 1 =,..., = µ D. Bios pg 18
19 Evaluating more than two unrelated (Statistical setting) The multi-dimensional regions can be reduced to a single parameter using an appropriate contrast, θ = d w dµ d. A linear-trend contrast is interpreted as the first-order approximation to the (potentially non-linear) dose-response function. If the true dose-response function is non-linear, then a linear-trend contrast will still generalize to populations treated with the same dose levels. Linear-trend contrast (equally-spaced dose-levels): w d = d d. The contrast θ = d w dµ d is interpretable as the change in response for a 1-level change in dose. Bios pg 19
20 Evaluating more than two unrelated (Statistical setting) Notes: There are other linear" contrasts (e.g., orthogonal polynomials), but the above contrast is the linear contrast for a linear trend. The contrast can be rescaled to change the interpretation (e.g., change units). Unless you know that the dose-response relationship is exactly linear, then it is better to fit a linear contrast across dose levels than to treat dose as continuous and estimate the slope parameter in a regression model. If dose is treated as continuous, then systematic departures from linearity are counted as residual variation thereby decreasing power. Bios pg 20
21 Evaluating more than two unrelated (sample size evaluation) Suppose that you have D + 1 dose groups (d = 0,..., D), and that you want adequate power for dose-response functions with µ D µ 0 µ +. As above, consider a linear-trend contrast: With θ = d w d µ d where w d = (d d) Estimate θ by (ˆθ) = d w d Y d. If the variance is constant across dose groups, then with N patients per group, ˆθ N (θ, d w d 2 σ 2 N ). Assume µ0 = 0. If µ d is in fact linear (i.e., µ d = d D µ D), then the sample size (in each dose group) require for power β when µ D = µ + is given by: N = z α + z β D d w d µ D + d=0 2 D wd 2 σ 2 d=0 You can also evaluate power under non-linear dose-response relationships; for example if µ d = µ D (d/d) P for several choices of P 1. Bios pg 21
22 Evaluating more than two unrelated (example) Example: Consider a trial in which children are randomized to different doses of inhaled steroid for asthma control (Busse WW (1999) J. Allergy Clin Immunol; ). Suppose that doses will be 100µg/d, 400µg/d, and 800µg/d. Response will be measured by FEV(%predicted) at week 6 and standard deviation is about 10(%predicted). Suppose that you want to detect a linear trend in dose-response of 1(% predicted) for every 100µg/d increase in steroid dose. This is similar to the magnitude of the dose-response relationship for FEV 1 in the paper. Bios pg 22
23 Evaluating more than two unrelated (example) Approach Linear-trend contrast: With 3 equally-spaced dose-groups the contrast is w d = ( 1, 0, 1). This study does not have equally-spaced doses, so we use wd = ( 1, 0.1, 1.1). To get the correct interpretation we use w d = wd /7.4 If response increases by 1(%pred) for every 100µg/d increase in steroid then if µ 100 = 20, µ 400 = 23, and µ 800 = 27 so that w d µ d = 1 Sample size for linear trend (90% power): * variance = d w 2 d σ2 = * Design alternative = θ + = 1: d ( ) zα + z 2 β N = ( ) = = Bios pg 23
24 Evaluating more than two unrelated (example) Evaluate sensitivity of power to non-linear dose-response relationships: Suppose µ d = (d/D) P. A study with 43 subjects per group has the following power for non-linear relationships: P θ + Power Bios pg 24
25 Evaluating more than two unrelated (example): Implementation in RCTdesign R code for above calculations: > # Linear contrast: > d <- c(100,400,800)/100 > wd <- d - mean(d) > wd <- wd/sum(wd^2) > # With this contrast thetaa is 1.0: > thetaa <- sum(wd*c(20,23,27)) > thetaa [1] 1 Equivalent seqdesign command: > fxd <- seqdesign(arms=0,variance=100,ratio=c(1,4,8), + alt.hypo=1,power="calculate",sample.size=42.56*3) PROBABILITY MODEL and HYPOTHESES: Theta is difference in means per unit difference in treatment level One-sided hypothesis test of a greater alternative: Null hypothesis : Theta <= 0 (size = ) Alternative hypothesis : Theta >= 1 (power = ) (Fixed sample test) STOPPING BOUNDARIES: Sample Mean scale Futility Efficacy Time 1 (N= ) Bios pg 25
26 Evaluating more than two unrelated Notes on normed contrasts: A normed contrast has length = 1; i.e., divide by d w d 2 A normed contrast may not preserve the units of interest (i.e., θ A may not have desired interpretation) Dose response example with normed contrast: > d <- c(100,400,800)/100 > wd <- d - mean(d) > wdn <- wd/sqrt(sum(wd^2)) > sum(wdn^2) [1] 1 > > thetaa <- sum(wdn*c(20,23,27)) > thetaa [1] Bios pg 26
27 Consider a study for estimating the linear trend (θ 1 ) in outcome Y with a continuous explanatory (X) variable: Least squares estimator: ˆθ 1 = E(Y i ) = θ 0 + θ 1 X i N i (X i X)(Y i Y ) i (X i X) 2 ( θ 1, V ) N where: V N = σ 2 Y X i (X i X) 2 σ2 Y X Nσ 2 X Bios pg 27
28 Sample size evaluation for regression designs Usual sample size formula can be used with the variance of ˆθ 1 /4: N = ( zα + z β θ 1+ ) 2 V = ( zα + z β θ 1+ ) 2 σ 2 Y X σ 2 X For example we require about 100 patients if we want 97.5% power for θ + = 0.25 when σ 2 Y X = 2 and σ 2 X = 5: N = ( ) = 98.5 The corresponding seqdesign command, assuming X i N ((0,ratio) is: dsgn <- seqdesign(arms=0,variance=2,ratio=5,alt.hypo=0.25) Logistic regression designs can be specified using the odds probability model (see users guide). Bios pg 28
29 endpoints (e) Designs using change from baseline endpoints In many trial the primary endpoint is the change from baseline in a continuous outcome measure Examples: * Change in FEV1 * Change in 6-minute walk distance * Change in weight (or BMI) Motivation for analyzing change: * Looking at change within a subject removes between-subject variability * May reduce variation (increase power) Which is the most precise estimate of treatment effect? * T-test of difference in mean at follow-up? * T-test of difference in change from baseline? * Regression (ANCOVA) of difference in followup given baseline? Bios pg 29
30 endpoints (e) Designs using change from baseline endpoints Setting: Let Yi0k and Y i1k denote a measurement on the ith subject in treatment group k (k = 0, 1) at baseline and follow-up. Let Dik = Y i1k Y i0k Consider the following statistical models for testing treatment effect (Tx = 1 [k=1] ) (a) Test difference at follow-up time: E(Y i1k ) = α 0 + α 1 Tx (b) Test change from baseline: E(D ik ) = β 0 + β 1 Tx (c) Test follow-up meausure given baseline measure (ANCOVA): E(Y i1k ) = γ 0 + γ 1 Tx + γ 2 E(Y i0k ) Bios pg 30
31 endpoints Designs using change from baseline endpoints Probability model Suppose that ( ) [( ) ] Y 0ik µ 00 N, Σ Y 1ik µ 1k where Σ = [ σ 2 ] ρσ 2 ρσ 2 σ 2 notice that µ 0k = µ 01 = µ 00 by randomization Bios pg 31
32 endpoints Designs using change from baseline endpoints Distribution of estimated treatment effects from the 3 statistical models with i = 1,...N subjects per group: (a) E(Y i1k ) = α 0 + α 1 Tx (b) E(D ik ) = β 0 + β 1 Tx ˆα 1 N ˆβ 1 N (µ 1 µ 0, 2σ2 N ) ( ) µ 1 µ 0, 4σ2 (1 ρ) N (c) E(Y i1k ) = γ 0 + γ 1 Tx + γ 2 E(Y i0k ) ( ) ˆγ 1 N µ 1 µ 0, 2σ2 (1 ρ 2 ) N where µ 1 = µ 11 µ 10 ; also notice: α 1 = µ 1 β 1 = µ 1 = (µ 11 µ 00 ) (µ 10 µ 00 ) γ 1 = µ 1 Bios pg 32
33 endpoints Designs using change from baseline endpoints Relative efficiency of the designs using the 3 statistical models: (a) vs (b) When is var( ˆβ 1 ) < var(ˆα 1 )? var( ˆβ 1 ) < var(ˆα 1 ) 4σ 2 (1 ρ) < 2σ < ρ < 1.0 (a) vs (c) When is var(ˆγ 1 ) < var(ˆα 1 )? var(ˆγ 1 ) < var(ˆα 1 ) 2σ 2 (1 ρ 2 ) < 2σ < ρ < 1.0 (b) vs (c) When is var(ˆγ 1 ) < var( ˆβ 1 )? var(ˆγ 1 ) < var( ˆβ 1 ) 2σ 2 (1 ρ 2 ) < 4σ 2 (1 ρ) 0.0 < ρ < 1.0 So, conditioning on baseline (ANCOVA) is more efficient (see also examples/proof in handout) Note: ˆγ1 is the same regardless of whether Y i1k or D ik is used as the response variable (try it). Bios pg 33
Bios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver
More informationBios 6648: Design & conduct of clinical research
Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary 2.5(c) Skewed baseline (a) Time-to-event (revisited) (b) Binary (revisited)
More informationSequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs
Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs Presented August 8-10, 2012 Daniel L. Gillen Department of Statistics University of California, Irvine
More informationBIOS 6649: Handout Exercise Solution
BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationAccounting for Baseline Observations in Randomized Clinical Trials
Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA October 6, 0 Abstract In clinical
More informationAccounting for Baseline Observations in Randomized Clinical Trials
Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA August 5, 0 Abstract In clinical
More informationPoisson regression: Further topics
Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to
More informationBios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & nformatics Colorado School of Public Health University of Colorado Denver
More informationBios 6648: Design & conduct of clinical research
Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary (a) Time-to-event (revisited) (b) Binary (revisited) (c) Skewed (d)
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationPubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH
PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 10 Class Summary Last time... We began our discussion of adaptive clinical trials Specifically,
More informationSample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA
Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider
More informationCHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity
CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity Prof. Kevin E. Thorpe Dept. of Public Health Sciences University of Toronto Objectives 1. Be able to distinguish among the various
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More informationAnalysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington
Analsis of Longitudinal Data Patrick J. Heagert PhD Department of Biostatistics Universit of Washington 1 Auckland 2008 Session Three Outline Role of correlation Impact proper standard errors Used to weight
More informationBIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke
BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart
More informationPower and Sample Size Calculations with the Additive Hazards Model
Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine
More informationGroup Sequential Designs: Theory, Computation and Optimisation
Group Sequential Designs: Theory, Computation and Optimisation Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 8th International Conference
More information4. Issues in Trial Monitoring
4. Issues in Trial Monitoring 4.1 Elements of Trial Monitoring Monitoring trial process and quality Infrastructure requirements and DSMBs 4.2 Interim analyses: group sequential trial design 4.3 Group sequential
More informationOverrunning in Clinical Trials: a Methodological Review
Overrunning in Clinical Trials: a Methodological Review Dario Gregori Unit of Biostatistics, Epidemiology and Public Health Department of Cardiac, Thoracic and Vascular Sciences dario.gregori@unipd.it
More informationCategorical data analysis Chapter 5
Categorical data analysis Chapter 5 Interpreting parameters in logistic regression The sign of β determines whether π(x) is increasing or decreasing as x increases. The rate of climb or descent increases
More informationProblems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B
Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2
More informationST 732, Midterm Solutions Spring 2019
ST 732, Midterm Solutions Spring 2019 Please sign the following pledge certifying that the work on this test is your own: I have neither given nor received aid on this test. Signature: Printed Name: There
More informationBIOS 2083 Linear Models c Abdus S. Wahed
Chapter 5 206 Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter
More informationSample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018
Sample Size Re-estimation in Clinical Trials: Dealing with those unknowns Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj University of Kyoto,
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More informationBIOS 312: Precision of Statistical Inference
and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More informationDiagnostics can identify two possible areas of failure of assumptions when fitting linear models.
1 Transformations 1.1 Introduction Diagnostics can identify two possible areas of failure of assumptions when fitting linear models. (i) lack of Normality (ii) heterogeneity of variances It is important
More informationThe SEQDESIGN Procedure
SAS/STAT 9.2 User s Guide, Second Edition The SEQDESIGN Procedure (Book Excerpt) This document is an individual chapter from the SAS/STAT 9.2 User s Guide, Second Edition. The correct bibliographic citation
More informationChapter 8 - Statistical intervals for a single sample
Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationMeasuring the fit of the model - SSR
Measuring the fit of the model - SSR Once we ve determined our estimated regression line, we d like to know how well the model fits. How far/close are the observations to the fitted line? One way to do
More informationAdaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013
Adaptive designs beyond p-value combination methods Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Outline Introduction Combination-p-value method and conditional error function
More informationComparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design
Comparing Adaptive Designs and the Classical Group Sequential Approach to Clinical Trial Design Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationOptimising Group Sequential Designs. Decision Theory, Dynamic Programming. and Optimal Stopping
: Decision Theory, Dynamic Programming and Optimal Stopping Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj InSPiRe Conference on Methodology
More informationAdaptive Designs: Why, How and When?
Adaptive Designs: Why, How and When? Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj ISBS Conference Shanghai, July 2008 1 Adaptive designs:
More informationDose-response modeling with bivariate binary data under model uncertainty
Dose-response modeling with bivariate binary data under model uncertainty Bernhard Klingenberg 1 1 Department of Mathematics and Statistics, Williams College, Williamstown, MA, 01267 and Institute of Statistics,
More informationInterim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping
Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming and Optimal Stopping Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj
More informationNext is material on matrix rank. Please see the handout
B90.330 / C.005 NOTES for Wednesday 0.APR.7 Suppose that the model is β + ε, but ε does not have the desired variance matrix. Say that ε is normal, but Var(ε) σ W. The form of W is W w 0 0 0 0 0 0 w 0
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationBIOS 2083: Linear Models
BIOS 2083: Linear Models Abdus S Wahed September 2, 2009 Chapter 0 2 Chapter 1 Introduction to linear models 1.1 Linear Models: Definition and Examples Example 1.1.1. Estimating the mean of a N(μ, σ 2
More informationPersonalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health
Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationChapter Seven: Multi-Sample Methods 1/52
Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze
More informationSTAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test
STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test Rebecca Barter April 13, 2015 Let s now imagine a dataset for which our response variable, Y, may be influenced by two factors,
More informationMicro-Randomized Trials & mhealth. S.A. Murphy NRC 8/2014
Micro-Randomized Trials & mhealth S.A. Murphy NRC 8/2014 mhealth Goal: Design a Continually Learning Mobile Health Intervention: HeartSteps + Designing a Micro-Randomized Trial 2 Data from wearable devices
More informationClinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.
Introduction to Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca September 18, 2014 38-1 : a review 38-2 Evidence Ideal: to advance the knowledge-base of clinical medicine,
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationThree-Way Contingency Tables
Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep
More informationSample Size Determination
Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by
More informationSTA441: Spring Multiple Regression. More than one explanatory variable at the same time
STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationSleep data, two drugs Ch13.xls
Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch
More informationUnbalanced Designs & Quasi F-Ratios
Unbalanced Designs & Quasi F-Ratios ANOVA for unequal n s, pooled variances, & other useful tools Unequal nʼs Focus (so far) on Balanced Designs Equal n s in groups (CR-p and CRF-pq) Observation in every
More informationSample Size and Power Considerations for Longitudinal Studies
Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous
More informationCOMPARING GROUPS PART 1CONTINUOUS DATA
COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationCOMPARING SEVERAL MEANS: ANOVA
LAST UPDATED: November 15, 2012 COMPARING SEVERAL MEANS: ANOVA Objectives 2 Basic principles of ANOVA Equations underlying one-way ANOVA Doing a one-way ANOVA in R Following up an ANOVA: Planned contrasts/comparisons
More informationStatistical Hypothesis Testing
Statistical Hypothesis Testing Dr. Phillip YAM 2012/2013 Spring Semester Reference: Chapter 7 of Tests of Statistical Hypotheses by Hogg and Tanis. Section 7.1 Tests about Proportions A statistical hypothesis
More informationTests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design
Chapter 170 Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design Introduction Senn (2002) defines a cross-over design as one in which each subject receives all treatments and the objective
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More information6 Sample Size Calculations
6 Sample Size Calculations A major responsibility of a statistician: sample size calculation. Hypothesis Testing: compare treatment 1 (new treatment) to treatment 2 (standard treatment); Assume continuous
More informationRerandomization to Balance Covariates
Rerandomization to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University Joint work with Don Rubin University of Minnesota Biostatistics 4/27/16 The Gold Standard Randomized
More informationBattery Life. Factory
Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several
More informationSAS/STAT 15.1 User s Guide The SEQDESIGN Procedure
SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationGeneral Linear Model: Statistical Inference
Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter 4), least
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 8 P-values When reporting results, we usually report p-values in place of reporting whether or
More informationSample Size/Power Calculation by Software/Online Calculators
Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. li.zhang@ucsf.edu Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology
More information401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.
401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis
More informationStat 579: Generalized Linear Models and Extensions
Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject
More informationSTAT 285: Fall Semester Final Examination Solutions
Name: Student Number: STAT 285: Fall Semester 2014 Final Examination Solutions 5 December 2014 Instructor: Richard Lockhart Instructions: This is an open book test. As such you may use formulas such as
More informationTopic 12 Overview of Estimation
Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationWelcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)
. Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to
More informationStatistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013
Statistical Aspects of Futility Analyses Kevin J Carroll March Spring 222013 nd 2013 1 Contents Introduction The Problem in Statistical Terms Defining Futility Three Common Futility Rules The Maths An
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationLecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Lecture 9 Two-Sample Test Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Computer exam 1 18 Histogram 14 Frequency 9 5 0 75 83.33333333
More informationTutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:
Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationComparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error
Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error by Diane Potvin Outline 1. Therapeutic Equivalence Designs 2. Objectives
More informationSTAT 525 Fall Final exam. Tuesday December 14, 2010
STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationSimple logistic regression
Simple logistic regression Biometry 755 Spring 2009 Simple logistic regression p. 1/47 Model assumptions 1. The observed data are independent realizations of a binary response variable Y that follows a
More informationBayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0)
Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402,
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationTwo-stage Adaptive Randomization for Delayed Response in Clinical Trials
Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal
More informationBiost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation
Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest
More information