Introduction to Crossover Trials

Similar documents
T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

General Linear Model (Chapter 4)

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

In many situations, there is a non-parametric test that corresponds to the standard test, as described below:

Repeated Measures Part 2: Cartoon data

df=degrees of freedom = n - 1

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Statistical tables are attached Two Hours UNIVERSITY OF MANCHESTER. May 2007 Final Draft

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Week 7.1--IES 612-STA STA doc

Topic 23: Diagnostics and Remedies

One-way ANOVA Model Assumptions

Overview. Prerequisites

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

Correlated data. Overview. Cross-over study. Repetition. Faculty of Health Sciences. Variance component models, II. More on variance component models

Analysis of repeated measurements (KLMED8008)

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.

Outline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013

Statistics in Stata Introduction to Stata

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

Module 9: Nonparametric Statistics Statistics (OA3102)

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

Formula for the t-test

Non-parametric (Distribution-free) approaches p188 CN

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Relating Graph to Matlab

ECON Introductory Econometrics. Lecture 17: Experiments

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Chapter 11. Analysis of Variance (One-Way)

22s:152 Applied Linear Regression. Take random samples from each of m populations.

PLS205 Lab 2 January 15, Laboratory Topic 3

Two Sample Problems. Two sample problems

Lecture 3: Inference in SLR

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking

APPENDIX B Sample-Size Calculation Methods: Classical Design

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

6. Multiple regression - PROC GLM

Analysis of Variance

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Aquatic Toxicology Lab 10 Pimephales promelas Larval Survival and Growth Test Data Analysis 1. Complete test initiated last week 1.

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD

4.1. Introduction: Comparing Means

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Sociology 6Z03 Review II

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

STA 303H1F: Two-way Analysis of Variance Practice Problems

N J SS W /df W N - 1

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Lecture 10: Introduction to Logistic Regression

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

16.400/453J Human Factors Engineering. Design of Experiments II

Rerandomization to Balance Covariates

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Sample Size Determination

Comparison of two samples

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS

Assessing Model Adequacy

Review of Statistics 101

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

Answer to exercise: Blood pressure lowering drugs

Outline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups

Factorial designs. Experiments

ANOVA: Analysis of Variation

Analysis of Variance

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Probabilistic Index Models

Business Statistics (BK/IBA) Tutorial 4 Full solutions

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Models for Clustered Data

Analysis of Variance

Analysis of variance. April 16, Contents Comparison of several groups

Models for Clustered Data

Analysis of variance. April 16, 2009

Analysis of 2x2 Cross-Over Designs using T-Tests

Outline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Topic 28: Unequal Replication in Two-Way ANOVA

Applied Statistics Qualifier Examination (Part II of the STAT AREA EXAM) January 25, 2017; 11:00AM-1:00PM

Biostatistics Quantitative Data

What is a Hypothesis?

ST505/S697R: Fall Homework 2 Solution.

E509A: Principle of Biostatistics. GY Zou

Linear models Analysis of Covariance

Statistics 5100 Spring 2018 Exam 1

Correlation and Simple Linear Regression

Linear models Analysis of Covariance

Mixed Models Lecture Notes By Dr. Hanford page 199 More Statistics& SAS Tutorial at

BIOSTATISTICAL METHODS

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)

Transcription:

Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst

A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain conditions, it can more precisely measuring the treatment effect. These types of trials are particularly well suited for comparing treatments for those with chronic conditions. Often, a patient only has one chance to be given a treatment for their condition. For example, after a cancer has gone into remission, there is only opportunity to assign one anti-relapse treatment to that patient. However, if we are not expecting to cure a patient, but instead improve their condition or some other measured outcome, we can give the patient a series of treatments in different time periods, and measure their response to each treatment. If we do this for multiple patients, and randomize the order of assigned treatments, we are conducting a crossover trial. A crossover trial is a type of designed experiment called a Latin square design. For the simplest example comparing only two treatments, group 1 will receive one treatment, call it treatment A, in period 1, and a second treatment, treatment B, in period 2. Group 2 will receive treatment B in period 1 and treatment A in period 2. This is a Latin square because every group sees every treatment over the course of the trial, and in every period, there is a group receiving each treatment. An example of this simple type of Latin square crossover trial design deals with patients suffering from enuresis (bedwetting). These data come from the Matthews text referenced below. Group 1 received treatment A, in this case the treatment of interest, in period 1, and a placebo in period 2, and group 2 received the treatments in the reverse order. The measured outcome for this trial was the number of dry nights observed over the treatment period. In this case the treatment period was two weeks. To see why this type of trial has the potential to more precisely measure the treatment effect, some theory will be presented. The notation comes from the Matthews text. (As does most everything I know about these types of trials.)

Before presenting the theory, we should note the problems with assigning every patient to treatment A in the first period, and treatment B in the second period. If we were to do this and we observed that treatment A improve the measured outcome statistically more than treatment B, we cannot conclude that treatment A is better than treatment B. There are two reasons for this. One is that the first treatment, whichever it is, might always do better than the second treatment. This could be considered an order effect, or a trend effect. By randomizing the assignment of each patient to a different order of treatments, we minimize this source of bias in our measurement by having some patients receive the treatments in the reverse order. Another source of bias has to do with what can be called period effects. For example, in one period, all the measurements might be higher, or lower, than another period. Therefore, if treatment A had a statistically different (suppose higher) measured outcome than treatment B, this could be caused by all the measurements in period 1 being higher because of this period effect. To see why randomizing the order gets rid of the period effect, we need to look at the theory below. Also, as one final point before presenting the theory, it should be noted that the periods need to be separated by sufficient time so that in the second time period, we are not measuring residual or left over effects of the treatment given in the previous period. Theory and Notation The notation for the theory is as follows: n 1 and n 2 are the numbers in each group x ij is measured effect in patient i in period j μ is the overall mean

π j is the period effect in period j τ A and τ B are the effects of each treatment ξ i is the effect of the ith patient, distributed N(0, σ ξ 2 ) ε ij is the normally distributed N(0, σ ε 2 ) error for patient i in period j τ is τ A τ B or how much different treatment A is than B π is π 1 π 2 or the period effect of period 1 minus period 2 With this notation, a person assigned to the first group has the following equations for the outcomes in each period: x i1 = μ + π 1 + τ A + ξ i + ε i1 for period 1 x i2 = μ + π 2 + τ B + ξ i + ε i2 for period 2 And for the second group, we see very similar equations, with the exception of the treatment effects being reversed: x i1 = μ + π 1 + τ B + ξ i + ε i1 for period 1 x i2 = μ + π 2 + τ A + ξ i + ε i2 for period 2 And the difference for someone in group 1 simplifies into the following equation: d i = x i1 x i2 = π 1 π 2 + τ A τ B + ε i1 ε i2 We have been able to eliminate the overall mean μ this way, and we have also been able to eliminate ξ i. This elimination of ξ i is perhaps one of the biggest advantages of crossover trials, and the reasons why these types of trials have the potential to be more precise in their measurement of the outcome of

interest. ξ i can be thought of as all the person specific variables that could be confounding factors in the experiment. For other types of situations, to eliminate the effect we might try to match similar patients with each other, or we could do a regression including possible confounding variables for each individual. Or if it's another designed experiment, we hope that all those confounding factors are eliminated in the randomization to assigned treatments, which they should be. We might, however, be unlucky in our randomization and not eliminate these effects. This approach has the potential to be better, because we are able to control for all those factors and eliminate them by how the experiment was designed. If these were a large source of variation, their elimination could lead to better estimates of the treatment effect. Continuing with the notation, the expectation of the last equation is the following, since the epsilons have mean zero: π + τ The difference for someone in group 2 is, which is the exact same, except for the reverse order of the treatment effects: d i = x i1 x i2 = π 1 π 2 + τ B τ A + ε i1 ε i2 or d i = x i1 x i2 = π 1 π 2 (τ A τ B) + ε i1 ε i2 Which in expectation is:

π τ So when you take the expectation of the difference you have: E d 1 d 2 = 2τ This means, for this example, that the average of the difference for group 1 (period 1 outcome minus the period 2 outcome) minus the average of the difference for group 2 (again period 1 outcome minus the period 2 outcome) ends up being twice the difference of the treatment A effect, minus the treatment B effect. This is essentially a two-sample t-test, except that the difference is twice what we would expect. The confidence intervals for a given alpha level are also twice what would be found from a simple two-sample t-test, and so the p-value ends up being the exact same. Continuing with our example about bed wetting, we find that tao equals 2.0368, and it is significant. We conclude that treatment A improves bedwetting by eliminating on average of two wet nights over a two week period. The code and SAS output for this example are in an appendix. Finally, it makes sense that because this is essentially a two sample t-test, if the assumptions of normality are violated, we should be able to use a non-parametric two sample equivalent, like a Wilcoxon Rank Sum test, making adjustments for the measure of location being twice the treatment difference we are interested in. Also, for more than two groups, we should be able to do something like a one-way ANOVA or Kruskal Wallis non-parametric equivalent. However, I wasn't able to find the necessary time to confirm my guess, and so I can't be sure that I was right in that hypothesis.

References Matthews, John, N.S. Introduction to Randomized Controlled Clinical Trials, Second Edition.

Appendix of SAS code and output data enuresis; input patient $ period1 period2 difference group $; datalines; 1 8 5 3 a 2 14 10 4 a 3 8 0 8 a 4 9 7 2 a 5 11 6 5 a... 25 2 4-2 b 26 8 13-5 b 27 9 7 2 b 28 7 10-3 b 29 7 6 1 b ; data enuresis; set enuresis; length group2 $ 8; group2 = "a"; if group = "a" then group2 = "b"; run; proc ttest data = enuresis; var period1 period2 / crossover = (group group2); run;

SAS output Crossover Variable Information Period Response Treatment 1 period1 group 2 period2 group2 Sequence Treatment Period N Mean Std Dev Std Err Minimum Maximum 1 a 1 17 8.1176 3.8387 0.9310 0 14.0000 2 a 2 12 8.9167 2.8110 0.8115 4.0000 14.0000 2 b 1 12 7.6667 2.9949 0.8646 2.0000 13.0000 1 b 2 17 5.2941 4.2539 1.0317 0 13.0000 1 Diff (1-2) 17 2.8235 3.4683 0.8412-2.0000 9.0000 2 Diff (1-2) 12 1.2500 2.9886 0.8627-4.0000 6.0000 Both Diff (1-2) 2.0368 1.6407 0.6186 Both Diff (1-2) 0.7868 1.6407 0.6186 Sequence Treatment Period Method Mean 95% CL Mean Std Dev 95% CL Std Dev 1 a 1 8.1176 6.1440 10.0913 3.8387 2.8589 5.8422 2 a 2 8.9167 7.1307 10.7027 2.8110 1.9913 4.7727 2 b 1 7.6667 5.7638 9.5696 2.9949 2.1216 5.0851 1 b 2 5.2941 3.1070 7.4813 4.2539 3.1682 6.4741 1 Diff (1-2) 2.8235 1.0403 4.6068 3.4683 2.5831 5.2786 2 Diff (1-2) 1.2500-0.6489 3.1489 2.9886 2.1171 5.0743 Both Diff (1-2) Pooled 2.0368 0.7675 3.3060 1.6407 1.2972 2.2332 Both Diff (1-2) Satterthwaite 2.0368 0.7979 3.2756 Both Diff (1-2) Pooled 0.7868-0.4825 2.0560 1.6407 1.2972 2.2332 Both Diff (1-2) Satterthwaite 0.7868-0.4521 2.0256

Treatment Period Method Variances DF t Value Pr > t Diff (1-2) Pooled Equal 27 3.29 0.0028 Diff (1-2) Satterthwaite Unequal 25.816 3.38 0.0023 Diff (1-2) Pooled Equal 27 1.27 0.2143 Diff (1-2) Satterthwaite Unequal 25.816 1.31 0.2031 Equality of Variances Method Num DF Den DF F Value Pr > F Folded F 16 11 1.35 0.6260