Bios 6648: Design & conduct of clinical research

Size: px
Start display at page:

Download "Bios 6648: Design & conduct of clinical research"

Transcription

1 Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary 2.5(c) Skewed baseline (a) Time-to-event (revisited) (b) Binary (revisited) (c) Skewed (d) Change-from-baseline Bios pg 1

2 designs (d) Change from baseline Outline: 1. Motivation and data structure 2. Approaches to defining when an endpoint is measured at baseline and follow-up 3. Other applications 4. Statistical design (sample size and CI evaluations) 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 2

3 (d) Change from baseline 1. Motivation and data structure Why measure change? Usually: within-individual change is less variable. For example, consider the CHEST data: Subj Trt 6 Minute walk distance ID Grp Baseline 12-weeks Change (b) Binary 2.5(c) Skewed baseline Bios pg 3

4 (d) Change from baseline 1. Motivation and data structure CHEST Trial summary statistics Study Placebo Riociguat Visit Mean (sd) Mean (sd) Baseline ( 74.7) ( 81.9) 16-weeks (122.2) (119.2) Change -5.5 ( 84.3) 38.9 ( 79.3) 2.5(b) Binary 2.5(c) Skewed baseline Approaches to analysis: Compare 6MWD after 16-weeks Compare 16-week improvement in 6MWD Linear regression: 16-week 6MWD conditional on baseline walk distance. Bios pg 4

5 (d) Change from baseline 2. Approaches to outcome definition Evaluate difference at last time measurement time: Outcome: Final measurement on each subject. Functional: θ1f and θ 0F denote the mean outcome with active and control therapy at the final measurement time. Contrast: θ = θ1f θ 0F. Example (CHEST) T-test of 16-week walk distance: 2.5(b) Binary 2.5(c) Skewed baseline ˆθ = ˆθ 1 ˆθ 0 = = se(ˆθ) = = %CI = ± se = ( , 62.08) p value = Result: At 16-week mean walk distance with Riociguat is meters farther than placebo (95% CI: to meters; p = 0.055). Notice inconclusive result. Bios pg 5

6 (d) Change from baseline 2. Approaches to outcome definition Evaluate change over follow-up period: Outcome: Change in outcome measurement over the time (final minus baseline) Functional: θ 1 = θ 1F θ 1B (mean change with active treatment) θ 0 = θ 0F θ 0B (mean change with control treatment) Contrast: θ = θ1 θ 0. CHEST: T-test of change in walk distance over 16 weeks: 2.5(b) Binary 2.5(c) Skewed baseline ˆθ = ˆθ 1 ˆθ 0 = 38.9 ( 5.5) = se(ˆθ) = = %CI = ± se = (23.05, 65.78) p value = Result: Over 16-weeks Riociguat treatment improves mean walk distance by meters (95% CI: to meters; p < ) more than the improvement with placebo. Bios pg 6

7 (d) Change from baseline 2. Approaches to outcome definition Last measurement time adjusting for baseline: Outcome: Outcome at the last measurement time (adjusting for baseline value) Functional: outcome at last measurement time Contrast: Difference in mean adjusted for baseline levels (θ in the following regression model): 2.5(b) Binary 2.5(c) Skewed baseline θ kf = β 0 + θtx + β 1 θ kb where Tx is the indicator for active treatment. CHEST: Fit the linear regression model: Y if = β 0 + θtx i + β 1 Y ib Estimate Std. Error t value Pr(> t ) (Intercept) e-02 exmpl[, "RioTx"] e-05 exmpl[, "base6"] e-47 That is: ˆβ0 = ˆθ = ˆβ 1 = Bios pg 7

8 (d) Change from baseline 2. Approaches to outcome definition Last measurement time adjusting for baseline: CHEST Result (con t): Point estimate: Among two populations with the same baseline walk distance, after 16 weeks a population taking riociguat will end up walking 46.1 meters farther than a population taking placebo. 95% CI: (25.35, 66.84) P-value: Comparison of approaches: 2.5(b) Binary 2.5(c) Skewed baseline Approach Estimate 95% CI p-value 16-week difference (-0.55, 62.08) Change from baseline (23.05, 65.78) Adjusting for baseline 46.1 (25.35, 66.84) Bios pg 8

9 (d) Change from baseline Graphical depiction of the CHEST results 16 Week WD P R Placebo Riociguat 2.5(b) Binary 2.5(c) Skewed baseline P R Baseline WD Bios pg 9

10 (d) Change from baseline Notes on previous graph: Boxplots on left represent the data used for a t-test of 16-week differences. Regression lines are the result of the above linear regression analysis. Vertical distance between regression lines is 46.1 meters (i.e., the effect of riociguat adjusted for baseline walk distance). Bloxplots on bottom show no confounding (similar distribution of baseline WD). This example shows how we can increase power by adjusting for a precision variable. I now illustrate the general behavior in a series of graphs 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 10

11 Precision variables (in linear regression) A precision variable reduces noise" (extraneous variation) so that the relationship between outcome and the primary explanatory variable is more precise. A precision variables must be: unrelated to the primary explanatory variable. an independent predictor of outcome. Adjusting for a precision variable increases precision for the comparison of interest. 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 11

12 Example (Precision variable) The nature of a precision variable can be illustrated using scatterplots. Let: Y denotes outcome X denotes primary explanatory variable (2-categories: H and L) Z denotes a covariate (precision variable) We are interested in the relationship between X and Y. In the following plots: The relationship between X and Y is fixed. There is no relationship between X and Z (the precision variable is unrelated to the explanatory variable). The relationship (correlation) between Y and Z is increasing. 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 12

13 Example (Figure 1a) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 13

14 Example (Figure 1b) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 14

15 Example (Figure 1c) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 15

16 Example (Figure 1d) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 16

17 Example (Figure 1e) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 17

18 Example (Figure 1f) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 18

19 Example (Figure 1g) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 19

20 Example (Figure 1h) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 20

21 Example (Figure 1i) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 21

22 Example (Figure 1j) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 22

23 Example (Figure 1k) Response (Y) L H (b) Binary 2.5(c) Skewed baseline L H Covariate (Z) Bios pg 23

24 Example (Precision variable) We would like to compare the results of an analysis if we ignore the precision variable (the crude" difference) with the results after we adjust for the precision variable (the adjusted difference"). (Z,Y) Crude Difference Adjusted Difference Correlation Estimate SE Estimate SE Fig 1a Fig 1b Fig 1c Fig 1d Fig 1e Fig 1f Fig 1g Fig 1h Fig 1i Fig 1j Fig 1k (b) Binary 2.5(c) Skewed baseline Bios pg 24

25 Example (Precision variable) Notice: Crude and adjustments are identical Standard error (SE) of the adjusted estimate is smaller than the standard error of the crude estimate (Note: smaller SE gives more power) The precision of the adjusted estimate increases with the correlation between Y and Z The precision variable is explaining" some of the variation (reducing the noise) in the primary comparison 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 25

26 (d) Change from baseline 3. Other applications Other situations in which the primary analysis is adjusted for baseline values: Common to adjust for stratification variables May adjust for scientific interpretability May adjust for comparability to previous studies It is important to pre-specify all adjustments as part of your primary analysis. 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 26

27 (d) Change from baseline 4. Implications for power and information The above examples and graphical illustration illustrate general principles: Variance (precision) of various approaches to defining with change from baseline data. It is particularly clear when σ0 = σ 1 (= σ) and N 1 = N 0 (= N): 2.5(b) Binary 2.5(c) Skewed baseline Variance when comparing only the last time point: var(ˆθ) = 2σ2 N Variance when comparing change from baseline: var(ˆθ) = 4σ2 (1 ρ) N Variance when comparisons are adjusted for baseline: var(ˆθ) = 2σ2 (1 ρ 2 ) N In all of the above, ρ is the correlation between baseline and follow-up measures. (1) (2) (3) Bios pg 27

28 (d) Change from baseline 4. Implications for power and information The above relationships can be used to prove: If ρ < 0.5, then it is more powerful to analyze follow-up differences (i.e., DO NOT compare change from baseline). If ρ > 0.5 then it is more powerful to analyze change from baseline than follow-up differences. it is always more powerful to us regression to adjust for baseline: This is also known as Analysis of covariance" (ANCOVA). The CHEST paper refers to it as the least-squares" estimate. 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 28

29 (d) Change from baseline 4. Implications for power and information Relative sample size of the analytic approaches: Change from baseline relative to follow-up only: 2.5(b) Binary 2.5(c) Skewed baseline 4σ 2 (1 ρ) 2σ 2 = 2(1 ρ) ANCOVA relative to follow-up only: 2σ 2 (1 ρ 2 ) 2σ 2 = (1 ρ 2 ) Bios pg 29

30 (d) Change from baseline 4. Implications for power and information Relative sample size of analytic approaches as function of correlation: Relative sample size Sample size relative to follow up only analysis Follow up only Change from baseline ANCOVA 2.5(b) Binary 2.5(c) Skewed baseline Correlation Bios pg 30

31 (d) Change from baseline 4. Implications for power and information How to evaluate information and sample size with change from baseline : Follow the same approaches described for other * Potential inference * Power and hypotheses discriminated Evaluate sensitivity to assumptions about correlation * As with other design parameters it may be necessary to evaluate sensitivity to design parameters. * See example below (b) Binary 2.5(c) Skewed baseline Bios pg 31

32 Evaluating information in a change from baseline design Use the form for V N given above (equations (1)-(3)) as we have previously done: Potential inference given V * Critical value = 1.96 N : V N * 95% CI at critical value = ( ) V 0, 3.92 N 2.5(b) Binary 2.5(c) Skewed baseline Find the power given V N : z β = θ + θ V N z α Sample size needed for power β: N = [ ] 2 zα + z β V θ + θ Bios pg 32

33 Evaluating information in a change from baseline design Power as a function of sample size, variance, and correlation Total sample size σ Correlation = Correlation = Correlation = Correlation = For example: With 350 subjects (175 per group) an ANCOVA analysis has 97.6% power if σ = 100 and and correlation is moderately high (ρ = 0.7). 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 33

34 (d) Change from baseline 4. Implications for power and information Frequent asked questions (FAQ) about the ANCOVA analysis: The ANCOVA model described above fits parallel lines. What happens if the lines are not parallel? * Non-parallel lines represents interaction; treatment works better (or worse) for low baseline values. * Interactions are explored in subsequent trials. What happens if the relationships are not linear? * Not a problem as long as baseline distribution is the same in both treatment groups (assured by randomization). * The line represents the first order approximation to the curve (i.e., is it treading up, down, or flat?). 2.5(b) Binary 2.5(c) Skewed baseline Bios pg 34

35 (d) Change from baseline 4. Implications for power and information Example: ANCOVA FAQ s in the CHEST trial No evidence for substantial non-linearity: 16 Week WD Relationship between baseline and 16 week walk distance Placebo Riociguat 2.5(b) Binary 2.5(c) Skewed baseline Baseline WD Bios pg 35

36 (d) Change from baseline 4. Implications for power and information Example: ANCOVA FAQ s in the CHEST trial Separate lines in each treatment group are nearly parallel: 16 Week WD Relationship between baseline and 16 week walk distance Placebo Riociguat 2.5(b) Binary 2.5(c) Skewed baseline Baseline WD No problem with interaction. Bios pg 36

37 (d) Change from baseline 4. Implications for power and information Example: ANCOVA FAQ s in the CHEST trial Separate lines in each treatment group are nearly parallel: 16 Week WD Relationship between baseline and 16 week walk distance Placebo Riociguat 2.5(b) Binary 2.5(c) Skewed baseline Baseline WD No problem with interaction. Bios pg 37

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Bios 6649: Clinical Trials - Statistical Design and Monitoring Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver

More information

BIOS 6649: Handout Exercise Solution

BIOS 6649: Handout Exercise Solution BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

Bios 6648: Design & conduct of clinical research

Bios 6648: Design & conduct of clinical research Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary (a) Time-to-event (revisited) (b) Binary (revisited) (c) Skewed (d)

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling

Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Linear Modelling in Stata Session 6: Further Topics in Linear Modelling Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 14/11/2017 This Week Categorical Variables Categorical

More information

Accounting for Baseline Observations in Randomized Clinical Trials

Accounting for Baseline Observations in Randomized Clinical Trials Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA October 6, 0 Abstract In clinical

More information

De-mystifying random effects models

De-mystifying random effects models De-mystifying random effects models Peter J Diggle Lecture 4, Leahurst, October 2012 Linear regression input variable x factor, covariate, explanatory variable,... output variable y response, end-point,

More information

Accounting for Baseline Observations in Randomized Clinical Trials

Accounting for Baseline Observations in Randomized Clinical Trials Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA August 5, 0 Abstract In clinical

More information

Lecture 4 Multiple linear regression

Lecture 4 Multiple linear regression Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters

More information

Extensions of One-Way ANOVA.

Extensions of One-Way ANOVA. Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa17.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How

More information

Topic 12 Overview of Estimation

Topic 12 Overview of Estimation Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

STA441: Spring Multiple Regression. More than one explanatory variable at the same time STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory

More information

Lecture 10: Introduction to Logistic Regression

Lecture 10: Introduction to Logistic Regression Lecture 10: Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 2007 Logistic Regression Regression for a response variable that follows a binomial distribution Recall the binomial

More information

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013 Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis

More information

Extensions of One-Way ANOVA.

Extensions of One-Way ANOVA. Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa18.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression: Biost 518 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture utline Choice of Model Alternative Models Effect of data driven selection of

More information

Regression #4: Properties of OLS Estimator (Part 2)

Regression #4: Properties of OLS Estimator (Part 2) Regression #4: Properties of OLS Estimator (Part 2) Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #4 1 / 24 Introduction In this lecture, we continue investigating properties associated

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

Dr. Allen Back. Sep. 23, 2016

Dr. Allen Back. Sep. 23, 2016 Dr. Allen Back Sep. 23, 2016 Look at All the Data Graphically A Famous Example: The Challenger Tragedy Look at All the Data Graphically A Famous Example: The Challenger Tragedy Type of Data Looked at the

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington Analsis of Longitudinal Data Patrick J. Heagert PhD Department of Biostatistics Universit of Washington 1 Auckland 2008 Session Three Outline Role of correlation Impact proper standard errors Used to weight

More information

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017 UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75

More information

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials Lecture : Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 27 Binomial Model n independent trials (e.g., coin tosses) p = probability of success on each trial (e.g., p =! =

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis. Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

Statistical View of Least Squares

Statistical View of Least Squares May 23, 2006 Purpose of Regression Some Examples Least Squares Purpose of Regression Purpose of Regression Some Examples Least Squares Suppose we have two variables x and y Purpose of Regression Some Examples

More information

Practical Considerations Surrounding Normality

Practical Considerations Surrounding Normality Practical Considerations Surrounding Normality Prof. Kevin E. Thorpe Dalla Lana School of Public Health University of Toronto KE Thorpe (U of T) Normality 1 / 16 Objectives Objectives 1. Understand the

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Density Temp vs Ratio. temp

Density Temp vs Ratio. temp Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

Quoting from the document I suggested you read (http://courses.ttu.edu/isqs5349 westfall/images/5349/practiceproblems_discussion.

Quoting from the document I suggested you read (http://courses.ttu.edu/isqs5349 westfall/images/5349/practiceproblems_discussion. Spring 14, ISQS 5349 Midterm 1. Instructions: Closed book, notes and no electronic devices. Put all answers on scratch paper provided. Points (out of 100) are in parentheses. 1. (20) Define regression

More information

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences

Example. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences 36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors

More information

Group Sequential Tests for Delayed Responses

Group Sequential Tests for Delayed Responses Group Sequential Tests for Delayed Responses Lisa Hampson Department of Mathematics and Statistics, Lancaster University, UK Chris Jennison Department of Mathematical Sciences, University of Bath, UK Read

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

Hypothesis Testing The basic ingredients of a hypothesis test are

Hypothesis Testing The basic ingredients of a hypothesis test are Hypothesis Testing The basic ingredients of a hypothesis test are 1 the null hypothesis, denoted as H o 2 the alternative hypothesis, denoted as H a 3 the test statistic 4 the data 5 the conclusion. The

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial

Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial William R. Gillespie Pharsight Corporation Cary, North Carolina, USA PAGE 2003 Verona,

More information

Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Lab 8. Matched Case Control Studies

Lab 8. Matched Case Control Studies Lab 8 Matched Case Control Studies Control of Confounding Technique for the control of confounding: At the design stage: Matching During the analysis of the results: Post-stratification analysis Advantage

More information

Practical Biostatistics

Practical Biostatistics Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:

More information

Final Exam. Economics 835: Econometrics. Fall 2010

Final Exam. Economics 835: Econometrics. Fall 2010 Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each

More information

Describing Contingency tables

Describing Contingency tables Today s topics: Describing Contingency tables 1. Probability structure for contingency tables (distributions, sensitivity/specificity, sampling schemes). 2. Comparing two proportions (relative risk, odds

More information

BIOS 2083: Linear Models

BIOS 2083: Linear Models BIOS 2083: Linear Models Abdus S Wahed September 2, 2009 Chapter 0 2 Chapter 1 Introduction to linear models 1.1 Linear Models: Definition and Examples Example 1.1.1. Estimating the mean of a N(μ, σ 2

More information

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial

More information

Chapter 10: Inferences based on two samples

Chapter 10: Inferences based on two samples November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach Meta-Analysis for Diagnostic Test Data: a Bayesian Approach Pablo E. Verde Coordination Centre for Clinical Trials Heinrich Heine Universität Düsseldorf Preliminaries: motivations for systematic reviews

More information

Categorical data analysis Chapter 5

Categorical data analysis Chapter 5 Categorical data analysis Chapter 5 Interpreting parameters in logistic regression The sign of β determines whether π(x) is increasing or decreasing as x increases. The rate of climb or descent increases

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Poisson regression: Further topics

Poisson regression: Further topics Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

BIOS 2083 Linear Models c Abdus S. Wahed

BIOS 2083 Linear Models c Abdus S. Wahed Chapter 5 206 Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Regression and Models with Multiple Factors. Ch. 17, 18

Regression and Models with Multiple Factors. Ch. 17, 18 Regression and Models with Multiple Factors Ch. 17, 18 Mass 15 20 25 Scatter Plot 70 75 80 Snout-Vent Length Mass 15 20 25 Linear Regression 70 75 80 Snout-Vent Length Least-squares The method of least

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014 Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of

More information

A Significance Test for the Lasso

A Significance Test for the Lasso A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen May 14, 2013 1 Last time Problem: Many clinical covariates which are important to a certain medical

More information

Single and multiple linear regression analysis

Single and multiple linear regression analysis Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics

More information

LDA Midterm Due: 02/21/2005

LDA Midterm Due: 02/21/2005 LDA.665 Midterm Due: //5 Question : The randomized intervention trial is designed to answer the scientific questions: whether social network method is effective in retaining drug users in treatment programs,

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Data Set 1A: Algal Photosynthesis vs. Salinity and Temperature

Data Set 1A: Algal Photosynthesis vs. Salinity and Temperature Data Set A: Algal Photosynthesis vs. Salinity and Temperature Statistical setting These data are from a controlled experiment in which two quantitative variables were manipulated, to determine their effects

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data

More information

Mixed Models for Longitudinal Ordinal and Nominal Outcomes

Mixed Models for Longitudinal Ordinal and Nominal Outcomes Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Lecture 2. The Simple Linear Regression Model: Matrix Approach

Lecture 2. The Simple Linear Regression Model: Matrix Approach Lecture 2 The Simple Linear Regression Model: Matrix Approach Matrix algebra Matrix representation of simple linear regression model 1 Vectors and Matrices Where it is necessary to consider a distribution

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons

Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons Self-Assessment Weeks 8: Multiple Regression with Qualitative Predictors; Multiple Comparisons 1. Suppose we wish to assess the impact of five treatments while blocking for study participant race (Black,

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Formula for the t-test

Formula for the t-test Formula for the t-test: How the t-test Relates to the Distribution of the Data for the Groups Formula for the t-test: Formula for the Standard Error of the Difference Between the Means Formula for the

More information

Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error

Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error by Diane Potvin Outline 1. Therapeutic Equivalence Designs 2. Objectives

More information

SUPPLEMENTARY SIMULATIONS & FIGURES

SUPPLEMENTARY SIMULATIONS & FIGURES Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,

More information

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University July 31, 2018

More information

Stationary and nonstationary variables

Stationary and nonstationary variables Stationary and nonstationary variables Stationary variable: 1. Finite and constant in time expected value: E (y t ) = µ < 2. Finite and constant in time variance: Var (y t ) = σ 2 < 3. Covariance dependent

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

What is Experimental Design?

What is Experimental Design? One Factor ANOVA What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change

More information