Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status
|
|
- Matthew Heath
- 5 years ago
- Views:
Transcription
1 Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, April Per Kragh Andersen Regression models The distribution of one outcome variable is modelled in relation to one or, more often several explanatory variables (or covariates). Well-known (perhaps?) examples of regression models include: linear regression logistic regression (Cox) proportional hazards regression The type of regression model to use in a given situation depends on the type of the outcome variable: linear regression: quantitative outcome logistic regression: binary outcome (Cox) proportional hazards regression: survival time outcome 1 2 Explanatory variables All types of regression models may include two types of explanatory variables: categorical explanatory variables quantitative explanatory variables This means that many features are common for linear, logistic, and Cox regression. The book Regression with linear predictors highlights such similarities by focussing on the type of explanatory variables and the way in which these are combined into the linear predictor (examples to follow). Example 1.1: Body mass index and vitamin D status European study in: Ireland, Poland, Finland, and Denmark, Data on vitamin D status (25OHD in serum, nmol/l), Data on (among other factors) age and body mass index: BMI = weight in kg (height in m) 2. Purpose: assess whether vitamin D status depends on BMI and age and how it varies among countries. Outcome variable, vit D: quantitative. Quantitative explanatory variables: age and BMI; Categorical explanatory variables: country and categorized BMI, e.g. Normal (<25) vs. Overweight ( 25) (a binary explanatory variable), Overweight women could be further divided into Slight overweight and Obese ( 30) 3 4
2 Table 1: Average 25OHD vitamin D values for 41 adult Irish women in subgroups given by body mass index. BMI Group n Vitamin D Normal Overweight Slight overweight Obese Example 1.2: Fever in early pregnancy and risk of fetal death The Danish National Birth Cohort Study recruited pregnant women for telephone interviews scheduled to take place in weeks Here: data on women recruited before 31 March 1999, interviewed before week 17, and who were still pregnant at week 17. Study relation between risk of fetal death and episodes of fever in early pregnancy. Outcome variable, fetal death: binary, Both categorical and quantitative explanatory variables relevant. Also Example 1.4 on surgery complications has a binary outcome. Vit D seems to decrease with increasing BMI. 5 6 Table 2: Distribution of fetal death by number of fever episodes before pregnancy week 17 in 11,778 women recruited to the Danish National Birth Cohort Study. Number of Fever Episodes Fetal Death Total No Yes Total /9693=1.0% of women without fever episodes experienced fetal death, roughly the same percentage for women with fever episodes: (20+1+0)/( )=1.0%. Confounding Note: In both examples there may be confounding, i.e. simple comparisons between Normal weight and Overweight women Women with or without reported episodes of fever in early pregnancy may not be fair because other factors associated with the outcome may be unevenly distributed in the groups to be compared. This calls for suitable adjustment when the groups are to be compared. 7 8
3 Example 1.3: The PBC-3 trial in liver cirrhosis PBC-3: multi-centre randomized trial in patients with primary biliary cirrhosis. Patients recruited from six European hospitals and randomized to CyA or placebo. Followed until death or liver transplantation (no longer than 1989); 4 patients were lost to follow-up before that date. Outcome variable: time to treatment failure (death or transplantation) Main explanatory variable, treatment, is binary. Other risk factors (serum bilirubin, age, gender etc.) may in spite of the randomization not be quite balanced between the two treatment groups. Both categorical and quantitative explanatory variables are relevant. Table 3: Average observation times in years (and numbers of patients) by treatment group and failure status in the PBC3 trial in liver cirrhosis. Treatment Failure Treatment No Yes Total Placebo (127) (46) (173) CyA (132) (44) (176) Total (259) (90) (349) What about the outcome variable? 9 10 Table 4: Number (%) of observation times less than two years by treatment group and failure status in the PBC3 trial in liver cirrhosis. Treatment Failure Treatment No Yes Observation Times Patients All Because of the incomplete information on the outcome variable (censoring) neither averages nor percentages are reasonable descriptions of the distribution. Instead, survival ( Kaplan-Meier ) curves are used for estimating survival probabilities as a function of time, t. Placebo (23%) (16%) (39%) (100%) CyA (23%) (14%) (37%) (100%) Total (23%) (15%) (38%) (100%) 11 12
4 Survival Years Describing the relation between outcome and one covariate Notation: outcome variable: y or (y i, i = 1,..., n). Covariate: x or (x i, i = 1,..., n). For a quantitative outcome (vitamin D) averages were used for estimating the expected value, m = E(y). For a binary outcome (fetal death, y = 0 or 1) relative frequencies were used for estimating the failure probability, p = pr(y = 1). For a survival time outcome (time to treatment failure) the Kaplan-Meier curve was used for estimating the survival function, S(t) = pr(y > t) as a function of time t. Figure 1: Comparison of estimated survival curves for CyA (dashed) and placebo (solid) treated patients with PBC One binary covariate Binary outcome m 0 : mean vit D for Normal weight women (x i = 0), p 0 : pr(fetal death) for women without fever episodes (x i = 0), m 1 : mean vit D for Overweight women (x i = 1) p 1 : pr(fetal death) for women with fever episodes (x i = 1). m 0 if x i = 0 E(y i ) = m 1 if x i = 1, (1) p 0 if x i = 0 pr(y i = 1) = p 1 if x i = 1. (2) That is, We could follow the lines from above and define some measure of discrepancy between p 1 and p 0. E(y i ) = m 0 + (m 1 m 0 )x i = a + bx i However, that will wait for link functions to be introduced later. a = m 0 : intercept; This leads to studying the log(odds ratio) as effect measure. b = m 1 m 0 : slope (the effect of x on y). Interpretation of a and b. Effect? 15 16
5 Survival time outcome S 0 (t): survival function for patients treated with placebo (x i = 0), S 1 (t): survival function for patients treated with CyA (x i = 1). S 0 (t) if x i = 0 pr(y i > t) = S 1 (t) if x i = 1. Again, we could follow the lines from above and define some measure of discrepancy between S 1 (t) and S 0 (t) and again that will wait for link functions to be introduced. This leads to studying the log(hazard ratio) as effect measure. (3) One categorical covariate For a covariate with k + 1 values, g 0, g 1,..., g k : m 0 if x i = g 0 m 1 if x i = g 1 E(y i ) = m k if x i = g k. Example: mean vitamin D status in 3 BMI categories. (4) One categorical covariate: dummy variables Introducing k indicator or dummy variables I(x i = g j ), j = 1,..., k where I(x i = g j ) = 1 if x i = g j, I(x i = g j ) = 0 otherwise. One quantitative covariate The BMI groups: Normal weight, Slight overweight, Obese are ordered. Monotonic relationship? Figure next slide. The straight line in the scatterplot corresponds to the simple linear regression model: E(y i ) = m 0 + (m 1 m 0 )I(x i = g 1 ) + (m 2 m 0 )I(x i = g 2 ) + + (m k m 0 )I(x i = g k ) E(y i ) = a + b 1 I(x i = g 1 ) + b 2 I(x i = g 2 ) + + b k I(x i = g k ), (5) where a = m 0 and b j = m j m 0, j = 1,..., k. (Details later). Covariate: x i = BMI for woman i. E(y i ) = a + bx i Slope b: difference in mean response between women differing 1 unit in x. Intercept a: the expected vit D level for women with BMI=0. Model often re-parametrized into, e.g. E(y i ) = a + b(x i 25) where now a is the expected vit D level for women with BMI=
6 Vitamin D Vitamin D BMI score BMI Figure 2: Average 25OHD-values plotted against the BMI scores 21.75, 27.5, and Figure 3: Scatterplot: values of the quantitative outcome y (25OHD) plotted against the quantitative covariate x (BMI) Several covariates For a single categorical (binary) or quantitative explanatory variable, building blocks b 1 I(x i = g 1 ) + b 2 I(x i = g 2 ) + + b k I(x i = g k ) (for a binary x: bi(x i = g 1 ) bx i were added to the intercept a. Multiple regression models are obtained by adding such building blocks for the different covariates to obtain the linear predictor. Vitamin D example, women from Ireland or Poland: x i,1 = BMI for woman i, x i,2 = I(woman i is from Ireland). Linear predictor: E(y i ) = a + b 1 I(x i,1 25) + b 2 x i,2 or E(y i ) = a + b 1 x i,1 + b 2 x i,
7 First model leads to expected values: Table 5: Expected values in four groups according to BMI and country. Normal Weight Overweight Poland a a + b 1 Ireland a + b 2 a + b 1 + b 2 Second model leads to parallel lines (vit D vs. BMI) for women from Ireland or Poland. Figure. b 1, the common slope is the common effect of BMI for both countries b 2, the (constant) distance between the two lines is the effect of country for any given value of BMI. Again: No interaction between country and BMI. Effects of BMI for women from Ireland or Poland are the same: (a + b 1 + b 2 ) (a + b 2 ) = b 1 and (a + b 1 ) a = b 1. Effects of country for Overweight and Normal weight women are the same: (a + b 1 + b 2 ) (a + b 1 ) = b 2 and (a + b 2 ) a = b 2. No interaction between country and BMI Vitamin D BMI Figure 4: Expected values from the second model: two parallel lines with slope b 1 and vertical distance b 2. Dashed curve is for Ireland, solid for Poland. Summary Multiple regression models are obtained by adding building blocks for the different covariates to obtain the linear predictor. For multiple covariates: (x i,1, x i,2,..., x i,nc ; i = 1,..., n) the linear predictor is LP i = a + b 1 x i,1 + b 2 x i, b nc x i,nc. For quantitative covariates an assumption of linearity is imposed. A consequence of adding the terms corresponding to each building block is assuming no interaction. These modelling assumptions need careful consideration
Ph.D. course: Regression models. Introduction. 19 April 2012
Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 19 April 2012 www.biostat.ku.dk/~pka/regrmodels12 Per Kragh Andersen 1 Regression models The distribution of one outcome variable
More informationPh.D. course: Regression models
Ph.D. course: Regression models Non-linear effect of a quantitative covariate PKA & LTS Sect. 4.2.1, 4.2.2 8 May 2017 www.biostat.ku.dk/~pka/regrmodels17 Per Kragh Andersen 1 Linear effects We have studied
More informationFaculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics
Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial
More informationSubgroup analysis using regression modeling multiple regression. Aeilko H Zwinderman
Subgroup analysis using regression modeling multiple regression Aeilko H Zwinderman who has unusual large response? Is such occurrence associated with subgroups of patients? such question is hypothesis-generating:
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationRegression models. Categorical covariate, Quantitative outcome. Examples of categorical covariates. Group characteristics. Faculty of Health Sciences
Faculty of Health Sciences Categorical covariate, Quantitative outcome Regression models Categorical covariate, Quantitative outcome Lene Theil Skovgaard April 29, 2013 PKA & LTS, Sect. 3.2, 3.2.1 ANOVA
More informationDefinitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen
Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation
More informationTurning a research question into a statistical question.
Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE
More informationADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables
ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53
More informationExtensions of Cox Model for Non-Proportional Hazards Purpose
PhUSE Annual Conference 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Author: Jadwiga Borucka PAREXEL, Warsaw, Poland Brussels 13 th - 16 th October 2013 Presentation Plan
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationIntroduction To Logistic Regression
Introduction To Lecture 22 April 28, 2005 Applied Regression Analysis Lecture #22-4/28/2005 Slide 1 of 28 Today s Lecture Logistic regression. Today s Lecture Lecture #22-4/28/2005 Slide 2 of 28 Background
More informationSTAT 5500/6500 Conditional Logistic Regression for Matched Pairs
STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data
More informationResiduals and model diagnostics
Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional
More informationSTA441: Spring Multiple Regression. More than one explanatory variable at the same time
STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory
More information1 The problem of survival analysis
1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those
More informationClassification & Regression. Multicollinearity Intro to Nominal Data
Multicollinearity Intro to Nominal Let s Start With A Question y = β 0 + β 1 x 1 +β 2 x 2 y = Anxiety Level x 1 = heart rate x 2 = recorded pulse Since we can all agree heart rate and pulse are related,
More informationAnalysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington
Analysis of Longitudinal Data Patrick J Heagerty PhD Department of Biostatistics University of Washington Auckland 8 Session One Outline Examples of longitudinal data Scientific motivation Opportunities
More informationGroup Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology
Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,
More informationSTAT 526 Spring Final Exam. Thursday May 5, 2011
STAT 526 Spring 2011 Final Exam Thursday May 5, 2011 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationLog-linearity for Cox s regression model. Thesis for the Degree Master of Science
Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical
More informationMore Statistics tutorial at Logistic Regression and the new:
Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual
More informationProportional hazards regression
Proportional hazards regression Patrick Breheny October 8 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/28 Introduction The model Solving for the MLE Inference Today we will begin discussing regression
More informationThe Weibull Distribution
The Weibull Distribution Patrick Breheny October 10 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 19 Introduction Today we will introduce an important generalization of the
More informationMachine Learning. Module 3-4: Regression and Survival Analysis Day 2, Asst. Prof. Dr. Santitham Prom-on
Machine Learning Module 3-4: Regression and Survival Analysis Day 2, 9.00 16.00 Asst. Prof. Dr. Santitham Prom-on Department of Computer Engineering, Faculty of Engineering King Mongkut s University of
More informationA multi-state model for the prognosis of non-mild acute pancreatitis
A multi-state model for the prognosis of non-mild acute pancreatitis Lore Zumeta Olaskoaga 1, Felix Zubia Olaskoaga 2, Guadalupe Gómez Melis 1 1 Universitat Politècnica de Catalunya 2 Intensive Care Unit,
More informationCausal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD
Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable
More informationYou know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?
You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David
More informationIndividualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models
Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University July 31, 2018
More informationTMA 4275 Lifetime Analysis June 2004 Solution
TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,
More informationAnalysing data: regression and correlation S6 and S7
Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association
More informationMultiple linear regression S6
Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationSurvival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek
Survival Analysis 732G34 Statistisk analys av komplexa data Krzysztof Bartoszek (krzysztof.bartoszek@liu.se) 10, 11 I 2018 Department of Computer and Information Science Linköping University Survival analysis
More informationStatistical Methods for Alzheimer s Disease Studies
Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations
More informationObjectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships
Objectives 2.3 Least-squares regression Regression lines Prediction and Extrapolation Correlation and r 2 Transforming relationships Adapted from authors slides 2012 W.H. Freeman and Company Straight Line
More informationAccelerated Failure Time Models
Accelerated Failure Time Models Patrick Breheny October 12 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 29 The AFT model framework Last time, we introduced the Weibull distribution
More informationREGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520
REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU
More informationLecture 3. Truncation, length-bias and prevalence sampling
Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationSurvival Analysis. Stat 526. April 13, 2018
Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined
More information7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis
Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression
More informationLecture 1. Introduction Statistics Statistical Methods II. Presented January 8, 2018
Introduction Statistics 211 - Statistical Methods II Presented January 8, 2018 linear models Dan Gillen Department of Statistics University of California, Irvine 1.1 Logistics and Contact Information Lectures:
More informationBIOSTATISTICAL METHODS
BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects
More informationSTAT 5500/6500 Conditional Logistic Regression for Matched Pairs
STAT 5500/6500 Conditional Logistic Regression for Matched Pairs The data for the tutorial came from support.sas.com, The LOGISTIC Procedure: Conditional Logistic Regression for Matched Pairs Data :: SAS/STAT(R)
More information7.1 The Hazard and Survival Functions
Chapter 7 Survival Models Our final chapter concerns models for the analysis of data which have three main characteristics: (1) the dependent variable or response is the waiting time until the occurrence
More informationExercises. (a) Prove that m(t) =
Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for
More informationResearch Methods in Political Science I
Research Methods in Political Science I 6. Linear Regression (1) Yuki Yanai School of Law and Graduate School of Law November 11, 2015 1 / 25 Today s Menu 1 Introduction What Is Linear Regression? Some
More informationBiost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation
Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest
More informationST745: Survival Analysis: Cox-PH!
ST745: Survival Analysis: Cox-PH! Eric B. Laber Department of Statistics, North Carolina State University April 20, 2015 Rien n est plus dangereux qu une idee, quand on n a qu une idee. (Nothing is more
More informationRobust estimates of state occupancy and transition probabilities for Non-Markov multi-state models
Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models 26 March 2014 Overview Continuously observed data Three-state illness-death General robust estimator Interval
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationPart III Measures of Classification Accuracy for the Prediction of Survival Times
Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples
More informationIf the roles of the variable are not clear, then which variable is placed on which axis is not important.
Chapter 6 - Scatterplots, Association, and Correlation February 6, 2015 In chapter 6-8, we look at ways to compare the relationship of 2 quantitative variables. First we will look at a graphical representation,
More informationPart [1.0] Measures of Classification Accuracy for the Prediction of Survival Times
Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Patrick J. Heagerty PhD Department of Biostatistics University of Washington 1 Biomarkers Review: Cox Regression Model
More informationSTATISTICS Relationships between variables: Correlation
STATISTICS 16 Relationships between variables: Correlation The gentleman pictured above is Sir Francis Galton. Galton invented the statistical concept of correlation and the use of the regression line.
More informationSurvival analysis in R
Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics
More informationSurvival analysis in R
Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics
More informationLecture 8 Stat D. Gillen
Statistics 255 - Survival Analysis Presented February 23, 2016 Dan Gillen Department of Statistics University of California, Irvine 8.1 Example of two ways to stratify Suppose a confounder C has 3 levels
More informationLab 8. Matched Case Control Studies
Lab 8 Matched Case Control Studies Control of Confounding Technique for the control of confounding: At the design stage: Matching During the analysis of the results: Post-stratification analysis Advantage
More informationUnbiased estimation of exposure odds ratios in complete records logistic regression
Unbiased estimation of exposure odds ratios in complete records logistic regression Jonathan Bartlett London School of Hygiene and Tropical Medicine www.missingdata.org.uk Centre for Statistical Methodology
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationTied survival times; estimation of survival probabilities
Tied survival times; estimation of survival probabilities Patrick Breheny November 5 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Tied survival times Introduction Breslow approximation
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationGeneral Regression Model
Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical
More informationPubH 7405: REGRESSION ANALYSIS MLR: BIOMEDICAL APPLICATIONS
PubH 7405: REGRESSION ANALYSIS MLR: BIOMEDICAL APPLICATIONS Multiple Regression allows us to get into two new areas that were not possible with Simple Linear Regression: (i) Interaction or Effect Modification,
More informationEstimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.
Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description
More informationInstrumental variables estimation in the Cox Proportional Hazard regression model
Instrumental variables estimation in the Cox Proportional Hazard regression model James O Malley, Ph.D. Department of Biomedical Data Science The Dartmouth Institute for Health Policy and Clinical Practice
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the
More informationespecially with continuous
Handling interactions in Stata, especially with continuous predictors Patrick Royston & Willi Sauerbrei UK Stata Users meeting, London, 13-14 September 2012 Interactions general concepts General idea of
More informationMonitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison
Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs Christopher Jennison Department of Mathematical Sciences, University of Bath http://people.bath.ac.uk/mascj
More informationOptimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai
Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationPractical Biostatistics
Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:
More informationSTAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis
STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive
More informationBasic Medical Statistics Course
Basic Medical Statistics Course S7 Logistic Regression November 2015 Wilma Heemsbergen w.heemsbergen@nki.nl Logistic Regression The concept of a relationship between the distribution of a dependent variable
More informationA new strategy for meta-analysis of continuous covariates in observational studies with IPD. Willi Sauerbrei & Patrick Royston
A new strategy for meta-analysis of continuous covariates in observational studies with IPD Willi Sauerbrei & Patrick Royston Overview Motivation Continuous variables functional form Fractional polynomials
More informationLecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018
, Non-, Precision, and Power Statistics 211 - Statistical Methods II Presented February 27, 2018 Dan Gillen Department of Statistics University of California, Irvine Discussion.1 Various definitions of
More informationLecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016
Statistics 255 - Survival Analysis Presented March 3, 2016 Motivating Dan Gillen Department of Statistics University of California, Irvine 11.1 First question: Are the data truly discrete? : Number of
More informationSurvival models and health sequences
Survival models and health sequences Walter Dempsey University of Michigan July 27, 2015 Survival Data Problem Description Survival data is commonplace in medical studies, consisting of failure time information
More informationEvaluating Predictive Accuracy of Survival Models with PROC PHREG
Paper SAS462-2017 Evaluating Predictive Accuracy of Survival Models with PROC PHREG Changbin Guo, Ying So, and Woosung Jang, SAS Institute Inc. Abstract Model validation is an important step in the model
More informationPackage threg. August 10, 2015
Package threg August 10, 2015 Title Threshold Regression Version 1.0.3 Date 2015-08-10 Author Tao Xiao Maintainer Tao Xiao Depends R (>= 2.10), survival, Formula Fit a threshold regression
More informationBinomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials
Lecture : Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 27 Binomial Model n independent trials (e.g., coin tosses) p = probability of success on each trial (e.g., p =! =
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationSurvival Analysis. Lu Tian and Richard Olshen Stanford University
1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival
More informationLecture 10: Introduction to Logistic Regression
Lecture 10: Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 2007 Logistic Regression Regression for a response variable that follows a binomial distribution Recall the binomial
More informationAnalysis of Time-to-Event Data: Chapter 6 - Regression diagnostics
Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the
More informationPart IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation
Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples
More informationCIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis
CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation
More informationCategorical data analysis Chapter 5
Categorical data analysis Chapter 5 Interpreting parameters in logistic regression The sign of β determines whether π(x) is increasing or decreasing as x increases. The rate of climb or descent increases
More informationChapter 6. Logistic Regression. 6.1 A linear model for the log odds
Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,
More informationStat 642, Lecture notes for 04/12/05 96
Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal
More informationNonparametric Model Construction
Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life
More informationSTA6938-Logistic Regression Model
Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationLecture 5 Models and methods for recurrent event data
Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationCan a Pseudo Panel be a Substitute for a Genuine Panel?
Can a Pseudo Panel be a Substitute for a Genuine Panel? Min Hee Seo Washington University in St. Louis minheeseo@wustl.edu February 16th 1 / 20 Outline Motivation: gauging mechanism of changes Introduce
More informationIntroduction to logistic regression
Introduction to logistic regression Tuan V. Nguyen Professor and NHMRC Senior Research Fellow Garvan Institute of Medical Research University of New South Wales Sydney, Australia What we are going to learn
More information