Public Health and Statistics In India IISA-Harvard-SAMSI May Supported by NIH R01 ES Donna Spiegelman, Sc.D.
|
|
- Ethelbert Lester
- 5 years ago
- Views:
Transcription
1 Public Health and Statistics In India IISA-Harvard-SAMSI May 2016 Supported by NIH R01 ES Donna Spiegelman, Sc.D. Professor of Epidemiologic Methods Departments of Epidemiology, Biostatistics, Nutrition and Global Health www.
2 2 2
3 Introduction Over the past 10 years, our group has developed methods that adjust for exposure measurement error in point and interval estimates of relative risk and other measures of association: Regression calibration for main study/external validation study designs Regression calibration for multiple surrogates for the same exposure Regression calibration with heteroscedastic error Regression calibration for main study/internal validation study designs Regression calibration for survival data analysis with baseline exposures, time-varying point exposures, and exposure metrics that are functions of the exposure history Methods have been motivated by studies in environmental and occupational epidemiology conducted at the Harvard School of Public Health 3
4 Time permits a brief overview of a few of these: Regression calibration for main study/external validation study designs Regression calibration for main study/internal validation study designs Regression calibration for multiple surrogates for the same exposure Regression calibration with heteroscedastic error In the future, I can discuss: Regression calibration for survival data analysis with baseline exposures, time-varying point exposures, and exposure metrics that are functions of the exposure history 4
5 Notation n 1 : Number of participants in main study n 2 : Number of participants in validation study D : Binary health outcome X : True exposure Z : Surrogate exposure U : t perfectly measured covariates (e.g. age, race, smoking status) Measured on all participants in the main and validation studies D i, Z i, U i, i = 1,, n 1 Main study X i, Z i, U i, i = n 1 + 1,, n 1 + n 2 External validation study D i, X i, Z i, U i, i = n 1 + 1,, n 1 + n 2 Internal validation study 5
6 Assumptions True exposure (X) and the t-vector of covariates (U) are related to the probability of binary outcome (D) by the logistic function: logit Pr D = 1 = β 0 + Xβ 1 + U β 2 where β 2 = (β 21, β 22,, β 2t ). Linear regression model is appropriate to relate the surrogates (Z) and the t covariates (U) to the true exposure: X = γ 0 + Zγ 1 + U 2 γ 2 + ε where E ε = 0, Var ε = σ X Z,U Z is a surrogate if Pr D X, U, Z = Pr D X, U), that is, knowledge of the surrogates provides no additional information if the true exposure is known. 2 ε~n(0, σ X Z,U ) and Pr (D) is small, or β σ X Z,U small. 6
7 Rosner et al. regression calibration method for MS/EVS The (Rosner, Willettt, Spiegelman,1989; Rosner, Spiegelman, Willett, 1990; Rosner, Spiegelman, Willett, 1992) version of regression calibration for MS/EVS design: 3-step algorithm: 1. In the main study, regress Y on Z and U to obtain β 0, β 1, β 2 where now Z is a s 1 vector of mis-measured continuous covariates and U is a t 1 vector of perfectly measured covariates. 7
8 Rosner et al. regression calibration method for MS/EVS 2. In the validation study, regress X on Z and U to obtain γ 0, Γ 1, Γ 2 where γ 0 is a s x 1 vector of regression intercepts, Γ 1 is a s s matrix of slopes for the regression of X on Z, adjusted for U, and Γ 1 is a s t matrix of slopes for the regression of X on U, adjusted for Z. 8 8
9 Rosner et al. regression calibration method for MS/EVS 3. Correct estimates of effect for measurement error, by β 1 = β 1 γ 1, β 0 = β 0 β 1 γ 0, β 2 = β 2 β 1 γ 2 or T Γ 1 0 T Γ β 1 T β 2 T = β 1 where 0 is a s t matrix of 0 s and I is a t t identity matrix, I t t = T β 2 T 9 9
10 Rosner et al. regression calibration method for MS/EVS 4. Use multivariate delta method to derive variance, e.g., Var β 1 = Var β (β 1 ) 2 Var γ 1 4 γ 1 γ 1 See Appendices 2 and 3 of Rosner et al., 1990 for a derivation of the variance of β 1 T β 2 T, again using the multivariate delta method. 10
11 Regression calibration (Carroll et al.) Given validation or reliability data, the Carroll et al. version of the regression calibration estimator follows (when n ri = n RI = 2): Sketch of Algorithm (univariate case) 1. Estimate γ 0 and γ 1 in the validation study from the regression of X i on Z i, i = 1,, n V or in the reliability study from the regression of Z i1 on Z i2, i = 1,,, n R, where n ri = n RI = 2 2. Estimate X i = γ 0 + γ 1 Z i + e i, i = 1,, n M in the main study. 11
12 Regression calibration (Carroll et al.) 3. Run usual regression model for Y on X in the main study to obtain estimates of effect adjusted for measurement error, i.e., fit model g E Y i X i = β 0 + β 1 X 1 in the main study, where g[ ] is a link function, e.g., identity for linear regression, log for Poisson and log-binomial regression, logit for logistic regression, probit for probit regression to obtain estimates of β 1 and β 0 that are corrected for measurement error, at least approximately. 4. Variance must be adjusted as well and cannot be obtained from the standard regression software. RSW and Carroll et al. versions are identical in GLMs (Thurston SW, Spiegelman D, Ruppert D. Equivalence of regression calibration methods for main study/external validation study designs. Journal of Statistical Planning and Inference, 2003; 113: ) 12
13 An example Home Endotoxin Exposure and Wheeze in Infants: Correction for Bias Due to Exposure Measurement Error Nora Horick, Edie Weller, Donald K. Milton, Diane R. Gold, Ruifeng Li, and Donna Spiegelman Department of Biostatistics and Department of Environmental Health, Harvard School of Public Health, Boston, Massachusetts, USA; Channing Laboratory, Harvard Medical School, Boston, Massachusetts, USA; Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA Environmental Health Perspectives Volume 114, Number 1, January
14 14
15 15
16 Download %blinplus SAS macro at 16
17 Regression calibration for logistic regression with multiple surrogates for one exposure Edie A. Weller, Donna Spiegelman, Don Milton, Ellen Eisen Departments of Biostatistics, Epidemiology, and Environmental Health Harvard School of Public Health and Dana Farber Cancer Institute Journal of Statistical Planning and Inference, 2007; 137: Occupational exposures often characterized by numerous factors of the workplace and work duration in a particular area ==> multiple surrogates describe one exposure. Validation study: Personal exposure is commonly measured on a subset of the subjects and these values are then used to estimate average exposure by job or exposure zone. No adjustment for bias or uncertainty in the exposure estimates. Standard methods typically assume that there is one surrogate for each exposure (for example, Rosner et al, 1989, 1990). Propose adjustment method which allows for multiple surrogates for one exposure using a regression calibration approach. 17
18 Main Study To assess the relationship between exposure to metal working fluids (MWF) and respiratory function (United Automobile Workers Union and General Motors Corporation sponsored study, Greaves et al, 1997). Outcome here is prevalence of wheeze Job characteristics include metal working fluid (MWF) type, plant and machine operation (grinding or not). Assembly workers are considered the non-exposed group. Possible confounders include age, smoking status and race. 18
19 Exposure Assessment Study (generically, the validation study) Exposure was measured in various job zones (Woskie et al., 1994). Intensity of exposure to MWF aerosol measured by the thoracic aerosol fraction (i.e. the sum of the two smallest size fractions measured with the personal monitors). Full shift (8 hour) personal samples of aerosol exposure in breathing zone of automobile workers were collected in various job zones. 19
20 Assumptions True exposure (X) and the t-vector of covariates (U) are related to the probability of binary outcome (D) by the logistic function: logit Pr D = 1 = β 0 + Xβ 1 + Uβ 2 where β 2 = (β 21, β 22,, β 2t ). Linear regression model is appropriate to relate the r surrogates (W) and the s covariates (Z) to the true exposure: X = γ 0 + W γ 1 + U γ 2 + ε 2 where E ε = 0, Var(ε) = σ X U,W W is a surrogate if Pr D X, W, U = Pr D X, U, that is, knowledge of the surrogates provides no additional information if the true exposure is known. 2 ε~n(0, σ X W,U ) and Pr (D) small, or β σ X W,U small 20
21 Goal: to obtain point and interval estimates of β and e β relating exposure (X) to outcome (D) adjusting for the covariates (U) Problem Quantitative measure of exposure (X) is not measured on all subjects W is measured on all n 1 of the subjects X and W measured on n 2 subjects Multiple surrogates, W, describe exposure Solution: An extension to two closely related approaches Rosner, Spiegelman and Willett (RSW, 1989, 1990) Carroll, Ruppert and Stefanski (CRS, 1995) 21
22 Procedure Propose the following approach which follows RSW and assumes normality of ε and rare disease, or that β σ X W,U is small 1. Estimate α from a logistic regression model of D on W and in n 1 subjects in main study logit Pr D = 1 = α 0 + W α 1 +U α 2 2. Estimate γ from a measurement error model among the n 2 validation study subjects using ordinary least squares regression. X = γ 0 + W γ 1 +U γ 2 SAS PROC GENMOD or PROC LOGISTIC for step 1, PROC REG for step 2 22
23 3. Optimally combine the adjusted estimates for each surrogate β W, ˆ ˆ τβ ˆ W where β W = Γ 1 1 α 1, Γ 1 = diag(γ 1 ) τ = (1 Σ 1 βw 1) 1 1 Σ 1 βw. 1 = 1,1,, 1 Σ βw is the estimated variance-covariance matrix of β W βw Σ α1 0 βw Σ βw = α 1, γ 1 0 Σ α 1,γ γ1 α 1, γ 1 1 α 1,γ 1 SAS macro downloadable from my website to accomplish step 3; input to the macro is the output from PROC LOGISTIC and PROC REG 23
24 Results from logistic regression model for wheeze. GM/UAW main study (n 1 = 1040). True Exposure (X) is thoracic aerosol fraction (mg/m 3 ) measures on n 2 = 83 workers Variable Uncorrected P-value Corrected P-value Exposure 1 (mg/m 3 ) (1.353, 6.108) Surrogates (W) Plant 2 Grinding Straight Synthetic (1.391, 3.198) (0.374, 1.332) (1.119, 2.407) (1.200, 2.854) < Covariates (Z) Age Age Age 50+ Race Current Smoker (0.615, 1.307) (0.512, 1.358) (0.544, 1.528) (0.796, 1.728) (2.210, 4.188) < (0.648, 1.437) (0.513, 1.418) (0.535, 1.561) (0.782, 1.740) (2.144, 4.137) < Estimated GLS weights are for straight, for synthetic, 0.15 for grinding, and for plant 24
25 Regression Calibration With Heteroscedastic Variance Donna Spiegelman, Roger Logan, Douglas Grove International Journal of Biostatistics: 2011 Vol. 7, Issue 1, Article 4. PMCID: PMC Conclusion: For all practical purposes, no need to worry about heteroscedasticity, that is, if Var(X i Z i, U i ) varies with i, little impact on bias or efficiency of RC method 25
26 A comparison of regression calibration methods for designs with internal validation data Sally W. Thurston, Paige L. Williams, Russ Hauser, Howard Hu, Mauricio Hernandez-Avila, and Donna Spiegelman Department of Biostatistics and Computational Biology, University of Rochester Medical Center, 601 Elmwood Avenue, P.O. Box 630, Rochester, NY 14642, USA Department of Biostatistics, Harvard School of Public Health, USA Department of Environmental Health, Harvard School of Public Health, USA Centro de Investigaciones en Salud Poblacional, Instituto Nacional de Salud Publica, Cuernavaca, Morelos, Mexico Department of Epidemiology, Harvard School of Public Health, USA Channing Laboratory, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, US Journal of Statistical Planning and Inference, 2005; 131:
27 ARE of optimal method compared to Carroll method 27
28 Conclusions I We can accommodate the following situations: multiple surrogates for a single mis-measured exposure heteroscedastic measurement error internal and hybrid validation study designs cumulative exposure variables and other functions of the exposure history in cohort studies User-friendly SAS macros are available to implement many of these procedures (for optimal main study / validation study design) 28
29 Conclusions II Bias due to exposure measurement error is a major limitation to the validity of occupational and environmental studies Methods have been developed which accommodate the features of study design and data distributions found in such studies These methods implement explicit adjustments for this source of bias, using the exposure validation study to characterize the magnitude and other features of the measurement error Point and interval estimates of effect are adjusted Papers have been published applying these methods to the analysis of occupational and environmental studies: you won t be the first! Just as we routinely adjust for confounding, we can routinely adjust for measurement error 29
30 Acknowledgements NIEHS Edie Weller, Ruifeng Li, Don Milton, Ellen Eisen, Barbara Valanis, Sally Thurston, Jon Samet, Paige Williams, Russ Hauser, Roger Logan, Jon Samet, Doug Grove, Doug Dockery, Lucas Neas, Nora Horrick, Diane Gold, Mauricio Hernandez, Howard Hu, Aparna Keshaviah Xiaomei Liao, Molin Wang, Biling Hong Francine Laden, Helen Suh, Jaime E. Hart, Joel Kaufman, Adam Szpiro, Lianne Sheppard, Ronald Williams, Robin C. Puett, Marianthi-Anna Kioumourtzoglou Alan Berkeley, Emily Long Thank you! 30
Statistical and epidemiological considerations in using remote sensing data for exposure estimation
Statistical and epidemiological considerations in using remote sensing data for exposure estimation Chris Paciorek Department of Biostatistics Harvard School of Public Health Collaborators: Yang Liu, Doug
More informationEffects of Exposure Measurement Error When an Exposure Variable Is Constrained by a Lower Limit
American Journal of Epidemiology Copyright 003 by the Johns Hopkins Bloomberg School of Public Health All rights reserved Vol. 157, No. 4 Printed in U.S.A. DOI: 10.1093/aje/kwf17 Effects of Exposure Measurement
More informationStatistical integration of disparate information for spatially-resolved PM exposure estimation
Statistical integration of disparate information for spatially-resolved PM exposure estimation Chris Paciorek Department of Biostatistics May 4, 2006 www.biostat.harvard.edu/~paciorek L Y X - FoilT E X
More informationIP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM
IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling
More informationMeasurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology
Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Chris Paciorek Department of Statistics, University of California, Berkeley and Adam
More informationHarvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen
Harvard University Harvard University Biostatistics Working Paper Series Year 2014 Paper 175 A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome Eric Tchetgen Tchetgen
More informationSTA6938-Logistic Regression Model
Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of
More informationThe Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology
The Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology Chris Paciorek and Brent Coull Department of Biostatistics Harvard School of Public Health wwwbiostatharvardedu/
More informationBiostatistics Advanced Methods in Biostatistics IV
Biostatistics 140.754 Advanced Methods in Biostatistics IV Jeffrey Leek Assistant Professor Department of Biostatistics jleek@jhsph.edu 1 / 35 Tip + Paper Tip Meet with seminar speakers. When you go on
More informationConstrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources
Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources Yi-Hau Chen Institute of Statistical Science, Academia Sinica Joint with Nilanjan
More informationBIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke
BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart
More informationMeasurement Error in Spatial Modeling of Environmental Exposures
Measurement Error in Spatial Modeling of Environmental Exposures Chris Paciorek, Alexandros Gryparis, and Brent Coull August 9, 2005 Department of Biostatistics Harvard School of Public Health www.biostat.harvard.edu/~paciorek
More informationMore Statistics tutorial at Logistic Regression and the new:
Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationMeasurement error as missing data: the case of epidemiologic assays. Roderick J. Little
Measurement error as missing data: the case of epidemiologic assays Roderick J. Little Outline Discuss two related calibration topics where classical methods are deficient (A) Limit of quantification methods
More informationLongitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts
Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationOn the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm
On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm Richard Wyss 1, Bruce Fireman 2, Jeremy A. Rassen 3, Sebastian Schneeweiss 1 Author Affiliations:
More informationRegression with a Binary Dependent Variable (SW Ch. 9)
Regression with a Binary Dependent Variable (SW Ch. 9) So far the dependent variable (Y) has been continuous: district-wide average test score traffic fatality rate But we might want to understand the
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationCausal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University
Causal Modeling in Environmental Epidemiology Joel Schwartz Harvard University When I was Young What do I mean by Causal Modeling? What would have happened if the population had been exposed to a instead
More informationGeneral Regression Model
Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical
More informationDoes low participation in cohort studies induce bias? Additional material
Does low participation in cohort studies induce bias? Additional material Content: Page 1: A heuristic proof of the formula for the asymptotic standard error Page 2-3: A description of the simulation study
More informationChapter 1. Modeling Basics
Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical
More informationDynamic Models Part 1
Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after
More informationMeta-analysis of epidemiological dose-response studies
Meta-analysis of epidemiological dose-response studies Nicola Orsini 2nd Italian Stata Users Group meeting October 10-11, 2005 Institute Environmental Medicine, Karolinska Institutet Rino Bellocco Dept.
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationStandardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE
ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationLecture 4 Multiple linear regression
Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters
More informationNon-Gaussian Berkson Errors in Bioassay
Non-Gaussian Berkson Errors in Bioassay Alaa Althubaiti & Alexander Donev First version: 1 May 011 Research Report No., 011, Probability and Statistics Group School of Mathematics, The University of Manchester
More informationGOODNESS-OF-FIT FOR GEE: AN EXAMPLE WITH MENTAL HEALTH SERVICE UTILIZATION
STATISTICS IN MEDICINE GOODNESS-OF-FIT FOR GEE: AN EXAMPLE WITH MENTAL HEALTH SERVICE UTILIZATION NICHOLAS J. HORTON*, JUDITH D. BEBCHUK, CHERYL L. JONES, STUART R. LIPSITZ, PAUL J. CATALANO, GWENDOLYN
More information1. Introduction This paper focuses on two applications that are closely related mathematically, matched-pair studies and studies with errors-in-covari
Orthogonal Locally Ancillary Estimating Functions for Matched-Pair Studies and Errors-in-Covariates Molin Wang Harvard School of Public Health and Dana-Farber Cancer Institute, Boston, USA and John J.
More informationPerson-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data
Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time
More informationDepartment of Biostatistics University of Copenhagen
Comparison of five software solutions to mediation analysis Liis Starkopf Mikkel Porsborg Andersen Thomas Alexander Gerds Christian Torp-Pedersen Theis Lange Research Report 17/01 Department of Biostatistics
More informationGENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR. Raymond J. Carroll: Texas A&M University
GENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR Raymond J. Carroll: Texas A&M University Naisyin Wang: Xihong Lin: Roberto Gutierrez: Texas A&M University University of Michigan Southern Methodist
More informationObtaining the Maximum Likelihood Estimates in Incomplete R C Contingency Tables Using a Poisson Generalized Linear Model
Obtaining the Maximum Likelihood Estimates in Incomplete R C Contingency Tables Using a Poisson Generalized Linear Model Stuart R. LIPSITZ, Michael PARZEN, and Geert MOLENBERGHS This article describes
More informationOnline supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population
Online supplement Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in Breathlessness in the General Population Table S1. Comparison between patients who were excluded or included
More informationDistributed analysis in multi-center studies
Distributed analysis in multi-center studies Sharing of individual-level data across health plans or healthcare delivery systems continues to be challenging due to concerns about loss of patient privacy,
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationHarvard University. Harvard University Biostatistics Working Paper Series
Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 192 Negative Outcome Control for Unobserved Confounding Under a Cox Proportional Hazards Model Eric J. Tchetgen
More informationPrevious lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.
Previous lecture P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Interaction Outline: Definition of interaction Additive versus multiplicative
More informationGMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM
Paper 1025-2017 GMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM Kyle M. Irimata, Arizona State University; Jeffrey R. Wilson, Arizona State University ABSTRACT The
More informationBIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY
BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1
More informationLISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014
LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers
More informationCase-control studies C&H 16
Case-control studies C&H 6 Bendix Carstensen Steno Diabetes Center & Department of Biostatistics, University of Copenhagen bxc@steno.dk http://bendixcarstensen.com PhD-course in Epidemiology, Department
More informationMisclassification in Logistic Regression with Discrete Covariates
Biometrical Journal 45 (2003) 5, 541 553 Misclassification in Logistic Regression with Discrete Covariates Ori Davidov*, David Faraggi and Benjamin Reiser Department of Statistics, University of Haifa,
More informationNemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014
Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of
More informationIntroduction to GSEM in Stata
Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /
More informationProbability and Probability Distributions. Dr. Mohammed Alahmed
Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about
More informationLOCAL LINEAR REGRESSION FOR GENERALIZED LINEAR MODELS WITH MISSING DATA
The Annals of Statistics 1998, Vol. 26, No. 3, 1028 1050 LOCAL LINEAR REGRESSION FOR GENERALIZED LINEAR MODELS WITH MISSING DATA By C. Y. Wang, 1 Suojin Wang, 2 Roberto G. Gutierrez and R. J. Carroll 3
More informationKnown unknowns : using multiple imputation to fill in the blanks for missing data
Known unknowns : using multiple imputation to fill in the blanks for missing data James Stanley Department of Public Health University of Otago, Wellington james.stanley@otago.ac.nz Acknowledgments Cancer
More informationTutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:
Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationPersonalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health
Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations
More informationEstimating direct effects in cohort and case-control studies
Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research
More informationSTAT 5500/6500 Conditional Logistic Regression for Matched Pairs
STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data
More informationBIOSTATISTICAL METHODS
BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects
More informationUnbiased estimation of exposure odds ratios in complete records logistic regression
Unbiased estimation of exposure odds ratios in complete records logistic regression Jonathan Bartlett London School of Hygiene and Tropical Medicine www.missingdata.org.uk Centre for Statistical Methodology
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationWU Weiterbildung. Linear Mixed Models
Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes
More informationDISCRETE PROBABILITY DISTRIBUTIONS
DISCRETE PROBABILITY DISTRIBUTIONS REVIEW OF KEY CONCEPTS SECTION 41 Random Variable A random variable X is a numerically valued quantity that takes on specific values with different probabilities The
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationExam Applied Statistical Regression. Good Luck!
Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.
More informationMarginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback
University of South Carolina Scholar Commons Theses and Dissertations 2017 Marginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback Yanan Zhang University of South Carolina Follow
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general
More informationGrowth Mixture Model
Growth Mixture Model Latent Variable Modeling and Measurement Biostatistics Program Harvard Catalyst The Harvard Clinical & Translational Science Center Short course, October 28, 2016 Slides contributed
More informationTutorial 2: Power and Sample Size for the Paired Sample t-test
Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,
More informationModel Assumptions; Predicting Heterogeneity of Variance
Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of
More informationMediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016
Mediation analyses Advanced Psychometrics Methods in Cognitive Aging Research Workshop June 6, 2016 1 / 40 1 2 3 4 5 2 / 40 Goals for today Motivate mediation analysis Survey rapidly developing field in
More informationSOME ASPECTS OF MEASUREMENT ERROR IN EXPLANATORY VARIABLES FOR CONTINUOUS AND BINARY REGRESSION MODELS
STATISTICS IN MEDICINE Statist. Med. 17, 2157 2177 (1998) SOME ASPECTS OF MEASUREMENT ERROR IN EXPLANATORY VARIABLES FOR CONTINUOUS AND BINARY REGRESSION MODELS G. K. REEVES*, D.R.COX, S. C. DARBY AND
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2008 Paper 241 A Note on Risk Prediction for Case-Control Studies Sherri Rose Mark J. van der Laan Division
More informationA Practitioner s Guide to Generalized Linear Models
A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically
More informationTutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances
Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function
More informationMeasurement error, GLMs, and notational conventions
The Stata Journal (2003) 3, Number 4, pp. 329 341 Measurement error, GLMs, and notational conventions James W. Hardin Arnold School of Public Health University of South Carolina Columbia, SC 29208 Raymond
More informationAttributable Risk Function in the Proportional Hazards Model
UW Biostatistics Working Paper Series 5-31-2005 Attributable Risk Function in the Proportional Hazards Model Ying Qing Chen Fred Hutchinson Cancer Research Center, yqchen@u.washington.edu Chengcheng Hu
More informationNon-linear mixed models in the analysis of mediated longitudinal data with binary outcomes
RESEARCH ARTICLE Open Access Non-linear mixed models in the analysis of mediated longitudinal data with binary outcomes Emily A Blood 1,2* and Debbie M Cheng 1 Abstract Background: Structural equation
More informationTHE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.
THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized
More informationCorrelation and regression
1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,
More informationDistribution-free ROC Analysis Using Binary Regression Techniques
Distribution-free Analysis Using Binary Techniques Todd A. Alonzo and Margaret S. Pepe As interpreted by: Andrew J. Spieker University of Washington Dept. of Biostatistics Introductory Talk No, not that!
More informationLogistic regression analysis. Birthe Lykke Thomsen H. Lundbeck A/S
Logistic regression analysis Birthe Lykke Thomsen H. Lundbeck A/S 1 Response with only two categories Example Odds ratio and risk ratio Quantitative explanatory variable More than one variable Logistic
More informationANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS
ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS Background Independent observations: Short review of well-known facts Comparison of two groups continuous response Control group:
More informationIntroduction to logistic regression
Introduction to logistic regression Tuan V. Nguyen Professor and NHMRC Senior Research Fellow Garvan Institute of Medical Research University of New South Wales Sydney, Australia What we are going to learn
More informationFaculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics
Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial
More informationInstructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses
ISQS 5349 Final Spring 2011 Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses 1. (10) What is the definition of a regression model that we have used throughout
More informationApproximate analysis of covariance in trials in rare diseases, in particular rare cancers
Approximate analysis of covariance in trials in rare diseases, in particular rare cancers Stephen Senn (c) Stephen Senn 1 Acknowledgements This work is partly supported by the European Union s 7th Framework
More information1 Introduction A common problem in categorical data analysis is to determine the effect of explanatory variables V on a binary outcome D of interest.
Conditional and Unconditional Categorical Regression Models with Missing Covariates Glen A. Satten and Raymond J. Carroll Λ December 4, 1999 Abstract We consider methods for analyzing categorical regression
More informationA note on R 2 measures for Poisson and logistic regression models when both models are applicable
Journal of Clinical Epidemiology 54 (001) 99 103 A note on R measures for oisson and logistic regression models when both models are applicable Martina Mittlböck, Harald Heinzl* Department of Medical Computer
More informationGeneralized Linear Models. Last time: Background & motivation for moving beyond linear
Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered
More informationBayesian Hierarchical Models
Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology
More informationSTA6938-Logistic Regression Model
Dr. Ying Zhang STA6938-Logistic Regression Model Topic 6-Logistic Regression for Case-Control Studies Outlines: 1. Biomedical Designs 2. Logistic Regression Models for Case-Control Studies 3. Logistic
More informationUNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016 Work all problems. 60 points are needed to pass at the Masters Level and 75 to pass at the
More informationThe impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference
The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference An application to longitudinal modeling Brianna Heggeseth with Nicholas Jewell Department of Statistics
More informatione author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls
e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls under the restrictions of the copyright, in particular
More information2 Describing Contingency Tables
2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random
More informationConfidence Intervals. Contents. Technical Guide
Technical Guide Confidence Intervals Contents Introduction Software options Directory of methods 3 Appendix 1 Byar s method 6 Appendix χ exact method 7 Appendix 3 Wilson Score method 8 Appendix 4 Dobson
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationDescribing Change over Time: Adding Linear Trends
Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More information