Marginal, crude and conditional odds ratios

Size: px
Start display at page:

Download "Marginal, crude and conditional odds ratios"

Transcription

1 Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010

2 Parameter Denitions When measuring the eect of a binary exposure(z ) on a binary response (Y ), there are a number of probabilities of interest with regard to the population: marginal: P(Y = 1) crude: P(Y = 1 Z = 1) conditional: P(Y = 1 Z = 1, X = x) These probabilities are not necessarily equal and have dierent interpretations which are each useful in dierent settings. We need to nd ways to estimate each value, being clear about which of these quantities our statistics estimate.

3 The Counterfactual Model Denitions We will use the following notation: Z - the binary exposure (or treatment) of interest X - covariates associated with exposure and response Y 1 - the potential binary reponse if the unit is exposed Y 0 - the potential binary reponse if the unit is not exposed Though each unit in the population may potentially have either Y 1 and Y 0, we are only able to observe one. We call the observed response Y and use the relation Y = Z Y 1 + (1 Z) Y 0

4 The Counterfactual Model A population A hypothetical population would then look like the following: Unit X 1 X 2 X 3 Z Y 1 Y where the observed Y for each unit is colored blue (This population will be used in a simulation example later...)

5 Marginal Odds Ratio The marginal odds ratio can be obtained by comparing the odds of response in the population if everyone is exposed Odds exp = P (Y 1 = 1) P (Y 1 = 0) to the odds of response if everyone in not exposed Looking at the odds ratio as Odds unexp = P (Y 0 = 1) P (Y 0 = 0) Oddsexp Odds unexp, we see the marginal odds ratio ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1)

6 Crude Odds Ratio The marginal odds ratio is often approximated by estimating the crude odds ratio, obtained by taking the cross product P (Y = 1 Z = 1) P (Y = 0 Z = 0) ψ crude = P (Y = 0 Z = 1) P (Y = 1 Z = 0) = P (Y 1 = 1 Z = 1) P (Y 0 = 0 Z = 0) P (Y 1 = 0 Z = 1) P (Y 0 = 1 Z = 0) However, when confounding is present P (Y 1 = 1 Z = 1) = P (Y 1 = 1) P (Y 0 = 1 Z = 0) = P (Y 0 = 1) suggesting that the crude and marginal odds ratio are dierent values

7 Conditional Odds Ratio The conditional odds ratio is dened as ψ cond (x) = P (Y = 1 Z = 1, X = x) P (Y = 0 Z = 0, X = x) P (Y = 0 Z = 1, X = x) P (Y = 1 Z = 0, X = x) With the assumption of strongly ignorable treatment assignment, i.e. (Y 1, Y 0 ) Z X, we can remove Z from the conditional probabilities: ψ cond (x) = P (Y 1 = 1 X = x) P (Y 0 = 0 X = x) P (Y 1 = 0 X = x) P (Y 0 = 1 X = x)

8 Conditional Odds Ratio Under the logistic model If the data follow a logistic model with { exp β 0 + } p β j=1 j x j + γz P (Y = 1 X = x, Z = z) = { 1 + exp β 0 + } p β j=1 j x j + γz logistic regression will lead to an unbiased and asymptotically ecient estimator of ψ cond (x) In fact, the conditional odds ratio is constant across X and ψ cond (x) = ψ cond = e γ In order for the estimate to be unbiased, X must contain all predictors of Y, not just the confounders (Gail, Wieand, Piantadosi [1984])

9 Contingency Tables Counterfactual values For potential response Y 1, we have the contingency table: So we see that where N is the population size Z = 1 Z = 0 Y 1 = 1 A (1) B (1) Y 1 = 0 C (1) D (1) P (Y 1 = 1) = A(1) + B (1) N P (Y 1 = 0) = C (1) + D (1) N

10 Contingency Tables Counterfactual values For potential response Y 0, we have the contingency table: So we see that Z = 1 Z = 0 Y 0 = 1 A (0) B (0) Y 0 = 0 C (0) D (0) P (Y 0 = 1) = A(0) + B (0) N P (Y 0 = 0) = C (0) + D (0) N

11 Contingency Tables Counterfactual values Then the marginal odds ratio is equal to ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1) ψ marg = (A (1) + B (1)) ( C (0) + D (0)) (C (1) + D (1)) ( A (0) + B (0)) but this involves the unobservable quantities B (1), D (1), A (0), and C (0).

12 Contingency Tables Observed values The crude odds ratio is computed by classifying the observed data into the contingency table then taking the cross products Z = 1 Z = 0 Y = 1 A (1) B (0) Y = 0 C (1) D (0) ψ crude = A(1) D (0) C (1) B (0) Compare this to the marginal odds ratio: (A (1) + B (1)) ( C (0) + D (0)) ψ marg = (C (1) + D (1)) ( A (0) + B (0))

13 Estimation Crude odds ratio Given a sample of n observations, the crude odds ratio is estimated by classifying the sample into the contingency table Z = 1 Z = 0 Y = 1 a b Y = 0 c d then taking the ratio of cross products ˆψ crude = a d c b This will lead to unbiased estimation of ψ crude, but is typically biased for ψ marg.

14 Estimation Conditional odds ratio If the logistic model is correct, then ˆψ cond = e ˆγ will be a consistent and asymptotically ecient estimate of ψ cond. Other (non-parametric) possibilities include subclassifying the observations on the covariate X then using a non-parametric estimator such as the Woolf estimate or the Mantel-Haenzsel estimate. We will focus on the Mantel-Hanzsel estimate.

15 Estimation Mantel-Haenszel: Denition The Mantel-Haenszel estimator is computed by stratifying/sublcassifying observations into subtables based on the covariate X : Z = 1 Z = 0 Y = 1 a k b k Y = 0 c k d k n k for stratum k, k {1,..., K}, then using the estimate ˆψ MH = K k=1 K k=1 a k d k n k b k c k n k = K w k ˆψk k=1 where w k = b k c k n k K b l c l l=1 n l and ˆψ k = a k d k b k c k

16 Estimation Mantel-Haenszel: Stratication on the covariates Assuming that the conditional odds ratio is constant across values of X, as in logistic regression, we have the following results Categorical covariates: If the strata correspond to levels of X, then from the formula ˆψ MH = K w k ˆψ k k=1 we see that ˆψ MH will estimate to the conditional odds ratio.

17 Estimation Mantel-Haenszel: Stratication on the covariates Continuous covariates: X will vary within each stratum, so ˆψ MH will be biased in estimating the conditional odds ratio: ˆψ MH ψ cond w k bias k k ˆψ MH will be consistent for ψ cond only if k w k bias k = 1 When matching on X, we assure that x is constant within each stratum, so bias k = 1. This means that k w k bias k = k w k = 1 and ˆψ MH is consistent for ψ cond

18 The Propensity Score Denition and consequences Proposed by Rosenbaum and Rubin (1983), the propensity score is dened as e(x) = P (Z = 1 X = x) It is known that X is distributed similarly in the subpopulations dened by Z when conditioning in the propensity score: X Z e(x) Also, if treatment assignment is strongly ignorable, i.e. (Y 1, Y 0 ) Z X, we also know that (Y 1, Y 0 ) Z e(x) Together, these mean that e(x ) is a univariate summary of X which will account for any confounding of X with Z, giving hope that an estimate for the marginal odds ratio is possible

19 Using the Propensity Score Stratication and Matching As is the case with stratication on covariates, stratication on the propensity score will lead to an estimator which is biased for the marginal odds ratio due to residual confounding within each stratum. In fact, simulations have shown that perfect 1-to-1 matching on the popensity score also leads to a quantity which is biased for the marginal odds ratio. While the magnitude of the bias varies, the bias tends to be in the direction of the conditional odds ratio.

20 Using the Propensity Score Weighting There are a number of estimators of P (Y 1 = 1) and P (Y 0 = 1) which weight observations by the inverse of the propensity score. We will discuss the simplest: ˆP (Y 1 = 1) = 1 n n Z i Y i e i i=1 and ˆP (Y0 = 1) = 1 n n i=1 (1 Z i ) Y i (1 e i ) Lunceford and Davidian (2004) show that these quanities are unbiased for P (Y 1 = 1) and P (Y 0 = 1), respectively. Since both estimators are unbiased for the respective parameters, substituting these values into the formula for ψ marg = P(Y 1=1)P(Y 0 =0) P(Y 1 =0)P(Y 0 =1) will lead to an asymptotically unbiased estimator.

21 Using the Propensity Score Weighting Numerical problem: ê(x) very close to 0 or 1 may lead to 1 n n i=1 Z i Y i ê i / (0, 1) or 1 n n i=1 (1 Z i ) Y i 1 ê i / (0, 1) In simulations, < ê(x) < showed little reason for concern. This problem will occur when the domain of X in the exposed and unexposed groups are not equal

22 Simulation Example Creating the population For the following simulation I created a population of 2 million units, starting with three mutually independent standard normal covariates X 1 N (0, 1) X 2 N (0, 1) X 3 N (0, 1) From these covariates, I simulated Z through the model logit [P (Z = 1 X)] = 1X X X 3 Finally, I simulated Y 1 and Y 0 for each unit using the model logit [P (Y 1 = 1 X)] = 2X 1 X 2 + X 3 + log(3) logit [P (Y 0 = 1 X)] = 2X 1 X 2 + X 3

23 Simulation Example Revisiting the population A few units in the simulated population: Unit X 1 X 2 X 3 Z Y 1 Y where the observed Y for each unit is colored blue

24 Simulation Example Parameters of interest From the population, I calculated the following parameters: ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1) = ψ crude = P (Y 1 = 1 Z = 1) P (Y 0 = 0 Z = 0) P (Y 1 = 0 Z = 1) P (Y 0 = 1 Z = 0) = ψ cond = P (Y 1 = 1 X = x) P (Y 0 = 0 X = x) P (Y 1 = 0 X = x) P (Y 0 = 1 X = x) = where the conditional odds ratio was calculated via logistic regression.

25 Simulation Example Mechanics 10,000 simulations Random sample of 2,100 observations from my population Removed counterfactual knowledge Only observed Y, not Y 1 or Y 0 Remember, Y = Z Y1 + (1 Z) Y0 Stratication was done by quantiles of the relevant variables

26 Simulation Example Estimation: Stratication on the covariates Crude Logistic Number of strata per variable Conditional: Crude: Marginal:

27 Simulation Example Estimation: Matching on the propensity score Crude Logistic Number of propensity score strata Conditional: Crude: Marginal:

28 Simulation Example Estimation: Weighting by the propensity score Crude PS weighted Logistic Conditional: Crude: Marginal:

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Lecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018

Lecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018 , Non-, Precision, and Power Statistics 211 - Statistical Methods II Presented February 27, 2018 Dan Gillen Department of Statistics University of California, Irvine Discussion.1 Various definitions of

More information

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Asymptotic equivalence of paired Hotelling test and conditional logistic regression Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Summary and discussion of The central role of the propensity score in observational studies for causal effects Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background

More information

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline

More information

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study Theoretical Mathematics & Applications, vol.2, no.3, 2012, 75-86 ISSN: 1792-9687 (print), 1792-9709 (online) Scienpress Ltd, 2012 Strategy of Bayesian Propensity Score Estimation Approach in Observational

More information

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Methodology Marginal versus conditional causal effects Kazem Mohammad 1, Seyed Saeed Hashemi-Nazari 2, Nasrin Mansournia 3, Mohammad Ali Mansournia 1* 1 Department

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 288 Targeted Maximum Likelihood Estimation of Natural Direct Effect Wenjing Zheng Mark J.

More information

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014 Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article

More information

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

More Statistics tutorial at Logistic Regression and the new:

More Statistics tutorial at  Logistic Regression and the new: Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Propensity Score Methods for Estimating Causal Effects from Complex Survey Data

Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School

More information

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

Estimating direct effects in cohort and case-control studies

Estimating direct effects in cohort and case-control studies Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on

More information

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Johns Hopkins University, Dept. of Biostatistics Working Papers 3-3-2011 SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Michael Rosenblum Johns Hopkins Bloomberg

More information

A tool to demystify regression modelling behaviour

A tool to demystify regression modelling behaviour A tool to demystify regression modelling behaviour Thomas Alexander Gerds 1 / 38 Appetizer Every child knows how regression analysis works. The essentials of regression modelling strategy, such as which

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

Logistic regression: Miscellaneous topics

Logistic regression: Miscellaneous topics Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

Stratification and Weighting Via the Propensity Score in Estimation of Causal Treatment Effects: A Comparative Study

Stratification and Weighting Via the Propensity Score in Estimation of Causal Treatment Effects: A Comparative Study Stratification and Weighting Via the Propensity Score in Estimation of Causal Treatment Effects: A Comparative Study Jared K. Lunceford 1 and Marie Davidian 2 1 Merck Research Laboratories, RY34-A316,

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION

JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION Johns Hopkins University, Dept. of Biostatistics Working Papers 2-19-2008 JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION Xianbin Li Department

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Describing Stratified Multiple Responses for Sparse Data

Describing Stratified Multiple Responses for Sparse Data Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative

More information

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,

More information

The effect of nonzero second-order interaction on combined estimators of the odds ratio

The effect of nonzero second-order interaction on combined estimators of the odds ratio Biometrika (1978), 65, 1, pp. 191-0 Printed in Great Britain The effect of nonzero second-order interaction on combined estimators of the odds ratio BY SONJA M. MCKINLAY Department of Mathematics, Boston

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)

More information

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14 STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2

More information

Classification. Chapter Introduction. 6.2 The Bayes classifier

Classification. Chapter Introduction. 6.2 The Bayes classifier Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode

More information

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015 Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation ( G-Estimation of Structural Nested Models 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited

More information

Propensity Score Methods, Models and Adjustment

Propensity Score Methods, Models and Adjustment Propensity Score Methods, Models and Adjustment Dr David A. Stephens Department of Mathematics & Statistics McGill University Montreal, QC, Canada. d.stephens@math.mcgill.ca www.math.mcgill.ca/dstephens/siscr2016/

More information

Section 10: Inverse propensity score weighting (IPSW)

Section 10: Inverse propensity score weighting (IPSW) Section 10: Inverse propensity score weighting (IPSW) Fall 2014 1/23 Inverse Propensity Score Weighting (IPSW) Until now we discussed matching on the P-score, a different approach is to re-weight the observations

More information

STA102 Class Notes Chapter Logistic Regression

STA102 Class Notes Chapter Logistic Regression STA0 Class Notes Chapter 0 0. Logistic Regression We continue to study the relationship between a response variable and one or more eplanatory variables. For SLR and MLR (Chapters 8 and 9), our response

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 334 Targeted Estimation and Inference for the Sample Average Treatment Effect Laura B. Balzer

More information

Lab 8. Matched Case Control Studies

Lab 8. Matched Case Control Studies Lab 8 Matched Case Control Studies Control of Confounding Technique for the control of confounding: At the design stage: Matching During the analysis of the results: Post-stratification analysis Advantage

More information

Part IV Statistics in Epidemiology

Part IV Statistics in Epidemiology Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret

More information

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK Advanced Statistical

More information

Stat 642, Lecture notes for 04/12/05 96

Stat 642, Lecture notes for 04/12/05 96 Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal

More information

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Brian Egleston Fox Chase Cancer Center Collaborators: Daniel Scharfstein,

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen

More information

Bootstrapping Sensitivity Analysis

Bootstrapping Sensitivity Analysis Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao David A. van Dyk Kosuke Imai July 3, 2018 Abstract Propensity score methods are

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Behavioral Data Mining. Lecture 19 Regression and Causal Effects

Behavioral Data Mining. Lecture 19 Regression and Causal Effects Behavioral Data Mining Lecture 19 Regression and Causal Effects Outline Counterfactuals and Potential Outcomes Regression Models Causal Effects from Matching and Regression Weighted regression Counterfactuals

More information

Matching to estimate the causal effects from multiple treatments

Matching to estimate the causal effects from multiple treatments Matching to estimate the causal effects from multiple treatments Michael J. Lopez, and Roee Gutman Department of Biostatistics, Box G-S121-7, 121 S. Main Street, Providence, RI 02912 e-mail: michael lopez@brown.edu;

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 260 Collaborative Targeted Maximum Likelihood For Time To Event Data Ori M. Stitelman Mark

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public

More information

Propensity Score for Causal Inference of Multiple and Multivalued Treatments

Propensity Score for Causal Inference of Multiple and Multivalued Treatments Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2016 Propensity Score for Causal Inference of Multiple and Multivalued Treatments Zirui Gu Virginia Commonwealth

More information

Marginal Structural Models and Causal Inference in Epidemiology

Marginal Structural Models and Causal Inference in Epidemiology Marginal Structural Models and Causal Inference in Epidemiology James M. Robins, 1,2 Miguel Ángel Hernán, 1 and Babette Brumback 2 In observational studies with exposures or treatments that vary over time,

More information

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Ruth Keogh, Stijn Vansteelandt, Rhian Daniel Department of Medical Statistics London

More information

Discussion of Papers on the Extensions of Propensity Score

Discussion of Papers on the Extensions of Propensity Score Discussion of Papers on the Extensions of Propensity Score Kosuke Imai Princeton University August 3, 2010 Kosuke Imai (Princeton) Generalized Propensity Score 2010 JSM (Vancouver) 1 / 11 The Theme and

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

Statistical Modelling with Stata: Binary Outcomes

Statistical Modelling with Stata: Binary Outcomes Statistical Modelling with Stata: Binary Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 21/11/2017 Cross-tabulation Exposed Unexposed Total Cases a b a + b Controls

More information

The linear model is the most fundamental of all serious statistical models encompassing:

The linear model is the most fundamental of all serious statistical models encompassing: Linear Regression Models: A Bayesian perspective Ingredients of a linear model include an n 1 response vector y = (y 1,..., y n ) T and an n p design matrix (e.g. including regressors) X = [x 1,..., x

More information

A class of latent marginal models for capture-recapture data with continuous covariates

A class of latent marginal models for capture-recapture data with continuous covariates A class of latent marginal models for capture-recapture data with continuous covariates F Bartolucci A Forcina Università di Urbino Università di Perugia FrancescoBartolucci@uniurbit forcina@statunipgit

More information

Improved estimation of controlled direct eects in the presence of unmeasured confounding of intermediate variables

Improved estimation of controlled direct eects in the presence of unmeasured confounding of intermediate variables STATISTICS IN MEDICINE Statist. Med. 2005; 24:1683 1702 Published online 1 March 2005 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2057 Improved estimation of controlled direct

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

(Mis)use of matching techniques

(Mis)use of matching techniques University of Warsaw 5th Polish Stata Users Meeting, Warsaw, 27th November 2017 Research financed under National Science Center, Poland grant 2015/19/B/HS4/03231 Outline Introduction and motivation 1 Introduction

More information

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao Department of Statistics, University of California, Irvine, CA 92697 shandonm@uci.edu

More information

Robust Tree-based Causal Inference for Complex Ad Effectiveness Analysis

Robust Tree-based Causal Inference for Complex Ad Effectiveness Analysis Robust Tree-based Causal Inference for Complex Ad Effectiveness Analysis Pengyuan Wang Wei Sun Dawei Yin Jian Yang Yi Chang Yahoo Labs, Sunnyvale, CA, USA Purdue University, West Lafayette, IN,USA {pengyuan,

More information

Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility

Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility Stephen Burgess Department of Public Health & Primary Care, University of Cambridge September 6, 014 Short title:

More information

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data

More information

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Marginal and Nested Structural Models Using Instrumental Variables

Marginal and Nested Structural Models Using Instrumental Variables Marginal and Nested Structural Models Using Instrumental Variables Zhiqiang TAN The objective of many scientific studies is to evaluate the effect of a treatment on an outcome of interest ceteris paribus.

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

Lecture 5: Poisson and logistic regression

Lecture 5: Poisson and logistic regression Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 3-5 March 2014 introduction to Poisson regression application to the BELCAP study introduction

More information

Overlap Propensity Score Weighting to Balance Covariates

Overlap Propensity Score Weighting to Balance Covariates Overlap Propensity Score Weighting to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu JSM 2016 Chicago, IL Joint work with Fan Li (Duke) and Alan Zaslavsky

More information

ANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula

ANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control You know how ANOVA works the total variation among

More information