Marginal, crude and conditional odds ratios
|
|
- Dustin Parker
- 5 years ago
- Views:
Transcription
1 Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010
2 Parameter Denitions When measuring the eect of a binary exposure(z ) on a binary response (Y ), there are a number of probabilities of interest with regard to the population: marginal: P(Y = 1) crude: P(Y = 1 Z = 1) conditional: P(Y = 1 Z = 1, X = x) These probabilities are not necessarily equal and have dierent interpretations which are each useful in dierent settings. We need to nd ways to estimate each value, being clear about which of these quantities our statistics estimate.
3 The Counterfactual Model Denitions We will use the following notation: Z - the binary exposure (or treatment) of interest X - covariates associated with exposure and response Y 1 - the potential binary reponse if the unit is exposed Y 0 - the potential binary reponse if the unit is not exposed Though each unit in the population may potentially have either Y 1 and Y 0, we are only able to observe one. We call the observed response Y and use the relation Y = Z Y 1 + (1 Z) Y 0
4 The Counterfactual Model A population A hypothetical population would then look like the following: Unit X 1 X 2 X 3 Z Y 1 Y where the observed Y for each unit is colored blue (This population will be used in a simulation example later...)
5 Marginal Odds Ratio The marginal odds ratio can be obtained by comparing the odds of response in the population if everyone is exposed Odds exp = P (Y 1 = 1) P (Y 1 = 0) to the odds of response if everyone in not exposed Looking at the odds ratio as Odds unexp = P (Y 0 = 1) P (Y 0 = 0) Oddsexp Odds unexp, we see the marginal odds ratio ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1)
6 Crude Odds Ratio The marginal odds ratio is often approximated by estimating the crude odds ratio, obtained by taking the cross product P (Y = 1 Z = 1) P (Y = 0 Z = 0) ψ crude = P (Y = 0 Z = 1) P (Y = 1 Z = 0) = P (Y 1 = 1 Z = 1) P (Y 0 = 0 Z = 0) P (Y 1 = 0 Z = 1) P (Y 0 = 1 Z = 0) However, when confounding is present P (Y 1 = 1 Z = 1) = P (Y 1 = 1) P (Y 0 = 1 Z = 0) = P (Y 0 = 1) suggesting that the crude and marginal odds ratio are dierent values
7 Conditional Odds Ratio The conditional odds ratio is dened as ψ cond (x) = P (Y = 1 Z = 1, X = x) P (Y = 0 Z = 0, X = x) P (Y = 0 Z = 1, X = x) P (Y = 1 Z = 0, X = x) With the assumption of strongly ignorable treatment assignment, i.e. (Y 1, Y 0 ) Z X, we can remove Z from the conditional probabilities: ψ cond (x) = P (Y 1 = 1 X = x) P (Y 0 = 0 X = x) P (Y 1 = 0 X = x) P (Y 0 = 1 X = x)
8 Conditional Odds Ratio Under the logistic model If the data follow a logistic model with { exp β 0 + } p β j=1 j x j + γz P (Y = 1 X = x, Z = z) = { 1 + exp β 0 + } p β j=1 j x j + γz logistic regression will lead to an unbiased and asymptotically ecient estimator of ψ cond (x) In fact, the conditional odds ratio is constant across X and ψ cond (x) = ψ cond = e γ In order for the estimate to be unbiased, X must contain all predictors of Y, not just the confounders (Gail, Wieand, Piantadosi [1984])
9 Contingency Tables Counterfactual values For potential response Y 1, we have the contingency table: So we see that where N is the population size Z = 1 Z = 0 Y 1 = 1 A (1) B (1) Y 1 = 0 C (1) D (1) P (Y 1 = 1) = A(1) + B (1) N P (Y 1 = 0) = C (1) + D (1) N
10 Contingency Tables Counterfactual values For potential response Y 0, we have the contingency table: So we see that Z = 1 Z = 0 Y 0 = 1 A (0) B (0) Y 0 = 0 C (0) D (0) P (Y 0 = 1) = A(0) + B (0) N P (Y 0 = 0) = C (0) + D (0) N
11 Contingency Tables Counterfactual values Then the marginal odds ratio is equal to ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1) ψ marg = (A (1) + B (1)) ( C (0) + D (0)) (C (1) + D (1)) ( A (0) + B (0)) but this involves the unobservable quantities B (1), D (1), A (0), and C (0).
12 Contingency Tables Observed values The crude odds ratio is computed by classifying the observed data into the contingency table then taking the cross products Z = 1 Z = 0 Y = 1 A (1) B (0) Y = 0 C (1) D (0) ψ crude = A(1) D (0) C (1) B (0) Compare this to the marginal odds ratio: (A (1) + B (1)) ( C (0) + D (0)) ψ marg = (C (1) + D (1)) ( A (0) + B (0))
13 Estimation Crude odds ratio Given a sample of n observations, the crude odds ratio is estimated by classifying the sample into the contingency table Z = 1 Z = 0 Y = 1 a b Y = 0 c d then taking the ratio of cross products ˆψ crude = a d c b This will lead to unbiased estimation of ψ crude, but is typically biased for ψ marg.
14 Estimation Conditional odds ratio If the logistic model is correct, then ˆψ cond = e ˆγ will be a consistent and asymptotically ecient estimate of ψ cond. Other (non-parametric) possibilities include subclassifying the observations on the covariate X then using a non-parametric estimator such as the Woolf estimate or the Mantel-Haenzsel estimate. We will focus on the Mantel-Hanzsel estimate.
15 Estimation Mantel-Haenszel: Denition The Mantel-Haenszel estimator is computed by stratifying/sublcassifying observations into subtables based on the covariate X : Z = 1 Z = 0 Y = 1 a k b k Y = 0 c k d k n k for stratum k, k {1,..., K}, then using the estimate ˆψ MH = K k=1 K k=1 a k d k n k b k c k n k = K w k ˆψk k=1 where w k = b k c k n k K b l c l l=1 n l and ˆψ k = a k d k b k c k
16 Estimation Mantel-Haenszel: Stratication on the covariates Assuming that the conditional odds ratio is constant across values of X, as in logistic regression, we have the following results Categorical covariates: If the strata correspond to levels of X, then from the formula ˆψ MH = K w k ˆψ k k=1 we see that ˆψ MH will estimate to the conditional odds ratio.
17 Estimation Mantel-Haenszel: Stratication on the covariates Continuous covariates: X will vary within each stratum, so ˆψ MH will be biased in estimating the conditional odds ratio: ˆψ MH ψ cond w k bias k k ˆψ MH will be consistent for ψ cond only if k w k bias k = 1 When matching on X, we assure that x is constant within each stratum, so bias k = 1. This means that k w k bias k = k w k = 1 and ˆψ MH is consistent for ψ cond
18 The Propensity Score Denition and consequences Proposed by Rosenbaum and Rubin (1983), the propensity score is dened as e(x) = P (Z = 1 X = x) It is known that X is distributed similarly in the subpopulations dened by Z when conditioning in the propensity score: X Z e(x) Also, if treatment assignment is strongly ignorable, i.e. (Y 1, Y 0 ) Z X, we also know that (Y 1, Y 0 ) Z e(x) Together, these mean that e(x ) is a univariate summary of X which will account for any confounding of X with Z, giving hope that an estimate for the marginal odds ratio is possible
19 Using the Propensity Score Stratication and Matching As is the case with stratication on covariates, stratication on the propensity score will lead to an estimator which is biased for the marginal odds ratio due to residual confounding within each stratum. In fact, simulations have shown that perfect 1-to-1 matching on the popensity score also leads to a quantity which is biased for the marginal odds ratio. While the magnitude of the bias varies, the bias tends to be in the direction of the conditional odds ratio.
20 Using the Propensity Score Weighting There are a number of estimators of P (Y 1 = 1) and P (Y 0 = 1) which weight observations by the inverse of the propensity score. We will discuss the simplest: ˆP (Y 1 = 1) = 1 n n Z i Y i e i i=1 and ˆP (Y0 = 1) = 1 n n i=1 (1 Z i ) Y i (1 e i ) Lunceford and Davidian (2004) show that these quanities are unbiased for P (Y 1 = 1) and P (Y 0 = 1), respectively. Since both estimators are unbiased for the respective parameters, substituting these values into the formula for ψ marg = P(Y 1=1)P(Y 0 =0) P(Y 1 =0)P(Y 0 =1) will lead to an asymptotically unbiased estimator.
21 Using the Propensity Score Weighting Numerical problem: ê(x) very close to 0 or 1 may lead to 1 n n i=1 Z i Y i ê i / (0, 1) or 1 n n i=1 (1 Z i ) Y i 1 ê i / (0, 1) In simulations, < ê(x) < showed little reason for concern. This problem will occur when the domain of X in the exposed and unexposed groups are not equal
22 Simulation Example Creating the population For the following simulation I created a population of 2 million units, starting with three mutually independent standard normal covariates X 1 N (0, 1) X 2 N (0, 1) X 3 N (0, 1) From these covariates, I simulated Z through the model logit [P (Z = 1 X)] = 1X X X 3 Finally, I simulated Y 1 and Y 0 for each unit using the model logit [P (Y 1 = 1 X)] = 2X 1 X 2 + X 3 + log(3) logit [P (Y 0 = 1 X)] = 2X 1 X 2 + X 3
23 Simulation Example Revisiting the population A few units in the simulated population: Unit X 1 X 2 X 3 Z Y 1 Y where the observed Y for each unit is colored blue
24 Simulation Example Parameters of interest From the population, I calculated the following parameters: ψ marg = P (Y 1 = 1) P (Y 0 = 0) P (Y 1 = 0) P (Y 0 = 1) = ψ crude = P (Y 1 = 1 Z = 1) P (Y 0 = 0 Z = 0) P (Y 1 = 0 Z = 1) P (Y 0 = 1 Z = 0) = ψ cond = P (Y 1 = 1 X = x) P (Y 0 = 0 X = x) P (Y 1 = 0 X = x) P (Y 0 = 1 X = x) = where the conditional odds ratio was calculated via logistic regression.
25 Simulation Example Mechanics 10,000 simulations Random sample of 2,100 observations from my population Removed counterfactual knowledge Only observed Y, not Y 1 or Y 0 Remember, Y = Z Y1 + (1 Z) Y0 Stratication was done by quantiles of the relevant variables
26 Simulation Example Estimation: Stratication on the covariates Crude Logistic Number of strata per variable Conditional: Crude: Marginal:
27 Simulation Example Estimation: Matching on the propensity score Crude Logistic Number of propensity score strata Conditional: Crude: Marginal:
28 Simulation Example Estimation: Weighting by the propensity score Crude PS weighted Logistic Conditional: Crude: Marginal:
Estimating the Marginal Odds Ratio in Observational Studies
Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationLecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018
, Non-, Precision, and Power Statistics 211 - Statistical Methods II Presented February 27, 2018 Dan Gillen Department of Statistics University of California, Irvine Discussion.1 Various definitions of
More informationAsymptotic equivalence of paired Hotelling test and conditional logistic regression
Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationSummary and discussion of The central role of the propensity score in observational studies for causal effects
Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background
More informationCausal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk
Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline
More informationStrategy of Bayesian Propensity. Score Estimation Approach. in Observational Study
Theoretical Mathematics & Applications, vol.2, no.3, 2012, 75-86 ISSN: 1792-9687 (print), 1792-9709 (online) Scienpress Ltd, 2012 Strategy of Bayesian Propensity Score Estimation Approach in Observational
More informationOUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores
OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity
More informationJournal of Biostatistics and Epidemiology
Journal of Biostatistics and Epidemiology Methodology Marginal versus conditional causal effects Kazem Mohammad 1, Seyed Saeed Hashemi-Nazari 2, Nasrin Mansournia 3, Mohammad Ali Mansournia 1* 1 Department
More informationCDA Chapter 3 part II
CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationDouble Robustness. Bang and Robins (2005) Kang and Schafer (2007)
Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 288 Targeted Maximum Likelihood Estimation of Natural Direct Effect Wenjing Zheng Mark J.
More informationAssess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014
Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationLogistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015
Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article
More informationLecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)
Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationMore Statistics tutorial at Logistic Regression and the new:
Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationPropensity Score Methods for Estimating Causal Effects from Complex Survey Data
Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School
More informationStandardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE
ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal
More informationGeneral Regression Model
Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical
More informationEstimating direct effects in cohort and case-control studies
Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationBiostat 2065 Analysis of Incomplete Data
Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on
More informationSIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION
Johns Hopkins University, Dept. of Biostatistics Working Papers 3-3-2011 SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Michael Rosenblum Johns Hopkins Bloomberg
More informationA tool to demystify regression modelling behaviour
A tool to demystify regression modelling behaviour Thomas Alexander Gerds 1 / 38 Appetizer Every child knows how regression analysis works. The essentials of regression modelling strategy, such as which
More informationAn Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data
An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)
More informationarxiv: v1 [stat.me] 15 May 2011
Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland
More informationLogistic regression: Miscellaneous topics
Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationSensitivity analysis and distributional assumptions
Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu
More informationStratification and Weighting Via the Propensity Score in Estimation of Causal Treatment Effects: A Comparative Study
Stratification and Weighting Via the Propensity Score in Estimation of Causal Treatment Effects: A Comparative Study Jared K. Lunceford 1 and Marie Davidian 2 1 Merck Research Laboratories, RY34-A316,
More informationFractional Imputation in Survey Sampling: A Comparative Review
Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical
More informationJOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION
Johns Hopkins University, Dept. of Biostatistics Working Papers 2-19-2008 JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION Xianbin Li Department
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationDescribing Stratified Multiple Responses for Sparse Data
Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative
More informationAnalytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis
Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,
More informationThe effect of nonzero second-order interaction on combined estimators of the odds ratio
Biometrika (1978), 65, 1, pp. 191-0 Printed in Great Britain The effect of nonzero second-order interaction on combined estimators of the odds ratio BY SONJA M. MCKINLAY Department of Mathematics, Boston
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationMantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC
Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)
More informationMatching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14
STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationGeoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015
Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto
More informationData Integration for Big Data Analysis for finite population inference
for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation
More informationG-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation
G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation ( G-Estimation of Structural Nested Models 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited
More informationPropensity Score Methods, Models and Adjustment
Propensity Score Methods, Models and Adjustment Dr David A. Stephens Department of Mathematics & Statistics McGill University Montreal, QC, Canada. d.stephens@math.mcgill.ca www.math.mcgill.ca/dstephens/siscr2016/
More informationSection 10: Inverse propensity score weighting (IPSW)
Section 10: Inverse propensity score weighting (IPSW) Fall 2014 1/23 Inverse Propensity Score Weighting (IPSW) Until now we discussed matching on the P-score, a different approach is to re-weight the observations
More informationSTA102 Class Notes Chapter Logistic Regression
STA0 Class Notes Chapter 0 0. Logistic Regression We continue to study the relationship between a response variable and one or more eplanatory variables. For SLR and MLR (Chapters 8 and 9), our response
More informationPlausible Values for Latent Variables Using Mplus
Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can
More informationPropensity Score Matching
Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 334 Targeted Estimation and Inference for the Sample Average Treatment Effect Laura B. Balzer
More informationLab 8. Matched Case Control Studies
Lab 8 Matched Case Control Studies Control of Confounding Technique for the control of confounding: At the design stage: Matching During the analysis of the results: Post-stratification analysis Advantage
More informationPart IV Statistics in Epidemiology
Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret
More informationConfounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning
Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK Advanced Statistical
More informationStat 642, Lecture notes for 04/12/05 96
Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal
More informationInvestigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?
Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Brian Egleston Fox Chase Cancer Center Collaborators: Daniel Scharfstein,
More informationWeighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai
Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen
More informationBootstrapping Sensitivity Analysis
Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationPropensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments
Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao David A. van Dyk Kosuke Imai July 3, 2018 Abstract Propensity score methods are
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationBehavioral Data Mining. Lecture 19 Regression and Causal Effects
Behavioral Data Mining Lecture 19 Regression and Causal Effects Outline Counterfactuals and Potential Outcomes Regression Models Causal Effects from Matching and Regression Weighted regression Counterfactuals
More informationMatching to estimate the causal effects from multiple treatments
Matching to estimate the causal effects from multiple treatments Michael J. Lopez, and Roee Gutman Department of Biostatistics, Box G-S121-7, 121 S. Main Street, Providence, RI 02912 e-mail: michael lopez@brown.edu;
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 260 Collaborative Targeted Maximum Likelihood For Time To Event Data Ori M. Stitelman Mark
More informationPrimal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing
Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal
More informationBayesian Linear Models
Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public
More informationPropensity Score for Causal Inference of Multiple and Multivalued Treatments
Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2016 Propensity Score for Causal Inference of Multiple and Multivalued Treatments Zirui Gu Virginia Commonwealth
More informationMarginal Structural Models and Causal Inference in Epidemiology
Marginal Structural Models and Causal Inference in Epidemiology James M. Robins, 1,2 Miguel Ángel Hernán, 1 and Babette Brumback 2 In observational studies with exposures or treatments that vary over time,
More informationMethods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures
Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Ruth Keogh, Stijn Vansteelandt, Rhian Daniel Department of Medical Statistics London
More informationDiscussion of Papers on the Extensions of Propensity Score
Discussion of Papers on the Extensions of Propensity Score Kosuke Imai Princeton University August 3, 2010 Kosuke Imai (Princeton) Generalized Propensity Score 2010 JSM (Vancouver) 1 / 11 The Theme and
More informationCausal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD
Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable
More informationStatistical Modelling with Stata: Binary Outcomes
Statistical Modelling with Stata: Binary Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 21/11/2017 Cross-tabulation Exposed Unexposed Total Cases a b a + b Controls
More informationThe linear model is the most fundamental of all serious statistical models encompassing:
Linear Regression Models: A Bayesian perspective Ingredients of a linear model include an n 1 response vector y = (y 1,..., y n ) T and an n p design matrix (e.g. including regressors) X = [x 1,..., x
More informationA class of latent marginal models for capture-recapture data with continuous covariates
A class of latent marginal models for capture-recapture data with continuous covariates F Bartolucci A Forcina Università di Urbino Università di Perugia FrancescoBartolucci@uniurbit forcina@statunipgit
More informationImproved estimation of controlled direct eects in the presence of unmeasured confounding of intermediate variables
STATISTICS IN MEDICINE Statist. Med. 2005; 24:1683 1702 Published online 1 March 2005 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2057 Improved estimation of controlled direct
More informationCovariate Balancing Propensity Score for General Treatment Regimes
Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian
More information(Mis)use of matching techniques
University of Warsaw 5th Polish Stata Users Meeting, Warsaw, 27th November 2017 Research financed under National Science Center, Poland grant 2015/19/B/HS4/03231 Outline Introduction and motivation 1 Introduction
More informationPerson-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data
Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time
More informationPropensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments
Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao Department of Statistics, University of California, Irvine, CA 92697 shandonm@uci.edu
More informationRobust Tree-based Causal Inference for Complex Ad Effectiveness Analysis
Robust Tree-based Causal Inference for Complex Ad Effectiveness Analysis Pengyuan Wang Wei Sun Dawei Yin Jian Yang Yi Chang Yahoo Labs, Sunnyvale, CA, USA Purdue University, West Lafayette, IN,USA {pengyuan,
More informationEstimating and contextualizing the attenuation of odds ratios due to non-collapsibility
Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility Stephen Burgess Department of Public Health & Primary Care, University of Cambridge September 6, 014 Short title:
More informationSTAT 5500/6500 Conditional Logistic Regression for Matched Pairs
STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data
More informationSection 9c. Propensity scores. Controlling for bias & confounding in observational studies
Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationMarginal and Nested Structural Models Using Instrumental Variables
Marginal and Nested Structural Models Using Instrumental Variables Zhiqiang TAN The objective of many scientific studies is to evaluate the effect of a treatment on an outcome of interest ceteris paribus.
More informationANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW
SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved
More informationLecture 5: Poisson and logistic regression
Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 3-5 March 2014 introduction to Poisson regression application to the BELCAP study introduction
More informationOverlap Propensity Score Weighting to Balance Covariates
Overlap Propensity Score Weighting to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu JSM 2016 Chicago, IL Joint work with Fan Li (Duke) and Alan Zaslavsky
More informationANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control You know how ANOVA works the total variation among
More information