G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation
|
|
- Hannah Lang
- 6 years ago
- Views:
Transcription
1 G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation
2 ( G-Estimation of Structural Nested Models 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited 14.3 Structural nested mean models 14.4 Rank preservation 14.5 G-estimation 14.6 Structural nested models with two or more parameters BIOS G-Estimation
3 14.2 Exchangeability revisited Recall conditional exchangeability defined to be For binary Y this is equivalent to Y a A L for a = 0,1 Pr[A = 1 Y a,l] = Pr[A = 1L] Consider the following parametric logistic regression model logit{pr[a = 1 Y a=0,l]} = α 0 + α 1 Y a=0 + α 2 L Fitting such a model to a real data set b/c Y a=0 not observed for all individuals. Thought experiment: Suppose Y a=0 observed for all individuals so that we can fit this model. If conditional exchangeability holds and the model is correctly specified, what would you expect ˆα 1 to equal? BIOS G-Estimation
4 Consider the model 14.3 Structural nested mean models E[Y a Y a=0 A = a,l] = β 1 a + β 2 al such that β 1 + β 2 l equals the average causal effect (RD) within stratum L = l Below we discuss using g-estimation to draw inference about β 1 and β 2 Note this model is semi-parametric in the sense that we are not specifying a model for E[Y a=0 L], i.e., there is no intercept β 0 or term β 3 L in the model. This is in contrast to the parametric g-formula from 13. Thus we expect g-estimation to be more robust to model mis-specification than the parametric g-formula. However, g-estimation can only be used to adjust for confounding, but not selection bias (eg, due to censoring) BIOS G-Estimation
5 14.4 Rank Preservation Suppose, contrary to fact, for the NHEFS data we knew Y a=1 and Y a=0, i.e., their potential weight gain if they quit smoking and if they did not quit smoking Imagine we sorted individuals according to Y a=1 from largest value to smallest value Imagine we sorted individuals according to Y a=0 from largest value to smallest value Suppose in either case individuals end up in the same order: rank preservation BIOS G-Estimation
6 14.4 Rank Preservation When the effect of treatment A on the outcome Y is exactly the same, on the additive scale, for all individuals in the study population, we say that additive rank preservation holds For example, if smoking cessation increases everybodys body weight by exactly 3 kg, then the ranking of individuals according to Y a=0 would be equal to the ranking according to Y a=1 A particular case of additive rank preservation occurs when the sharp null hypothesis is true ( 1), i.e., treatment has no effect on the outcomes of any individual For the purposes of structural nested mean models, we will care about additive rank preservation within levels of L. This conditional additive rank preservation holds if the effect of treatment A on the outcome Y is exactly the same for all individuals with the same values of L BIOS G-Estimation
7 14.4 Rank Preservation An example of an (additive conditional) rank-preserving structural model is Yi a Yi a=0 = ψ 1 a + ψ 2 al i for all subjects i where ψ 1 +ψ 2 l is the constant causal effect for all individuals with covariate values L = l For every individual i with L i = l Yi a=1 = Yi a=0 + ψ 1 + ψ 2 l Potential outcome under no treatment Yi a=0 is shifted by ψ 1 + ψ 2 l to obtain potential outcome under treatment Yi a=1 BIOS G-Estimation
8 Rank Preservation everybody s body weight by exactly 3 kg, then the rankin cording to =0 would be equal to the ranking accordi that in the latter list all individuals will be 3 kg heavier. additive rank preservation occurs when the sharp null hy Figs 14.1 and 14.2 show examples of additive rank preservation within two strata L = l and L = l Figure 14.1 Chapter 1), i.e., if treatment has no effect on the outcomes the study population. For the purposes of structural nest will care about additive rank preservation shifts from within =0 levels to of = additive rank preservation holds ifstratum. the effect Figure of treatment 14.2 s is exactly the same for all individuals stratum with the= same 0. The value d An example of an (additive conditional) from than rank-preservi in stratum is to the left of the mean =0 = 1 individuals + 2 for in all stratum subjec cessation than individu where is the constant causal effect for all individ values =. That is, for every individual for all individ with = is equal to =0 For most treatments A subject s counterfactual treatment =0 pected to be constant is shifted by to obtain the value o with the same covariate outcome under treatment. tion is scientifically impl Figure Figure shows an example of additive rank pres cessation affects equally stratum =. The bell-shaped curves represent the distr terfactual outcomes =0 ues of. Some (left curve) and =1 people a (right cu effects of smoking cessa in the upper part of the figure represent the values of the. The individual caus outcomes for subject, and the two dots in the lower par after quitting smoking ues of the two counterfactual outcomes for subject. The gain little, and others m the situation depicted i varies across individual not preserved since the when =0but not w Because of the impla use methods for causal we consider in this book structural mean models Figure 14.3 not for individual causa For most treatments and outcomes, the individual causal effect is not expected to be constant across individuals with the same covariate values, and thus (additive conditional) rank preservation is scientifically implausible BIOS G-Estimation tion. The estimated ave
9 14.4 Rank Preservation Eg, we do not expect that smoking Figure 14.2cessation affects equally the body weight of all individuals with the same values of L Reality is probably closer to Fig 14.3 Figure 14.3 A structural nested mean model is well definedintheabsenceofrank preservation. For example, one could propose a structural nested mean model for the setting depicted in Figure 14.3 to estimate the average causal effect within strata of. Such average causal effect will generally differ from the individuallevelcausaleffects. cessation than individuals in stratum = for all individuals with = 0, For most treatments and outcomes, t pected to be constant not even approx with the same covariate values, and thus tion is scientifically implausible. In our ex cessation affects equally the body weight ues of. Some people are genetically o effects of smoking cessation than others,. The individual causal effect of smoki after quitting smoking some individuals gain little, and others may even lose som the situation depicted in Figure 14.3, in varies across individuals with the same not preserved since the outcome for indiv when =0but not when =1. Because of the implausibility of rank p use methods for causal inference that rel we consider in this book require rank pre structural mean models from Chapter 12 not for individual causal effects, and thu Here not only are the shifts from Y a=0 to Y a=1 different between individuals, but also the ranks are not preserved tion. The estimated average causal effect was 3 5 kg (95% CI: 2 5, 4 5). This ave rank preservation of individual causal eff nested mean model in the previous sectio preservation. The additive rank-preserving model in assumption than non-rank-preserving m stant treatment effect for all individuals w reason why we would want to use such a in practice. And yet we use it in the ne because g-estimation is easier to underst because the g-estimation procedure is ac B/c of implausibility of rank preservation, causal methods that rely on it not recommended. Used in 14.5 to introduce g-est b/c g-est is easier to understand for rank-preserving models, and b/c g-est procedure is actually the same for rank-preserving and non-rankpreserving models. and non-rank-preserving models. Note t preserving structural model is a structura BIOS G-Estimation
10 14.5 G-Estimation Suppose the goal is estimating the parameters of the structural nested mean model E[Y a Y a=0 A = a,l] = β 1 a For simplicity only considering model with one parameter, effectively assuming average causal effect constant across strata of L Assume additive rank-preserving model Yi a Yi a=0 such that ψ 1 = β 1. Equivalently = ψ 1 a or by causal consistency Y a=0 i Y a=0 i = Y a i ψ 1 a = Y ψ 1 a BIOS G-Estimation
11 14.5 G-Estimation If model correct and we knew ψ 1, then could calculate Yi a=0 individuals for all Don t know ψ 1. Moreover, drawing inference of ψ 1 is our goal. Thought experiment: Your friend (an oracle) knows the value of ψ 1. She tells you it equals one of the following three values: ψ = 20, ψ = 0 or ψ = 10. She then challenges you to determine the true value based on the oberved data. You accept the challenge. For each individual compute H(ψ ) = Y ψ A for each of the three possible values of ψ The three newly created random variables H( 20), H(0) and H(10) are candidate potential outcomes. Only one of the three is the correct potential outcome Y a=0. How do you choose which one? BIOS G-Estimation
12 14.5 G-Estimation Remember from 14.2 that the assumption of conditional exchangeability can be expressed as a logistic model for treatment given the counterfactual outcome and the covariates L. When conditional exchangeability holds, the coefficient for the counterfactual outcome should be zero. This suggests we fit three separate logistic regression models logitpr[a = 1 H(ψ ),L] = α 0 + α 1 H(ψ ) + α 2 L The candidate H(ψ ) with α 1 = 0 is the counterfactual Y a=0 and the corresponding ψ equals the true ψ 1 Eg, suppose for H(ψ = 10) that ˆα 1 = 0. Then ˆψ 1 = 10. This is g-estimation. BIOS G-Estimation
13 14.5 G-Estimation Important note: G-est does not test whether conditional exchangeability holds; it assumes it holds in order to draw inference about the causal effect of interest In reality we do not have an oracle friend supplying a short list of possible values of ψ 1 Therefore need to search over all possible values of ψ 1 until we find one where the corresponding ˆα 1 = 0 Operationally this is done by a search over a fine grid (eg, -20 to 20 by 0.01) NHEFS example: consider 31 possible candidates H(2.0), H(2.1), H(2.2),..., H(4.9), H(5.0). Fit 31 separate logistic regression models of the probability of smoking cessation A = 1 just as in 12, but include H(ψ ) as an additional covariate BIOS G-Estimation
14 14.5 G-Estimation Coefficient estimate ˆα 1 for H(ψ ) was closest to zero for H(3.4) and H(3.5) Finer search reveals ˆα 1 essentially zero for ψ = Thus g-est of average causal effect of smoking cessation on weight gain is 3.4 kg Wald test of H 0 : α 1 = 0 at ψ = yields p-value p 1 To find a 95% confidence interval for ψ, find subset of ψ where p > 0.05 (this is the standard approach of constructing a CI by inverting a hypothesis test) For NHEFS data 31 logistic models, this yields 95% CI [2.5, 4.5] (essentially the same as IP weighting and parametric G-formula) BIOS G-Estimation
15 14.5 G-Estimation: Comments Other tests of H 0 : α 1 = 0 aside from Wald test, such as the score test or likelihood ratio test, could be used instead If we assume Y a {A,C} L no need to adjust for censoring O/w, if we make the weaker assumption Y C {A,L}, need to construct inverse probability of censoring weights W C = 1/Pr[C = 0 A = a,l] as in 12 With IP censoring weights and standard software, can (conservatively) use robust variance estimate to construct Wald tests of H 0 : α 1 = 0; expect 95% CIs to be wider than if non-conservative variance estimate or bootstrap used instead BIOS G-Estimation
16 14.5 G-Estimation: Comments Back to non-rank-preserving models g-est algo (ie the computer code implementing the procedure) for estimating ψ 1 produces consistent estimate of parameter β 1 of mean model, assuming mean model is correctly specified (ie, if average treatment effect is equal in all levels of L) This is true regardless of whether the individual treatment effect is constant Ie, it is not necessary that H(β 1 ) = Y a=0 for all subjects. Rather, it is sufficient for H(β 1 ) and Y a=0 to have the same conditional mean given L BIOS G-Estimation
17 14.6 SNM with 2 or more parameters One parameter structural nested model E[Y a Y a=0 L] = β 1 a assumes same average treatment effect If this model is mis-specified, i.e., there is effect modification by some components V of L, inferences will be wrong We expect effect modification to be the case in general [Note, in contrast, that effect modification does not invalidate MSM methods described in 12] Relax this assumption by considering instead two-parameter SNM E[Y a Y a=0 L] = β 1 a + β 2 av and, for g-estimation, the corresponding rank preserving model Y a i Y a=0 i = ψ 1 a + ψ 2 av BIOS G-Estimation
18 14.6 SNM with 2 or more parameters To estimate ψ 1 and ψ 2, fit logistic model logitpr[a = 1 H(ψ ),L] = α 0 + α 1 H(ψ ) + α 2 H(ψ )V + α 3 L Find combination of ψ 1 and ψ 2 that result in H(ψ ) A L Ie, search for combination of (ψ 1,ψ 2 ) that yields ˆα 1 = ˆα 2 = 0 In general, solution does not have a closed form and therefore numerical search algorithms (eg Nelder-Mead Simplex) must be used For linear mean model, like the ones discussed thus far, estimator does have a closed from (Tech Pt 14.2) BIOS G-Estimation
19 Tech Pt 14.2 Consider one parameter SNM E[Y a Y a=0 L] = β 1 a Suppose g-est based on score test of H 0 : α 1 = 0 Then equivalent (HW) to finding parameter value ψ that solves EE I[C i = 0]Ŵi C H i (ψ )(A i E[A i L i ]) = 0 i Using the fact H i (ψ ) = Y i ψ A i, closed form solution ˆψ 1 = i I[C i = 0]Wi C Y i (A i E[A i L i ]) i I[C i = 0]Wi C A i (A i E[A i L i ]) What if we fit a two parameter SNM? (HW) BIOS G-Estimation
G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation
G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation G-Estimation of Structural Nested Models ( 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited
More informationOUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores
OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity
More informationIP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM
IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationTargeted Maximum Likelihood Estimation in Safety Analysis
Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationSemiparametric Regression
Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under
More informationPSC 504: Dynamic Causal Inference
PSC 504: Dynamic Causal Inference Matthew Blackwell 4/8/203 e problem Let s go back to a problem that we faced earlier, which is how to estimate causal effects with treatments that vary over time. We could
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationRank preserving Structural Nested Distribution Model (RPSNDM) for Continuous
Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous Y : X M Y a=0 = Y a a m = Y a cum (a) : Y a = Y a=0 + cum (a) an unknown parameter. = 0, Y a = Y a=0 = Y for all subjects Rank
More informationBiost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation
Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest
More informationThe International Journal of Biostatistics
The International Journal of Biostatistics Volume 2, Issue 1 2006 Article 2 Statistical Inference for Variable Importance Mark J. van der Laan, Division of Biostatistics, School of Public Health, University
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationGov 2002: 3. Randomization Inference
Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last week: This week: What can we identify using randomization? Estimators were justified via
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationEstimating the Marginal Odds Ratio in Observational Studies
Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationOverview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation
Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already
More informationLinear Model Under General Variance
Linear Model Under General Variance We have a sample of T random variables y 1, y 2,, y T, satisfying the linear model Y = X β + e, where Y = (y 1,, y T )' is a (T 1) vector of random variables, X = (T
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationSection 9c. Propensity scores. Controlling for bias & confounding in observational studies
Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a
More informationChapter 11. Correlation and Regression
Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of
More informationApplication of Time-to-Event Methods in the Assessment of Safety in Clinical Trials
Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More informationGov 2002: 13. Dynamic Causal Inference
Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments
More informationExtending causal inferences from a randomized trial to a target population
Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh
More informationGov 2000: 6. Hypothesis Testing
Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.
More informationCox s proportional hazards model and Cox s partial likelihood
Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.
More informationBusiness Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationCausal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017
Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationData Integration for Big Data Analysis for finite population inference
for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation
More informationLecture 12: Effect modification, and confounding in logistic regression
Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46
BIO5312 Biostatistics Lecture 10:Regression and Correlation Methods Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/1/2016 1/46 Outline In this lecture, we will discuss topics
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationEXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science
EXAMINATION: QUANTITATIVE EMPIRICAL METHODS Yale University Department of Political Science January 2014 You have seven hours (and fifteen minutes) to complete the exam. You can use the points assigned
More informationLecture 4 Multiple linear regression
Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work
More informationIntroduction to Econometrics. Heteroskedasticity
Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory
More informationEstimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.
Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More informationImportant note: Transcripts are not substitutes for textbook assignments. 1
In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More informationChapter 5: Logistic Regression-I
: Logistic Regression-I Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu] D. Bandyopadhyay
More informationTelescope Matching: A Flexible Approach to Estimating Direct Effects
Telescope Matching: A Flexible Approach to Estimating Direct Effects Matthew Blackwell and Anton Strezhnev International Methods Colloquium October 12, 2018 direct effect direct effect effect of treatment
More informationDistribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationMarginal, crude and conditional odds ratios
Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010 Parameter Denitions When measuring the eect of a binary
More informationWhat s New in Econometrics. Lecture 1
What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and
More informationA Practitioner s Guide to Cluster-Robust Inference
A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode
More informationNew Developments in Econometrics Lecture 16: Quantile Estimation
New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile
More informationRegression Models - Introduction
Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent
More informationCasual Mediation Analysis
Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction
More informationWeighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.
Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 334 Targeted Estimation and Inference for the Sample Average Treatment Effect Laura B. Balzer
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationBehavioral Data Mining. Lecture 19 Regression and Causal Effects
Behavioral Data Mining Lecture 19 Regression and Causal Effects Outline Counterfactuals and Potential Outcomes Regression Models Causal Effects from Matching and Regression Weighted regression Counterfactuals
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationSummer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.
Summer School in Statistics for Astronomers V June 1 - June 6, 2009 Regression Mosuk Chow Statistics Department Penn State University. Adapted from notes prepared by RL Karandikar Mean and variance Recall
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationST495: Survival Analysis: Hypothesis testing and confidence intervals
ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took
More informationData Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University
Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Why uncertainty? Why should data mining care about uncertainty? We
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationGeneralized Linear Models. Last time: Background & motivation for moving beyond linear
Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered
More informationLecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power
Lecture 06 DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Introduction Assumptions When broken then we are not able to make inference or accurate descriptions about
More informationImproving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates
Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Lecture 20: Epistasis and Alternative Tests in GWAS Jason Mezey jgm45@cornell.edu April 16, 2016 (Th) 8:40-9:55 None Announcements Summary
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More information36-463/663: Multilevel & Hierarchical Models
36-463/663: Multilevel & Hierarchical Models (P)review: in-class midterm Brian Junker 132E Baker Hall brian@stat.cmu.edu 1 In-class midterm Closed book, closed notes, closed electronics (otherwise I have
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationData Analysis and Statistical Methods Statistics 651
y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent
More informationL6: Regression II. JJ Chen. July 2, 2015
L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,
More informationPart IV Statistics in Epidemiology
Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationResearch Design: Causal inference and counterfactuals
Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent
More informationFinal Exam. Name: Solution:
Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.
More informationRegression with a Single Regressor: Hypothesis Tests and Confidence Intervals
Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression
More informationStatistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers
Statistical Inference Greg C Elvers 1 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample
More informationScatter plot of data from the study. Linear Regression
1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25
More informationA NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL
Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Duke University Medical, Dept. of Biostatistics Joint work
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationLecture 2: Poisson and logistic regression
Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 11-12 December 2014 introduction to Poisson regression application to the BELCAP study introduction
More informationLogistic regression: Miscellaneous topics
Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationLast few slides from last time
Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an
More informationSpecification Errors, Measurement Errors, Confounding
Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model
More information