One-stage dose-response meta-analysis

Similar documents
Meta-analysis of epidemiological dose-response studies

Multivariate Dose-Response Meta-Analysis: the dosresmeta R Package

Journal of Statistical Software

Point-wise averaging approach in dose-response meta-analysis of aggregated data

Homework Solutions Applied Logistic Regression

Working with Stata Inference on the mean

Lecture 3.1 Basic Logistic LDA

Multilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models. Jian Wang September 18, 2012

Correlation and Simple Linear Regression

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 5: Poisson and logistic regression

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

Lecture 2: Poisson and logistic regression

Binary Dependent Variables

Logit estimates Number of obs = 5054 Wald chi2(1) = 2.70 Prob > chi2 = Log pseudolikelihood = Pseudo R2 =

Consider Table 1 (Note connection to start-stop process).

Estimating compound expectation in a regression framework with the new cereg command

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio

Lecture 10: Introduction to Logistic Regression

Statistical Modelling with Stata: Binary Outcomes

Simulating complex survival data

multilevel modeling: concepts, applications and interpretations

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data

How To Do Piecewise Exponential Survival Analysis in Stata 7 (Allison 1995:Output 4.20) revised

Lecture 4: Generalized Linear Mixed Models

Lecture 3 Linear random intercept models

A procedure to tabulate and plot results after flexible modeling of a quantitative covariate

Instantaneous geometric rates via Generalized Linear Models

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

Graduate Econometrics Lecture 4: Heteroskedasticity

Applied Statistics and Econometrics

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Applied Survival Analysis Lab 10: Analysis of multiple failures

Lab 11 - Heteroskedasticity

The Simulation Extrapolation Method for Fitting Generalized Linear Models with Additive Measurement Error

Section Least Squares Regression

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Correlated Data: Linear Mixed Models with Random Intercepts

Instantaneous geometric rates via generalized linear models

STAT 7030: Categorical Data Analysis

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

Classification. Chapter Introduction. 6.2 The Bayes classifier

Correlation and regression

Heteroskedasticity. Part VII. Heteroskedasticity

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

BIOS 312: Precision of Statistical Inference

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes

Sociology 362 Data Exercise 6 Logistic Regression 2

Confidence intervals for the variance component of random-effects linear models

Lecture 7 Time-dependent Covariates in Cox Regression

Nonlinear Regression Functions

BIOS 2083 Linear Models c Abdus S. Wahed

2. We care about proportion for categorical variable, but average for numerical one.

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Group Comparisons: Differences in Composition Versus Differences in Models and Effects

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58

Introductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1

Empirical Application of Simple Regression (Chapter 2)

Introduction to Statistical Analysis

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

A Journey to Latent Class Analysis (LCA)

Generalized, Linear, and Mixed Models

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

Chapter 22: Log-linear regression for Poisson counts

Problem set - Selection and Diff-in-Diff

Linear Regression Models P8111

BMI 541/699 Lecture 22

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Parametric multistate survival analysis: New developments

BIOSTATS Intermediate Biostatistics Spring 2017 Exam 2 (Units 3, 4 & 5) Practice Problems SOLUTIONS

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Mixed Models for Longitudinal Binary Outcomes. Don Hedeker Department of Public Health Sciences University of Chicago.

Model comparison and selection

4. MA(2) +drift: y t = µ + ɛ t + θ 1 ɛ t 1 + θ 2 ɛ t 2. Mean: where θ(l) = 1 + θ 1 L + θ 2 L 2. Therefore,

Modelling Rates. Mark Lunt. Arthritis Research UK Epidemiology Unit University of Manchester

Stat 579: Generalized Linear Models and Extensions

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

Jeffrey M. Wooldridge Michigan State University

Regression, Ridge Regression, Lasso

Statistics in medicine

especially with continuous

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

Modeling the Mean: Response Profiles v. Parametric Curves

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Statistical Modelling in Stata 5: Linear Models

Statistical Distribution Assumptions of General Linear Models

Appendix A. Numeric example of Dimick Staiger Estimator and comparison between Dimick-Staiger Estimator and Hierarchical Poisson Estimator

Estimation of discrete time (grouped duration data) proportional hazards models: pgmhaz

Quasi-likelihood Scan Statistics for Detection of

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Outline. Statistical inference for linear mixed models. One-way ANOVA in matrix-vector form

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

Transcription:

One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and Baltic Stata Users Group meeting September 1, 2017 Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 1 / 29

Outline Goal Data Model Estimation Examples Summary Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 2 / 29

Goal A dose-response analysis describes the changes of a response across levels of a quantitative factor. The quantitative factor could be an administered drug or an exposure. A meta-analysis of dose-response (exposure-disease) relations aims at identifying the trend underlying multiple studies trying to answer the same research question. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 3 / 29

Increasing number of dose-response meta-analyses Number of citations 140 120 100 80 60 40 20 0 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 Data source: ISI Web of Knowledge 2002 2003 2004 2005 Publication year 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 4 / 29

The world s most comprehensive analysis of cancer prevention and survival research Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 5 / 29

Common practice in statistical analysis Plot summarized data and connect the dots with a line First estimate a curve within each study and then average regression coefficients across studies (two-stage approach) Exclude studies with less than 3 exposure groups Linear vs non-linear relationships Find the best fitting dose-response model Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 6 / 29

Data for a single study Table: Rate ratios of prostate cancer according to categories of body mass index (kg/m 2 ). Data from a cohort of 36,143 middle-age and elderly men followed for 446,699 person-years during which 2,037 were diagnosed with prostate cancer. Alcohol Median, No. of Person- Rate Ratio Intake grams/day cases years (95% CI) < 21.00 20.0 84 21,289 1.00 Ref. [21.00; 23.00) 22.2 323 61,895 1.32 (1.04, 1.68) [23.00; 25.00) 24.1 532 115,885 1.16 (0.92, 1.46) [25.00; 27.50) 26.2 651 136,917 1.21 (0.96, 1.51) [27.50; 30.00) 28.6 283 68,008 1.05 (0.83, 1.35) 30 32.3 164 42,704 0.97 (0.75, 1.27) Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 7 / 29

Plot of the data for a single study 1.4 1.3 Rate Ratio 1.2 1.1 1.0 15 20 25 30 35 40 Body Mass Index, kg/m 2 0.9 Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 8 / 29

Summarized vs Individual Data. glst logrr bmic, cov (py case ) se(se) ir Generalized least - squares regression Number of obs = 5 Goodness -of - fit chi2 (4) = 9.62 Model chi2 (1) = 6.35 Prob > chi2 = 0.0473 Prob > chi2 = 0.0117 ------------------------------------------------------------------------------ logrr Coef. Std. Err. z P > z [95% Conf. Interval ] -------------+---------------------------------------------------------------- bmir -.0189389.0075147-2.52 0.012 -.0336675 -.0042103 ------------------------------------------------------------------------------ Every 5 kg/m 2 increase in body mass index is associated with 9% (95% CI=0.85-0.98) lower prostate cancer risk.. streg bmi, dist ( exp ) nohr Exponential PH regression No. of subjects = 36,143 Number of obs = 36,143 No. of failures = 2,037 Time at risk = 446698.5243 LR chi2 (1) = 7.92 Log likelihood = -9249.8404 Prob > chi2 = 0.0049 ------------------------------------------------------------------------------ _t Coef. Std. Err. z P > z [95% Conf. Interval ] -------------+---------------------------------------------------------------- bmi -.0197478.0070674-2.79 0.005 -.0335997 -.0058959 _cons -4.88436.1817654-26.87 0.000-5.240614-4.528106 ------------------------------------------------------------------------------ Every 5 kg/m 2 increase in body mass index is associated with 9% (95% CI=0.85-0.97) lower prostate cancer rate. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 9 / 29

Challenge of comparing alternative parametrizations expressed in relative terms 1.4 1.3 1.2 1.1 1.0 Rate Ratio 0.9 0.8 0.7 15 20 25 30 35 40 Body Mass Index, kg/m 2 Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 10 / 29

Features of the data The response variable is a vector of contrasts relative to a common referent Correlation among study-specific contrasts Graphical comparison of alternative models is not straightforward Number of contrasts is varying across studies Exclusion of studies with not enough contrasts to fit more complicated model Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 11 / 29

Model A one-stage model for meta-analysis of aggregated dose-response data can be written in the general form of a linear mixed model y i = X i β + Z i b i + ɛ i (1) y i is the n i 1 outcome vector in the i-th study X i is the corresponding n i p design matrix for the fixed-effects β, consisting of the p transformations able to answer a variety of research questions. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 12 / 29

Splines according to the research question Rate ratio of colorectal cancer 2.0 1.5 1.2 1.0 0.8 a) Restricted cubic splines 2.0 1.5 1.2 1.0 0.8 b) Piecewise linear 0 10 20 30 40 50 60 70 0 10 20 30 40 50 60 70 Rate ratio of colorectal cancer 2.0 1.5 1.2 1.0 0.8 c) Piecewise constant 0 10 20 30 40 50 60 70 Alcohol intake, grams/day 2.0 1.5 1.2 1.0 0.8 d) Mix of splines 0 10 20 30 40 50 60 70 Alcohol intake, grams/day Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 13 / 29

Design matrix Since the y i is a set of response contrasts relative to the baseline dose x 0i, X i needs to be constructed in a similar way by centering the p transformations of the dose levels to the corresponding values in x 0i. Let consider, for example, a transformation f ; the generic j-th row of X i would be defined as f (x ji ) f (x 0i ). As a consequence X does not contain the intercept term (y = 0 for x = x 0i ). Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 14 / 29

Random effects and residual error term b i N (0, Ψ) The random-effects b i represent study-specific deviations from the population average dose-response coefficients β. Z i is the analogous n i q design matrix for the random-effects. The residual error term ɛ i N (0, S i ), whose variance matrix S i is assumed known. S i can be either given or approximated using available summarized data. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 15 / 29

Marginal and conditional model The marginal model of Equation 1 can be written as ) y i N (X i β, Z i ΨZ i + S i (2) with Z i ΨZ i + S i = Σ i. The marginal variance Σ i can be separated in two parts: the within-study component S i, that can be reconstructed from the available data and the between-study variability as a quadratic form of Ψ. Alternatively the conditional model can be written as y i b i N (X i β + Z i b i, S i ) (3) The dose-response model in Equation 1 can be extended to the case of meta-regression by including an interaction terms between the p dose transformations and the study-levels variables in the fixed-effect design matrix X i. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 16 / 29

Estimation We consider estimation methods based on maximum likelihood (ML) and restricted maximum likelihood (REML). The marginal likelihood for the model in Equation 2 is defined as l (β, ξ) = 1 2 n log(2π) 1 2 1 2 k i=1 k log Σ i (ξ) + i=1 [ ] (y i X i β) Σ i (ξ) 1 (y i X i β) where n = k i=1 n i and ξ is the vector of the variance components in Ψ to be estimated. Assuming ξ is known, ML estimates of β and V (β) are obtained by generalized least square estimators. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 17 / 29

Estimation An alternative is provided by REML estimation that maximizes the following likelihood l R (ξ) = 1 2 (n p) log(2π) 1 2 1 k 2 log X T i Σ i (ξ) 1 X i 1 2 i=1 k log Σ i (ξ) + i=1 k i=1 [ ( ) y i X i ˆβ ( Σi (ξ) 1 y i X i ˆβ) ] where ˆβ indicates the estimates obtained by generalized least squares. Both ML and REML estimation methods have been implemented in the new drmeta Stata package. The additional fixed-effects analysis constrains the variance components ξ in Ψ to be all equal to zero. (4) Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 18 / 29

Hypothesis testing, goodness-of-fit, model comparison Hypothesis testing and confidence intervals for single coefficients can be constructed using standard inference from linear mixed models, based on the approximate multivariate distribution of β. Multivariate extensions of Wald-type or likelihood ratio tests can be adopted to test the hypothesis H 0 : β 1 = = β p = 0. An absolute measure of the fit of the model (Discacciati et al Res Synt Meth, 2015) is the deviance D = k i=1 (y i X i β) Σ 1 i (y i X i β) The coefficients of determination R 2 and a visual assessment of the decorrelated residuals may complement the previous measure. The fit of the separate analyses can be also compared using fit statistics such as the Akaike information criterion. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 19 / 29

Prediction The average dose-response curve can be presented pointwisely as predicted (log) relative responses for selected dose values x using one value x 0 as referent ŷ = (X X 0 ) ˆβ (5) where X and X 0 are the design matrices evaluated at x and x 0 respectively. An approximate 95% confidence interval for the predicted (log) relative measures can be constructed as (X X 0 ) ˆβ ± z 1 α 2 diag ( (ˆβ) (X X 0 ) V (X X 0 ) ) (6) Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 20 / 29

Best Linear Unbiased Prediction (BLUP) The multivariate normal assumption for unobserved random-effects can be used for making inference on the study-specific curves. Henderson (Biometrics, 1950) showed that the (asymptotic) best linear unbiased prediction (BLUP) of b can be computed as ˆb i = ˆΨZ i ˆΣ 1 ( ) i y i X i ˆβ (7) The conditional study-specific curves are given by X i ˆβ + ˆbi, that is a combination of the study-specific and population-average associations. Interestingly, the study-specific curve can be predicted also for studies with p transformations of the dose with p > n i. Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 21 / 29

Summarized data for 9 simulated case-control studies +-----------------------------------------------+ id rr lrr urr dose n cases ----------------------------------------------- 1 1.00 1.00 1.00 2.4 2260 42 1 0.89 0.62 1.28 5.2 6136 102 ----------------------------------------------- 2 1.00 1.00 1.00 1.7 651 39 2 0.68 0.47 0.97 5.1 3962 164 2 1.13 0.68 1.89 8.8 387 26 ----------------------------------------------- 3 1.00 1.00 1.00 0.8 224 11 3 0.75 0.40 1.43 3.9 2639 99 3 0.79 0.42 1.51 6.7 2031 80 3 2.02 0.83 4.91 9.8 106 10 ----------------------------------------------- 4 1.00 1.00 1.00 3.7 4306 89 4 0.69 0.50 0.96 6.2 4316 62 ----------------------------------------------- 5 1.00 1.00 1.00 2.0 849 22 5 1.03 0.64 1.65 5.2 3638 97 5 1.76 0.97 3.20 8.6 513 23 ----------------------------------------------- 6 1.00 1.00 1.00 0.1 112 7 6 0.42 0.19 0.95 3.5 2229 61 6 0.51 0.23 1.13 6.4 2515 83 6 1.12 0.41 3.04 9.6 144 10 ----------------------------------------------- 7 1.00 1.00 1.00 3.2 3807 117 7 0.64 0.49 0.83 6.0 5295 105 ----------------------------------------------- 8 1.00 1.00 1.00 1.5 442 14 8 1.20 0.69 2.11 4.7 3667 139 8 2.05 1.13 3.73 7.9 891 56 ----------------------------------------------- 9 1.00 1.00 1.00 0.0 87 5 9 0.38 0.15 0.98 3.2 1943 44 9 0.46 0.18 1.17 6.0 2697 74 9 0.62 0.21 1.88 9.0 273 10 +-----------------------------------------------+ Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 22 / 29

One-stage vs Two-stage model The one-stage model can be written as y i = (β 1 + b 1i )(x i x 0i ) + (β 2 + b 2i )(x 2 i x 2 0i) + ɛ i where y i is the vector of log odds ratios for the non-referent exposure levels in the i-th studies. The alternative two-stage analysis estimates the same model separately for each study y i = β 1i (x i x 0i ) + β 2i (x 2 i x 2 0i) + ɛ i and obtaines the population-average dose-response coefficients by using multivariate meta-analysis on the study-specific ˆβ i estimated in the previous step. Note that 3 studies (ID 1, 4, and 7) cannot be included in the two-stage analysis, since the quadratic models are not identifiable (p = 2 > n i = 1, for i = 1, 4, 7). Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 23 / 29

drmeta - One-stage model. gen dosesq = dose ^2. bysort id: gen dosec = dose - dose [1]. bysort id: gen dosesqc = dosesq - dosesq [1]. drmeta logrr dosec dosesqc, se(se) data (n cases ) set (id typen ) reml One - stage random - effect dose - response model Number of studies = 9 Number of obs = 18 Optimization = reml Model chi2 (2) = 36.67 Log likelihood = -8.6740999 Prob > chi2 = 0.0000 ------------------------------------------------------------------------------ logrr Coef. Std. Err. z P > z [95% Conf. Interval ] -------------+---------------------------------------------------------------- dose -.3294237.0733105-4.49 0.000 -.4731097 -.1857376 dosesq.0341118.0060608 5.63 0.000.0222329.0459907 ------------------------------------------------------------------------------ Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 24 / 29

drmeta - Two-stage model A traditional two-stage approach is feasible by excluding the three studies with only one non-reference catergory.. drop if inlist (id, 1,4,7). drmeta logrr dosec dosesqc, se(se) data (n cases ) set (id type ) reml 2 stage Two - stage random - effect dose - response model Number of studies = 6 Number of obs = 12 Optimization = reml Model chi2 (2) = 41.31 Log likelihood = 23.134639 Prob > chi2 = 0.0000 ------------------------------------------------------------------------------ logrr Coef. Std. Err. z P > z [95% Conf. Interval ] -------------+---------------------------------------------------------------- dosec -.2771024.0642939-4.31 0.000 -.4031161 -.1510887 dosesqc.0321893.0059277 5.43 0.000.0205713.0438073 ------------------------------------------------------------------------------ Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 25 / 29

Population average dose-response curves 2.00 1.70 True 1 stage (9 studies) 2 stage (6 studies) Odds Ratio 1.30 1.00 0.80 0 1.5 3 4.5 6 7.5 9 10.5 Dose Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 26 / 29

Study-specific dose-response trends based on predicted random effects Odds Ratio 2.00 1.70 1.30 1.00 0.80 0 1.5 3 4.5 6 7.5 9 10.5 Dose Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 27 / 29

What is the best model? 2.00 Best FP(3,3) 2 Stage Best FP(.5,3) 1 Stage True Quadratic 1.70 Odds Ratio 1.30 1.00 0.80 0 1.5 3 4.5 6 7.5 9 10.5 Dose Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 28 / 29

Summary We introduced a one-stage approach for dose-response meta-analysis using linear mixed models for summarized data It includes all the available data in answering research questions It facilitates graphical comparison of study-specific and pooled dose-response relationship It seems to allow a better comparison of alternative models It is computationally more demanding than a classic two-stage approach Orsini N (PHS, KI) Dose-response meta-analysis September 1, 2017 29 / 29