Estimation of Optimally-Combined-Biomarker Accuracy in the Absence of a Gold-Standard Reference Test

Similar documents
Nonparametric Estimation of ROC Curves in the Absence of a Gold Standard (Biometrics 2005)

Lecture 16: Mixtures of Generalized Linear Models

Introduction to Machine Learning

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Comparison of multiple imputation methods for systematically and sporadically missing multilevel data

Reconstruction of individual patient data for meta analysis via Bayesian approach

Nonparametric predictive inference with parametric copulas for combining bivariate diagnostic tests

Introduction to Machine Learning

Minimum Message Length Inference and Mixture Modelling of Inverse Gaussian Distributions

Sample Size Calculations for ROC Studies: Parametric Robustness and Bayesian Nonparametrics

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

Naïve Bayes classification

ROC Curve Analysis for Randomly Selected Patients

High-dimensional regression modeling

A Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

More on nuisance parameters

Bayesian Hierarchical Models

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves

Bayesian Methods: Naïve Bayes

Statistics & Data Sciences: First Year Prelim Exam May 2018

Longitudinal breast density as a marker of breast cancer risk

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Statistical Methods for Multivariate Meta-Analysis of Diagnostic Tests

Multinomial Data. f(y θ) θ y i. where θ i is the probability that a given trial results in category i, i = 1,..., k. The parameter space is

STAT 518 Intro Student Presentation

David Hughes. Flexible Discriminant Analysis Using. Multivariate Mixed Models. D. Hughes. Motivation MGLMM. Discriminant. Analysis.

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

Analysis of longitudinal imaging data. & Sandwich Estimator standard errors

Equivalence of random-effects and conditional likelihoods for matched case-control studies

Default Priors and Effcient Posterior Computation in Bayesian

AMS-207: Bayesian Statistics

Hierarchical Modeling for non-gaussian Spatial Data

Represent processes and observations that span multiple levels (aka multi level models) R 2

Hierarchical Modelling for non-gaussian Spatial Data

1 EM algorithm: updating the mixing proportions {π k } ik are the posterior probabilities at the qth iteration of EM.

10. Exchangeability and hierarchical models Objective. Recommended reading

Plausible Values for Latent Variables Using Mplus

New Bayesian methods for model comparison

Introduction to Machine Learning CMU-10701

Modeling conditional distributions with mixture models: Theory and Inference

A semiparametric Bayesian model for examiner agreement in periodontal research

University of California, Berkeley

Modeling conditional dependence among multiple diagnostic tests

VCMC: Variational Consensus Monte Carlo

Bayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC)

Hierarchical Modelling for non-gaussian Spatial Data

Stat 216 Final Solutions

Introduction to Machine Learning. Lecture 2

Nonparametric Bayes tensor factorizations for big data

W-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS

Bayesian Statistics for Personalized Medicine. David Yang

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Application of Latent Class with Random Effects Models to Longitudinal Data. Ken Beath Macquarie University

Modeling continuous diagnostic test data using approximate Dirichlet process distributions

REPRODUCIBLE ANALYSIS OF HIGH-THROUGHPUT EXPERIMENTS

Lecture 1: Bayesian Framework Basics

Introduction to Machine Learning. Maximum Likelihood and Bayesian Inference. Lecturers: Eran Halperin, Yishay Mansour, Lior Wolf

University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries

Estimating Diagnostic Error without a Gold Standard: A Mixed Membership Approach

BIOS 312: Precision of Statistical Inference

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach

Constructing Confidence Intervals of the Summary Statistics in the Least-Squares SROC Model

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Assessment of uncertainty in computer experiments: from Universal Kriging to Bayesian Kriging. Céline Helbert, Delphine Dupuy and Laurent Carraro

Zhiguang Huo 1, Chi Song 2, George Tseng 3. July 30, 2018

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Based on slides by Richard Zemel

Introduction (Alex Dmitrienko, Lilly) Web-based training program

ECE521 week 3: 23/26 January 2017

ABTEKNILLINEN KORKEAKOULU

Combining diverse information sources with the II-CC-FF paradigm

Quasi-likelihood Scan Statistics for Detection of

10708 Graphical Models: Homework 2

LDA, QDA, Naive Bayes

The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference

STA 216, GLM, Lecture 16. October 29, 2007

CLASS NOTES Models, Algorithms and Data: Introduction to computing 2018

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Part 8: GLMs and Hierarchical LMs and GLMs

Prediction-based structured variable selection through the receiver operating characteristic curves. The Pennsylvania State University

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

Machine Learning Linear Classification. Prof. Matteo Matteucci

Learning with Probabilities

Approaches to Modeling Menstrual Cycle Function

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Outline. Selection Bias in Multilevel Models. The selection problem. The selection problem. Scope of analysis. The selection problem

A simulation study of model fitting to high dimensional data using penalized logistic regression

Model Checking and Improvement

Genetics: Early Online, published on May 5, 2017 as /genetics

Using Bayesian Priors for More Flexible Latent Class Analysis

Probabilistic Index Models

Approximate analysis of covariance in trials in rare diseases, in particular rare cancers

Chapter 10. Semi-Supervised Learning

Transcription:

Estimation of Optimally-Combined-Biomarker Accuracy in the Absence of a Gold-Standard Reference Test L. García Barrado 1 E. Coart 2 T. Burzykowski 1,2 1 Interuniversity Institute for Biostatistics and Statistical Bioinformatics (I-Biostat) 2 International Drug Development Institute (IDDI) 1 / 26

Outline Problem setting Accuracy definition Optimal combination of biomarkers Absence of gold-standard reference Bayesian latent-class mixture model Naive prior definition Controlled prior definition Simulation study Data and model Results Conclusions 2 / 26

Problem setting Outline Problem setting Accuracy definition Optimal combination of biomarkers Absence of gold-standard reference Bayesian latent-class mixture model Naive prior definition Controlled prior definition Simulation study Data and model Results Conclusions 3 / 26

Problem setting Problem setting Establish accuracy of a combination of biomarkers in the absence of a gold-standard reference test Area under the Receiver Operating Characteristics (ROC) curve (AUC) as measure of accuracy Choose combination of biomarkers that maximizes AUC Imperfect reference test leads to biased estimates of accuracy => To this end a Bayesian latent-class mixture model will be proposed 4 / 26

Problem setting Accuracy definition Area under the Receiver Operating Characteristics curve Classification example Classification example ROC curve 0.0 0.1 0.2 0.3 0.4 Control Disease 0.0 0.1 0.2 0.3 0.4 Control Disease Se1 0.0 0.2 0.4 0.6 0.8 1.0 AUC=0.98 AUC=0.76 2 0 2 4 2 0 2 4 0.0 0.2 0.4 0.6 0.8 1.0 Biomarker value Biomarker value 1 Sp 5 / 26

Problem setting Optimal combination of biomarkers Data assumptions and notation Underlying true biomarker distribution Mixture of two K -variate normal distributions by true disease status (D) Y D=0 N K (µ 0, Σ 0 ) Y D=1 N K (µ 1, Σ 1 ) Se: Unknown sensitivity of the reference test (T) Sp: Unknown specificity of the reference test (T) θ: Unknown true prevalence of disease in the data set Reference test is imperfect Conditionally on true disease status, misclassification independent of biomarker value Ignoring will UNDERESTIMATE performance of biomarker 6 / 26

Problem setting Optimal combination of biomarkers AUC of optimal combination of biomarkers According to Siu and Liu (1993) the linear combination maximizing AUC is of the form: a Y D=0 N(a µ 0, a Σ 0a) a Y D=1 N(a µ 1, a Σ 1a) For which: a (Σ 0 + Σ 1) 1 (µ 1 µ 0 ) Area Under the ROC Curve: AUC OptComb = Φ { } ((µ 1 µ 0 ) (Σ 0 + Σ 1) 1 (µ 1 µ 0 )) 1 2 This is all under the assumption of a gold standard reference test. We propose to extend this to the imperfect reference test case. 7 / 26

Problem setting Optimal combination of biomarkers AUC of optimal combination of biomarkers According to Siu and Liu (1993) the linear combination maximizing AUC is of the form: a Y D=0 N(a µ 0, a Σ 0a) a Y D=1 N(a µ 1, a Σ 1a) For which: a (Σ 0 + Σ 1) 1 (µ 1 µ 0 ) Area Under the ROC Curve: AUC OptComb = Φ { } ((µ 1 µ 0 ) (Σ 0 + Σ 1) 1 (µ 1 µ 0 )) 1 2 This is all under the assumption of a gold standard reference test. We propose to extend this to the imperfect reference test case. 8 / 26

Problem setting Absence of gold-standard reference Underlying versus observed data Ignoring misclassification in imperfect reference test will lead to bias of estimated accuracy: True distributions VS observed data Observed Control (T=0) Observed Disease (T=1) True Control (D=0) True Disease (D=1) In example: conditionally independent misclassification Misclassification in reference test causes skewed observed distributions Goal: retrieve accuracy of true underlying biomarker by observed data 2 0 2 4 6 Biomarker value 9 / 26

Bayesian latent-class mixture model Outline Problem setting Accuracy definition Optimal combination of biomarkers Absence of gold-standard reference Bayesian latent-class mixture model Naive prior definition Controlled prior definition Simulation study Data and model Results Conclusions 10 / 26

Bayesian latent-class mixture model Full data likelihood L(µ 0, µ 1, Σ 0, Σ 1, θ, Se, Sp Y, T, D) ( N { = θse t i (1 Se) (1 t i ) 1 (2π)K Σ EXP 1 } ) d i 1 2 (Yi µ 1 ) Σ 1 1 (Y i µ 1 ) i=1 ( { (1 θ)(1 Sp) t i Sp (1 t i ) 1 (2π)K Σ EXP 1 } ) (1 d i ) 0 2 (Yi µ 0) Σ 1 0 (Y i µ 0 ) 11 / 26

Bayesian latent-class mixture model Naive prior definition Naive prior definition Hyperprior θ Uniform(0.1,0.9) Priors D i Bernoulli(θ) (Observation i: 1,...,N) µ kj N(0,10 6 ) (Disease indicator j: 0, 1; Biomarker k: 1,...,K) Σ 1 j Wish(S,K) (Disease indicator j: 0, 1) with S = VarCov-matrix of observed control group Se = Sp Beta(1,1)T(0.51, ) [Non-informative] OR Se = Sp Beta(10,1.764706)T(0.51, ) [Informative] 12 / 26

Bayesian latent-class mixture model Naive prior definition Se/Sp Beta(10,1.764706) Prior Mean = 0.85 Var = 0.009988479 Equal-tail 95%-probability interval: 0.6078-0.9834 Informative Se/Sp prior Density 0 1 2 3 4 0.5 0.6 0.7 0.8 0.9 1.0 Se or Sp 13 / 26

Bayesian latent-class mixture model Naive prior definition Implied prior AUC Density 0 10 20 30 40 50 Implied AUC prior 0.0 0.2 0.4 0.6 0.8 1.0 AUC Prior specification is used commonly (e.g. O Malley and Zou (2006)) Uninformative mixture component priors lead to prior point mass distribution centred at 1 for AUC Extremely informative prior for component of interest! 14 / 26

Bayesian latent-class mixture model Controlled prior definition Controlled prior definition (AUC) AUC OptComb { } = Φ ((µ 1 µ 0 ) (Σ 0 + Σ 1 ) 1 (µ 1 µ 0 )) 1 2 { } = Φ ((µ 1 µ 0 ) L L(µ 1 µ 0 )) 1 2 With L = the Cholesky factor of (Σ 0 + Σ 1 ) 1 Set = L(µ 1 µ 0 ) N K (κ, Ψ) µ 0k N(0, 10 6 ) (k: 1,...,K) µ 1 = L 1 + µ 0 15 / 26

Bayesian latent-class mixture model Controlled prior definition Implied priors AUC For κ = 0 0, σi = 0.7 and ρ ij = 0.6 0 [i,j: 1,...,K] AUC Kappa=(0.0,0.0,0.0),Sd=(0.7,0.7,0.7),Cor=(0.6,0.6,0.6) Simulated AUC fy 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Expansion appr. Less informative prior distribution for AUC Prior on gives control over informativeness AUC prior Similar issue for Σ 0 and Σ 1 (Wei, Y. & Higgins, P.T., 2013) 0.5 0.6 0.7 0.8 0.9 1.0 Y 16 / 26

Simulation study Outline Problem setting Accuracy definition Optimal combination of biomarkers Absence of gold-standard reference Bayesian latent-class mixture model Naive prior definition Controlled prior definition Simulation study Data and model Results Conclusions 17 / 26

Simulation study Data and model 100 data sets for 3 biomarkers 4 data generating settings R = I K vs R I K Σ 0 = Σ 1 vs Σ 0 Σ 1 Parameter settings N = 100, 400 or 600 θ = 0.5 Se = Sp = 0.85 Mixture component parameters set such that: AUC 1 = AUC 2 = AUC 3 = 0.75 AUC OptimalCombination R=IK = 0.88 AUC OptimalCombination R IK = 0.78 Analysis model: General model fitting 2 unstructured covariance matrices 18 / 26

Simulation study Results AUC Results (Average of median posterior AUC) In cell: Average of median posterior AUC (Standard Error) [Number of converged fits] Data Gen. Sample Size Model Se/Sp True VarCov Corr N100 N400 N600 prior AUC Σ 0 = Σ 1 ρ 0 GS 0.784 0.689 (0.039) [100] 0.695 (0.022) [100] 0.694 (0.018) [100] Σ 0 Σ 1 ρ = 0 GS 0.879 0.756 (0.037) [100] 0.762 (0.019) [100] 0.760 (0.015) [100] Σ 0 Σ 1 ρ 0 GS 0.784 0.693 (0.040) [100] 0.697 (0.022) [100] 0.696 (0.018) [100] Gold Standard model fit leads to significant underestimation 19 / 26

Simulation study Results AUC Results (Average of median posterior AUC) Data Gen. Model Sample Size VarCov Corr Se/Sp True prior AUC N100 N400 N600 Σ 0 = Σ 1 ρ 0 GS 0.784 0.689 (0.039) [100] 0.695 (0.022) [100] 0.694 (0.018) [100] Σ 0 = Σ 1 ρ 0 NINF 0.784 0.812 (0.056) [18] 0.803 (0.034) [61] 0.796 (0.030) [60] Σ 0 Σ 1 ρ = 0 GS 0.879 0.756 (0.037) [100] 0.762 (0.019) [100] 0.760 (0.015) [100] Σ 0 Σ 1 ρ = 0 NINF 0.879 0.879 (0.041) [89] 0.878 (0.022) [100] 0.882 (0.019) [100] Σ 0 Σ 1 ρ 0 GS 0.784 0.693 (0.040) [100] 0.697 (0.022) [100] 0.696 (0.018) [100] Σ 0 Σ 1 ρ 0 NINF 0.784 0.797 (0.047) [72] 0.787 (0.030) [100] 0.785 (0.025) [100] Proposed model with controlled prior leads to unbiased estimates Non-convergence issues for low information data Small sample size Correlation Over-specified VarCov model Increased sample size increases precision 20 / 26

Simulation study Results AUC Results (Average of median posterior AUC) Data Gen. Model Sample Size VarCov Corr Se/Sp True prior AUC N100 N400 N600 Σ 0 = Σ 1 ρ 0 GS 0.784 0.689 (0.039) [100] 0.695 (0.022) [100] 0.694 (0.018) [100] Σ 0 = Σ 1 ρ 0 NINF 0.784 0.812 (0.056) [18] 0.803 (0.034) [61] 0.796 (0.030) [60] Σ 0 = Σ 1 ρ 0 INF 0.784 0.791 (0.053) [33] 0.790 (0.033) [83] 0.782 (0.027) [86] Σ 0 Σ 1 ρ = 0 GS 0.879 0.756 (0.037) [100] 0.762 (0.019) [100] 0.760 (0.015) [100] Σ 0 Σ 1 ρ = 0 NINF 0.879 0.879 (0.041) [89] 0.878 (0.022) [100] 0.882 (0.019) [100] Σ 0 Σ 1 ρ = 0 INF 0.879 0.870 (0.045) [96] 0.876 (0.022) [100] 0.879 (0.019) [100] Σ 0 Σ 1 ρ 0 GS 0.784 0.693 (0.040) [100] 0.697 (0.022) [100] 0.696 (0.018) [100] Σ 0 Σ 1 ρ 0 NINF 0.784 0.797 (0.047) [72] 0.787 (0.030) [100] 0.785 (0.025) [100] Σ 0 Σ 1 ρ 0 INF 0.784 0.786 (0.049) [83] 0.784 (0.029) [100] 0.783 (0.025) [100] Inclusion of prior Se and Sp information No effect on point estimate or precision Positive effect on number of converged data sets 21 / 26

Conclusions Outline Problem setting Accuracy definition Optimal combination of biomarkers Absence of gold-standard reference Bayesian latent-class mixture model Naive prior definition Controlled prior definition Simulation study Data and model Results Conclusions 22 / 26

Conclusions Conclusions Bayesian latent-class mixture model: Takes unknown true disease status into account Incorporates information from reference test while acknowledges imperfectness Provides estimates of accuracy of the reference test Simulation study Model is able to retrieve true AUC when: sample size is adequate sensible non-informative prior is considered Careful prior specification Complex function of uninformative prior distributions => informative prior => biased estimates Controlled prior specification is proposed 23 / 26

Conclusions Further considerations Sensitivity to misspecified Se/Sp prior distribution Extend to incorporate non-normally distributed biomarkers Evaluate impact of conditional independence assumption 24 / 26

Conclusions References O Malley, A.J., Zou, K.H.: Bayesian multivariate hierarchical transformation models for ROC analysis. Statistical Medicine. 25, 459 479 (2006) Su, J.Q., Liu, J.S.: Linear combinations of multiple diagnostic markers. Journal of the American Statistical Association. 88, 1350 1355 (1993) Wei, Y, Higgins, P.T.: Bayesian multivariate meta-analysis with multiple outcomes. Statistics in Medicine (2013) doi: 10.1002/sim.5745 25 / 26

Conclusions Thank you for your attention! 26 / 26