Incidence from Recent Infection Prevalence

Similar documents
TB incidence estimation

An Introduction to Tuberculosis Disease Modeling

Lecture 3. Truncation, length-bias and prevalence sampling

Bayesian Reliability Analysis: Statistical Challenges from Science-Based Stockpile Stewardship

Thursday. Threshold and Sensitivity Analysis

Missing covariate data in matched case-control studies: Do the usual paradigms apply?

TEMPORAL EXPONENTIAL- FAMILY RANDOM GRAPH MODELING (TERGMS) WITH STATNET

Lecture 7 Time-dependent Covariates in Cox Regression

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Downloaded from:

Models of Infectious Disease Formal Demography Stanford Summer Short Course James Holland Jones, Instructor. August 15, 2005

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Inference for partially observed stochastic dynamic systems: A new algorithm, its theory and applications

The SIR Disease Model Trajectories and MatLab

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Group analysis. Jean Daunizeau Wellcome Trust Centre for Neuroimaging University College London. SPM Course Edinburgh, April 2010

Modelling spatio-temporal patterns of disease

Introduction to SEIR Models

MAS3301 / MAS8311 Biostatistics Part II: Survival

Challenges (& Some Solutions) and Making Connections

Diagnostics. Gad Kimmel

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions

Estimation of Optimally-Combined-Biomarker Accuracy in the Absence of a Gold-Standard Reference Test

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Pump failure data. Pump Failures Time

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Approximate Bayesian computation: methods and applications for complex systems

Stat 476 Life Contingencies II. Multiple Life and Multiple Decrement Models

Discussion of Tests of Equal Predictive Ability with Real-Time Data by T. E. Clark and M.W. McCracken

Survival Analysis I (CHL5209H)

Neutral Bayesian reference models for incidence rates of (rare) clinical events

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21

Princeton University Press, all rights reserved. Chapter 10: Dynamics of Class-Structured Populations

Step-Stress Models and Associated Inference

Pubh 8482: Sequential Analysis

Statistical concepts in climate research

Supplement to TB in Canadian First Nations at the turn-of-the twentieth century

Tackling Statistical Uncertainty in Method Validation

Measurement Error in Spatial Modeling of Environmental Exposures

An Intuitive Introduction to Motivic Homotopy Theory Vladimir Voevodsky

Performance of INLA analysing bivariate meta-regression and age-period-cohort models

Mathematical Statistics

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, )

Bias Correction in Classification Tree Construction ICML 2001

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Estimating the Exponential Growth Rate and R 0

Round and Round the Circle

OUTBREAK INVESTIGATION AND SPATIAL ANALYSIS OF SURVEILLANCE DATA: CLUSTER DATA ANALYSIS. Preliminary Programme / Draft_2. Serbia, Mai 2013

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

A Generic Multivariate Distribution for Counting Data

Survival Analysis I (CHL5209H)

Models of Infectious Disease Formal Demography Stanford Spring Workshop in Formal Demography May 2008

Empirical Bayes Moderation of Asymptotically Linear Parameters

Frailty Modeling for clustered survival data: a simulation study

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Chapter 2: Describing Contingency Tables - I

A multi-state model for the prognosis of non-mild acute pancreatitis

Test Strategies for Experiments with a Binary Response and Single Stress Factor Best Practice

Uncertainty Propagation

Detailed Balance and Branching Processes Monday, October 20, :04 PM

The Spreading of Epidemics in Complex Networks

Algorithmisches Lernen/Machine Learning

Healthcare and Consumption with Aging

Introduction to Bayesian Inference

Constrained estimation for binary and survival data

Mediation: Background, Motivation, and Methodology

Mathematical Analysis of Epidemiological Models: Introduction

Time-dependent covariates

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Chapter 2 - Survival Models

Introduction to risk assessment

Understanding the contribution of space on the spread of Influenza using an Individual-based model approach

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Chapter Three. Hypothesis Testing

Naïve Bayes Classifiers and Logistic Regression. Doug Downey Northwestern EECS 349 Winter 2014

Stephen Scott.

Mathematical Analysis of Epidemiological Models III

Statistical Analysis of List Experiments

Dynamic pair formation models

CTDL-Positive Stable Frailty Model

Double-Sampling Designs for Dropouts

CROSS SECTIONAL STUDY & SAMPLING METHOD

A Note on the Spread of Infectious Diseases. in a Large Susceptible Population

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Introduction to Statistical Analysis

Mathematical modelling and controlling the dynamics of infectious diseases

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE

DEVELOPMENT OF DATA SETS ON JOINT CHARACTERISTICS AND CONSIDERATION OF ASSOCIATED INSTABILITY FOR A TYPICAL SOUTH AFRICAN GOLD MINE

Climate Response Across Time Scales as Inferred From the Proxy Record

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Topics in Current Status Data. Karen Michelle McKeown. A dissertation submitted in partial satisfaction of the. requirements for the degree of

Mixed effects and Group Modeling for fmri data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

GEOL212 Due 10/1/18 Homework V

Censoring mechanisms

Benchmarking the Hartree-Fock and Hartree-Fock-Bogoliubov approximations to level densities. G.F. Bertsch, Y. Alhassid, C.N. Gilbreth, and H.

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Transcription:

(with Norman Ives and Tom McWalter) School of Computational and Applied Mathematics, University of the Witwatersrand and DST/NRF Centre of Excellence in Epidemiological Modelling and Analysis (SACEMA) DIMACS/SACEMA Workshop 25 July 2007

Outline of Talk 1 Introduction 2 3 4 5 6

Motivation Challenges 1 Introduction Motivation Challenges 2 3 4 5 6

Motivation Introduction Motivation Challenges Determination of epidemiological trends requires knowledge of both incidence and prevalence. This is required for: Allocating resources Evaluating interventions Planning cohort based studies

Motivation Challenges Measuring Incidence is Problematic Gold Standard of cohort follow up is: Expensive Slow Not free from bias Intervention Loss to follow up

Alternatives to Gold Standard Motivation Challenges Incidence from snapshots of recent infection prevalence Many pasts can produce one present Problem of calibrating recent Forever recent? Sample sizes

Disease Progression and Survival Relating History to Present 1 Introduction 2 Disease Progression and Survival Relating History to Present 3 4 5 6

Stages of Infection Introduction Disease Progression and Survival Relating History to Present As observed through two assays of different sensitivity, infected individuals experience the following events: 0 The moment of infection, at t 1 Infection becomes detectable by assay one, at t + t 1 2 Infection becomes detectable by assay two, at t + t 2 3 Death, from whatever cause, at t + t 3

Counting Infectees Introduction Disease Progression and Survival Relating History to Present All historically infected individuals N i (0) = 0 λ(t) dt = 0 I(t)N s (t) dt After infection at time t < 0 the probability of a being alive and classified as recently infected at time zero is given by P { (t 1 < t) AND (t 2 > t) AND (t 3 > t) } = t 0 t t ρ(t 1, t 2, t 3 ) dt 3 dt 2 dt 1 The probability of a being alive and classified as not recently infected at time zero is given by P { (t 2 < t) AND (t 3 > t) }

Disease Progression and Survival Relating History to Present Recently and Long Infected Individuals Counts of recently and long infected individuals observed in a cross-sectional survey R = 0 I(t)N s (t)p { (t 1 < t) AND (t 2 > t) AND (t 3 > t) } dt L = 0 I(t)N s (t)p { (t 2 < t) AND (t 3 > t) } dt

What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics 1 Introduction 2 3 What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics 4 5 6

Simplifications Introduction What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics In order to infer incidence from observables, we need to make numerous simplifications of the general model. Redefine incidence to start at t 1. Factorise ρ(t 2, t 3 ) into ρ r (t 2 )ρ a (t 3 ) (independent events). Use only a few moments of ρ s (preferably just the means). Now define the survival times S r (t) = 1 S a (t) = 1 t 0 t 0 ρ r (s) ds ρ a (s) ds

What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics Which Incidence Can Possibly Be Measured? If I(t) is not constant then define a weighted average I w = 0 I(t)W (t) dt 0 W (t) dt where W (t) is a statistical weight implied by the sampling methodology and formal assumptions. It will be useful to consider in which case 0 W (t) = S a ( t)s r ( t) = S ar ( t) W (t) dt = 0 S ar ( t) dt = E [τ r ]

When Recent is Forever What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics BED assay may be best bet for a practical definition of recent. But: not everyone leaves this category. Note symmetry!

General Practical Model What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics Counts of infection categories R t = 0 L = (1 P ) R f = I(t)N s (t)s r ( t)s a ( t) dt 0 P (1 P ) L Unbiased estimation of R t is possible! R t = R o I(t)N s (t)(1 S r ( t))s a ( t) dt P (1 P ) L

Constant Number of Susceptibles What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics If N s (t) = n 0 then R t = n 0 0 I(t)S ar( t) dt E [τ r ] E [τ r ] = n 0 I ar E [τ r ] Hence I ar = R t n 0 E [τ r ] is an unbiased estimator.

When N s (t) is Non-Constant What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics It is not possible to obtain unbiased estimator for I w with non-constant I(t) and non-constant N s (t). Need to assume specific functional forms for N s (t) and I(t). Can infer up to two parameters in I(t), N s (t). In the case where I(t) = i 0 and N s (t) = n 0 + n 1 t we recover I w = i 0 = R t n 0 E [τ r ] n 1 2 E [τ 2 r ]

What Can We Hope to Know? Modelling the Forever Recent Infection Equilibrium Population Dynamics Beyond Equilibrium Population Dynamics When N s (t) and I(t) are Both Non-constant Even the simple case of N s (t) = n 0 + n 1 t and I(t) = i 0 + i 1 t requires calibration for E [ τr 2 ] [ ], E τ 3 r and E [τa ], which seems impossible. But: we can use a simple formula and then see how wrong it is when we violate the assumptions under which it would be correct.

Analytical Simulations 1 Introduction 2 3 4 Analytical Simulations 5 6

Closed-Form Calculation of Bias Analytical Simulations Declare I(t), N s (t), ρ a (t) and ρ r (t). Evaluate I ar from knowledge of declared inputs. Evaluate naive formula to estimate I: Compare I ar and I est. I est = R t E [τ r ] N s (0) Note bias as a function of parameters which break assumption in naive formula.

Analytical Simulations 0.35 I(t) = 0.1 + 0.04t 0.3 0.25 Relative error in Iar 0.2 0.15 0.1 0.05 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 n 1 /n 0, where Ns(t) = n 0 +n 1 t

Analytical Simulations 0.06 I(t) = I 0 2 t 0.05 Relative error in Iar 0.04 0.03 0.02 0.01 0 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 r, where Ns(t)=n 0 e rt

Matlab Simulations Introduction Analytical Simulations Produce infection events according to λ(t) = N s (t)i(t). Obtain samples after burn-in time. Confirms analytical results. Can explore sample-path specific I w. Can combine sample size effects with sample path effects. Future implementation of hierarchical Bayesian inference.

Analytical Simulations x Population 104 8 Long infected True recents 7 False recents 6 5 4 3 2 1 0 0 2 4 6 8 10 12 14 16 18 20 0.12 0.1 Naive incidence (total population) Naive incidence (sample population) Realised incidence Platonic incidence Incidence 0.08 0.06 0.04 0.02 0 0 2 4 6 8 10 12 14 16 18 20

Sample Sizes Calibration 1 Introduction 2 3 4 5 Sample Sizes Calibration 6

How Big Must Surveys Be? Sample Sizes Calibration Precision limiting step in a survey is the count of recent infections. Need to find tens of recent infections for reasonable precision. Example: suppose we wish to distinguish I = 0.05 confidently from I = 0.04, then: BED definition of recent leads to N 5000. RNA+/AB- definition of recent leads to N 100000.

Sample Sizes Calibration Determining S ar (t) and P Most important parameter is P. Need to observe this failure to progress several tens of times in a calibration study. Need to estimate the mean of S ar (t) in a cohort censored for progression. Non equilibrium formulas require estimation of long term survival times or short term standard deviations.

1 Introduction 2 3 4 5 6

Long Term Incidence Surveillance Can t run from the need for incidence surveillance. There is no magic bullet. BED type definition is perhaps the best short term option. Just discussed one-shot methods. Need to set up open ended calibration. Need to manage multi-survey meta analysis to: construct informative priors ; calibrate demographic/epidemiological parameters.