Sample size considerations for precision medicine
|
|
- Brittney Shepherd
- 5 years ago
- Views:
Transcription
1 Sample size considerations for precision medicine Eric B. Laber Department of Statistics, North Carolina State University November 28
2 Acknowledgments Thanks to Lan Wang NCSU/UNC A and Precision Medicine Lab PR Department of Natural Resources National Science Foundation National nstitutes of Health Joint work with Eric Rose Marie Davidian Butch Tsiatis Michael Kosorok Students in lab
3 Precision medicine The right treatment for the right patient at the right time. Mantra of personalized medicine advocates Widely recognized that best clinical care requires treatment decisions tailored to individual patient characteristics mprove patient outcomes, reduce cost and patient burden Treatment regimes Formalize clinical decision making as sequence of decision rules One rule per stage of clinical intervention Maps current patient info to recommended treatment Optimal regime maximizes the mean of some cumulative clinical outcome if applied to population of interest / 3
4 Ex. Treatment regime: mhealth for PTSD in cancer patients (P S. Smith) First stage decision rule f distress 3 then: Cancer Distress Coach (CDC) Else if PTSD symptom score 2 then: CDC Else: usual care Second stage decision rule f responder then: continue first stage treatment Else if using CDC and PSTD change 3 then: add mcoaching Else if using CDC and distress 4 then: add FaceTime CBT Else FaceTime CBT only 2 / 3
5 Ex. Trial design: mhealth for PTSD in cancer Continue patients (P S. Smith) Distress coach Yes Treatment AA Add mcoaching Treatment A Distress Coach Response? Treatment AB No Facetime CBT R R Continue Follow-up only Yes Treatment BA DC + mcoaching Treatment B Standard Care Response? Treatment BB No Facetime CBT R 3 / 3
6 Sample size calculations for precision medicine Sample size typically based on primary aim Power for simple hypotheses, e.g., compare mean outcome under two non-overlapping treatment sequences t-test Clear criterion for power calculations Test statistics are standard Estimation of treatment regime typically secondary analysis Composite hypotheses, high-dim nuisance parameters Unclear what criteria should be used Test statistics are non-standard 4 / 3
7 State-of-the-art: estimate and validate Estimate optimal regime Run SMART Run follow-up RCT Continue Est. opt. regime Distress coach Yes Add mcoaching Treatment A Distress Coach Response? Treatment AB No Facetime CBT R R Continue Follow-up only Yes Treatment BA DC + mcoaching Treatment B Standard Care π bn Treatment AA Response? Treatment BB No Facetime CBT π bn R Primary analyses Eval via CV R Control Control Standard hypothesis test 5 / 3
8 Estimate and validate: discussion Sound scientific premise but potentially inefficient Requires two distinct clinical trials Higher cost Delayed clinical impact π bn may be of low-quality Poor patient outcomes Obscure new clinical knowledge deal: run single trial that allows primary aims and estimation and evaluation of optimal treatment regime 6 / 3
9 Perception of power and precision medicine dentifying subgroups has negative connotations Data snooping Lack of reproducibility/ generalizability Lack of statistical rigor Widely held that sizing trial to estimate opt treatment regime Would require an enormous sample size Cannot guarantee frequentist operating characteristics ntractable without strong assumptions 7 / 3
10 Perception of power and precision medicine dentifying subgroups has negative connotations Data snooping Lack of reproducibility/ generalizability Lack of statistical rigor Widely held that sizing trial to estimate opt treatment regime Would require an enormous sample size Cannot guarantee frequentist operating characteristics ntractable without strong assumptions 8 / 3
11 Preview of remainder of this talk Goal: sample size procedures to ensure estimation and evaluation of opt regime with performance guarantees Develop sample size procedures for two extremes Strong distributional assumptions which (implicitly) exclude non-standard test statistics Non-parametric model and asymptotic approximations that allow for non-standard test statistics Many intermediate procedures between these extremes 9 / 3
12 Outline Estimation of optimal treatment regimes SMART sample size formulae via parametric models SMART sample size formulae under minimal assumptions Simulation experiments
13 Outline Estimation of optimal treatment regimes SMART sample size formulae via parametric models SMART sample size formulae under minimal assumptions Simulation experiments
14 Setup and notation Observe {(X,i, A,i, X 2,i, A2,i, Yi )}ni=, i.i.d. from P X Rp A {, } : first treatment : baseline subj. info. X 2 Rp A2 {, } : second treatment Y R : outcome, higher is better 2 : interim subj. info. during course of A H = X, H 2 = (X, A, X 2) Define history Treatment regime π = (π, π2 ) where πt : supp H t supp At, patient presenting with H t = ht assigned treatment πt (ht ) / 3
15 Characterizing optimal treatment regime π ) = Eπ Y Optimal regime maximizes value V (π Define Q-functions Q2 (h 2, a2 ) = E Y H 2 = h 2, A2 = a2 Q (h, a ) = E max Q2 (H 2, a2 ) H = h, A = a a2 Dynamic programming (Bellman, 957) πtopt (h t ) = arg maxat Qt (h t, at ) / 3
16 Q-learning Assume estimation via Q-learning with linear models Regression-based dynamic programming algorithm (Q) Postulate working models for Q-functions Qt (h t, at ; βt ) = h t, βt, + at h t, βt,, h t,, h t, features of ht 2 (Q) Compute βb2,n = arg minβ2 Pn {Y Q2 (H 2, A2 ; β2 )} (Q2) Compute n o2 βb,n = arg minβ Pn maxa2 Q2 (H 2, A2 ; βb2,n ) Q (H, A ; β ) (Q3) π bt,n (h t ) = arg maxat Qt (h t, at ; βbt,n ) 2 / 3
17 Sample size criteria Let B >, γ, α, η,, ζ (, ) be fixed Want to choose sample size n so that π opt ) B with power at (PWR) there exists an α-level test of H : V (π π opt ) B + η least ( γ) + o() provided V (π π opt ) } ζ + o() (OPT) P {V (b π n ) V (π 3 / 3
18 Outline Estimation of optimal treatment regimes SMART sample size formulae via parametric models SMART sample size formulae under minimal assumptions Simulation experiments
19 Approach one: impose structure Assume the generative model satisfies + a2 h 2, β2, (AN) Q2 (h 2, a2 ) = h 2, β2, (AN2) E H2, β2, H = h, A = a + a h, ξ, = h, ξ, β2, = H,2 $,2 + A H,3 $,3 + τ Z, where (AN3) H2, Z Normal(, ) ind error (AN4) H, ξ,, H, ξ,, H,2 $,2, H,3 $,3 Normal(ω, Ω ) 4 / 3
20 Approach one: impose structure Assume the generative model satisfies + a2 h 2, β2, (AN) Q2 (h 2, a2 ) = h 2, β2, (AN2) E H2, β2, H = h, A = a + a h, ξ, = h, ξ, β2, = H,2 $,2 + A H,3 $,3 + τ Z, where (AN3) H2, Z Normal(, ) ind error (AN4) H, ξ,, H, ξ,, H,2 $,2, H,3 $,3 Normal(ω, Ω ) Note* implicitly make technical assumptions including compactness of parameter space, existence of requisite moments etc. 4 / 3
21 Approach one: impose structure Assume the generative model satisfies + a2 h 2, β2, (AN) Q2 (h 2, a2 ) = h 2, β2, (AN2) E H2, β2, H = h, A = a + a h, ξ, = h, ξ, β2, = H,2 $,2 + A H,3 $,3 + τ Z, where (AN3) H2, Z Normal(, ) ind error (AN4) H, ξ,, H, ξ,, H,2 $,2, H,3 $,3 Normal(ω, Ω ) Note* implicitly make technical assumptions including compactness of parameter space, existence of requisite moments etc. 4 / 3
22 Characterizing the optimal regime Define g : R4 R as g (v ) = max ρ {,} (v + ρv2 )2 v3 + ρv4 + exp 2 2π + (v + ρv2 ) { 2Φ (v + ρv2 )} Lemma Assume (A)-(A4). Let ψ denote normal density, then Z π opt ) = ν {τ, ω, vech(ω )} = V (π τ g (v /τ ) ψ(v ; ω, Ω )d v. R4 5 / 3
23 π opt ) B Sizing for power to reject H : V (π Lemma π opt ). Assume (AN)-(AN4). Let Vbn be plug-in estimator of V (π Then, o n π opt ) n Vbn V (π Normal(, σ 2 ), where σ 2 is a smooth functional of generative model. 6 / 3
24 π opt ) B Sizing for power to reject H : V (π Lemma π opt ). Assume (AN)-(AN4). Let Vbn be plug-in estimator of V (π Then, o n π opt ) n Vbn V (π Normal(, σ 2 ), where σ 2 is a smooth functional of generative model. Corollary Assume (AN)-(AN4) and let σ bn2 be a consistent estimator of σ 2. b σn zα is an Then a test that rejects when n Vn B /b π opt ) B. Moreover, (asymptotic) α-level test of H : V (π choosing l m 2 n = (σ 2 /η 2 ) Φ ( γ) + zα satisfies (PWR). 6 / 3
25 Sizing to ensure high-quality estimation of π opt Lemma Assume (AN)-(AN4). Then under mild regularity conditions there exists K, δ > such that π opt ) Knδ Vbn V (π π opt ) + op (/ n). V (b π n ) V (π Subexpontential tail bounds and a bounded density for the treatment effect at each stage are sufficient. 7 / 3
26 Sizing to ensure high-quality estimation of π opt Lemma Assume (AN)-(AN4). Then under mild regularity conditions there exists K, δ > such that π opt ) Knδ Vbn V (π π opt ) + op (/ n). V (b π n ) V (π Corollary Assume (A)-(AN4). Then under mild regularity conditions & 2 ' Φ ( ζ)σ n= satisfies (OPT). Subexpontential tail bounds and a bounded density for the treatment effect at each stage are sufficient. 7 / 3
27 Sample size formulae via parametric models Reduce to simple formulae Linear mean models and normal error Not vacuous Can be checked empirically Can be relaxed/extended Requires elicitation and/or historical data to estimate σ 2 8 / 3
28 Outline Estimation of optimal treatment regimes SMART sample size formulae via parametric models SMART sample size formulae under minimal assumptions Simulation experiments
29 Q-learning (again) Assume estimation via Q-learning with linear models Regression-based dynamic programming algorithm (Q) Postulate working models for Q-functions Qt (h t, at ; βt ) = h t, βt, + at h t, βt,, h t,, h t, features of ht 2 (Q) Compute βb2,n = arg minβ2 Pn {Y Q2 (H 2, A2 ; β2 )} (Q2) Compute n o2 βb,n = arg minβ Pn maxa2 Q2 (H 2, A2 ; βb2,n ) Q (H, A ; β ) (Q3) π bt,n (h t ) = arg maxat Qt (h t, at ; βbt,n ) Population parameters βt obtained by replacing Pn with P; define π Q,opt (h t ) = arg maxat Qt (h t, at ; βt ) t 9 / 3
30 nference and Q-learning mpose less structure on generative model Only impose moment and tail conditions Do not assume postulated models are correct Both (PWR) and (OPT) are meaningful under misspecification π Q,opt ) nonsmooth functionals of Both β and V (π data-generating model No regular asymptotically unbiased estimators exist Standard normal approximations can perform poorly Base sample size on conservative interval estimators 2 / 3
31 (Nonpar) characterizing the optimal regime For each β = (β, β2 ) define (β) = 4Y A H, β, > A 2 H 2, β2, > + A (H 2, A2, β), where A(H 2, A2, β) mean zero aug term (Tsiatis 26) Let V Q (β) = P (β) and Vb Q (β) = Pn (β) π Q,opt ) VbnQ (βbn ) is APWE of V Q (β ) = V (π Estimators consistent even if Q-functions misspecified 2 / 3
32 Projection test Define ςn2 (β) = P { (β) P (β)}2 then o n bq n V (β) V Q (β) Normal(, ) Zn (β) = ςbn (β) Let Ξn,ϑ be a ( ϑ ) % confidence region for β 2 then a test that rejects when zϑ2 ςbn (β) Q b inf Vn (β) B β Ξn,ϑ n has type error no more than α = ϑ + ϑ2 + o() under π Q,opt ) B H : V (π 2 See L. et al. 24; Robins / 3
33 Power calculation Power of projection test P inf β Ξn,ϑ [Zn (β) + Nn (β)] zϑ2 nη P inf zϑ2 Zn (β) + min Nn (β), β Ξn,ϑ ςbn (β) where Nn (β) = Q n V (β) B /b ςn (β) No closed form for power of projection test Assume pilot data of size n available Use bootstrap oversampling to estimate power Solve for sample size that attains nominal power 23 / 3
34 Power calculation cont d Bootstrap resample size n from n (b) Pn,n bootstrap empirical distn f Wn = f (P, Pn ) then Wn,n = f (Pn, Pn,n ) (b) (b) Bootstrap sample size for (PWR) is smallest integral n such that PB " inf β Ξ(b) ( (b) (b) Zn,n (β) + min Nn,n (β), n,n,ϑ nη (b) ςbn,n (β) )# zϑ2 exceeds nominal power ( γ) 24 / 3
35 Consistency of bootstrap oversampling Theorem Let ϑ (, ) be fixed, κ, K >, s R be arbitrary, and ηn = η + s/ n. Then, under moment conditions ( lim P n,n sup PB v K P " inf (b),n,ϑ β Ξn (b) Z(b) n,n (β) + min Nn,n (β), nηn Zn (β) + min Nn (β), v β Ξn,ϑ ςbn (β) inf ( nηn (b) )# ) v ςbn,n (β)! >κ =. 25 / 3
36 Sample size formulae with minimal assumptions Projection test + bootstrap oversampling Consistent estimation of power curve (PWR) Analogous procedure for (OPT) (not shown) Did not assume parametric or correctly specified model ncreased complexity Nuisance parameters difficult to elicit pilot/hist data 26 / 3
37 Outline Estimation of optimal treatment regimes SMART sample size formulae via parametric models SMART sample size formulae under minimal assumptions Simulation experiments
38 Simulation experiment Overview of generative model From L. et al. (24), focus on non-standard cases Linear at second stage Normal errors Correct and incorrectly specified Q-functions Consider pilot data of size n = 5 Additional details Level α =.5 Desired power ( γ) % = 9% Probability of being near-optimal ( ζ) =.9 Sample size selected so that both (PWR) and (OPT) hold 27 / 3
39 Simulation experiments cont d Compare parametric and nonparametric sample size procedure with sample size required for comparing fixed regimes (t-test) is difference between mean outcome under optimal regime and best fixed (non-adaptive) regime in η units = represents worst case tailoring has no benefit Consider two methods for selecting σ Known Estimated from pilot/historical data (n = 5) 28 / 3
40 Method for σ (PWR) (OPT) Correctly specified Misspecified Results: parametric sample size formulae n fixed En SDn / 3
41 Method for σ (PWR) (OPT) Correctly specified Misspecified Results: parametric sample size formulae n fixed En SDn / 3
42 Method for σ (PWR) (OPT) Correctly specified Misspecified Results: parametric sample size formulae n fixed En SDn / 3
43 Method for σ (PWR) (OPT) Correctly specified Misspecified Results: parametric sample size formulae n fixed En SDn / 3
44 Method for σ (PWR) (OPT) Correctly specified Misspecified Results: parametric sample size formulae n fixed En SDn / 3
45 (PWR) (OPT) n fixed En SDn n = NA P. Correct P. ncorrect Results: nonparametric sample size formulae / 3
46 (PWR) (OPT) n fixed En SDn n = NA P. Correct P. ncorrect Results: nonparametric sample size formulae / 3
47 (PWR) (OPT) n fixed En SDn n = NA P. Correct P. ncorrect Results: nonparametric sample size formulae / 3
48 Discussion Sample size procedures possible for precision medicine Not excessively conservative* Attain nominal frequentist operating characteristics Range of assumptions on generative model Canonical information requirement in sample size procedures Pilot/historical data Elicitation Modeling assumptions Presented first steps, many interesting open problems! 3 / 3
49 Thank you. laber-labs.com
A Sampling of IMPACT Research:
A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationEstimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian
Estimation of Optimal Treatment Regimes Via Machine Learning Marie Davidian Department of Statistics North Carolina State University Triangle Machine Learning Day April 3, 2018 1/28 Optimal DTRs Via ML
More informationSet-valued dynamic treatment regimes for competing outcomes
Set-valued dynamic treatment regimes for competing outcomes Eric B. Laber Department of Statistics, North Carolina State University JSM, Montreal, QC, August 5, 2013 Acknowledgments Zhiqiang Tan Jamie
More informationRobustifying Trial-Derived Treatment Rules to a Target Population
1/ 39 Robustifying Trial-Derived Treatment Rules to a Target Population Yingqi Zhao Public Health Sciences Division Fred Hutchinson Cancer Research Center Workshop on Perspectives and Analysis for Personalized
More informationImproving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates
Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationGroup Sequential Designs: Theory, Computation and Optimisation
Group Sequential Designs: Theory, Computation and Optimisation Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 8th International Conference
More informationRobust Backtesting Tests for Value-at-Risk Models
Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society
More informationST745: Survival Analysis: Nonparametric methods
ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate
More informationBandits : optimality in exponential families
Bandits : optimality in exponential families Odalric-Ambrym Maillard IHES, January 2016 Odalric-Ambrym Maillard Bandits 1 / 40 Introduction 1 Stochastic multi-armed bandits 2 Boundary crossing probabilities
More informationWeb-based Supplementary Materials for A Robust Method for Estimating. Optimal Treatment Regimes
Biometrics 000, 000 000 DOI: 000 000 0000 Web-based Supplementary Materials for A Robust Method for Estimating Optimal Treatment Regimes Baqun Zhang, Anastasios A. Tsiatis, Eric B. Laber, and Marie Davidian
More informationTargeted Group Sequential Adaptive Designs
Targeted Group Sequential Adaptive Designs Mark van der Laan Department of Biostatistics, University of California, Berkeley School of Public Health Liver Forum, May 10, 2017 Targeted Group Sequential
More informationST495: Survival Analysis: Hypothesis testing and confidence intervals
ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took
More informationProgram Evaluation with High-Dimensional Data
Program Evaluation with High-Dimensional Data Alexandre Belloni Duke Victor Chernozhukov MIT Iván Fernández-Val BU Christian Hansen Booth ESWC 215 August 17, 215 Introduction Goal is to perform inference
More informationInference on distributions and quantiles using a finite-sample Dirichlet process
Dirichlet IDEAL Theory/methods Simulations Inference on distributions and quantiles using a finite-sample Dirichlet process David M. Kaplan University of Missouri Matt Goldman UC San Diego Midwest Econometrics
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 12 Review So far... We have discussed the role of phase III clinical trials in drug development
More informationOptimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai
Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment
More informationPersonalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health
Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations
More informationQED. Queen s Economics Department Working Paper No Hypothesis Testing for Arbitrary Bounds. Jeffrey Penney Queen s University
QED Queen s Economics Department Working Paper No. 1319 Hypothesis Testing for Arbitrary Bounds Jeffrey Penney Queen s University Department of Economics Queen s University 94 University Avenue Kingston,
More informationAdaptive Designs: Why, How and When?
Adaptive Designs: Why, How and When? Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj ISBS Conference Shanghai, July 2008 1 Adaptive designs:
More informationImplementing Precision Medicine: Optimal Treatment Regimes and SMARTs. Anastasios (Butch) Tsiatis and Marie Davidian
Implementing Precision Medicine: Optimal Treatment Regimes and SMARTs Anastasios (Butch) Tsiatis and Marie Davidian Department of Statistics North Carolina State University http://www4.stat.ncsu.edu/~davidian
More informationExtending causal inferences from a randomized trial to a target population
Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview
Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations
More informationGroup Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology
Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,
More information11. Bootstrap Methods
11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods
More informationLecture 8: Information Theory and Statistics
Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang
More informationReader Reaction to A Robust Method for Estimating Optimal Treatment Regimes by Zhang et al. (2012)
Biometrics 71, 267 273 March 2015 DOI: 10.1111/biom.12228 READER REACTION Reader Reaction to A Robust Method for Estimating Optimal Treatment Regimes by Zhang et al. (2012) Jeremy M. G. Taylor,* Wenting
More informatione author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls
e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls under the restrictions of the copyright, in particular
More informationOptimising Group Sequential Designs. Decision Theory, Dynamic Programming. and Optimal Stopping
: Decision Theory, Dynamic Programming and Optimal Stopping Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj InSPiRe Conference on Methodology
More informationHypothesis testing (cont d)
Hypothesis testing (cont d) Ulrich Heintz Brown University 4/12/2016 Ulrich Heintz - PHYS 1560 Lecture 11 1 Hypothesis testing Is our hypothesis about the fundamental physics correct? We will not be able
More informationInterim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping
Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming and Optimal Stopping Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj
More informationSize Distortion and Modi cation of Classical Vuong Tests
Size Distortion and Modi cation of Classical Vuong Tests Xiaoxia Shi University of Wisconsin at Madison March 2011 X. Shi (UW-Mdsn) H 0 : LR = 0 IUPUI 1 / 30 Vuong Test (Vuong, 1989) Data fx i g n i=1.
More informationInteractive Q-learning for Probabilities and Quantiles
Interactive Q-learning for Probabilities and Quantiles Kristin A. Linn 1, Eric B. Laber 2, Leonard A. Stefanski 2 arxiv:1407.3414v2 [stat.me] 21 May 2015 1 Department of Biostatistics and Epidemiology
More informationInference For High Dimensional M-estimates. Fixed Design Results
: Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More informationMonitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison
Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs Christopher Jennison Department of Mathematical Sciences, University of Bath http://people.bath.ac.uk/mascj
More informationLecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina
Lecture 9: Learning Optimal Dynamic Treatment Regimes Introduction Refresh: Dynamic Treatment Regimes (DTRs) DTRs: sequential decision rules, tailored at each stage by patients time-varying features and
More informationMultiple Testing in Group Sequential Clinical Trials
Multiple Testing in Group Sequential Clinical Trials Tian Zhao Supervisor: Michael Baron Department of Mathematical Sciences University of Texas at Dallas txz122@utdallas.edu 7/2/213 1 Sequential statistics
More informationNonparametric Inference In Functional Data
Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationHigh Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data
High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department
More informationMIT Spring 2016
MIT 18.655 Dr. Kempthorne Spring 2016 1 MIT 18.655 Outline 1 2 MIT 18.655 3 Decision Problem: Basic Components P = {P θ : θ Θ} : parametric model. Θ = {θ}: Parameter space. A{a} : Action space. L(θ, a)
More informationWeighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai
Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving
More informationDeductive Derivation and Computerization of Semiparametric Efficient Estimation
Deductive Derivation and Computerization of Semiparametric Efficient Estimation Constantine Frangakis, Tianchen Qian, Zhenke Wu, and Ivan Diaz Department of Biostatistics Johns Hopkins Bloomberg School
More informationMethods for Interpretable Treatment Regimes
Methods for Interpretable Treatment Regimes John Sperger University of North Carolina at Chapel Hill May 4 th 2018 Sperger (UNC) Interpretable Treatment Regimes May 4 th 2018 1 / 16 Why Interpretable Methods?
More informationBootstrap inference for the finite population total under complex sampling designs
Bootstrap inference for the finite population total under complex sampling designs Zhonglei Wang (Joint work with Dr. Jae Kwang Kim) Center for Survey Statistics and Methodology Iowa State University Jan.
More informationQuantifying Stochastic Model Errors via Robust Optimization
Quantifying Stochastic Model Errors via Robust Optimization IPAM Workshop on Uncertainty Quantification for Multiscale Stochastic Systems and Applications Jan 19, 2016 Henry Lam Industrial & Operations
More informationECO Class 6 Nonparametric Econometrics
ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................
More informationDiscussion of Sensitivity and Informativeness under Local Misspecification
Discussion of Sensitivity and Informativeness under Local Misspecification Jinyong Hahn April 4, 2019 Jinyong Hahn () Discussion of Sensitivity and Informativeness under Local Misspecification April 4,
More informationSupplement to Q- and A-learning Methods for Estimating Optimal Dynamic Treatment Regimes
Submitted to the Statistical Science Supplement to - and -learning Methods for Estimating Optimal Dynamic Treatment Regimes Phillip J. Schulte, nastasios. Tsiatis, Eric B. Laber, and Marie Davidian bstract.
More informationGroup Sequential Tests for Delayed Responses
Group Sequential Tests for Delayed Responses Lisa Hampson Department of Mathematics and Statistics, Lancaster University, UK Chris Jennison Department of Mathematical Sciences, University of Bath, UK Read
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationNonparametric Tests for Multi-parameter M-estimators
Nonparametric Tests for Multi-parameter M-estimators John Robinson School of Mathematics and Statistics University of Sydney The talk is based on joint work with John Kolassa. It follows from work over
More informationInference in Regression Analysis
Inference in Regression Analysis Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 4, Slide 1 Today: Normal Error Regression Model Y i = β 0 + β 1 X i + ǫ i Y i value
More informationLecture 7 Introduction to Statistical Decision Theory
Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7
More informationA multiple testing procedure for input variable selection in neural networks
A multiple testing procedure for input variable selection in neural networks MicheleLaRoccaandCiraPerna Department of Economics and Statistics - University of Salerno Via Ponte Don Melillo, 84084, Fisciano
More informationA Conditional Approach to Modeling Multivariate Extremes
A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying
More informationBios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver
More informationarxiv: v1 [math.st] 29 Jul 2014
On Estimation of Optimal Treatment Regimes For Maximizing t-year Survival Probability arxiv:1407.7820v1 [math.st] 29 Jul 2014 Runchao Jiang 1, Wenbin Lu, Rui Song, and Marie Davidian North Carolina State
More informationarxiv: v3 [math.oc] 25 Apr 2018
Problem-driven scenario generation: an analytical approach for stochastic programs with tail risk measure Jamie Fairbrother *, Amanda Turner *, and Stein W. Wallace ** * STOR-i Centre for Doctoral Training,
More informationSample size determination for a binary response in a superiority clinical trial using a hybrid classical and Bayesian procedure
Ciarleglio and Arendt Trials (2017) 18:83 DOI 10.1186/s13063-017-1791-0 METHODOLOGY Open Access Sample size determination for a binary response in a superiority clinical trial using a hybrid classical
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationComparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design
Comparing Adaptive Designs and the Classical Group Sequential Approach to Clinical Trial Design Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj
More informationEmpirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications
Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications Fumiya Akashi Research Associate Department of Applied Mathematics Waseda University
More informationThe International Journal of Biostatistics
The International Journal of Biostatistics Volume 2, Issue 1 2006 Article 2 Statistical Inference for Variable Importance Mark J. van der Laan, Division of Biostatistics, School of Public Health, University
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationGARCH Models Estimation and Inference
GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in
More informationA Gate-keeping Approach for Selecting Adaptive Interventions under General SMART Designs
1 / 32 A Gate-keeping Approach for Selecting Adaptive Interventions under General SMART Designs Tony Zhong, DrPH Icahn School of Medicine at Mount Sinai (Feb 20, 2019) Workshop on Design of mhealth Intervention
More informationSemi-Nonparametric Inferences for Massive Data
Semi-Nonparametric Inferences for Massive Data Guang Cheng 1 Department of Statistics Purdue University Statistics Seminar at NCSU October, 2015 1 Acknowledge NSF, Simons Foundation and ONR. A Joint Work
More informationGARCH Models Estimation and Inference. Eduardo Rossi University of Pavia
GARCH Models Estimation and Inference Eduardo Rossi University of Pavia Likelihood function The procedure most often used in estimating θ 0 in ARCH models involves the maximization of a likelihood function
More informationUniformly and Restricted Most Powerful Bayesian Tests
Uniformly and Restricted Most Powerful Bayesian Tests Valen E. Johnson and Scott Goddard Texas A&M University June 6, 2014 Valen E. Johnson and Scott Goddard Texas A&MUniformly University Most Powerful
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationThe information complexity of sequential resource allocation
The information complexity of sequential resource allocation Emilie Kaufmann, joint work with Olivier Cappé, Aurélien Garivier and Shivaram Kalyanakrishan SMILE Seminar, ENS, June 8th, 205 Sequential allocation
More informationNonconcave Penalized Likelihood with A Diverging Number of Parameters
Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized
More informationST495: Survival Analysis: Maximum likelihood
ST495: Survival Analysis: Maximum likelihood Eric B. Laber Department of Statistics, North Carolina State University February 11, 2014 Everything is deception: seeking the minimum of illusion, keeping
More informationf-divergence Estimation and Two-Sample Homogeneity Test under Semiparametric Density-Ratio Models
IEEE Transactions on Information Theory, vol.58, no.2, pp.708 720, 2012. 1 f-divergence Estimation and Two-Sample Homogeneity Test under Semiparametric Density-Ratio Models Takafumi Kanamori Nagoya University,
More informationControlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method
Controlling the False Discovery Rate: Understanding and Extending the Benjamini-Hochberg Method Christopher R. Genovese Department of Statistics Carnegie Mellon University joint work with Larry Wasserman
More informationThe consequences of misspecifying the random effects distribution when fitting generalized linear mixed models
The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San
More informationCross-Validation with Confidence
Cross-Validation with Confidence Jing Lei Department of Statistics, Carnegie Mellon University UMN Statistics Seminar, Mar 30, 2017 Overview Parameter est. Model selection Point est. MLE, M-est.,... Cross-validation
More informationNotes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed
18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,
More informationFair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths
Fair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths for New Developments in Nonparametric and Semiparametric Statistics, Joint Statistical Meetings; Vancouver, BC,
More informationStratégies bayésiennes et fréquentistes dans un modèle de bandit
Stratégies bayésiennes et fréquentistes dans un modèle de bandit thèse effectuée à Telecom ParisTech, co-dirigée par Olivier Cappé, Aurélien Garivier et Rémi Munos Journées MAS, Grenoble, 30 août 2016
More informationStatistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation
Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence
More informationTesting a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials
Testing a secondary endpoint after a group sequential test Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 9th Annual Adaptive Designs in
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationSTAT 512 sp 2018 Summary Sheet
STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}
More informationBIOS 312: Precision of Statistical Inference
and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample
More informationConstruction and statistical analysis of adaptive group sequential designs for randomized clinical trials
Construction and statistical analysis of adaptive group sequential designs for randomized clinical trials Antoine Chambaz (MAP5, Université Paris Descartes) joint work with Mark van der Laan Atelier INSERM
More information14.30 Introduction to Statistical Methods in Economics Spring 2009
MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationOnline Learning and Sequential Decision Making
Online Learning and Sequential Decision Making Emilie Kaufmann CNRS & CRIStAL, Inria SequeL, emilie.kaufmann@univ-lille.fr Research School, ENS Lyon, Novembre 12-13th 2018 Emilie Kaufmann Sequential Decision
More informationChapter 2: Fundamentals of Statistics Lecture 15: Models and statistics
Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Data from one or a series of random experiments are collected. Planning experiments and collecting data (not discussed here). Analysis:
More informationPB HLTH 240A: Advanced Categorical Data Analysis Fall 2007
Cohort study s formulations PB HLTH 240A: Advanced Categorical Data Analysis Fall 2007 Srine Dudoit Division of Biostatistics Department of Statistics University of California, Berkeley www.stat.berkeley.edu/~srine
More informationSequential Detection. Changes: an overview. George V. Moustakides
Sequential Detection of Changes: an overview George V. Moustakides Outline Sequential hypothesis testing and Sequential detection of changes The Sequential Probability Ratio Test (SPRT) for optimum hypothesis
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 3, Issue 1 2004 Article 14 Multiple Testing. Part II. Step-Down Procedures for Control of the Family-Wise Error Rate Mark J. van der Laan
More informationGeneralized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses
Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:
More informationA Type of Sample Size Planning for Mean Comparison in Clinical Trials
Journal of Data Science 13(2015), 115-126 A Type of Sample Size Planning for Mean Comparison in Clinical Trials Junfeng Liu 1 and Dipak K. Dey 2 1 GCE Solutions, Inc. 2 Department of Statistics, University
More informationBetter Bootstrap Confidence Intervals
by Bradley Efron University of Washington, Department of Statistics April 12, 2012 An example Suppose we wish to make inference on some parameter θ T (F ) (e.g. θ = E F X ), based on data We might suppose
More informationQ learning. A data analysis method for constructing adaptive interventions
Q learning A data analysis method for constructing adaptive interventions SMART First stage intervention options coded as 1(M) and 1(B) Second stage intervention options coded as 1(M) and 1(B) O1 A1 O2
More informationLecture 8: Information Theory and Statistics
Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and Estimation I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 22, 2015
More informationBios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & nformatics Colorado School of Public Health University of Colorado Denver
More information