Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk
|
|
- Benedict Green
- 6 years ago
- Views:
Transcription
1 Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk Axel Gandy Department of Mathematics Imperial College London user! 2009, Rennes July 8-10, 2009
2 Introduction Test statistic T, reject for large values. Observation: t. p-value: p = P(T t) Often not available in closed form. Monte Carlo Test: ˆp naive = 1 n where T, T 1,... T n i.i.d. Examples: Bootstrap, Permutation tests. Goal: Estimate p using few X i n I(T i t), i=1 Mainly interested in deciding if p α for some α. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2
3 Introduction Test statistic T, reject for large values. Observation: t. p-value: p = P(T t) Often not available in closed form. Monte Carlo Test: ˆp naive = 1 n where T, T 1,... T n i.i.d. Examples: Bootstrap, Permutation tests. Goal: Estimate p using few X i n I(T i t), }{{} =:X i B(1,p) i=1 Mainly interested in deciding if p α for some α. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2
4 Introduction Test statistic T, reject for large values. Observation: t. p-value: p = P(T t) Often not available in closed form. Monte Carlo Test: ˆp naive = 1 n where T, T 1,... T n i.i.d. Examples: Bootstrap, Permutation tests. Goal: Estimate p using few X i n I(T i t), }{{} =:X i B(1,p) i=1 Mainly interested in deciding if p α for some α. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2
5 Introduction Test statistic T, reject for large values. Observation: t. p-value: p = P(T t) Often not available in closed form. Monte Carlo Test: ˆp naive = 1 n where T, T 1,... T n i.i.d. Examples: Bootstrap, Permutation tests. Goal: Estimate p using few X i n I(T i t), }{{} =:X i B(1,p) i=1 Mainly interested in deciding if p α for some α. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 2
6 Sequential approaches based on S n = n i=1 X i S n Stop once S n U n or S n L n τ: hitting time Compute ˆp based on S τ and τ. Hit B U : decide p > α, Hit B L : decide p α, n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3
7 Sequential approaches based on S n = n i=1 X i S n Stop once S n U n or S n L n τ: hitting time Compute ˆp based on S τ and τ. Hit B U : decide p > α, Hit B L : decide p α, n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3
8 Sequential approaches based on S n = n i=1 X i S n Stop once S n U n or S n L n τ: hitting time Compute ˆp based on S τ and τ. Hit B U : decide p > α, Hit B L : decide p α, n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 3
9 Previous Approaches Besag & Clifford (1991): h S n 0 n 0 m (Truncated) Sequential Probability Ratio Test, Fay et al. (2007) h S n 0 0 m R-package MChtest. n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 4
10 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
11 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
12 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
13 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
14 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
15 What do we really want? Is p α? Two individuals using the same statistical method on the same data should arrive at the same conclusion. First law of applied statistics, Gleser (1996) Consider the resampling risk Want: RR p (ˆp) { P p (ˆp > α) if p α, P p (ˆp α) if p > α. sup RR p (ˆp) ɛ p [0,1] for some (small) ɛ > 0. For Besag & Clifford (1991), SPRT: sup p RR P 0.5 Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 5
16 Recursive Definition of the Boundaries Want: sup RR p (ˆp) ɛ p Suffices to ensure P α (hit B U ) ɛ P α (hit B L ) ɛ Recursive definition: Given U 1,..., U n 1 and L 1,..., L n 1, define U n as the minimal value such that P α (hit B U until n) ɛ n and L n as the maximal value such that P α (hit B L until n) ɛ n where ɛ n 0 with ɛ n ɛ (spending sequence). Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6
17 Recursive Definition of the Boundaries Want: sup RR p (ˆp) ɛ p Suffices to ensure P α (hit B U ) ɛ P α (hit B L ) ɛ Recursive definition: Given U 1,..., U n 1 and L 1,..., L n 1, define U n as the minimal value such that P α (hit B U until n) ɛ n and L n as the maximal value such that P α (hit B L until n) ɛ n where ɛ n 0 with ɛ n ɛ (spending sequence). Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6
18 Recursive Definition of the Boundaries Want: sup RR p (ˆp) ɛ p Suffices to ensure P α (hit B U ) ɛ P α (hit B L ) ɛ Recursive definition: Given U 1,..., U n 1 and L 1,..., L n 1, define U n as the minimal value such that P α (hit B U until n) ɛ n and L n as the maximal value such that P α (hit B L until n) ɛ n where ɛ n 0 with ɛ n ɛ (spending sequence). Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6
19 Recursive Definition of the Boundaries Want: sup RR p (ˆp) ɛ p Suffices to ensure P α (hit B U ) ɛ P α (hit B L ) ɛ Recursive definition: Given U 1,..., U n 1 and L 1,..., L n 1, define U n as the minimal value such that P α (hit B U until n) ɛ n and L n as the maximal value such that P α (hit B L until n) ɛ n where ɛ n 0 with ɛ n ɛ (spending sequence). Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 6
20 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n P α (S n =k, τ n) 0 k= 3 k= 2 k= 1 k= 0 1 ɛ n 0 U n 1 L n -1 n = Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
21 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n P α (S n =k, τ n) 0 1 k= 3 k= 2 k= 1.2 k= ɛ n 0.07 U n 1 2 L n -1-1 n = Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
22 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n P α (S n =k, τ n) k= 3 k= 2.04 k= k= ɛ n U n L n n = Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
23 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n P α (S n =k, τ n) k= 3 k= k= k= ɛ n U n L n n = Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
24 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n n = P α (S n =k, τ n) k= 3 k= k= k= ɛ n U n L n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
25 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n n = P α (S n =k, τ n) k= 3.02 k= k= k= ɛ n U n L n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
26 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n n = P α (S n =k, τ n) k= k= k= k= ɛ n U n L n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
27 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n n = P α (S n =k, τ n) k= k= k= k= ɛ n U n L n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
28 Recursive Definition - Example α = 0.2, ɛ n = 0.4 n 5+n. U n =the minimal value such that L n = maximal value such that P α (hit B U until n) ɛ n P α (hit B L until n) ɛ n n = P α (S n =k, τ n) k= k= k= k= ɛ n U n L n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 7
29 Sequential Decision Procedure - Example α = 0.2, ɛ n = 0.4 n 5+n n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 8
30 Influence of ɛ on the stopping rule ɛ = 0.1, 0.001, 10 5, 10 7 ; ɛ n = ɛ n 1000+n n Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 9
31 Sequential Estimation based on the MLE One can show: S τ ˆp = τ, τ < α, τ =, hitting the upper boundary implies ˆp > α, hitting the lower boundary implies ˆp < α. Hence, sup RR p (ˆp) ɛ p Furthermore, random interval I n s.t. In only depends on X 1,..., X n, ˆp In. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 10
32 Example - Two-way sparse contingency table H 0 : variables are independent. Reject for large values of the likelihood ratio test statistic T T d χ 2 (7 1)(5 1) under H 0. Based on this: p = Matrix sparse - approximation poor? Use parametric bootstrap based on row and column sums. Naive test statistic ˆp naive with n = 1,000 replicates: p = < Probability of reporting p > 0.05: roughly Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11
33 Example - Two-way sparse contingency table H 0 : variables are independent. Reject for large values of the likelihood ratio test statistic T T d χ 2 (7 1)(5 1) under H 0. Based on this: p = Matrix sparse - approximation poor? Use parametric bootstrap based on row and column sums. Naive test statistic ˆp naive with n = 1,000 replicates: p = < Probability of reporting p > 0.05: roughly Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11
34 Example - Two-way sparse contingency table H 0 : variables are independent. Reject for large values of the likelihood ratio test statistic T T d χ 2 (7 1)(5 1) under H 0. Based on this: p = Matrix sparse - approximation poor? Use parametric bootstrap based on row and column sums. Naive test statistic ˆp naive with n = 1,000 replicates: p = < Probability of reporting p > 0.05: roughly Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 11
35 Example - Bootstrap and Sequential Algorithm > dat <- matrix(c(1,2,2,1,1,0,1, 2,0,0,2,3,0,0, 0,1,1,1,2,7,3, 1,1,2,0,0,0,1, + 0,1,1,1,1,0,0), nrow=5,ncol=7,byrow=true) > loglikrat <- function(data){ + cs <- colsums(data);rs <- rowsums(data); mu <- outer(rs,cs)/sum(rs) + 2*sum(ifelse(data<=0.5, 0,data*log(data/mu))) + } > resample <- function(data){ + cs <- colsums(data);rs <- rowsums(data); n <- sum(rs) + mu <- outer(rs,cs)/n/n + matrix(rmultinom(1,n,c(mu)),nrow=dim(data)[1],ncol=dim(data)[2]) + } > t <- loglikrat(dat); > library(simctest) > res <- simctest(function(){loglikrat(resample(dat))>=t},maxsteps=1000) > res No decision reached. Final estimate will be in [ , ] Current estimate of the p.value: Number of samples: 1000 > cont(res, steps=10000) p.value: Number of samples: 8574
36 Further Uses of the Algorithm Simulation study to evaluate whether a test is liberal/conservative. Determining the sample size to achieve a certain power. Iterated Use: Determining the power of a bootstrap test. Simulation study to evaluate whether a bootstrap test is liberal/conservative. Double bootstrap test. Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 13
37 Expected Hitting Time Result: E p (τ) < p α Example with α = 0.05, ɛ n = ɛ n 1000+n : E p (τ) ε = ε = 1e 05 ε = 1e 07 E p (τ) µ p p µ p = theoretical lower bound on E p (τ). Note: 1 0 µ pdp = ; for iterated use: Need to limit the number of steps.
38 Expected Hitting Time Result: E p (τ) < p α Example with α = 0.05, ɛ n = ɛ n 1000+n : E p (τ) ε = ε = 1e 05 ε = 1e 07 E p (τ) µ p p µ p = theoretical lower bound on E p (τ). Note: 1 0 µ pdp = ; for iterated use: Need to limit the number of steps.
39 Summary Sequential implementation of Monte Carlo Tests and computation of p-values. Useful when implementing tests in packages. After a finite number of steps: ˆp or interval [ˆp L n, ˆp U n ] in which ˆp will lie. Guarantee (up to a very small error probability): ˆp is on the correct side of α. R-package simctest available on CRAN. (efficient implementation with C-code) For details see Gandy (2009). Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 15
40 References Besag, J. & Clifford, P. (1991). Sequential Monte Carlo p-values. Biometrika 78, Davison, A. & Hinkley, D. (1997). Bootstrap methods and their application. Cambridge University Press. Fay, M. P., Kim, H.-J. & Hachey, M. (2007). On using truncated sequential probability ratio test boundaries for Monte Carlo implementation of hypothesis tests. Journal of Computational & Graphical Statistics 16, Gandy, A. (2009). Sequential implementation of Monte Carlo tests with uniformly bounded resampling risk. Accepted for publication in JASA. Gleser, L. J. (1996). Comment on Bootstrap Confidence Intervals by T. J. DiCiccio and B. Efron. Statistical Science 11, Axel Gandy Sequential Implementation of Monte Carlo Tests with Uniformly Bounded Resampling Risk 16
arxiv: v4 [stat.me] 20 Oct 2017
A simple method for implementing Monte Carlo tests Dong Ding, Axel Gandy, and Georg Hahn Department of Mathematics, Imperial College London arxiv:1611.01675v4 [stat.me] 20 Oct 2017 Abstract We consider
More informationOn Using Truncated Sequential Probability Ratio Test Boundaries for Monte Carlo Implementation of Hypothesis Tests
On Using Truncated Sequential Probability Ratio Test Boundaries for Monte Carlo Implementation of Hypothesis Tests (to appear in Journal of Computational and Graphical Statistics) Michael P. Fay National
More informationOptimal Generalized Sequential Monte Carlo Test
Optimal Generalized Sequential Monte Carlo Test I. R. Silva, R. M. Assunção Departamento de Estatística, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil Abstract Conventional
More informationResampling and the Bootstrap
Resampling and the Bootstrap Axel Benner Biostatistics, German Cancer Research Center INF 280, D-69120 Heidelberg benner@dkfz.de Resampling and the Bootstrap 2 Topics Estimation and Statistical Testing
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More informationMultiple Sample Categorical Data
Multiple Sample Categorical Data paired and unpaired data, goodness-of-fit testing, testing for independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More information(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.
Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2
MA 575 Linear Models: Cedric E. Ginestet, Boston University Non-parametric Inference, Polynomial Regression Week 9, Lecture 2 1 Bootstrapped Bias and CIs Given a multiple regression model with mean and
More informationPermutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods
Permutation Tests Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods The Two-Sample Problem We observe two independent random samples: F z = z 1, z 2,, z n independently of
More informationMonte Carlo test under general conditions: Power and number of simulations
Monte Carlo test under general conditions: Power and number of simulations I.R.Silva, R.M.Assunção Departamento de Estatística, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
More informationRobust Backtesting Tests for Value-at-Risk Models
Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society
More informationRandom Numbers and Simulation
Random Numbers and Simulation Generating random numbers: Typically impossible/unfeasible to obtain truly random numbers Programs have been developed to generate pseudo-random numbers: Values generated
More informationLikelihood-based inference with missing data under missing-at-random
Likelihood-based inference with missing data under missing-at-random Jae-kwang Kim Joint work with Shu Yang Department of Statistics, Iowa State University May 4, 014 Outline 1. Introduction. Parametric
More informationBootstrap Testing in Econometrics
Presented May 29, 1999 at the CEA Annual Meeting Bootstrap Testing in Econometrics James G MacKinnon Queen s University at Kingston Introduction: Economists routinely compute test statistics of which the
More informationMultiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap
University of Zurich Department of Economics Working Paper Series ISSN 1664-7041 (print) ISSN 1664-705X (online) Working Paper No. 254 Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More information11. Bootstrap Methods
11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationBootstrap and Parametric Inference: Successes and Challenges
Bootstrap and Parametric Inference: Successes and Challenges G. Alastair Young Department of Mathematics Imperial College London Newton Institute, January 2008 Overview Overview Review key aspects of frequentist
More informationBayesian Econometrics
Bayesian Econometrics Christopher A. Sims Princeton University sims@princeton.edu September 20, 2016 Outline I. The difference between Bayesian and non-bayesian inference. II. Confidence sets and confidence
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationPreliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference
1 / 171 Bootstrap inference Francisco Cribari-Neto Departamento de Estatística Universidade Federal de Pernambuco Recife / PE, Brazil email: cribari@gmail.com October 2013 2 / 171 Unpaid advertisement
More informationApril 20th, Advanced Topics in Machine Learning California Institute of Technology. Markov Chain Monte Carlo for Machine Learning
for for Advanced Topics in California Institute of Technology April 20th, 2017 1 / 50 Table of Contents for 1 2 3 4 2 / 50 History of methods for Enrico Fermi used to calculate incredibly accurate predictions
More informationUNIVERSITÄT POTSDAM Institut für Mathematik
UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam
More informationAppendix of the paper: Are interest rate options important for the assessment of interest rate risk?
Appendix of the paper: Are interest rate options important for the assessment of interest rate risk? Caio Almeida,a, José Vicente b a Graduate School of Economics, Getulio Vargas Foundation b Research
More informationPreliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference
1 / 172 Bootstrap inference Francisco Cribari-Neto Departamento de Estatística Universidade Federal de Pernambuco Recife / PE, Brazil email: cribari@gmail.com October 2014 2 / 172 Unpaid advertisement
More informationResampling and the Bootstrap
Resampling and the Bootstrap Axel Benner Biostatistics, German Cancer Research Center INF 280, D-69120 Heidelberg benner@dkfz.de Resampling and the Bootstrap 2 Topics Estimation and Statistical Testing
More informationHypothesis Testing (May 30, 2016)
Ch. 5 Hypothesis Testing (May 30, 2016) 1 Introduction Inference, so far as we have seen, often take the form of numerical estimates, either as single points as confidence intervals. But not always. In
More informationBootstrap, Jackknife and other resampling methods
Bootstrap, Jackknife and other resampling methods Part VI: Cross-validation Rozenn Dahyot Room 128, Department of Statistics Trinity College Dublin, Ireland dahyot@mee.tcd.ie 2005 R. Dahyot (TCD) 453 Modern
More informationThe chopthin algorithm for resampling
The chopthin algorithm for resampling Axel Gandy F. Din-Houn Lau Department of Mathematics, Imperial College London Abstract Resampling is a standard step in particle filters and more generally sequential
More informationSTAT 730 Chapter 5: Hypothesis Testing
STAT 730 Chapter 5: Hypothesis Testing Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Analysis 1 / 28 Likelihood ratio test def n: Data X depend on θ. The
More informationStatistical Inference
Statistical Inference Robert L. Wolpert Institute of Statistics and Decision Sciences Duke University, Durham, NC, USA Spring, 2006 1. DeGroot 1973 In (DeGroot 1973), Morrie DeGroot considers testing the
More informationSequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process
Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationAnalytical Bootstrap Methods for Censored Data
JOURNAL OF APPLIED MATHEMATICS AND DECISION SCIENCES, 6(2, 129 141 Copyright c 2002, Lawrence Erlbaum Associates, Inc. Analytical Bootstrap Methods for Censored Data ALAN D. HUTSON Division of Biostatistics,
More informationThe bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap
Patrick Breheny December 6 Patrick Breheny BST 764: Applied Statistical Modeling 1/21 The empirical distribution function Suppose X F, where F (x) = Pr(X x) is a distribution function, and we wish to estimate
More informationHigh-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018
High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously
More informationOptimal exact tests for complex alternative hypotheses on cross tabulated data
Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex
More informationSequential Detection. Changes: an overview. George V. Moustakides
Sequential Detection of Changes: an overview George V. Moustakides Outline Sequential hypothesis testing and Sequential detection of changes The Sequential Probability Ratio Test (SPRT) for optimum hypothesis
More informationBetter Bootstrap Confidence Intervals
by Bradley Efron University of Washington, Department of Statistics April 12, 2012 An example Suppose we wish to make inference on some parameter θ T (F ) (e.g. θ = E F X ), based on data We might suppose
More informationFormal Statement of Simple Linear Regression Model
Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationA fast sampler for data simulation from spatial, and other, Markov random fields
A fast sampler for data simulation from spatial, and other, Markov random fields Andee Kaplan Iowa State University ajkaplan@iastate.edu June 22, 2017 Slides available at http://bit.ly/kaplan-phd Joint
More informationCER Prediction Uncertainty
CER Prediction Uncertainty Lessons learned from a Monte Carlo experiment with an inherently nonlinear cost estimation relationship (CER) April 30, 2007 Dr. Matthew S. Goldberg, CBO Dr. Richard Sperling,
More informationLARGE NUMBERS OF EXPLANATORY VARIABLES. H.S. Battey. WHAO-PSI, St Louis, 9 September 2018
LARGE NUMBERS OF EXPLANATORY VARIABLES HS Battey Department of Mathematics, Imperial College London WHAO-PSI, St Louis, 9 September 2018 Regression, broadly defined Response variable Y i, eg, blood pressure,
More informationHypothesis testing: theory and methods
Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable
More informationAnalysis of Type-II Progressively Hybrid Censored Data
Analysis of Type-II Progressively Hybrid Censored Data Debasis Kundu & Avijit Joarder Abstract The mixture of Type-I and Type-II censoring schemes, called the hybrid censoring scheme is quite common in
More informationECE 5424: Introduction to Machine Learning
ECE 5424: Introduction to Machine Learning Topics: Ensemble Methods: Bagging, Boosting PAC Learning Readings: Murphy 16.4;; Hastie 16 Stefan Lee Virginia Tech Fighting the bias-variance tradeoff Simple
More informationLifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with an Arrhenius rate relationship
Scholars' Mine Doctoral Dissertations Student Research & Creative Works Spring 01 Lifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with
More informationLet us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided
Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or
More informationAdvanced Statistical Methods: Beyond Linear Regression
Advanced Statistical Methods: Beyond Linear Regression John R. Stevens Utah State University Notes 3. Statistical Methods II Mathematics Educators Worshop 28 March 2009 1 http://www.stat.usu.edu/~jrstevens/pcmi
More informationEmpirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm
Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier
More informationLecture 10: Generalized likelihood ratio test
Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual
More informationCase study: stochastic simulation via Rademacher bootstrap
Case study: stochastic simulation via Rademacher bootstrap Maxim Raginsky December 4, 2013 In this lecture, we will look at an application of statistical learning theory to the problem of efficient stochastic
More informationA better way to bootstrap pairs
A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression
More informationLecture Particle Filters. Magnus Wiktorsson
Lecture Particle Filters Magnus Wiktorsson Monte Carlo filters The filter recursions could only be solved for HMMs and for linear, Gaussian models. Idea: Approximate any model with a HMM. Replace p(x)
More informationPackage CCP. R topics documented: December 17, Type Package. Title Significance Tests for Canonical Correlation Analysis (CCA) Version 0.
Package CCP December 17, 2009 Type Package Title Significance Tests for Canonical Correlation Analysis (CCA) Version 0.1 Date 2009-12-14 Author Uwe Menzel Maintainer Uwe Menzel to
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationMaximum Non-extensive Entropy Block Bootstrap
Overview&Motivation MEB Simulation of the MnEBB Conclusions References Maximum Non-extensive Entropy Block Bootstrap Jan Novotny CEA, Cass Business School & CERGE-EI (with Michele Bergamelli & Giovanni
More informationQuestion. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?
Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.
More informationPost-exam 2 practice questions 18.05, Spring 2014
Post-exam 2 practice questions 18.05, Spring 2014 Note: This is a set of practice problems for the material that came after exam 2. In preparing for the final you should use the previous review materials,
More informationRobustness and Distribution Assumptions
Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology
More informationEconomics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,
Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem
More informationChapter 7. Hypothesis Testing
Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function
More informationPackage dhsic. R topics documented: July 27, 2017
Package dhsic July 27, 2017 Type Package Title Independence Testing via Hilbert Schmidt Independence Criterion Version 2.0 Date 2017-07-24 Author Niklas Pfister and Jonas Peters Maintainer Niklas Pfister
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 8 P-values When reporting results, we usually report p-values in place of reporting whether or
More informationMachine Learning, Midterm Exam: Spring 2008 SOLUTIONS. Q Topic Max. Score Score. 1 Short answer questions 20.
10-601 Machine Learning, Midterm Exam: Spring 2008 Please put your name on this cover sheet If you need more room to work out your answer to a question, use the back of the page and clearly mark on the
More informationA comparison study of the nonparametric tests based on the empirical distributions
통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical
More informationDistributed Estimation, Information Loss and Exponential Families. Qiang Liu Department of Computer Science Dartmouth College
Distributed Estimation, Information Loss and Exponential Families Qiang Liu Department of Computer Science Dartmouth College Statistical Learning / Estimation Learning generative models from data Topic
More informationDirect Simulation Methods #2
Direct Simulation Methods #2 Econ 690 Purdue University Outline 1 A Generalized Rejection Sampling Algorithm 2 The Weighted Bootstrap 3 Importance Sampling Rejection Sampling Algorithm #2 Suppose we wish
More informationDecentralized Sequential Hypothesis Testing. Change Detection
Decentralized Sequential Hypothesis Testing & Change Detection Giorgos Fellouris, Columbia University, NY, USA George V. Moustakides, University of Patras, Greece Outline Sequential hypothesis testing
More informationFinite Population Correction Methods
Finite Population Correction Methods Moses Obiri May 5, 2017 Contents 1 Introduction 1 2 Normal-based Confidence Interval 2 3 Bootstrap Confidence Interval 3 4 Finite Population Bootstrap Sampling 5 4.1
More informationBlind Equalization via Particle Filtering
Blind Equalization via Particle Filtering Yuki Yoshida, Kazunori Hayashi, Hideaki Sakai Department of System Science, Graduate School of Informatics, Kyoto University Historical Remarks A sequential Monte
More informationLecture 21. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationDynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement
Submitted to imanufacturing & Service Operations Management manuscript MSOM-11-370.R3 Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times Online Supplement (Authors names blinded
More informationCUSUM(t) D data. Supplementary Figure 1: Examples of changes in linear trend and CUSUM. (a) The existence of
Supplementary Figures a b a(t) a(t) c t d t d(t) d(t) t t e f CUSUM(t) D data t CUSUM(t) D data t Supplementary Figure 1: Examples of changes in linear trend and CUSUM. (a) The existence of an abrupt jump
More informationAn Introduction to Statistical Machine Learning - Theoretical Aspects -
An Introduction to Statistical Machine Learning - Theoretical Aspects - Samy Bengio bengio@idiap.ch Dalle Molle Institute for Perceptual Artificial Intelligence (IDIAP) CP 592, rue du Simplon 4 1920 Martigny,
More informationEmpirical likelihood-based methods for the difference of two trimmed means
Empirical likelihood-based methods for the difference of two trimmed means 24.09.2012. Latvijas Universitate Contents 1 Introduction 2 Trimmed mean 3 Empirical likelihood 4 Empirical likelihood for the
More informationLecture 1: Random number generation, permutation test, and the bootstrap. August 25, 2016
Lecture 1: Random number generation, permutation test, and the bootstrap August 25, 2016 Statistical simulation 1/21 Statistical simulation (Monte Carlo) is an important part of statistical method research.
More informationA Resampling Method on Pivotal Estimating Functions
A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation
More informationA Nonparametric Sequential Test for Online Randomized Experiments
A Nonparametric Sequential Test for Online Randomized Experiments Vineet Abhishek Walmart Labs Sunnyvale, CA, USA vabhishek@walmartlabs.com Shie Mannor Technion Haifa, Israel shie@ee.technion.ac.il ABSTRACT
More informationSTAT 461/561- Assignments, Year 2015
STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and
More informationECE531 Lecture 13: Sequential Detection of Discrete-Time Signals
ECE531 Lecture 13: Sequential Detection of Discrete-Time Signals D. Richard Brown III Worcester Polytechnic Institute 30-Apr-2009 Worcester Polytechnic Institute D. Richard Brown III 30-Apr-2009 1 / 32
More informationBias-Variance Tradeoff
What s learning, revisited Overfitting Generative versus Discriminative Logistic Regression Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University September 19 th, 2007 Bias-Variance Tradeoff
More informationRobert Collins CSE586, PSU Intro to Sampling Methods
Intro to Sampling Methods CSE586 Computer Vision II Penn State Univ Topics to be Covered Monte Carlo Integration Sampling and Expected Values Inverse Transform Sampling (CDF) Ancestral Sampling Rejection
More informationHybrid Censoring; An Introduction 2
Hybrid Censoring; An Introduction 2 Debasis Kundu Department of Mathematics & Statistics Indian Institute of Technology Kanpur 23-rd November, 2010 2 This is a joint work with N. Balakrishnan Debasis Kundu
More informationBootstrap Tests: How Many Bootstraps?
Bootstrap Tests: How Many Bootstraps? Russell Davidson James G. MacKinnon GREQAM Department of Economics Centre de la Vieille Charité Queen s University 2 rue de la Charité Kingston, Ontario, Canada 13002
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions
More informationThe Goodness-of-fit Test for Gumbel Distribution: A Comparative Study
MATEMATIKA, 2012, Volume 28, Number 1, 35 48 c Department of Mathematics, UTM. The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study 1 Nahdiya Zainal Abidin, 2 Mohd Bakri Adam and 3 Habshah
More informationHypothesis Testing with the Bootstrap. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods
Hypothesis Testing with the Bootstrap Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods Bootstrap Hypothesis Testing A bootstrap hypothesis test starts with a test statistic
More informationComputational statistics
Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated
More informationModel Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University
Model Selection, Estimation, and Bootstrap Smoothing Bradley Efron Stanford University Estimation After Model Selection Usually: (a) look at data (b) choose model (linear, quad, cubic...?) (c) fit estimates
More informationSome General Types of Tests
Some General Types of Tests We may not be able to find a UMP or UMPU test in a given situation. In that case, we may use test of some general class of tests that often have good asymptotic properties.
More informationLecture Particle Filters
FMS161/MASM18 Financial Statistics November 29, 2010 Monte Carlo filters The filter recursions could only be solved for HMMs and for linear, Gaussian models. Idea: Approximate any model with a HMM. Replace
More informationMidterm exam CS 189/289, Fall 2015
Midterm exam CS 189/289, Fall 2015 You have 80 minutes for the exam. Total 100 points: 1. True/False: 36 points (18 questions, 2 points each). 2. Multiple-choice questions: 24 points (8 questions, 3 points
More informationMonte Carlo Methods. Leon Gu CSD, CMU
Monte Carlo Methods Leon Gu CSD, CMU Approximate Inference EM: y-observed variables; x-hidden variables; θ-parameters; E-step: q(x) = p(x y, θ t 1 ) M-step: θ t = arg max E q(x) [log p(y, x θ)] θ Monte
More informationLarge sample distribution for fully functional periodicity tests
Large sample distribution for fully functional periodicity tests Siegfried Hörmann Institute for Statistics Graz University of Technology Based on joint work with Piotr Kokoszka (Colorado State) and Gilles
More information6. Fractional Imputation in Survey Sampling
6. Fractional Imputation in Survey Sampling 1 Introduction Consider a finite population of N units identified by a set of indices U = {1, 2,, N} with N known. Associated with each unit i in the population
More informationComparing non-nested models in the search for new physics
Comparing non-nested models in the search for new physics Sara Algeri 1,2 In collaboration with: Prof. David van Dyk 1 and Prof. Jan Conrad 2 1 Department of Mathematics, Imperial College London, London,
More information