Discussion on Change-Points: From Sequential Detection to Biology and Back by David Siegmund

Similar documents
Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

University, Tempe, Arizona, USA b Department of Mathematics and Statistics, University of New. Mexico, Albuquerque, New Mexico, USA

Online publication date: 01 March 2010 PLEASE SCROLL DOWN FOR ARTICLE

Precise Large Deviations for Sums of Negatively Dependent Random Variables with Common Long-Tailed Distributions

Communications in Algebra Publication details, including instructions for authors and subscription information:

Guangzhou, P.R. China

Online publication date: 22 March 2010

Characterizations of Student's t-distribution via regressions of order statistics George P. Yanev a ; M. Ahsanullah b a

Dissipation Function in Hyperbolic Thermoelasticity

Gilles Bourgeois a, Richard A. Cunjak a, Daniel Caissie a & Nassir El-Jabi b a Science Brunch, Department of Fisheries and Oceans, Box

The American Statistician Publication details, including instructions for authors and subscription information:

OF SCIENCE AND TECHNOLOGY, TAEJON, KOREA

Published online: 05 Oct 2006.

Derivation of SPDEs for Correlated Random Walk Transport Models in One and Two Dimensions

Online publication date: 12 January 2010

Use and Abuse of Regression

Park, Pennsylvania, USA. Full terms and conditions of use:

Diatom Research Publication details, including instructions for authors and subscription information:

Full terms and conditions of use:

Online publication date: 30 March 2011

Full terms and conditions of use:

Published online: 17 May 2012.

Tore Henriksen a & Geir Ulfstein b a Faculty of Law, University of Tromsø, Tromsø, Norway. Available online: 18 Feb 2011

The Homogeneous Markov System (HMS) as an Elastic Medium. The Three-Dimensional Case

Nacional de La Pampa, Santa Rosa, La Pampa, Argentina b Instituto de Matemática Aplicada San Luis, Consejo Nacional de Investigaciones Científicas

Version of record first published: 01 Sep 2006.

Erciyes University, Kayseri, Turkey

Open problems. Christian Berg a a Department of Mathematical Sciences, University of. Copenhagen, Copenhagen, Denmark Published online: 07 Nov 2014.

A Simple Approximate Procedure for Constructing Binomial and Poisson Tolerance Intervals

PLEASE SCROLL DOWN FOR ARTICLE

The Fourier transform of the unit step function B. L. Burrows a ; D. J. Colwell a a

G. S. Denisov a, G. V. Gusakova b & A. L. Smolyansky b a Institute of Physics, Leningrad State University, Leningrad, B-

George L. Fischer a, Thomas R. Moore b c & Robert W. Boyd b a Department of Physics and The Institute of Optics,

Geometrical optics and blackbody radiation Pablo BenÍTez ab ; Roland Winston a ;Juan C. Miñano b a

Communications in Algebra Publication details, including instructions for authors and subscription information:

Ankara, Turkey Published online: 20 Sep 2013.

A note on adaptation in garch models Gloria González-Rivera a a

CCSM: Cross correlogram spectral matching F. Van Der Meer & W. Bakker Published online: 25 Nov 2010.

MSE Performance and Minimax Regret Significance Points for a HPT Estimator when each Individual Regression Coefficient is Estimated

To cite this article: Edward E. Roskam & Jules Ellis (1992) Reaction to Other Commentaries, Multivariate Behavioral Research, 27:2,

University, Wuhan, China c College of Physical Science and Technology, Central China Normal. University, Wuhan, China Published online: 25 Apr 2014.

University of Thessaloniki, Thessaloniki, Greece

PLEASE SCROLL DOWN FOR ARTICLE

Acyclic, Cyclic and Polycyclic P n

Published online: 10 Apr 2012.

PLEASE SCROLL DOWN FOR ARTICLE

FB 4, University of Osnabrück, Osnabrück

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Geometric View of Measurement Errors

P. C. Wason a & P. N. Johnson-Laird a a Psycholinguistics Research Unit and Department of

Asymptotically Optimal Quickest Change Detection in Distributed Sensor Systems

Projective spaces in flag varieties

PLEASE SCROLL DOWN FOR ARTICLE

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use:

A Strongly Convergent Method for Nonsmooth Convex Minimization in Hilbert Spaces

PLEASE SCROLL DOWN FOR ARTICLE

András István Fazekas a b & Éva V. Nagy c a Hungarian Power Companies Ltd., Budapest, Hungary. Available online: 29 Jun 2011

Tong University, Shanghai , China Published online: 27 May 2014.

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use:

Global Existence of Large BV Solutions in a Model of Granular Flow

Marko A. A. Boon a, John H. J. Einmahl b & Ian W. McKeague c a Department of Mathematics and Computer Science, Eindhoven

Móstoles, Madrid. Spain. Universidad de Málaga. Málaga. Spain

Avenue G. Pompidou, BP 56, La Valette du Var cédex, 83162, France

Melbourne, Victoria, 3010, Australia. To link to this article:

Online publication date: 29 April 2011

Dresden, Dresden, Germany Published online: 09 Jan 2009.

PLEASE SCROLL DOWN FOR ARTICLE

Fokker-Planck Solution for a Neuronal Spiking Model Derek J. Daniel a a

To link to this article:

Sports Technology Publication details, including instructions for authors and subscription information:

Xiaojun Yang a a Department of Geography, Florida State University, Tallahassee, FL32306, USA Available online: 22 Feb 2007

Osnabrück, Germany. Nanotechnology, Münster, Germany

Raman Research Institute, Bangalore, India

Uncertainty. Jayakrishnan Unnikrishnan. CSL June PhD Defense ECE Department

Astrophysical Observatory, Smithsonian Institution, PLEASE SCROLL DOWN FOR ARTICLE

PLEASE SCROLL DOWN FOR ARTICLE

35-959, Rzeszów, Poland b Institute of Computer Science, Jagiellonian University,

Fractal Dimension of Turbulent Premixed Flames Alan R. Kerstein a a

A. H. Abuzaid a, A. G. Hussin b & I. B. Mohamed a a Institute of Mathematical Sciences, University of Malaya, 50603,

PLEASE SCROLL DOWN FOR ARTICLE

Accepted author version posted online: 28 Nov 2012.

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS

Quan Wang a, Yuanxin Xi a, Zhiliang Wan a, Yanqing Lu a, Yongyuan Zhu a, Yanfeng Chen a & Naiben Ming a a National Laboratory of Solid State

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

Miami, Florida, USA. Engineering, University of California, Irvine, California, USA. Science, Rehovot, Israel

INFORMATION APPROACH FOR CHANGE POINT DETECTION OF WEIBULL MODELS WITH APPLICATIONS. Tao Jiang. A Thesis

Published online: 04 Jun 2015.

Depth Measures for Multivariate Functional Data

A Hypothesis Test for the End of a Common Source Outbreak

Horizonte, MG, Brazil b Departamento de F sica e Matem tica, Universidade Federal Rural de. Pernambuco, Recife, PE, Brazil

France. Published online: 23 Apr 2007.

Early Detection of Epidemics as a Sequential Change-Point Problem

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

Lecture Notes in Statistics 180 Edited by P. Bickel, P. Diggle, S. Fienberg, U. Gather, I. Olkin, S. Zeger

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Analysis of Leakage Current Mechanisms in BiFeO 3. Thin Films P. Pipinys a ; A. Rimeika a ; V. Lapeika a a

Transcription:

This article was downloaded by: [Michael Baron] On: 2 February 213, At: 21:2 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Sequential Analysis: Design Methods and Applications Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/lsqa2 Discussion on Change-Points: From Sequential Detection to Biology and Back by David Siegmund Michael Baron a a Department of Mathematical Sciences, University of Texas at Dallas, Richardson, Texas, USA To cite this article: Michael Baron (213): Discussion on Change-Points: From Sequential Detection to Biology and Back by David Siegmund, Sequential Analysis: Design Methods and Applications, 32:1, 15-18 To link to this article: http://dx.doi.org/1.18/7474946.213.751837 PLEASE SCROLL DOWN FOR ARTICLE Full terms and conditions of use: http://www.tandfonline.com/page/terms-and-conditions This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.

Sequential Analysis, 32: 15 18, 213 Copyright Taylor & Francis Group, LLC ISSN: 747-4946 print/1532-4176 online DOI: 1.18/7474946.213.751837 Discussion on Change-Points: From Sequential Detection to Biology and Back by David Siegmund Michael Baron Department of Mathematical Sciences, University of Texas at Dallas, Richardson, Texas, USA Abstract: Motivated by Professor Siegmund s article and his other works in the area of change-point analysis and its biomedical applications, we discuss approaches to change-point related problems that involve parameters of unknown dimension. Keywords: Change point; CUSUM; Error rate; Multiple sequences; Unknown dimension. Subject Classifications: 62L1; 62L15; 62P1. 1. INTRODUCTION In March 1993, the Applied Change Point Conference hosted by my graduate school at the University of Maryland Baltimore County (UMBC) discussed the latest 2th century applications of change-point analysis to diverse fields, from system reliability to baseball (Ahsanullah et al., 1995). It is amazing how quickly this range of applications developed and expanded to include rather complex (21st century) models arising in biology and medicine that are presented in Professor Siegmund s inspiring and thought-provoking paper. Statistical analysis of complicated stochastic models often meets a challenge when it involves estimation of parameters of unknown dimensions. Without a proper penalty for overfitting (such as, e.g., Chen and Gupta (1997) or Harchaoui and Lévy-Leduc (21)), conventional methods tend to overestimate the number of unknown parameters. The choice of a penalty is typically subjective; however, it has a major effect on the estimation results. Here we discuss two unknown dimension problems that appear in the context of change-point estimation in genetic mapping. One of them is estimation of multiple Received July 28, 212, Revised October 5, 212, Accepted October 6, 212 Recommended by A. G. Tartakovsky Address correspondence to M. Baron, Department of Mathematical Sciences, University of Texas at Dallas, FO-35, 8 W. Campbell Rd., Richardson, TX 758, USA; Fax: +1 (972) 883-6622; E-mail: mbaron@ utdallas.edu

16 Baron change-points, for example, in the case of multiple variant intervals, where the number of variant intervals is a priori unknown. The other problem is about multiple channels such as DNA sequences, an unknown portion of which experience a change-point. 2. MULTIPLE CHANGE POINTS In DNA sequencing as well as quality control, psychology, economics, finance, and other areas, the observed process may experience several change-points. In this case, the maximum likelihood routines (Fu and Curnow, 199) including the Vierbi algorithm (Rabiner, 1989) tend to detect too many false change-points. Indeed, the likelihood may only increase when the parameter space is expanded. On the other hand, the binary segmentation scheme (Vostrikova, 1981) tends to miss some of the change-points because it assumes only one change-point at each step. It is tempting to impose constraints on the dispersion of change-points and to use restricted maximum likelihood estimation methods under deterministic constraints or Bayesian computation (Chib, 1998) under a prior distribution. These constraints turn out to have a strong impact on the estimation result and, typically, the procedure will detect as many change-points as it is allowed. Thus, the best option seems to be detecting the multiple change-points sequentially, or one at a time, as mentioned by Professor Siegmund. No wrong assumptions have to be taken, each change-point has a chance to be detected, and the nuisance parameters are estimated using the corresponding blocks of data between the estimated change points. Noticeably, these parameters are likely to be estimated from contaminated data. Initially detected change points are not estimated accurately and, thus, the pre- and post-change parameters may be estimated from subsamples that inadvertently overlap with the adjacent inter-change-point segments. For this reason, it is even possible to miss a change-point without ever detecting it (Baron, 2). A solution is to refine the set of estimated change-points and nuisance parameters iteratively. During this process, similar adjacent segments may be merged, if a change-point separating them is not found to be significant at some step. In a reverse situation, a segment may be divided, generating a new change-point, if such a split is found to create two significantly different distributions. Hopefully, a more accurately estimated set of multiple change-points is computed at each iteration, and less contamination appears in the estimation of nuisance parameters (Baron, 24). This, however, cannot be guaranteed, as long as the underlying distributions have overlapping supports. 3. MULTIPLE SEQUENCES WITH POSSIBLE CHANGE POINTS The problem of change-point detection in multiple DNA sequences is certainly important and interesting but it also contains a challenge. Any of N sequences may have a variant interval that marks two change-points, 1 and 2. Then, in the presence of nuisance parameters, it is not clear which sequences should be modeled with one parameter (no change-point) and which ones with two parameters (one before 1 and after 2 and another between 1 and 2 ). If one allows the maximum likelihood method to choose between these two options, it should always support the latter, simply because the maximum over a larger set is higher.

Discussion on Change-Points 17 Different approaches have been proposed for this problem. Tartakovsky et al. (23) and Tartakovsky and Veeravalli (24) allowed the occurrence of a changepoint in exactly one of N sequences. More generally, Professor Siegmund assumes a known fraction p of sequences with a change-point. In the joint likelihood, p plays the role of a prior probability for each sequence to have a change-point. The asymptotic behavior of the resulting change-point estimator is interesting. It is generally known that no consistent change-point estimator exists, in the current setting, where the change-point is defined as the index of the first data point from a new distribution (e.g., Hinkley, 197). On the other hand, when subsamples before and after the change-point are sufficiently large to allow consistent estimation of each nuisance parameter, the number of change-points should be estimated consistently, so no change remains unnoticed. Intuitively, since large data sets should dominate over the prior distribution, the fraction of sequences with detected change-points should converge to the true proportion. Perhaps for this reason the procedure is robust with respect to the choice of p, as mentioned by Professor Siegmund. Then, if the limiting proportion of detected change-points differs from p, does it contradict the initial choice of p? Seeking an alternative method that is not based on arbitrary assumptions, one might recall that the cumulative sum (CUSUM) procedure itself was derived by Page (1954) as a result of a sequence of likelihood ratio tests. Page s CUSUM algorithm reports a change-point the first time when the no-change null hypothesis is rejected. Furthermore, as seen from Theorem 4.7 of Shiryaev (1978), the Bayes stopping rule for change-point detection under the geometric prior can also be obtained from a sequence of Bayes tests as the moment of the first rejection of the no-change hypothesis. The same can be stated about its limiting case, the Shiryaev-Roberts procedure (Roberts, 1966), that can be obtained from a sequence of Bayes tests under the non informative generalized prior. Applying Page s idea to multiple sequences, one may test multiple hypotheses of no change in each sequence and define the stopping time to be the first moment of (at least one) rejection. For the set of N parallel sequences, there are N hypotheses, 1 2 1 m and n such that t n = n + n I 1 <t 2, for n = 1 N. Two directions can be considered here. A simpler problem arises when one is interested in the time of changes only. Then it suffices to test one composite hypothesis H = H n ; that is, no change H n t n = n for all t 1 m vs. H n A in any sequence so far, vs. H A = H n A ; that is, a change has occurred in the distribution of at least one sequence. In fact, the maximum log-likelihood ratio proposed in Tartakovsky et al. (23) can be viewed as a test statistic for this H yielding significance level N when each individual Wald s stopping boundary is based on level. As briefly mentioned by Professor Siegmund, one may also be interested in finding which sequences experienced change-points. A composite hypothesis H will no longer serve this purpose. Instead, N individual hypotheses H n are to be tested sequentially until a decision is made on the occurrence of a change in each of them. The needed multiple testing methodology is well developed for non-sequential data, although only a few algorithms are designed for sequential experiments. The methods of Bartroff and Lai (21) or De and Baron (212a,b) can be used to test H n n= 1 N controlling the family-wise error across N sequences. Application of such sequential multiple testing will detect, with a certain false alarm

18 Baron rate, a change and identify, with a certain family-wise error rate, the sequences that are affected by this change. In practice, for large N, it may suffice to determine only a portion of sequences that experienced a change. ACKNOWLEDGMENTS The author greatly appreciates his research funding by the National Science Foundation (grant DMS 17775) and the National Security Agency (grant H9823-11-1-147). REFERENCES Ahsanullah, M., Rukhin, A. L., and Sinha, B. (1995). Applied Change Point Problems in Statistics, New York: Nova Science Publishers. Baron, M. (2). Nonparametric Adaptive Change-Point Estimation and On-Line Detection, Sequential Analysis 19: 1 23. Baron, M. (24). Sequential Methods for Multistate Processes, in Applications of Sequential Methodologies, N. Mukhopadhyay, S. Datta, and S. Chattopadhyay, eds., pp. 55 73, New York: Marcel Dekker. Bartroff, J. and Lai, T.-L. (21). Multistage Tests of Multiple Hypotheses, Communications in Statistics - Theory and Methods 39: 1597 167. Chen, J. and Gupta, A. K. (1997). Testing and Locating Variance Changepoints with Application to Stock Prices, Journal of American Statistical Association 92: 739 747. Chib, S. (1998). Estimation and Comparison of Multiple Change-Point Models, Journal of Econometrics 86: 221 241. De, S. and Baron, M. (212a). Sequential Bonferroni Methods for Multiple Hypothesis Testing with Strong Control of Familywise Error Rates I and II, Sequential Analysis 31: 238 262. De, S. and Baron, M. (212b). Step-Up and Step-Down Methods for Testing Multiple Hypotheses in Sequential Experiments, Journal of Statistical Planning and Inference 142: 259 27. Fu, Y.-X. and Curnow, R. N. (199). Maximum Likelihood Estimation of Multiple Change Points, Biometrika 77: 563 573. Harchaoui, Z. and Lévy-Leduc, C. (21). Multiple Change-Point Estimation with a Total Variation Penalty, Journal of American Statistical Association 15: 148 1493. Hinkley, D. V. (197). Inference about the Change-Point in a Sequence of Random Variables, Biometrika 57: 1 17. Page, E. S. (1954). Continuous Inspection Schemes, Biometrika 41: 1 115. Rabiner, L. R. (1989). A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, IEEE Proceedings 77: 257 285. Roberts, S. W. (1966). A Comparison of Some Control Chart Procedures, Technometrics 8: 411 43. Shiryaev, A. N. (1978). Optimal Stopping Rules, New York: Springer-Verlag. Tartakovsky, A. G., Li, X. R., and Yaralov, G. (23). Sequential Detection of Targets in Multichannel Systems, IEEE Transactions on Information Theory 49: 425 445. Tartakovsky, A. G. and Veeravalli, V. V. (24). Change-Point Detection in Multichannel and Distributed Systems with Applications, in Applications of Sequential Methodologies, N. Mukhopadhyay, S. Datta, and S. Chattopadhyay, eds., pp. 339 37, New York: Marcel Dekker. Vostrikova, L. J. (1981). Detecting Disorder in Multidimensional Random Processes, Soviet Mathematics Doklady 24: 55 59.