Pruscha: Semiparametric Point Process and Time Series Models for Series of Events
|
|
- Merry Stewart
- 6 years ago
- Views:
Transcription
1 Pruscha: Semiparametric Point Process and ime Series Models for Series of Events Sonderforschungsbereich 386, Paper 4 (998) Online unter: Projektpartner
2 Semiparametric Point Process and ime Series Models for Series of Events Helmut Pruscha University of Munich, Mathematical Institute, heresienstr. 39 D Munich, Summary: We are dealing with series of events occurring at random times n and carrying further quantitive information n. Examples are sequences of extrasystoles in ECG-records. We will present two approaches for analyzing such (typically long) sequences ( n n ), n = :::. (i) A point process model is based on an intensity of the form (t) b t (), t 0, with b t astochastic intensity of the self-exciting type. (ii) A time series approach is based on a transitional GLM. he conditional expectation of the waiting time n+ = n+ ; n is set to be ( n ) h( n ()), with h a response function and n a regression term. he deterministic functions and, respectively, describe the long-term trend of the process. Keywords: Semiparametric estimation rend function Penalized least squares Point process Intensity process GLM-based time series ECG-data Introduction Series of events can be analyzed in a dual way, namely by establishing point process models for the sequence ::: of occurrence times time series models for the sequence ::: of the lengths of time intervals between events ( n = n ; n;, 0 =0) (see Cox & Lewis, 966). In the present paper, we employ intensity based point process models of the self-exciting type (Hawkes, 97) GLM-based time series models of the transitional type (Zeger & Qaqish, 988 Fahrmeir & utz, 994) In both cases the model equation is factorized into a deterministic trend function and into a stochastic part describing the short-term oscillation. As in many semiparametric problems a major task is to separate the two factors by the estimation procedure. In our situation of one single realization over a long time interval we will need a kind of ergodic behaviour of the process. he trend function is estimated by penalized least squares (l.s.) methods based on aggregated data, gained byanaveraging procedure over longer time intervals (preserving enough short-term information). For the parameter of the stochastic component we will then employ likelihood methods, thus reversing the
3 order of estimation known from partial likelihood approaches in survival time analysis. In addition to the occurrence times n metrically scaled marks n are observed, forming p-dimensional vectors and playing the role of covariates. he results will show that both approaches, the time series and the point process approach, yield satisfactory trend estimations and give insight into the inner dynamics of the process. Semiparametric Point Process Model he sequence n, n, of occurrence times is described by anintensity process t, t 0, in the multiplicative form t = (t)b t () IR d () with a deterministic and smooth function and a stochastic intensity process b. he latter is modelled in a self-exciting form by b t = b (n) (t ) =f(w n t; n ) for t ( n n+ ] () with a regression term w n being iteratively dened by w n = u(w n; ( n n )): Ex.. f(w s) =w u(w (s x)) = e ;s w + e > x we are faced with a piecewise constant intensity function b t (Pruscha, 983). With w 0 = equation () can be written in the closed form b (n) (t ) = nx i=0 n;i e ;(n; i) e > i 0 =0 0 =0: Ex.. f(w s) = + e ;s w u(w (s x)) = e ;s w + e > x the intensity function b t is of the form of Hawkes's self-exciting point process (Hawkes, 97). In fact, with w 0 =0we can write equation () as b (n) (t ) = + nx i= e ;(t; i) e > i : We will close this section by deriving the conditional expectation of the waiting times n = n ; n;. In terms of the intensity, the transition probabilities for n+, given the information F n =( ::: n n ) of the past, can be written by IP( n+ s jf n )=; exp(; n n+s) where s t = R t s u du. Conseqently, the conditional expectation of n+ is IE( n+ jf n )= Z 0 exp(; n n+s)ds : (3) Assuming (for the moment) a piecewise constant intensityon( n n+ ], say (n),weget = (n) on the ride hand side of (3): the factors in the following model equation (4) will stand in a reciprocal relation to those in ().
4 3 Semiparametric ime Series Model he sequence n, n, of the waiting times between events is now analyzed within the framework of time series. Let F n =( ::: n n ) comprise the information up to time n as above. hen the conditional expectation of n+, given F n,ismodelled by a transitional GLM of the form IE( n+ jf n )=( n )h( n ()) (4) with a smooth, deterministic trend component, a suitable response function h, and a regression term n iteratively dened by n = n; + n + G( > n ): Ex.. h() = or h() ==, with 0 =, and nonnegative, G(x) = exp(x) or G(x) =x, in the latter case with a side condition > >0 Ex.. h() =e, with 0 =0,G(x) =x. Observe that in both cases positivity ofh() is guaranteed. Zeger and Qaqish (988,.(iv)) used h() == and put = n (instead n ) in the regression term n. hese types of response functions h are in fact suggested by the two examples in sec.. Assuming for simplicity = then in the case of Ex. in sec. equation (3) yields =w n, leading to the choice h() == in the case of Ex. we getc exp(;c w n ) after some calculations, proposing the response function h() = exp(). 4 Nonparametric Estimation of the rend In order to estimate or a method of rough averaging is suggested. o this end, we divide the observation period (0 ]into K subintervals I j =(u j; u j ] of length j = u j ; u j;, j = ::: K, where K is assumed to be considerably smaller than the observed number N of events in (0 ]. We are going to base the l.s. function on the dierence Y j ; IEY j, where Y j is the rate of occurrence of events or the averaged length of time intervals between events, respectively, in the interval I j. Let for the following N j = N uj ; N uj; be the number of events in I j 4. Point process model Putting Y j = N j j wehave on the basis of model () IEY j = j Z uj u j; IE s ds = (t j) Z uj IEb s () ds j u j; for some t j [u j; u j ]. We assume, that the approximation (t j) (t j ) t j = (u j; + u j ) (5) 3
5 is possible, as well as, for larger j = u j ; u j;, Z uj IEb s () ds b () () (6) j u j; where b () () is independent ofj. Putting a(t) =(t) b () (), and using the approximations (5) and (6), we can write IEY j a(t j ). In Ex. of sec. the limit intensity value is identied as (Pruscha, 997) b () ( ) = ; () where () = a.s.- lim( n P n i= i ). In Ex. of sec. the limit intensity is ; exp(; () ). 4. ime series model We divide the interval (0 ], = N,asabove and dene Y j = j N j. Assuming for the moment that I j =( k k+ ], i.e. N j =, for some k = k(j), we have on the basis of model (4) IEY j = X i= IE( k+i )=(t j) X i= IEh( k+i; ()) for some t j Ij. By means of approximations similar to (5) and (6) above, namely (t ) (t j j), t j = (u j; + u j ), and, for larger, X i= IEh( k+i; ()) h () () (7) we arrive atiey j a(t j ), where we nowhave a(t) =(t) h () (). For the regression term n and the examples given in sec. 3, explicit formulas for h () () can be given, assuming the existence of the Cesaro limits (), (), (), G () () of the sequences n, n, n, G( > n ). Indeed, we rst have forjj < () () = ; () + G () () (8) where G () () = > () in the case G(x) =x. hen for the examples in sec. 3 Ex. (with h() =): h () () = () () Ex. (with h() =e ): If centered variables i and i are used, we get () =0 from (8), such thatwehave h () () + () = for all i.e. we can assume that the side condition (9) below is automatically fullled. 4.3 An asymptotic set-up justifying the various approximations will be given below in sec 6. We now dene for both models the penalized l.s. criterion (a) = SSE(a)+ K H () (a), with >0 a smoothing parameter and with SSE(a) =(Y ; a) > W (Y ; a) H () (a) = 4 Z 0 (a 00 (s)) ds
6 where (Y ; a) isak {vector having components Y j ; a(t j ) and where W is an appropriate K K{weight matrix. It is well known, that (a) = min is solved by natural cubic spline functions a (Green and Silverman, 994). Note that or can be estimated only in the form a = b () or a = h (), where the second factor does not depend on t, buton. 5 Parametric Estimation For the parametric part of the problem we apply likelihood methods. Maximization of the log-likelihood function l (), IR d, = N,isdoneby plugging into l () the spline solution ^a from sec. 4, leading to a function ^l (), and then solving ^l () = max under b () () =orh () () =: (9) he side condition should guarantee that the two factors and b in () or and h in (4) can be separated by the estimation procedure. Point process model: he log-likelihood function of a realization ( ::: N )ofthe point process has the form l () = NX i= log(( i )b (i;) ( i )) ; where we have to insert ^(t) =^a(t)=b () (). Z i i;! (s)b (i;) (s ) ds ime series model: Assuming that the i 's are (conditionally) exponentially distributed, the log-likelihood function of a time series realization ( ::: N ) is given by P N i= log( i; exp(; i; i )), where i ==(( i ) h( i )) cf (4). We arrive at l () = NX i= ; log ( i; )h( i; ()) ; i ( i; )h( i; ) Plugging in the solution ^(t) =^a(t)=h () () from sec. 4 and neglecting irrelevant terms we areledtoafunction^l () which can (under the side condition h () () =) be interpreted as the log-likelihood function of a realization (^ ::: ^ N ), with ^ n = n =^a( n; ), fullling the (purely parametric) model equation IE(^ n+ jf n )=h( n ()): (0) For the transitional GLM (0) quasi-likelihood methods from GLM software can be employed.! : 6 Asymptotic Set-up 6. rend function he following set-up is known from nonparametric regression (Eubank, 988) and from non-stationary time series analysis (Dahlhaus, 996). Let (t), t [0 ], be a positive 5
7 function with a continous derivative dene for >0 t (t) = t [0 ]: Let further I j =(u j; u j ], j = :::, be a division of (0 ]into subintervals of length j. Putting (0) =min j j, () =max j j,thenwe consider limits of the kind! (0) ()!! 0: () Since for s s 0 I j j (s) ; (s 0 )j j max j 0 (t)j () I j max j 0 (t)j [0 ] which tends to 0 under (), the setting (5) is established. An analogous device is used for the trend function (t) = (t= ), = N, in the time series case. 6. Ergodic properties Point process model: For each >0, the counting process N t, t [0 ], with intensity process t = (t) b t () t [0 ], may full the ergodic law Z IE b s () ds! b () () I denotes an interval of length. hen the ap- for limits of the kind (), where I proximation (6) is justied. ime series model: For each N IN thevariables n, n f ::: Ng, satisfying the equation IE N ( n+ jf n )= ( n )h( n ()) ( = N ), may full the ergodic law n N X ii N IE N h( i ())! h () () where I N comprises n N adjacent integers. hen the approximation (7) is justied. A more detailed analysis will be given elsewhere. 7 Application Data sets on long-term ECGs of patients suering from heart arrhythmias consist of the occurrence times n of ventricular extrasystoles. he covariates n are the strength of these events measured as the relative deviation from the normal beat (see Pruscha, Ulm & Schmid, 997). As an example wechoose a patient with 74 extrasystoles within 6
8 a 0-hours observation period. he reciprocal relationship of the trend functions (t) and (t) becomes apparent in the plot of Fig.. Parameters were estimated from the data of 48 patients 5 persons died of a later sudden heart attack, while 33 survived. We used point process model (), Ex., with = and normalized i 's the mean cluster size m c is also calculated from the estimated parameters, according to a cluster process representation by Hawkes&Oakes (974) ( will be denoted by PP ) time series model (4), Ex.., employing S+,glm,quasi(link=log),dispersion estimated for equation (0), with standardized i 's, ( will here be denoted by S ) he able shows the tendency, that larger positive PP -values and smaller mean clustering sizes m c correspond to smaller S -values and smaller autocorrelations r^ (), where ^ i = i =^a( i; ) as in (0). Patient N PP m c S r^ () q^ ^h() ODJFZ OVXSBH PFQWGU OEMQL he data of the 48 patients, labelled by their status =survival or =death, were submitted to a binary logistic regression, with x = Sz, y = p m c ; andz = VES/h as regressors. Hereby, VES/h is the average rate of extrasystoles per hour, and Sz is the standard deviation of the time interval lengths between those two heart beats being qualied as normal beats (not being designated as extrasystoles), vgl. ab. of Pruscha, Ulm & Schmid (997). hen in the scatter diagram of Fig., the(x 0 y 0 ) values of the 48 patients were plotted, with x 0 y 0 being the x and y values, respectively, corrected by thevariable z = VES/h on the basis of the logistic regression equation. We have three distinct outlier cases from the group lying in the "area" of group. he two most extremes of them are also outliers in the evaluation system of nonlinear dynamics (Schmid et al, 996), where the variables x = sin and y = VES were used. With respect to the quality of prediction both systems perform nearly equally well. Acknowledgments his work is supported by the Deutsche Forschungsgemeinschaft (SFB 386). he ECG data were kindly placed at my disposal by K.Ulm(echnical University,Munich). he help of dipl. math. S. Siegle with respect to Fortran programming and S+ data analysis is thankfully acknowledged. 7
9 References Cox, D. R. & Lewis, P.A.W. (966). Methuen, London. he Statistical Analysis of Series of Events. Dahlhaus, R. (996). Maximum likelihood estimation and model selection for locally stationary processes. Nonparametric Statistics, 6, 7-9. Eubank, R. L. (988). Spline Smoothing and Nonparametric Regression. M. Decker, N.Y. Fahrmeir, L. & utz, G. (994). Multivariate Statistical Modelling Based on Generalized Linear Models. Springer-Verlag, N.Y. Green, P. J. & Silverman, B. W. (994). Nonparametric Regression and Generalized Linear Models. Chapman and Hall, London. Hawkes, A. G. (97). Spectra of some self-exciting point processes. Biometrika, 58, Hawkes, A.G. & Oakes, D. (974). A cluster process representation of a self-exciting process. J. Appl. Prob.,, Pruscha, H. (983). Learning models with continuous time parameter and multivariate point processes. J. Appl. Prob., 0, Pruscha, H. (997). Semiparametric estimation in regression models for point processes based on one realization. Proceedings of the th International Workshop on Statistical Modelling, Biel 997, Pruscha, H., Ulm, K. & Schmid, G. (997). Statistische Analyse des Einusses von Herzrythmus-Storungen auf das Mortalitatsrisiko. Discussion papers, 56, SFB 386 Munchen. Schmid, G., Morll, G. E. et al. (996). Variabilityof ventricular premature complexes and mortality risk. PACE, 9, Zeger, S. L. & Qaqish, B. (988). Markov regression models for time series: a quasi likelihood approach. Biometrics, 44, Legends to Figs. Fig. (above) Extrasystoles within a 0-hours ECG record of a patient (OVXSBH). he n = 74 time points of occurrence are marked on the time scale. he occurrence frequencies (triangles) and the mean waiting times (circles) were plotted over the j = ::: 0 hours, together with the estimated trend functions and, respectively. Fig. (below) Scatterplot of the variable Cluster Size over the variable Standard Deviation, both corrected by VES/h on the basis of a logistic regression equation, for 48 patients. 8
10 Extrasystoles in a 0-Hours ECG Interval Frequency (.0) (53) Inverse of Interval Frequency ime [h] Plot of 48 Patients Suffering from Cardiac Arrythmias ClusterSize corrected by VES/h = Survived (33) = Sudden Cardiac Death (5) StandDev corrected by VES/h 9
Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization
Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization Sonderforschungsbereich 386, Paper 66 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationPruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies
Pruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies Sonderforschungsbereich 386, Paper 173 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationPENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA
PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University
More informationKüchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models
Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Sonderforschungsbereich 386, Paper 27 (996) Online unter: http://epububuni-muenchende/ Projektpartner An
More informationFahrmeir: Discrete failure time models
Fahrmeir: Discrete failure time models Sonderforschungsbereich 386, Paper 91 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Discrete failure time models Ludwig Fahrmeir, Universitat
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationRegularization in Cox Frailty Models
Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University
More informationAnalysing geoadditive regression data: a mixed model approach
Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression
More informationAnalysis of competing risks data and simulation of data following predened subdistribution hazards
Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013
More informationAugustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models
Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Sonderforschungsbereich 386, Paper 196 (2000) Online
More informationVariable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting
Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Andreas Groll 1 and Gerhard Tutz 2 1 Department of Statistics, University of Munich, Akademiestrasse 1, D-80799, Munich,
More informationInversion Base Height. Daggot Pressure Gradient Visibility (miles)
Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:
More informationFahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation
Fahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation Sonderforschungsbereich 386, Paper 137 (1998) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Recent Advances in Semiparametric
More informationOn prediction and density estimation Peter McCullagh University of Chicago December 2004
On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating
More informationA general mixed model approach for spatio-temporal regression data
A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression
More informationKneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"
Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/
More informationRegression with Numerical Optimization. Logistic
CSG220 Machine Learning Fall 2008 Regression with Numerical Optimization. Logistic regression Regression with Numerical Optimization. Logistic regression based on a document by Andrew Ng October 3, 204
More informationFrailty Models and Copulas: Similarities and Differences
Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt
More informationGeneralized Additive Models
By Trevor Hastie and R. Tibshirani Regression models play an important role in many applied settings, by enabling predictive analysis, revealing classification rules, and providing data-analytic tools
More informationStatistics 203: Introduction to Regression and Analysis of Variance Course review
Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationExpressions for the covariance matrix of covariance data
Expressions for the covariance matrix of covariance data Torsten Söderström Division of Systems and Control, Department of Information Technology, Uppsala University, P O Box 337, SE-7505 Uppsala, Sweden
More informationConsider Table 1 (Note connection to start-stop process).
Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event
More informationUsing Estimating Equations for Spatially Correlated A
Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship
More informationBias-corrected AIC for selecting variables in Poisson regression models
Bias-corrected AIC for selecting variables in Poisson regression models Ken-ichi Kamo (a), Hirokazu Yanagihara (b) and Kenichi Satoh (c) (a) Corresponding author: Department of Liberal Arts and Sciences,
More informationdata lam=36.9 lam=6.69 lam=4.18 lam=2.92 lam=2.21 time max wavelength modulus of max wavelength cycle
AUTOREGRESSIVE LINEAR MODELS AR(1) MODELS The zero-mean AR(1) model x t = x t,1 + t is a linear regression of the current value of the time series on the previous value. For > 0 it generates positively
More informationMulti-state Models: An Overview
Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed
More informationRecap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks
Recap Probability, stochastic processes, Markov chains ELEC-C7210 Modeling and analysis of communication networks 1 Recap: Probability theory important distributions Discrete distributions Geometric distribution
More informationStatistícal Methods for Spatial Data Analysis
Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis
More informationCh 4. Linear Models for Classification
Ch 4. Linear Models for Classification Pattern Recognition and Machine Learning, C. M. Bishop, 2006. Department of Computer Science and Engineering Pohang University of Science and echnology 77 Cheongam-ro,
More informationLocally stationary Hawkes processes
Locally stationary Hawkes processes François Roue LTCI, CNRS, Télécom ParisTech, Université Paris-Saclay Talk based on Roue, von Sachs, and Sansonnet [2016] 1 / 40 Outline Introduction Non-stationary Hawkes
More informationMultivariate Survival Data With Censoring.
1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.
More informationDuration Analysis. Joan Llull
Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual
More informationRejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2
Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David
More informationA NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL
Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl
More information* * * * * * * * * * * * * * * ** * **
Generalized Additive Models Trevor Hastie y and Robert Tibshirani z Department of Statistics and Division of Biostatistics Stanford University 12th May, 1995 Regression models play an important role in
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More informationCharles E. McCulloch Biometrics Unit and Statistics Center Cornell University
A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components
More informationThe classifier. Theorem. where the min is over all possible classifiers. To calculate the Bayes classifier/bayes risk, we need to know
The Bayes classifier Theorem The classifier satisfies where the min is over all possible classifiers. To calculate the Bayes classifier/bayes risk, we need to know Alternatively, since the maximum it is
More informationThe classifier. Linear discriminant analysis (LDA) Example. Challenges for LDA
The Bayes classifier Linear discriminant analysis (LDA) Theorem The classifier satisfies In linear discriminant analysis (LDA), we make the (strong) assumption that where the min is over all possible classifiers.
More information13 Endogeneity and Nonparametric IV
13 Endogeneity and Nonparametric IV 13.1 Nonparametric Endogeneity A nonparametric IV equation is Y i = g (X i ) + e i (1) E (e i j i ) = 0 In this model, some elements of X i are potentially endogenous,
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationNon-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
More informationIndex. Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright John Wiley & Sons, Inc. ISBN.
Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright 0 2002 John Wiley & Sons, Inc. ISBN. 0-471-36355-3 Index Adaptive rejection sampling, 233 Adjacent categories
More informationHandbook of Stochastic Methods
Springer Series in Synergetics 13 Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences von Crispin W Gardiner Neuausgabe Handbook of Stochastic Methods Gardiner schnell und portofrei
More informationPairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion
Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial
More informationStatistical Data Mining and Machine Learning Hilary Term 2016
Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml Naïve Bayes
More informationComments on \Wavelets in Statistics: A Review" by. A. Antoniadis. Jianqing Fan. University of North Carolina, Chapel Hill
Comments on \Wavelets in Statistics: A Review" by A. Antoniadis Jianqing Fan University of North Carolina, Chapel Hill and University of California, Los Angeles I would like to congratulate Professor Antoniadis
More informationChapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS
Chapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS 63 2.1 Introduction In this chapter we describe the analytical tools used in this thesis. They are Markov Decision Processes(MDP), Markov Renewal process
More informationOne important issue in the study of queueing systems is to characterize departure processes. Study on departure processes was rst initiated by Burke (
The Departure Process of the GI/G/ Queue and Its MacLaurin Series Jian-Qiang Hu Department of Manufacturing Engineering Boston University 5 St. Mary's Street Brookline, MA 2446 Email: hqiang@bu.edu June
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear
More informationBIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY
BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1
More informationSemi-parametric estimation of non-stationary Pickands functions
Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of
More informationHigh-dimensional Markov Chain Models for Categorical Data Sequences with Applications Wai-Ki CHING AMACL, Department of Mathematics HKU 19 March 2013
High-dimensional Markov Chain Models for Categorical Data Sequences with Applications Wai-Ki CHING AMACL, Department of Mathematics HKU 19 March 2013 Abstract: Markov chains are popular models for a modelling
More informationLOGISTIC REGRESSION Joseph M. Hilbe
LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of
More informationMachine Learning Lecture 5
Machine Learning Lecture 5 Linear Discriminant Functions 26.10.2017 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Course Outline Fundamentals Bayes Decision Theory
More informationSTA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random
STA 216: GENERALIZED LINEAR MODELS Lecture 1. Review and Introduction Much of statistics is based on the assumption that random variables are continuous & normally distributed. Normal linear regression
More informationA comparison study of the nonparametric tests based on the empirical distributions
통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical
More informationYou know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?
You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David
More information( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan
Outline: Cox regression part 2 Ørnulf Borgan Department of Mathematics University of Oslo Recapitulation Estimation of cumulative hazards and survival probabilites Assumptions for Cox regression and check
More informationGROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin
FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationEstimation of cumulative distribution function with spline functions
INTERNATIONAL JOURNAL OF ECONOMICS AND STATISTICS Volume 5, 017 Estimation of cumulative distribution function with functions Akhlitdin Nizamitdinov, Aladdin Shamilov Abstract The estimation of the cumulative
More informationSpectra of some self-exciting and mutually exciting point processes
Biometrika (1971), 58, 1, p. 83 83 Printed in Great Britain Spectra of some self-exciting and mutually exciting point processes BY ALAN G. HAWKES University of Durham SUMMAKY In recent years methods of
More informationApproximation of Survival Function by Taylor Series for General Partly Interval Censored Data
Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor
More informationIntroduction to Machine Learning Midterm, Tues April 8
Introduction to Machine Learning 10-701 Midterm, Tues April 8 [1 point] Name: Andrew ID: Instructions: You are allowed a (two-sided) sheet of notes. Exam ends at 2:45pm Take a deep breath and don t spend
More informationToutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable
Toutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable Sonderforschungsbereich 386, Paper 178 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationModelling Non-linear and Non-stationary Time Series
Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September
More informationPoint processes, temporal
Point processes, temporal David R. Brillinger, Peter M. Guttorp & Frederic Paik Schoenberg Volume 3, pp 1577 1581 in Encyclopedia of Environmetrics (ISBN 0471 899976) Edited by Abdel H. El-Shaarawi and
More informationProblems. HW problem 5.7 Math 504. Spring CSUF by Nasser Abbasi
Problems HW problem 5.7 Math 504. Spring 2008. CSUF by Nasser Abbasi 1 Problem 6.3 Part(A) Let I n be an indicator variable de ned as 1 when (n = jj I n = 0 = i) 0 otherwise Hence Now we see that E (V
More informationYour use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
On the Estimation of the Intensity Function of a Stationary Point Process Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 27, No. 2 (1965), pp. 332-337
More informationFrontier estimation based on extreme risk measures
Frontier estimation based on extreme risk measures by Jonathan EL METHNI in collaboration with Ste phane GIRARD & Laurent GARDES CMStatistics 2016 University of Seville December 2016 1 Risk measures 2
More informationToutenburg, Fieger: Using diagnostic measures to detect non-mcar processes in linear regression models with missing covariates
Toutenburg, Fieger: Using diagnostic measures to detect non-mcar processes in linear regression models with missing covariates Sonderforschungsbereich 386, Paper 24 (2) Online unter: http://epub.ub.uni-muenchen.de/
More informationImproving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates
Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationBayesian Estimation and Inference for the Generalized Partial Linear Model
Bayesian Estimation Inference for the Generalized Partial Linear Model Haitham M. Yousof 1, Ahmed M. Gad 2 1 Department of Statistics, Mathematics Insurance, Benha University, Egypt. 2 Department of Statistics,
More informationWarm up. Regrade requests submitted directly in Gradescope, do not instructors.
Warm up Regrade requests submitted directly in Gradescope, do not email instructors. 1 float in NumPy = 8 bytes 10 6 2 20 bytes = 1 MB 10 9 2 30 bytes = 1 GB For each block compute the memory required
More informationEfficient Estimation for the Partially Linear Models with Random Effects
A^VÇÚO 1 33 ò 1 5 Ï 2017 c 10 Chinese Journal of Applied Probability and Statistics Oct., 2017, Vol. 33, No. 5, pp. 529-537 doi: 10.3969/j.issn.1001-4268.2017.05.009 Efficient Estimation for the Partially
More informationMachine Learning Lecture 7
Course Outline Machine Learning Lecture 7 Fundamentals (2 weeks) Bayes Decision Theory Probability Density Estimation Statistical Learning Theory 23.05.2016 Discriminative Approaches (5 weeks) Linear Discriminant
More informationChapter 2 Inference on Mean Residual Life-Overview
Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate
More informationWarm up: risk prediction with logistic regression
Warm up: risk prediction with logistic regression Boss gives you a bunch of data on loans defaulting or not: {(x i,y i )} n i= x i 2 R d, y i 2 {, } You model the data as: P (Y = y x, w) = + exp( yw T
More informationINRIA Rh^one-Alpes. Abstract. Friedman (1989) has proposed a regularization technique (RDA) of discriminant analysis
Regularized Gaussian Discriminant Analysis through Eigenvalue Decomposition Halima Bensmail Universite Paris 6 Gilles Celeux INRIA Rh^one-Alpes Abstract Friedman (1989) has proposed a regularization technique
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationFahrmeir, Pritscher: Regression analysis of forest damage by marginal models for correlated ordinal responses
Fahrmeir, Pritscher: Regression analysis of forest damage by marginal models for correlated ordinal responses Sonderforschungsbereich 386, Paper 9 (1995) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationGeneralized Linear Models (GLZ)
Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the
More informationIntroduction to the Generalized Linear Model: Logistic regression and Poisson regression
Introduction to the Generalized Linear Model: Logistic regression and Poisson regression Statistical modelling: Theory and practice Gilles Guillot gigu@dtu.dk November 4, 2013 Gilles Guillot (gigu@dtu.dk)
More informationFINITE MIXTURE DISTRIBUTIONS
MONOGRAPHS ON APPLl~[) PROBABILITY AND STATISTICS FINITE MIXTURE DISTRIBUTIONS MONOGRAPHS ON APPLIED PROBABILITY AND STATISTICS General Editor D.R. COX, FRS Also available in the series Probability, Statistics
More informationPiotr Majer Risk Patterns and Correlated Brain Activities
Alena My²i ková Piotr Majer Song Song Alena Myšičková Peter N. C. Mohr Peter N. C. Mohr Wolfgang K. Härdle Song Song Hauke R. Heekeren Wolfgang K. Härdle Hauke R. Heekeren C.A.S.E. Centre C.A.S.E. for
More informationExponential martingales: uniform integrability results and applications to point processes
Exponential martingales: uniform integrability results and applications to point processes Alexander Sokol Department of Mathematical Sciences, University of Copenhagen 26 September, 2012 1 / 39 Agenda
More informationCurrie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK
An Introduction to Generalized Linear Array Models Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK E-mail: I.D.Currie@hw.ac.uk 1 Motivating
More informationMarkov Chains. Arnoldo Frigessi Bernd Heidergott November 4, 2015
Markov Chains Arnoldo Frigessi Bernd Heidergott November 4, 2015 1 Introduction Markov chains are stochastic models which play an important role in many applications in areas as diverse as biology, finance,
More informationGeneralized Linear Models
Generalized Linear Models David Rosenberg New York University April 12, 2015 David Rosenberg (New York University) DS-GA 1003 April 12, 2015 1 / 20 Conditional Gaussian Regression Gaussian Regression Input
More informationMarkovian Combination of Decomposable Model Structures: MCMoSt
Markovian Combination 1/45 London Math Society Durham Symposium on Mathematical Aspects of Graphical Models. June 30 - July, 2008 Markovian Combination of Decomposable Model Structures: MCMoSt Sung-Ho
More informationHigh-Throughput Sequencing Course
High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an
More informationModelling Survival Data using Generalized Additive Models with Flexible Link
Modelling Survival Data using Generalized Additive Models with Flexible Link Ana L. Papoila 1 and Cristina S. Rocha 2 1 Faculdade de Ciências Médicas, Dep. de Bioestatística e Informática, Universidade
More information1 The problem of survival analysis
1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those
More information