Pruscha: Semiparametric Point Process and Time Series Models for Series of Events

Size: px
Start display at page:

Download "Pruscha: Semiparametric Point Process and Time Series Models for Series of Events"

Transcription

1 Pruscha: Semiparametric Point Process and ime Series Models for Series of Events Sonderforschungsbereich 386, Paper 4 (998) Online unter: Projektpartner

2 Semiparametric Point Process and ime Series Models for Series of Events Helmut Pruscha University of Munich, Mathematical Institute, heresienstr. 39 D Munich, Summary: We are dealing with series of events occurring at random times n and carrying further quantitive information n. Examples are sequences of extrasystoles in ECG-records. We will present two approaches for analyzing such (typically long) sequences ( n n ), n = :::. (i) A point process model is based on an intensity of the form (t) b t (), t 0, with b t astochastic intensity of the self-exciting type. (ii) A time series approach is based on a transitional GLM. he conditional expectation of the waiting time n+ = n+ ; n is set to be ( n ) h( n ()), with h a response function and n a regression term. he deterministic functions and, respectively, describe the long-term trend of the process. Keywords: Semiparametric estimation rend function Penalized least squares Point process Intensity process GLM-based time series ECG-data Introduction Series of events can be analyzed in a dual way, namely by establishing point process models for the sequence ::: of occurrence times time series models for the sequence ::: of the lengths of time intervals between events ( n = n ; n;, 0 =0) (see Cox & Lewis, 966). In the present paper, we employ intensity based point process models of the self-exciting type (Hawkes, 97) GLM-based time series models of the transitional type (Zeger & Qaqish, 988 Fahrmeir & utz, 994) In both cases the model equation is factorized into a deterministic trend function and into a stochastic part describing the short-term oscillation. As in many semiparametric problems a major task is to separate the two factors by the estimation procedure. In our situation of one single realization over a long time interval we will need a kind of ergodic behaviour of the process. he trend function is estimated by penalized least squares (l.s.) methods based on aggregated data, gained byanaveraging procedure over longer time intervals (preserving enough short-term information). For the parameter of the stochastic component we will then employ likelihood methods, thus reversing the

3 order of estimation known from partial likelihood approaches in survival time analysis. In addition to the occurrence times n metrically scaled marks n are observed, forming p-dimensional vectors and playing the role of covariates. he results will show that both approaches, the time series and the point process approach, yield satisfactory trend estimations and give insight into the inner dynamics of the process. Semiparametric Point Process Model he sequence n, n, of occurrence times is described by anintensity process t, t 0, in the multiplicative form t = (t)b t () IR d () with a deterministic and smooth function and a stochastic intensity process b. he latter is modelled in a self-exciting form by b t = b (n) (t ) =f(w n t; n ) for t ( n n+ ] () with a regression term w n being iteratively dened by w n = u(w n; ( n n )): Ex.. f(w s) =w u(w (s x)) = e ;s w + e > x we are faced with a piecewise constant intensity function b t (Pruscha, 983). With w 0 = equation () can be written in the closed form b (n) (t ) = nx i=0 n;i e ;(n; i) e > i 0 =0 0 =0: Ex.. f(w s) = + e ;s w u(w (s x)) = e ;s w + e > x the intensity function b t is of the form of Hawkes's self-exciting point process (Hawkes, 97). In fact, with w 0 =0we can write equation () as b (n) (t ) = + nx i= e ;(t; i) e > i : We will close this section by deriving the conditional expectation of the waiting times n = n ; n;. In terms of the intensity, the transition probabilities for n+, given the information F n =( ::: n n ) of the past, can be written by IP( n+ s jf n )=; exp(; n n+s) where s t = R t s u du. Conseqently, the conditional expectation of n+ is IE( n+ jf n )= Z 0 exp(; n n+s)ds : (3) Assuming (for the moment) a piecewise constant intensityon( n n+ ], say (n),weget = (n) on the ride hand side of (3): the factors in the following model equation (4) will stand in a reciprocal relation to those in ().

4 3 Semiparametric ime Series Model he sequence n, n, of the waiting times between events is now analyzed within the framework of time series. Let F n =( ::: n n ) comprise the information up to time n as above. hen the conditional expectation of n+, given F n,ismodelled by a transitional GLM of the form IE( n+ jf n )=( n )h( n ()) (4) with a smooth, deterministic trend component, a suitable response function h, and a regression term n iteratively dened by n = n; + n + G( > n ): Ex.. h() = or h() ==, with 0 =, and nonnegative, G(x) = exp(x) or G(x) =x, in the latter case with a side condition > >0 Ex.. h() =e, with 0 =0,G(x) =x. Observe that in both cases positivity ofh() is guaranteed. Zeger and Qaqish (988,.(iv)) used h() == and put = n (instead n ) in the regression term n. hese types of response functions h are in fact suggested by the two examples in sec.. Assuming for simplicity = then in the case of Ex. in sec. equation (3) yields =w n, leading to the choice h() == in the case of Ex. we getc exp(;c w n ) after some calculations, proposing the response function h() = exp(). 4 Nonparametric Estimation of the rend In order to estimate or a method of rough averaging is suggested. o this end, we divide the observation period (0 ]into K subintervals I j =(u j; u j ] of length j = u j ; u j;, j = ::: K, where K is assumed to be considerably smaller than the observed number N of events in (0 ]. We are going to base the l.s. function on the dierence Y j ; IEY j, where Y j is the rate of occurrence of events or the averaged length of time intervals between events, respectively, in the interval I j. Let for the following N j = N uj ; N uj; be the number of events in I j 4. Point process model Putting Y j = N j j wehave on the basis of model () IEY j = j Z uj u j; IE s ds = (t j) Z uj IEb s () ds j u j; for some t j [u j; u j ]. We assume, that the approximation (t j) (t j ) t j = (u j; + u j ) (5) 3

5 is possible, as well as, for larger j = u j ; u j;, Z uj IEb s () ds b () () (6) j u j; where b () () is independent ofj. Putting a(t) =(t) b () (), and using the approximations (5) and (6), we can write IEY j a(t j ). In Ex. of sec. the limit intensity value is identied as (Pruscha, 997) b () ( ) = ; () where () = a.s.- lim( n P n i= i ). In Ex. of sec. the limit intensity is ; exp(; () ). 4. ime series model We divide the interval (0 ], = N,asabove and dene Y j = j N j. Assuming for the moment that I j =( k k+ ], i.e. N j =, for some k = k(j), we have on the basis of model (4) IEY j = X i= IE( k+i )=(t j) X i= IEh( k+i; ()) for some t j Ij. By means of approximations similar to (5) and (6) above, namely (t ) (t j j), t j = (u j; + u j ), and, for larger, X i= IEh( k+i; ()) h () () (7) we arrive atiey j a(t j ), where we nowhave a(t) =(t) h () (). For the regression term n and the examples given in sec. 3, explicit formulas for h () () can be given, assuming the existence of the Cesaro limits (), (), (), G () () of the sequences n, n, n, G( > n ). Indeed, we rst have forjj < () () = ; () + G () () (8) where G () () = > () in the case G(x) =x. hen for the examples in sec. 3 Ex. (with h() =): h () () = () () Ex. (with h() =e ): If centered variables i and i are used, we get () =0 from (8), such thatwehave h () () + () = for all i.e. we can assume that the side condition (9) below is automatically fullled. 4.3 An asymptotic set-up justifying the various approximations will be given below in sec 6. We now dene for both models the penalized l.s. criterion (a) = SSE(a)+ K H () (a), with >0 a smoothing parameter and with SSE(a) =(Y ; a) > W (Y ; a) H () (a) = 4 Z 0 (a 00 (s)) ds

6 where (Y ; a) isak {vector having components Y j ; a(t j ) and where W is an appropriate K K{weight matrix. It is well known, that (a) = min is solved by natural cubic spline functions a (Green and Silverman, 994). Note that or can be estimated only in the form a = b () or a = h (), where the second factor does not depend on t, buton. 5 Parametric Estimation For the parametric part of the problem we apply likelihood methods. Maximization of the log-likelihood function l (), IR d, = N,isdoneby plugging into l () the spline solution ^a from sec. 4, leading to a function ^l (), and then solving ^l () = max under b () () =orh () () =: (9) he side condition should guarantee that the two factors and b in () or and h in (4) can be separated by the estimation procedure. Point process model: he log-likelihood function of a realization ( ::: N )ofthe point process has the form l () = NX i= log(( i )b (i;) ( i )) ; where we have to insert ^(t) =^a(t)=b () (). Z i i;! (s)b (i;) (s ) ds ime series model: Assuming that the i 's are (conditionally) exponentially distributed, the log-likelihood function of a time series realization ( ::: N ) is given by P N i= log( i; exp(; i; i )), where i ==(( i ) h( i )) cf (4). We arrive at l () = NX i= ; log ( i; )h( i; ()) ; i ( i; )h( i; ) Plugging in the solution ^(t) =^a(t)=h () () from sec. 4 and neglecting irrelevant terms we areledtoafunction^l () which can (under the side condition h () () =) be interpreted as the log-likelihood function of a realization (^ ::: ^ N ), with ^ n = n =^a( n; ), fullling the (purely parametric) model equation IE(^ n+ jf n )=h( n ()): (0) For the transitional GLM (0) quasi-likelihood methods from GLM software can be employed.! : 6 Asymptotic Set-up 6. rend function he following set-up is known from nonparametric regression (Eubank, 988) and from non-stationary time series analysis (Dahlhaus, 996). Let (t), t [0 ], be a positive 5

7 function with a continous derivative dene for >0 t (t) = t [0 ]: Let further I j =(u j; u j ], j = :::, be a division of (0 ]into subintervals of length j. Putting (0) =min j j, () =max j j,thenwe consider limits of the kind! (0) ()!! 0: () Since for s s 0 I j j (s) ; (s 0 )j j max j 0 (t)j () I j max j 0 (t)j [0 ] which tends to 0 under (), the setting (5) is established. An analogous device is used for the trend function (t) = (t= ), = N, in the time series case. 6. Ergodic properties Point process model: For each >0, the counting process N t, t [0 ], with intensity process t = (t) b t () t [0 ], may full the ergodic law Z IE b s () ds! b () () I denotes an interval of length. hen the ap- for limits of the kind (), where I proximation (6) is justied. ime series model: For each N IN thevariables n, n f ::: Ng, satisfying the equation IE N ( n+ jf n )= ( n )h( n ()) ( = N ), may full the ergodic law n N X ii N IE N h( i ())! h () () where I N comprises n N adjacent integers. hen the approximation (7) is justied. A more detailed analysis will be given elsewhere. 7 Application Data sets on long-term ECGs of patients suering from heart arrhythmias consist of the occurrence times n of ventricular extrasystoles. he covariates n are the strength of these events measured as the relative deviation from the normal beat (see Pruscha, Ulm & Schmid, 997). As an example wechoose a patient with 74 extrasystoles within 6

8 a 0-hours observation period. he reciprocal relationship of the trend functions (t) and (t) becomes apparent in the plot of Fig.. Parameters were estimated from the data of 48 patients 5 persons died of a later sudden heart attack, while 33 survived. We used point process model (), Ex., with = and normalized i 's the mean cluster size m c is also calculated from the estimated parameters, according to a cluster process representation by Hawkes&Oakes (974) ( will be denoted by PP ) time series model (4), Ex.., employing S+,glm,quasi(link=log),dispersion estimated for equation (0), with standardized i 's, ( will here be denoted by S ) he able shows the tendency, that larger positive PP -values and smaller mean clustering sizes m c correspond to smaller S -values and smaller autocorrelations r^ (), where ^ i = i =^a( i; ) as in (0). Patient N PP m c S r^ () q^ ^h() ODJFZ OVXSBH PFQWGU OEMQL he data of the 48 patients, labelled by their status =survival or =death, were submitted to a binary logistic regression, with x = Sz, y = p m c ; andz = VES/h as regressors. Hereby, VES/h is the average rate of extrasystoles per hour, and Sz is the standard deviation of the time interval lengths between those two heart beats being qualied as normal beats (not being designated as extrasystoles), vgl. ab. of Pruscha, Ulm & Schmid (997). hen in the scatter diagram of Fig., the(x 0 y 0 ) values of the 48 patients were plotted, with x 0 y 0 being the x and y values, respectively, corrected by thevariable z = VES/h on the basis of the logistic regression equation. We have three distinct outlier cases from the group lying in the "area" of group. he two most extremes of them are also outliers in the evaluation system of nonlinear dynamics (Schmid et al, 996), where the variables x = sin and y = VES were used. With respect to the quality of prediction both systems perform nearly equally well. Acknowledgments his work is supported by the Deutsche Forschungsgemeinschaft (SFB 386). he ECG data were kindly placed at my disposal by K.Ulm(echnical University,Munich). he help of dipl. math. S. Siegle with respect to Fortran programming and S+ data analysis is thankfully acknowledged. 7

9 References Cox, D. R. & Lewis, P.A.W. (966). Methuen, London. he Statistical Analysis of Series of Events. Dahlhaus, R. (996). Maximum likelihood estimation and model selection for locally stationary processes. Nonparametric Statistics, 6, 7-9. Eubank, R. L. (988). Spline Smoothing and Nonparametric Regression. M. Decker, N.Y. Fahrmeir, L. & utz, G. (994). Multivariate Statistical Modelling Based on Generalized Linear Models. Springer-Verlag, N.Y. Green, P. J. & Silverman, B. W. (994). Nonparametric Regression and Generalized Linear Models. Chapman and Hall, London. Hawkes, A. G. (97). Spectra of some self-exciting point processes. Biometrika, 58, Hawkes, A.G. & Oakes, D. (974). A cluster process representation of a self-exciting process. J. Appl. Prob.,, Pruscha, H. (983). Learning models with continuous time parameter and multivariate point processes. J. Appl. Prob., 0, Pruscha, H. (997). Semiparametric estimation in regression models for point processes based on one realization. Proceedings of the th International Workshop on Statistical Modelling, Biel 997, Pruscha, H., Ulm, K. & Schmid, G. (997). Statistische Analyse des Einusses von Herzrythmus-Storungen auf das Mortalitatsrisiko. Discussion papers, 56, SFB 386 Munchen. Schmid, G., Morll, G. E. et al. (996). Variabilityof ventricular premature complexes and mortality risk. PACE, 9, Zeger, S. L. & Qaqish, B. (988). Markov regression models for time series: a quasi likelihood approach. Biometrics, 44, Legends to Figs. Fig. (above) Extrasystoles within a 0-hours ECG record of a patient (OVXSBH). he n = 74 time points of occurrence are marked on the time scale. he occurrence frequencies (triangles) and the mean waiting times (circles) were plotted over the j = ::: 0 hours, together with the estimated trend functions and, respectively. Fig. (below) Scatterplot of the variable Cluster Size over the variable Standard Deviation, both corrected by VES/h on the basis of a logistic regression equation, for 48 patients. 8

10 Extrasystoles in a 0-Hours ECG Interval Frequency (.0) (53) Inverse of Interval Frequency ime [h] Plot of 48 Patients Suffering from Cardiac Arrythmias ClusterSize corrected by VES/h = Survived (33) = Sudden Cardiac Death (5) StandDev corrected by VES/h 9

Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization

Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization Sonderforschungsbereich 386, Paper 66 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner

More information

Pruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies

Pruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies Pruscha, Göttlein: Regression Analysis for Forest Inventory Data with Time and Space Dependencies Sonderforschungsbereich 386, Paper 173 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models

Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Küchenhoff: An exact algorithm for estimating breakpoints in segmented generalized linear models Sonderforschungsbereich 386, Paper 27 (996) Online unter: http://epububuni-muenchende/ Projektpartner An

More information

Fahrmeir: Discrete failure time models

Fahrmeir: Discrete failure time models Fahrmeir: Discrete failure time models Sonderforschungsbereich 386, Paper 91 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Discrete failure time models Ludwig Fahrmeir, Universitat

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Sonderforschungsbereich 386, Paper 196 (2000) Online

More information

Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting

Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Variable Selection for Generalized Additive Mixed Models by Likelihood-based Boosting Andreas Groll 1 and Gerhard Tutz 2 1 Department of Statistics, University of Munich, Akademiestrasse 1, D-80799, Munich,

More information

Inversion Base Height. Daggot Pressure Gradient Visibility (miles)

Inversion Base Height. Daggot Pressure Gradient Visibility (miles) Stanford University June 2, 1998 Bayesian Backtting: 1 Bayesian Backtting Trevor Hastie Stanford University Rob Tibshirani University of Toronto Email: trevor@stat.stanford.edu Ftp: stat.stanford.edu:

More information

Fahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation

Fahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation Fahrmeir: Recent Advances in Semiparametric Bayesian Function Estimation Sonderforschungsbereich 386, Paper 137 (1998) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Recent Advances in Semiparametric

More information

On prediction and density estimation Peter McCullagh University of Chicago December 2004

On prediction and density estimation Peter McCullagh University of Chicago December 2004 On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating

More information

A general mixed model approach for spatio-temporal regression data

A general mixed model approach for spatio-temporal regression data A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression

More information

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Kneib, Fahrmeir: Supplement to Structured additive regression for categorical space-time data: A mixed model approach Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach" Sonderforschungsbereich 386, Paper 43 (25) Online unter: http://epub.ub.uni-muenchen.de/

More information

Regression with Numerical Optimization. Logistic

Regression with Numerical Optimization. Logistic CSG220 Machine Learning Fall 2008 Regression with Numerical Optimization. Logistic regression Regression with Numerical Optimization. Logistic regression based on a document by Andrew Ng October 3, 204

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

Generalized Additive Models

Generalized Additive Models By Trevor Hastie and R. Tibshirani Regression models play an important role in many applied settings, by enabling predictive analysis, revealing classification rules, and providing data-analytic tools

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

ECE521 week 3: 23/26 January 2017

ECE521 week 3: 23/26 January 2017 ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear

More information

Expressions for the covariance matrix of covariance data

Expressions for the covariance matrix of covariance data Expressions for the covariance matrix of covariance data Torsten Söderström Division of Systems and Control, Department of Information Technology, Uppsala University, P O Box 337, SE-7505 Uppsala, Sweden

More information

Consider Table 1 (Note connection to start-stop process).

Consider Table 1 (Note connection to start-stop process). Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Bias-corrected AIC for selecting variables in Poisson regression models

Bias-corrected AIC for selecting variables in Poisson regression models Bias-corrected AIC for selecting variables in Poisson regression models Ken-ichi Kamo (a), Hirokazu Yanagihara (b) and Kenichi Satoh (c) (a) Corresponding author: Department of Liberal Arts and Sciences,

More information

data lam=36.9 lam=6.69 lam=4.18 lam=2.92 lam=2.21 time max wavelength modulus of max wavelength cycle

data lam=36.9 lam=6.69 lam=4.18 lam=2.92 lam=2.21 time max wavelength modulus of max wavelength cycle AUTOREGRESSIVE LINEAR MODELS AR(1) MODELS The zero-mean AR(1) model x t = x t,1 + t is a linear regression of the current value of the time series on the previous value. For > 0 it generates positively

More information

Multi-state Models: An Overview

Multi-state Models: An Overview Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed

More information

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks

Recap. Probability, stochastic processes, Markov chains. ELEC-C7210 Modeling and analysis of communication networks Recap Probability, stochastic processes, Markov chains ELEC-C7210 Modeling and analysis of communication networks 1 Recap: Probability theory important distributions Discrete distributions Geometric distribution

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Ch 4. Linear Models for Classification

Ch 4. Linear Models for Classification Ch 4. Linear Models for Classification Pattern Recognition and Machine Learning, C. M. Bishop, 2006. Department of Computer Science and Engineering Pohang University of Science and echnology 77 Cheongam-ro,

More information

Locally stationary Hawkes processes

Locally stationary Hawkes processes Locally stationary Hawkes processes François Roue LTCI, CNRS, Télécom ParisTech, Université Paris-Saclay Talk based on Roue, von Sachs, and Sansonnet [2016] 1 / 40 Outline Introduction Non-stationary Hawkes

More information

Multivariate Survival Data With Censoring.

Multivariate Survival Data With Censoring. 1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.

More information

Duration Analysis. Joan Llull

Duration Analysis. Joan Llull Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual

More information

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David

More information

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl

More information

* * * * * * * * * * * * * * * ** * **

* * * * * * * * * * * * * * * ** * ** Generalized Additive Models Trevor Hastie y and Robert Tibshirani z Department of Statistics and Division of Biostatistics Stanford University 12th May, 1995 Regression models play an important role in

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components

More information

The classifier. Theorem. where the min is over all possible classifiers. To calculate the Bayes classifier/bayes risk, we need to know

The classifier. Theorem. where the min is over all possible classifiers. To calculate the Bayes classifier/bayes risk, we need to know The Bayes classifier Theorem The classifier satisfies where the min is over all possible classifiers. To calculate the Bayes classifier/bayes risk, we need to know Alternatively, since the maximum it is

More information

The classifier. Linear discriminant analysis (LDA) Example. Challenges for LDA

The classifier. Linear discriminant analysis (LDA) Example. Challenges for LDA The Bayes classifier Linear discriminant analysis (LDA) Theorem The classifier satisfies In linear discriminant analysis (LDA), we make the (strong) assumption that where the min is over all possible classifiers.

More information

13 Endogeneity and Nonparametric IV

13 Endogeneity and Nonparametric IV 13 Endogeneity and Nonparametric IV 13.1 Nonparametric Endogeneity A nonparametric IV equation is Y i = g (X i ) + e i (1) E (e i j i ) = 0 In this model, some elements of X i are potentially endogenous,

More information

Classification. Chapter Introduction. 6.2 The Bayes classifier

Classification. Chapter Introduction. 6.2 The Bayes classifier Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode

More information

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

More information

Index. Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright John Wiley & Sons, Inc. ISBN.

Index. Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright John Wiley & Sons, Inc. ISBN. Regression Models for Time Series Analysis. Benjamin Kedem, Konstantinos Fokianos Copyright 0 2002 John Wiley & Sons, Inc. ISBN. 0-471-36355-3 Index Adaptive rejection sampling, 233 Adjacent categories

More information

Handbook of Stochastic Methods

Handbook of Stochastic Methods Springer Series in Synergetics 13 Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences von Crispin W Gardiner Neuausgabe Handbook of Stochastic Methods Gardiner schnell und portofrei

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Statistical Data Mining and Machine Learning Hilary Term 2016

Statistical Data Mining and Machine Learning Hilary Term 2016 Statistical Data Mining and Machine Learning Hilary Term 2016 Dino Sejdinovic Department of Statistics Oxford Slides and other materials available at: http://www.stats.ox.ac.uk/~sejdinov/sdmml Naïve Bayes

More information

Comments on \Wavelets in Statistics: A Review" by. A. Antoniadis. Jianqing Fan. University of North Carolina, Chapel Hill

Comments on \Wavelets in Statistics: A Review by. A. Antoniadis. Jianqing Fan. University of North Carolina, Chapel Hill Comments on \Wavelets in Statistics: A Review" by A. Antoniadis Jianqing Fan University of North Carolina, Chapel Hill and University of California, Los Angeles I would like to congratulate Professor Antoniadis

More information

Chapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS

Chapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS Chapter 2 SOME ANALYTICAL TOOLS USED IN THE THESIS 63 2.1 Introduction In this chapter we describe the analytical tools used in this thesis. They are Markov Decision Processes(MDP), Markov Renewal process

More information

One important issue in the study of queueing systems is to characterize departure processes. Study on departure processes was rst initiated by Burke (

One important issue in the study of queueing systems is to characterize departure processes. Study on departure processes was rst initiated by Burke ( The Departure Process of the GI/G/ Queue and Its MacLaurin Series Jian-Qiang Hu Department of Manufacturing Engineering Boston University 5 St. Mary's Street Brookline, MA 2446 Email: hqiang@bu.edu June

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Semi-parametric estimation of non-stationary Pickands functions

Semi-parametric estimation of non-stationary Pickands functions Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of

More information

High-dimensional Markov Chain Models for Categorical Data Sequences with Applications Wai-Ki CHING AMACL, Department of Mathematics HKU 19 March 2013

High-dimensional Markov Chain Models for Categorical Data Sequences with Applications Wai-Ki CHING AMACL, Department of Mathematics HKU 19 March 2013 High-dimensional Markov Chain Models for Categorical Data Sequences with Applications Wai-Ki CHING AMACL, Department of Mathematics HKU 19 March 2013 Abstract: Markov chains are popular models for a modelling

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Machine Learning Lecture 5

Machine Learning Lecture 5 Machine Learning Lecture 5 Linear Discriminant Functions 26.10.2017 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Course Outline Fundamentals Bayes Decision Theory

More information

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random STA 216: GENERALIZED LINEAR MODELS Lecture 1. Review and Introduction Much of statistics is based on the assumption that random variables are continuous & normally distributed. Normal linear regression

More information

A comparison study of the nonparametric tests based on the empirical distributions

A comparison study of the nonparametric tests based on the empirical distributions 통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical

More information

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What? You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David

More information

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan Outline: Cox regression part 2 Ørnulf Borgan Department of Mathematics University of Oslo Recapitulation Estimation of cumulative hazards and survival probabilites Assumptions for Cox regression and check

More information

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Estimation of cumulative distribution function with spline functions

Estimation of cumulative distribution function with spline functions INTERNATIONAL JOURNAL OF ECONOMICS AND STATISTICS Volume 5, 017 Estimation of cumulative distribution function with functions Akhlitdin Nizamitdinov, Aladdin Shamilov Abstract The estimation of the cumulative

More information

Spectra of some self-exciting and mutually exciting point processes

Spectra of some self-exciting and mutually exciting point processes Biometrika (1971), 58, 1, p. 83 83 Printed in Great Britain Spectra of some self-exciting and mutually exciting point processes BY ALAN G. HAWKES University of Durham SUMMAKY In recent years methods of

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Introduction to Machine Learning Midterm, Tues April 8

Introduction to Machine Learning Midterm, Tues April 8 Introduction to Machine Learning 10-701 Midterm, Tues April 8 [1 point] Name: Andrew ID: Instructions: You are allowed a (two-sided) sheet of notes. Exam ends at 2:45pm Take a deep breath and don t spend

More information

Toutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable

Toutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable Toutenburg, Nittner: The Classical Linear Regression Model with one Incomplete Binary Variable Sonderforschungsbereich 386, Paper 178 (1999) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner

More information

Modelling Non-linear and Non-stationary Time Series

Modelling Non-linear and Non-stationary Time Series Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September

More information

Point processes, temporal

Point processes, temporal Point processes, temporal David R. Brillinger, Peter M. Guttorp & Frederic Paik Schoenberg Volume 3, pp 1577 1581 in Encyclopedia of Environmetrics (ISBN 0471 899976) Edited by Abdel H. El-Shaarawi and

More information

Problems. HW problem 5.7 Math 504. Spring CSUF by Nasser Abbasi

Problems. HW problem 5.7 Math 504. Spring CSUF by Nasser Abbasi Problems HW problem 5.7 Math 504. Spring 2008. CSUF by Nasser Abbasi 1 Problem 6.3 Part(A) Let I n be an indicator variable de ned as 1 when (n = jj I n = 0 = i) 0 otherwise Hence Now we see that E (V

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at On the Estimation of the Intensity Function of a Stationary Point Process Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 27, No. 2 (1965), pp. 332-337

More information

Frontier estimation based on extreme risk measures

Frontier estimation based on extreme risk measures Frontier estimation based on extreme risk measures by Jonathan EL METHNI in collaboration with Ste phane GIRARD & Laurent GARDES CMStatistics 2016 University of Seville December 2016 1 Risk measures 2

More information

Toutenburg, Fieger: Using diagnostic measures to detect non-mcar processes in linear regression models with missing covariates

Toutenburg, Fieger: Using diagnostic measures to detect non-mcar processes in linear regression models with missing covariates Toutenburg, Fieger: Using diagnostic measures to detect non-mcar processes in linear regression models with missing covariates Sonderforschungsbereich 386, Paper 24 (2) Online unter: http://epub.ub.uni-muenchen.de/

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Bayesian Estimation and Inference for the Generalized Partial Linear Model

Bayesian Estimation and Inference for the Generalized Partial Linear Model Bayesian Estimation Inference for the Generalized Partial Linear Model Haitham M. Yousof 1, Ahmed M. Gad 2 1 Department of Statistics, Mathematics Insurance, Benha University, Egypt. 2 Department of Statistics,

More information

Warm up. Regrade requests submitted directly in Gradescope, do not instructors.

Warm up. Regrade requests submitted directly in Gradescope, do not  instructors. Warm up Regrade requests submitted directly in Gradescope, do not email instructors. 1 float in NumPy = 8 bytes 10 6 2 20 bytes = 1 MB 10 9 2 30 bytes = 1 GB For each block compute the memory required

More information

Efficient Estimation for the Partially Linear Models with Random Effects

Efficient Estimation for the Partially Linear Models with Random Effects A^VÇÚO 1 33 ò 1 5 Ï 2017 c 10 Chinese Journal of Applied Probability and Statistics Oct., 2017, Vol. 33, No. 5, pp. 529-537 doi: 10.3969/j.issn.1001-4268.2017.05.009 Efficient Estimation for the Partially

More information

Machine Learning Lecture 7

Machine Learning Lecture 7 Course Outline Machine Learning Lecture 7 Fundamentals (2 weeks) Bayes Decision Theory Probability Density Estimation Statistical Learning Theory 23.05.2016 Discriminative Approaches (5 weeks) Linear Discriminant

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

Warm up: risk prediction with logistic regression

Warm up: risk prediction with logistic regression Warm up: risk prediction with logistic regression Boss gives you a bunch of data on loans defaulting or not: {(x i,y i )} n i= x i 2 R d, y i 2 {, } You model the data as: P (Y = y x, w) = + exp( yw T

More information

INRIA Rh^one-Alpes. Abstract. Friedman (1989) has proposed a regularization technique (RDA) of discriminant analysis

INRIA Rh^one-Alpes. Abstract. Friedman (1989) has proposed a regularization technique (RDA) of discriminant analysis Regularized Gaussian Discriminant Analysis through Eigenvalue Decomposition Halima Bensmail Universite Paris 6 Gilles Celeux INRIA Rh^one-Alpes Abstract Friedman (1989) has proposed a regularization technique

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

Fahrmeir, Pritscher: Regression analysis of forest damage by marginal models for correlated ordinal responses

Fahrmeir, Pritscher: Regression analysis of forest damage by marginal models for correlated ordinal responses Fahrmeir, Pritscher: Regression analysis of forest damage by marginal models for correlated ordinal responses Sonderforschungsbereich 386, Paper 9 (1995) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Introduction to the Generalized Linear Model: Logistic regression and Poisson regression

Introduction to the Generalized Linear Model: Logistic regression and Poisson regression Introduction to the Generalized Linear Model: Logistic regression and Poisson regression Statistical modelling: Theory and practice Gilles Guillot gigu@dtu.dk November 4, 2013 Gilles Guillot (gigu@dtu.dk)

More information

FINITE MIXTURE DISTRIBUTIONS

FINITE MIXTURE DISTRIBUTIONS MONOGRAPHS ON APPLl~[) PROBABILITY AND STATISTICS FINITE MIXTURE DISTRIBUTIONS MONOGRAPHS ON APPLIED PROBABILITY AND STATISTICS General Editor D.R. COX, FRS Also available in the series Probability, Statistics

More information

Piotr Majer Risk Patterns and Correlated Brain Activities

Piotr Majer Risk Patterns and Correlated Brain Activities Alena My²i ková Piotr Majer Song Song Alena Myšičková Peter N. C. Mohr Peter N. C. Mohr Wolfgang K. Härdle Song Song Hauke R. Heekeren Wolfgang K. Härdle Hauke R. Heekeren C.A.S.E. Centre C.A.S.E. for

More information

Exponential martingales: uniform integrability results and applications to point processes

Exponential martingales: uniform integrability results and applications to point processes Exponential martingales: uniform integrability results and applications to point processes Alexander Sokol Department of Mathematical Sciences, University of Copenhagen 26 September, 2012 1 / 39 Agenda

More information

Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK

Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK An Introduction to Generalized Linear Array Models Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK E-mail: I.D.Currie@hw.ac.uk 1 Motivating

More information

Markov Chains. Arnoldo Frigessi Bernd Heidergott November 4, 2015

Markov Chains. Arnoldo Frigessi Bernd Heidergott November 4, 2015 Markov Chains Arnoldo Frigessi Bernd Heidergott November 4, 2015 1 Introduction Markov chains are stochastic models which play an important role in many applications in areas as diverse as biology, finance,

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models David Rosenberg New York University April 12, 2015 David Rosenberg (New York University) DS-GA 1003 April 12, 2015 1 / 20 Conditional Gaussian Regression Gaussian Regression Input

More information

Markovian Combination of Decomposable Model Structures: MCMoSt

Markovian Combination of Decomposable Model Structures: MCMoSt Markovian Combination 1/45 London Math Society Durham Symposium on Mathematical Aspects of Graphical Models. June 30 - July, 2008 Markovian Combination of Decomposable Model Structures: MCMoSt Sung-Ho

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Modelling Survival Data using Generalized Additive Models with Flexible Link

Modelling Survival Data using Generalized Additive Models with Flexible Link Modelling Survival Data using Generalized Additive Models with Flexible Link Ana L. Papoila 1 and Cristina S. Rocha 2 1 Faculdade de Ciências Médicas, Dep. de Bioestatística e Informática, Universidade

More information

1 The problem of survival analysis

1 The problem of survival analysis 1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those

More information