On Model Fitting Procedures for Inhomogeneous Neyman-Scott Processes

Size: px
Start display at page:

Download "On Model Fitting Procedures for Inhomogeneous Neyman-Scott Processes"

Transcription

1 On Model Fitting Procedures for Inhomogeneous Neyman-Scott Processes Yongtao Guan July 31, 2006 ABSTRACT In this paper we study computationally efficient procedures to estimate the second-order parameters for a class of inhomogeneous Neyman-Scott processes proposed by Waagepetersen (2006). Specifically, we consider three different estimation procedures: a minimum contrast estimation procedure (MCEP) using the K-function, an MCEP using the pair correlation function, and a procedure based on composite likelihood. We give recommendations on how to select the tuning parameters involved in each of these three estimation procedures. We also discuss about which procedure to use based on some preexaminations of the data at hands and the goal of the study. KEY WORDS: Composite Likelihood, Inhomogeneous Neyman-Scott Process, Minimum Contrast Estimation, Pair Correlation Function. Yongtao Guan is Assistant Professor, Division of Biostatistics, Yale School of Public Health, Yale University, New Haven, CT , yongtao.guan@yale.edu. This research was supported by National Science Foundation grant DMS

2 1. INTRODUCTION Many of the emerging spatial point pattern data are inhomogeneous in nature (e.g., Diggle 2003). Conventional point process models often have difficulty with modeling such data since they were developed under stationarity (i.e., homogeneity). To solve the problem, a large number of new models have been recently proposed, where the majority belong to the general class of Gibbs point process models (e.g., Stoyan and Stoyan 1998, Nielsen and Jensen 2004). Although they are flexible for modeling repulsive spatial interactions, Gibbs point process models often are not appropriate for attractive point patterns, as pointed out by Møller and Waagepetersen (2006). Waagepetersen (2006) proposed a new class of inhomogeneous Neyman-Scott process (INSP) models that allow attractive interactions between events. This class of models is analytically simple yet practically sensible for modeling attractive point patterns especially those arising from ecological studies. In addition, statistical properties for estimators of the first-order structure (i.e., the inhomogeneous parameters) have been developed, which makes inference on these parameters possible. In this paper, we discuss procedures that can be used to estimate the second-order parameters (SOPs) of an INSP, a subject that has not been well studied in literature. The estimation of the SOPs is important for two reasons: 1) they can provide insights on how events interact with each other such as the interaction range and strength, which are often of great interest by themselves, and 2) they are critical for a correct inference on the firstorder parameters (FOPs) since variances of these estimates depend on the SOPs. We will consider three procedures to estimate the SOPs. The first is a minimum contrast estimation procedure (MCEP) based on the K-function as given in Waagepetersen (2006); the second is an MCEP based on the pair correlation function (PCF); and the last is an estimation equation approach based on composite likelihood (CL). It will be seen from Section 3 that each of these three procedures involves at least one unknown tuning parameter. The 1

3 choice of these tuning parameters is critical for the accuracy of the resulting estimates. The main objective of this paper is to give data-driven procedures (whenever available) and/or empirical evidences that are obtained from simulations on how to select these parameters. The remainder of this article is organized as follows. In Section 2 we give some background on INSP. We discuss the aforementioned three estimation procedures in Section 3 and assess the performance of the procedures through simulations in Section 4. The conclusion of the paper is given in Section 5. Some technical details are given in the Appendix. 2. INHOMOGENEOUS NEYMAN-SCOTT PROCESSES Consider a two-dimensional homogeneous spatial Poisson process whose first-order intensity is equal to ρ. Each event of the process is considered as a parent which will in turn generate a Poisson number of offspring with an expected value equal to µ. Conditional on the location of a parent, the offspring are dispersed independently following some common probability density function (pdf). For any location s, let X(s) be a p 1 vector of covariates that are recorded at this location. An offspring at s is retained with a probability exp[x(s) T β]/m, where M = max { exp[x(s) T β] } and β is a p 1 vector of unknown parameters. The resulting offspring process then forms an INSP which will be subsequently denoted by N here and henceforth. From the definition of an INSP, we see that the covariates, which often reflect conditions of the local environment, control the survival rate of an offspring at a given location. Another perspective to understand this formulation is that the initial offspring process is thinned by some thinning probabilities which depend on the covariates process X (Waagepetersen 2006). A nice property about INSPs is that their summary functions are often available in closed forms. For example, the first- and second-order intensity functions of the process are given as follows λ(s) = ρµ exp[x(s) T β]/m, (1) 2

4 λ(s 1, s 2 ) = λ(s 1 )λ(s 2 )g(s 1, s 2 ; θ), (2) respectively, where g(s 1, s 2 ; θ) is the PCF (e.g., Stoyan and Stoyan 1994) for the unthinned offspring process depending some unknown parameter vector θ. For example, if the pdf for an offspring location relative to its parent is a bivariate radially symmetric normal distribution (e.g., Diggle 2003), then g(s 1, s 2 ; θ) = 1 + exp[ s 1 s 2 2 /(4σ 2 )]/(4πρσ 2 ), where σ 2 is the variance parameter associated with the pdf and is the Euclidean norm. We call θ as the SOPs since it affects only the second-order structure of the process. Note that for the above example, θ = (ρ, σ 2 ). Throughout the remainder of this article, we will assume that g(s 1, s 2 ; θ) = g( s 1 s 2, θ), i.e., the PCF is isotropic. The FOPs β are often estimated by an estimation equation approach. Let β 0 = ρµ/m and let D be the domain of interest where a realization of N is observed. Waagpetersen (2006) proposed to estimate β by solving u(β) = s N D [ log(β0 ) + X(s) T β ] D β 0 exp[x(s) T β]ds = 0. (3) An alternative approach to estimate β is to solve the following estimation equation u(β) = s N D [ { }] X(s) T β log exp[x(s) T β]ds = 0. (4) D Our simulation experience suggests that (3) and (4) often give nearly identical estimates, although it appears that (4) may be preferable since it eliminates the need to estimate β 0. The statistical properties of the resulting estimator from (3) are available in Waagpetersen (2006). It should be emphasized here that although neither (3) or (4) requires the estimation of the SOPs, the inference on the FOPs does since their distributions depend on the true SOPs. 3

5 3. PROCEDURES TO ESTIMATE THE SOPS 3.1 MCEP based on the K-function Throughout the remainder of the paper, let ˆβ 0 and ˆβ denote an estimate of β 0 and β by solving either (3) or (4). Following Waagepetersen (2006), we define the empirical K- function for an INSP as follows ˆK(t) = s 1,s 2 N D I(0 < s 1 s 2 t) e(s 1, s 2 ) ( ˆβ 0 ) 2 exp{[x(s 1 ) T + X(s 2 ) T ] ˆβ}, where I( ) is an indicator function and e(s 1, s 2 ) is an edge correction term. For the various forms of e(s 1, s 2 ), see Diggle (2003) and Stoyan and Stoyan (1994). To estimate θ, the empirical K-function is often compared with the theoretical K- function K(t; θ) = t ug(u; θ)du through the following discrepancy measure 0 U K (θ) = r 0 {[ ˆK(t)] c [K(t; θ)] c } 2 dt. (5) The minimizer of U K over θ gives the estimated model parameters, ˆθ. The two additional parameters c and r in (5) are called tuning parameters and need to be preselected. Diggle (2003) suggested using c =.25 or smaller for attractive point patterns in the homogeneous case. Guan and Sherman (2006) found that this recommendation was questionable since using c = 1 with an appropriate choice of r often outperformed using c =.25. In the INSP case, Waagepetersen (2006) used c =.25. To the author s best knowledge, no specific recommendation has been given on the choice of r. Although the choice of c and r may be less of an issue if the sample size is very large, for data with a moderate sample size (say 200 points), it is often very important. We intend to give some recommendations on the choice of these parameters based on the results from our simulation study in Section 4. 4

6 3.2 MCEP based on the PCF The MCEP discussed in the previous section can be extended to the case of using the PCF. Specifically, we may define the following discrepancy measure that is analogous to (5) U g (θ) = r 0 {[ĝ(t; h)] c [g(t; θ)] c } 2 dt, (6) where ĝ(t; h) is an estimate of g(t; θ) by nonparametric smoothing and h is the bandwidth being used to obtain ĝ(t; h). Let k( ) denote a kernel function. We define ĝ(t; h) as ĝ(t; h) = 1 2π s 1,s 2 N D I(s 1 s 2 )k[(t s 1 s 2 )/h] e(s 1, s 2 ) ( ˆβ 0 ) 2 exp{[x(s 1 ) T + X(s 2 ) T ] ˆβ} s 1 s 2 h. (7) Note that there is one more parameter h that has been introduced in (6) as compared to (5). We will discuss how to select h at a later time. One difficulty with using (6) is that unlike the K-function case, there is no general recommendation available on how to select c. It certainly would be desirable if we can eliminate the need to select c. For this purpose, we will extend the method of Guan (2006a) in the homogeneous case to the current setting. Specifically, we derive the asymptotic variance of ĝ(t; h) and obtain an estimate of the variance based on its asymptotic expression. From the results that are detailed in the Appendix, we see that by ignoring some multiplicative constant, V ar[ĝ(t; h)] g(t; θ) t D 2π 0 1 ( ˆβ dψds, (8) 0 ) 2 exp[{x(s) T + X[s + u(t, ψ)] T } ˆβ] where u(t, ψ) = [t cos(ψ), t sin(ψ)]. Assume that r is relatively small compared to the size of D and that X( ) is smooth enough such that X[s + u(t 1, ψ)] X[s + u(t 2, ψ)] for all t 1, t 2 r. Then (8) may be simplified as V ar[ĝ(t; h)] g(t; θ)/t. In light of the above result, we assign w(t) = t/ĝ(t; h) and modify (6) as follows U g (θ) = r 0 w(t) [ĝ(t; h) g(t; θ)] 2 dt, (9) 5

7 Note that the use of the power transformation c in both (5) and (6) is intended to control the sampling fluctuation associated with the estimate of the respective summary function. In (9), this is achieved through the use of the weight function w(t). An advantage of using (9) over (6) is there is one fewer tuning parameter to be selected. To use (9), it is also important to select the bandwidth h, preferably by some data-driven methods. In this section, we discuss one method based on least-squares cross validation. In what follows, let r h be a preselected constant and ĝ (s 1,s 2 ) (t; h) denote the estimate of g(t; θ) by deleting s 1 and s 2 from the observed events. Define D D u = {s : s D, s + u D}. Guan (2006b) proposed to select h by minimizing the following criterion M(h) = [ĝ( u ; h)] 2 du u r h 2 I(0 < s 1 s 2 r h )ĝ (s 1,s 2 ) ( s 1 s 2 ; h) ( ˆβ. (10) 0 ) 2 s 1,s 2 D N D D s 1 + s 2 exp{[x(s 1 ) T + X(s 2 ) T ] ˆβ} In the asymptotic sense, Guan (2006b) showed that minimizing M(h) was equivalent to minimizing R(h) = u r h [ĝ( u ; h) g( u ; θ)] 2 du under some mild regulatory conditions. Note that to minimize (10), we need the value of r h. Our numerical experience is that setting r h to be slightly larger than the range of the process (i.e., the distance beyond which the dependence between events dies out) often works fine. Only a rough estimate of the range is necessary. This can be obtained by examining the empirical PCF plot using a pilot bandwidth or from a fitted model using a different procedure, say an MCEP using the K-function. 3.3 Estimation procedures based on CL The estimation procedures to be discussed in this section are motivated by the recent work of Guan (2006a) and Guan (2006c). Let W (s 1, s 2 ) denote an appropriately selected weight function. Guan (2006c) proposed to estimate θ for a homogeneous spatial point process by 6

8 maximizing the following CL criterion s 1,s 2 D N { [ W (s 1, s 2 ) log[λ 2 (s 1, s 2 ; θ)] log W (u, v)λ 2 (u, v; θ)dudv] }. (11) Here the integrations in (11) are both over D. For the weight function, Guan (2006c) recommended using W (s 1, s 2 ) = I(0 < s 1 s 2 r c )g( s 1 s 2 ; ˆθ) for attractive point patterns, where r c is a preselected constant and ˆθ is an estimate of θ. We note that (11) is a CL criterion because each term in the summations is a legitimate (log) likelihood due to the fact that f(s 1, s 2 ; θ) = λ 2 (s 1, s 2 ; θ) λ2 (u, v; θ)dudv (12) is the pdf for two arbitrary events in D N to be at s 1 and s 2. In another word, the criterion given by (11) simply sums up all the pairwise (log) likelihoods. For INSP processes, (11) can still be used in principle since (12) does not depend on homogeneity and thus is still a valid pdf. However, the calculation of the integrals in (11) may be computationally cumbersome due to the introduction of inhomogeneity. To solve this problem, we assign W (s 1, s 2 ) = g ( s 1 s 2 ; ˆθ) ( ˆβ 0 ) 2 exp{[x(s 1 ) T + X(s 2 ) T ] ˆβ} for some isotropic function g ( ), e.g., g (u) = I(0 < u r c )g(u; ˆθ). Then λ 2 (u, v; θ)dudv g ( u v ; ˆθ)g( u v ; θ)dudv, assuming that ˆβ 0 and ˆβ are close to β 0 and β, respectively. The fact that both g and g are isotropic will greatly simply the computational effort. In particular, the computationally efficient algorithms in Guan (2006c) can be applied without difficulty. Another use of criterion (10) is to select the bandwidth h in (7). To see this, first define W (s 1, s 2 ) = g ( s 1 s 2 ; ˆθ) ( ˆβ 0 ) 2 exp{[x(s 1 ) T + X(s 2 ) T ] ˆβ} 1 D D s 1 + s 2, 7

9 where the term 1 D D s 1 +s 2 is an edge correction term which simplifies the computation. By ignoring some terms that do not depend on h, we propose the following composite likelihood cross-validation criterion C(h) = s 1,s 2 D N { [ W (s 1, s 2 ) log[ĝ( s 1 s 2 ; h)] log ug (u)ĝ(u; h)du] }. (13) R The bandwidth h can then be selected by maximizing C(h). For an m-dependent (i.e., events separated by a distance larger than m are independent) and homogeneous spatial point process, Guan (2006a) argued that maximizing (13) was asymptotically equivalent to minimizing a modified Kullback-Leibler information distance measure (e.g., Silverman 1998) between the pdf f defined by (12) and its nonparametric estimates that is obtained by replacing λ 2 and θ in (12) with ˆλ 2 and h, respectively. 4. A SIMULATION STUDY To compare the performance of the proposed methods, we applied them to realizations from INSPs on a unit square, where the offspring dispersion of the processes was a bivariate radially symmetric normal distribution. We added inhomogeneity to the model by assigning X(s) = s x where s = (s x, s y ) and β = 1. Thus the likelihood for an offspring to survive increases as s x increases. The expected numbers of events per simulation (denoted by λ) was 100 and 200, whereas the expected number of parents ρ = 12.5, 25 for λ = 100 and ρ = 25, 50 for λ = 200. The cluster spread parameter σ was.01,.02,.04. To assess the effects of the tuning parameters, we set r = 3σ, 4σ, 5σ, 6σ and c =.125,.25,.5, 1 for MCEP using the K-function, r h = 5σ and r = 3σ, 4σ, 5σ, 6σ for MCEP using the PCF, and r c =.2,.3,.4 for the CL estimation procedure. Note that 4σ is often regarded as the dependence range for this type of process (e.g., Diggle 2003). Thus we simply intended to link the tuning parameter r to the range of the process. A recommendation on the choice of r in terms of the range of a process is practically appealing since a rough estimate of 8

10 the range can often be obtained with relative ease. For the convenience of presentation, we will use MCEPK, MCEPP and CLEP to denote the MCEP based on the K-function, the PCF and the CL estimation procedure, respectively. Table 1 lists the mean squared errors (MSE) for MCEPK and MCEPP. For MCEPK, we omit the results for c = 1 since they are similar to (and in fact often inferior to) those for c =.5. Similarly, we present only the results for MCEPP when (13) was used to select the bandwidth. Due to space constraint, we also do not include the results for ρ = 50. These omissions do not affect our main conclusions. From Table 1 we see that the choice of tuning parameters plays an important role in the performance of MCEPK. Note that the combination of r = 3σ and c =.125 often led to unstable estimations. In contrast, MCEPP is fairly insensitive to the choice of the tuning parameter (i.e., r). Nevertheless it appears that r = 3σ, 4σ yielded overall the best results. From a practical point of view, this means that we should choose r to be around or slightly smaller than the dependence range of the process. When compared to the best MCEPK, we see that MCEPP gave competitive results for ˆρ and generally comparative results for ˆσ across all levels of r. It s worth noting that for MCEPK, a small MSE for ˆρ is often accompanied by a large MSE for ˆσ and vice verse. This is especially true when σ =.04. We thus recommend MCEPP over MCEPK in general. We also comment on how to select r and c for MCEPK since this is a very popular approach for many practitioners. As being noted in the foregoing paragraph, a combination of a small r value relative to the range of the process and a small c value should be avoided. For a process with a short dependence range (i.e., tight clusters as is the case for σ =.01 in the simulation), c =.5 consistently yielded the best results across all r values. Note that this contradicts to the general perception that c =.25 or less gave better results for attractive point patterns. To select r in this case, we recommend using r = 4σ, 5σ, i.e., around or slightly larger than the dependence range. For a process with a medium 9

11 dependence range that is comparable to the case of σ =.02, c =.25 or.5 worked well. Although c =.125 sometime gave better results when estimating ρ, this was achieved often at a price of a highly variable estimate for σ. To select r in this case, we recommend using r = 4σ, i.e., around the range of the process. For a process with a long dependence range (e.g., σ =.04 in our simulation), c =.125 often performed better than c =.25 and.5 except when r = 3σ. However, one should be aware of that it may also result in an unstable estimate for σ, especially when µ (i.e., cluster size) is large. We thus do not recommend using c =.125 in this case. To select r, we recommend using r = 4σ with c =.25 or r = 3σ for c =.5. Table 2 lists the MSE for CLEP. Overall r c =.3 or.4 worked well. This agrees with the findings of Guan (2006b) in the homogeneous case. When compared to MCEPK and MCEPP, we see that CLEP often yielded better estimates for σ but slightly worse estimates for ρ. Thus only if the main interest of a study is to assess the interaction range between events, CLEP should be preferred. The performances of all three procedures were affected by the model parameters ρ, µ, σ and λ. For the same λ, σ can be estimated more accurately when ρ is larger. This is due to the fact that σ is a cluster level parameter. A larger number of clusters (i.e., ρ) means a bigger sample size for the estimation of σ, which in turn will lead to a better estimate. For the estimation of ρ, more accurate estimates can be achieved when the cluster is tighter (i.e., smaller σ) and the cluster size (i.e., µ) is large. This is because tighter and larger clusters can be more easily distinguished from each other. When the sample size increases, both ρ and σ can be estimated more accurately when µ and σ are fixed. 5. CONCLUSION We have discussed three estimation procedures (i.e., MCEPK, MCEPP and CLEP) that can be used to estimate the SOPs of an INSP. The estimation of the SOPs is important 10

12 since they play a critical role for the inference on the FOPs. We have given specific recommendations on how to select the tuning parameters involved in each procedure. These recommendations are based either on theoretical justifications or on simulation evidences as being presented in Section 3 and Section 4, respectively. The MCEPP is easier to use than the MCEPK due to the facts that only one tuning parameter is involved (as compared to two for MCEPK) and that the performance of it is fairly insensitive to the choice of the tuning parameter. We thus recommend the use of MCEPP over MCEPK in general, especially when the dependence range of the process is relatively large. If the main goal of the study is to obtain the interaction range among events, we recommend the use of the CLEP. APPENDIX: DERIVATIONS OF (8) Assume that λ(s) is bounded below from zero. Consider a sequence of regions D n and bandwidths h n. Let D n denote the boundary of D n and D n denote the length of D n. We assume the following condition on D n and h n : D n = O(n 2 ), D n = O(n), and h n = O(n β ) for some β (0, 2). (14) For the ease of mathematical representations, let e(s 1, s 2 ) = 1/ D n, that is, we do not consider any edge correction term. Also assume that ˆβ 0 and ˆβ converge to the target parameters faster than ĝ n (t; h) converges to g(t; θ). We thus substitute them with the true values without altering the asymptotic results. Also define λ(s) = β 0 exp[x(s)β]. Let λ k (s 1,, s k ) denote the kth-order intensity function of the process. Note that λ k (s 1,, s k ) has the general form of λ(s 1 ) λ(s k )g k (s 2 s 1,, s k s 1 ) for k 2. Then 4π 2 ( D n ) 2 V ar[ĝ n (t; h)] {k[(t s1 s 2 )/h n ]} 2 g( s 1 s 2 ) = 2 ds λ(s 1 )λ(s 2 ) s 1 s 2 2 (h n ) 2 1 ds 2 k[(t s1 s 2 )/h n ]k[(t s 1 s 3 )/h n ]g 3 (s 2 s 1, s 3 s 1 ) + 4 ds λ(s 1 ) s 1 s 2 s 1 s 3 (h n ) 2 1 ds 2 ds 3 11

13 + k[(t s1 s 2 )/h n ]k[(t s 3 s 4 )/h n ] s 1 s 2 s 3 s 4 (h n ) 2 [g 4 (s 2 s 1, s 3 s 1, s 4 s 1 ) g( s 1 s 2 )g( s 3 s 4 )]ds 1 ds 2 ds 3 ds 4, where the integrations are all over D. Lengthy yet straightforward algebra not included in the paper yields that the first term on the right side of the equality dominates over the other two. This is due to condition (14) and the fact that any INSP is Brillinger mixing (Heinrich 1988). Furthermore, the first term is approximately g(t; θ) h n t 2π 0 1 λ(s)λ[s + u(t, ψ)] dψds [k(u)] 2 du. Thus we have proved (8). REFERENCES Diggle, P. J. (2003), Statistical Analysis of Spatial Point Patterns, New York: Oxford University Press Inc. Guan, Y. (2006a), A Composite Likelihood Cross-Validation Approach in Selecting Bandwidth for the Estimation of the Pair Correlation Function, Scandinavian Journal of Statistics, to appear. Guan, Y. (2006b), A Least-Squares Cross-Validation Bandwidth Selection Approach in Pair Correlation Estimations, Statistics & Probability Letters, submitted. Guan, Y. (2006c), A Composite Likelihood Approach in Fitting Spatial Point Process Models, Journal of the American Statistical Association, to appear. Guan, Y. and Sherman, M. (2006), On Least Squares Fitting for Spatial Point Processes, Journal of the Royal Statistical Society, Ser. B, submitted revision. 12

14 Heinrich, L. (1988), Asymptotic Gaussianity of Some Estimators for Reduced Factorial Moment Measures and Product Densities of Stationary Poisson Cluster Processes, Statistics, 19, Møller, J. and Waagepetersen, R. P. (2004), Statistical Inference and Simulation for Spatial Point Processes, New York: Chapman & Hall. Møller, J. and Waagepetersen, R. P. (2006), Modern Statistics for Spatial Point Processes, Scandinavian Journal of Statistics, submitted. Nielsen, L. S. and Jensen, E. B..V. (2004), Statistical Inference for Transformation Inhomogeneous Point Processes, Scandinavian Journal of Statistics, 28, Silverman, B. W. (1998), Density Estimation for Statistics and Data Analysis, New York: Chapman & Hall. Stoyan, D. and Stoyan, H. (1994), Fractals, Random Shapes and Point Fields, New York: Wiley. Stoyan, D. and Stoyan, H. (1998), Non-Homogeneous Gibbs Process Models for Forestry - A Case Study, Biometrical Journal, 40, Waagepetersen, R. P. (2006), An Estimating Function Approach to Inference for Inhomogeneous Neyman-Scott Processes, Biometrics, to appear. 13

15 Table 1. MSEs of ˆρ and ˆσ for MCEPs using the K-function and the PCF. Each MSE is divided by the squared target parameter. ρ = 12.5, λ = 100 ρ = 25, λ = 100 ρ = 25, λ = 200 K PCF K PCF K PCF σ r ˆρ.01 3σ σ σ σ σ σ σ σ σ σ σ σ ˆσ.01 3σ σ σ σ σ σ σ σ σ σ σ σ

16 Table 2. MSEs of ˆρ and ˆσ for CLEP. Each MSE is divided by the squared target parameter. ρ = 12.5, λ = 100 ρ = 25, λ = 100 ρ = 25, λ = ˆρ ˆσ

Variance Estimation for Statistics Computed from. Inhomogeneous Spatial Point Processes

Variance Estimation for Statistics Computed from. Inhomogeneous Spatial Point Processes Variance Estimation for Statistics Computed from Inhomogeneous Spatial Point Processes Yongtao Guan April 14, 2007 Abstract This paper introduces a new approach to estimate the variance of statistics that

More information

A Thinned Block Bootstrap Variance Estimation. Procedure for Inhomogeneous Spatial Point Patterns

A Thinned Block Bootstrap Variance Estimation. Procedure for Inhomogeneous Spatial Point Patterns A Thinned Block Bootstrap Variance Estimation Procedure for Inhomogeneous Spatial Point Patterns May 22, 2007 Abstract When modeling inhomogeneous spatial point patterns, it is of interest to fit a parametric

More information

Second-Order Analysis of Spatial Point Processes

Second-Order Analysis of Spatial Point Processes Title Second-Order Analysis of Spatial Point Process Tonglin Zhang Outline Outline Spatial Point Processes Intensity Functions Mean and Variance Pair Correlation Functions Stationarity K-functions Some

More information

Point Processes. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part II)

Point Processes. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part II) Title: patial tatistics for Point Processes and Lattice Data (Part II) Point Processes Tonglin Zhang Outline Outline imulated Examples Interesting Problems Analysis under tationarity Analysis under Nonstationarity

More information

ESTIMATING FUNCTIONS FOR INHOMOGENEOUS COX PROCESSES

ESTIMATING FUNCTIONS FOR INHOMOGENEOUS COX PROCESSES ESTIMATING FUNCTIONS FOR INHOMOGENEOUS COX PROCESSES Rasmus Waagepetersen Department of Mathematics, Aalborg University, Fredrik Bajersvej 7G, DK-9220 Aalborg, Denmark (rw@math.aau.dk) Abstract. Estimation

More information

Spatial analysis of tropical rain forest plot data

Spatial analysis of tropical rain forest plot data Spatial analysis of tropical rain forest plot data Rasmus Waagepetersen Department of Mathematical Sciences Aalborg University December 11, 2010 1/45 Tropical rain forest ecology Fundamental questions:

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Minimum Hellinger Distance Estimation in a. Semiparametric Mixture Model

Minimum Hellinger Distance Estimation in a. Semiparametric Mixture Model Minimum Hellinger Distance Estimation in a Semiparametric Mixture Model Sijia Xiang 1, Weixin Yao 1, and Jingjing Wu 2 1 Department of Statistics, Kansas State University, Manhattan, Kansas, USA 66506-0802.

More information

Estimating functions for inhomogeneous spatial point processes with incomplete covariate data

Estimating functions for inhomogeneous spatial point processes with incomplete covariate data Estimating functions for inhomogeneous spatial point processes with incomplete covariate data Rasmus aagepetersen Department of Mathematics Aalborg University Denmark August 15, 2007 1 / 23 Data (Barro

More information

On prediction and density estimation Peter McCullagh University of Chicago December 2004

On prediction and density estimation Peter McCullagh University of Chicago December 2004 On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Estimation of a Neyman-Scott process from line transect data by a one-dimensional K-function

Estimation of a Neyman-Scott process from line transect data by a one-dimensional K-function Estimation of a Neyman-Scott process from line transect data by a one-dimensional K-function Magne Aldrin Norwegian Computing Center, P.O. Box 114 Blindern, N-0314 Oslo, Norway email: magne.aldrin@nr.no

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

Spatial point process. Odd Kolbjørnsen 13.March 2017

Spatial point process. Odd Kolbjørnsen 13.March 2017 Spatial point process Odd Kolbjørnsen 13.March 2017 Today Point process Poisson process Intensity Homogeneous Inhomogeneous Other processes Random intensity (Cox process) Log Gaussian cox process (LGCP)

More information

Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models

Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models Junfeng Shang Bowling Green State University, USA Abstract In the mixed modeling framework, Monte Carlo simulation

More information

Chapter 2. Poisson point processes

Chapter 2. Poisson point processes Chapter 2. Poisson point processes Jean-François Coeurjolly http://www-ljk.imag.fr/membres/jean-francois.coeurjolly/ Laboratoire Jean Kuntzmann (LJK), Grenoble University Setting for this chapter To ease

More information

Log-Density Estimation with Application to Approximate Likelihood Inference

Log-Density Estimation with Application to Approximate Likelihood Inference Log-Density Estimation with Application to Approximate Likelihood Inference Martin Hazelton 1 Institute of Fundamental Sciences Massey University 19 November 2015 1 Email: m.hazelton@massey.ac.nz WWPMS,

More information

Nonparametric Density Estimation

Nonparametric Density Estimation Nonparametric Density Estimation Econ 690 Purdue University Justin L. Tobias (Purdue) Nonparametric Density Estimation 1 / 29 Density Estimation Suppose that you had some data, say on wages, and you wanted

More information

Two step estimation for Neyman-Scott point process with inhomogeneous cluster centers. May 2012

Two step estimation for Neyman-Scott point process with inhomogeneous cluster centers. May 2012 Two step estimation for Neyman-Scott point process with inhomogeneous cluster centers Tomáš Mrkvička, Milan Muška, Jan Kubečka May 2012 Motivation Study of the influence of covariates on the occurrence

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Akaike Information Criterion to Select the Parametric Detection Function for Kernel Estimator Using Line Transect Data

Akaike Information Criterion to Select the Parametric Detection Function for Kernel Estimator Using Line Transect Data Journal of Modern Applied Statistical Methods Volume 12 Issue 2 Article 21 11-1-2013 Akaike Information Criterion to Select the Parametric Detection Function for Kernel Estimator Using Line Transect Data

More information

Ripley s K function. Philip M. Dixon Iowa State University,

Ripley s K function. Philip M. Dixon Iowa State University, Statistics Preprints Statistics 12-2001 Ripley s K function Philip M. Dixon Iowa State University, pdixon@iastate.edu Follow this and additional works at: http://lib.dr.iastate.edu/stat_las_preprints Part

More information

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017 Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION By Degui Li, Peter C. B. Phillips, and Jiti Gao September 017 COWLES FOUNDATION DISCUSSION PAPER NO.

More information

A Bivariate Point Process Model with Application to Social Media User Content Generation

A Bivariate Point Process Model with Application to Social Media User Content Generation 1 / 33 A Bivariate Point Process Model with Application to Social Media User Content Generation Emma Jingfei Zhang ezhang@bus.miami.edu Yongtao Guan yguan@bus.miami.edu Department of Management Science

More information

Lecture 10 Spatio-Temporal Point Processes

Lecture 10 Spatio-Temporal Point Processes Lecture 10 Spatio-Temporal Point Processes Dennis Sun Stats 253 July 23, 2014 Outline of Lecture 1 Review of Last Lecture 2 Spatio-temporal Point Processes 3 The Spatio-temporal Poisson Process 4 Modeling

More information

ECE 275A Homework 7 Solutions

ECE 275A Homework 7 Solutions ECE 275A Homework 7 Solutions Solutions 1. For the same specification as in Homework Problem 6.11 we want to determine an estimator for θ using the Method of Moments (MOM). In general, the MOM estimator

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

RESEARCH REPORT. A note on gaps in proofs of central limit theorems. Christophe A.N. Biscio, Arnaud Poinas and Rasmus Waagepetersen

RESEARCH REPORT. A note on gaps in proofs of central limit theorems.   Christophe A.N. Biscio, Arnaud Poinas and Rasmus Waagepetersen CENTRE FOR STOCHASTIC GEOMETRY AND ADVANCED BIOIMAGING 2017 www.csgb.dk RESEARCH REPORT Christophe A.N. Biscio, Arnaud Poinas and Rasmus Waagepetersen A note on gaps in proofs of central limit theorems

More information

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS Ivy Liu and Dong Q. Wang School of Mathematics, Statistics and Computer Science Victoria University of Wellington New Zealand Corresponding

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project Devin Cornell & Sushruth Sastry May 2015 1 Abstract In this article, we explore

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.1 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

Lecture 2: Poisson point processes: properties and statistical inference

Lecture 2: Poisson point processes: properties and statistical inference Lecture 2: Poisson point processes: properties and statistical inference Jean-François Coeurjolly http://www-ljk.imag.fr/membres/jean-francois.coeurjolly/ 1 / 20 Definition, properties and simulation Statistical

More information

Decomposition of variance for spatial Cox processes

Decomposition of variance for spatial Cox processes Decomposition of variance for spatial Cox processes Rasmus Waagepetersen Department of Mathematical Sciences Aalborg University Joint work with Abdollah Jalilian and Yongtao Guan November 8, 2010 1/34

More information

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 Rejoinder Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2 1 School of Statistics, University of Minnesota 2 LPMC and Department of Statistics, Nankai University, China We thank the editor Professor David

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Point process models for earthquakes with applications to Groningen and Kashmir data

Point process models for earthquakes with applications to Groningen and Kashmir data Point process models for earthquakes with applications to Groningen and Kashmir data Marie-Colette van Lieshout colette@cwi.nl CWI & Twente The Netherlands Point process models for earthquakes with applications

More information

An introduction to spatial point processes

An introduction to spatial point processes An introduction to spatial point processes Jean-François Coeurjolly 1 Examples of spatial data 2 Intensities and Poisson p.p. 3 Summary statistics 4 Models for point processes A very very brief introduction...

More information

Quasi-likelihood Scan Statistics for Detection of

Quasi-likelihood Scan Statistics for Detection of for Quasi-likelihood for Division of Biostatistics and Bioinformatics, National Health Research Institutes & Department of Mathematics, National Chung Cheng University 17 December 2011 1 / 25 Outline for

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Professors Lin and Ying are to be congratulated for an interesting paper on a challenging topic and for introducing survival analysis techniques to th

Professors Lin and Ying are to be congratulated for an interesting paper on a challenging topic and for introducing survival analysis techniques to th DISCUSSION OF THE PAPER BY LIN AND YING Xihong Lin and Raymond J. Carroll Λ July 21, 2000 Λ Xihong Lin (xlin@sph.umich.edu) is Associate Professor, Department ofbiostatistics, University of Michigan, Ann

More information

Overall Objective Priors

Overall Objective Priors Overall Objective Priors Jim Berger, Jose Bernardo and Dongchu Sun Duke University, University of Valencia and University of Missouri Recent advances in statistical inference: theory and case studies University

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.2 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Information geometry for bivariate distribution control

Information geometry for bivariate distribution control Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic

More information

On Backtesting Risk Measurement Models

On Backtesting Risk Measurement Models On Backtesting Risk Measurement Models Hideatsu Tsukahara Department of Economics, Seijo University e-mail address: tsukahar@seijo.ac.jp 1 Introduction In general, the purpose of backtesting is twofold:

More information

A Novel Nonparametric Density Estimator

A Novel Nonparametric Density Estimator A Novel Nonparametric Density Estimator Z. I. Botev The University of Queensland Australia Abstract We present a novel nonparametric density estimator and a new data-driven bandwidth selection method with

More information

On robust and efficient estimation of the center of. Symmetry.

On robust and efficient estimation of the center of. Symmetry. On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Principles of Statistical Inference Recap of statistical models Statistical inference (frequentist) Parametric vs. semiparametric

More information

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture

More information

More on Estimation. Maximum Likelihood Estimation.

More on Estimation. Maximum Likelihood Estimation. More on Estimation. In the previous chapter we looked at the properties of estimators and the criteria we could use to choose between types of estimators. Here we examine more closely some very popular

More information

Decomposition of variance for spatial Cox processes

Decomposition of variance for spatial Cox processes Decomposition of variance for spatial Cox processes Rasmus Waagepetersen Department of Mathematical Sciences Aalborg University Joint work with Abdollah Jalilian and Yongtao Guan December 13, 2010 1/25

More information

POLI 8501 Introduction to Maximum Likelihood Estimation

POLI 8501 Introduction to Maximum Likelihood Estimation POLI 8501 Introduction to Maximum Likelihood Estimation Maximum Likelihood Intuition Consider a model that looks like this: Y i N(µ, σ 2 ) So: E(Y ) = µ V ar(y ) = σ 2 Suppose you have some data on Y,

More information

A Bayesian perspective on GMM and IV

A Bayesian perspective on GMM and IV A Bayesian perspective on GMM and IV Christopher A. Sims Princeton University sims@princeton.edu November 26, 2013 What is a Bayesian perspective? A Bayesian perspective on scientific reporting views all

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Advanced Methods for Data Analysis (36-402/36-608 Spring 2014 1 Generalized linear models 1.1 Introduction: two regressions So far we ve seen two canonical settings for regression.

More information

Model comparison: Deviance-based approaches

Model comparison: Deviance-based approaches Model comparison: Deviance-based approaches Patrick Breheny February 19 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/23 Model comparison Thus far, we have looked at residuals in a fairly

More information

Modelling Non-linear and Non-stationary Time Series

Modelling Non-linear and Non-stationary Time Series Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September

More information

Properties of spatial Cox process models

Properties of spatial Cox process models Properties of spatial Cox process models Jesper Møller Department of Mathematical Sciences, Aalborg University Abstract: Probabilistic properties of Cox processes of relevance for statistical modelling

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Regression I: Mean Squared Error and Measuring Quality of Fit

Regression I: Mean Squared Error and Measuring Quality of Fit Regression I: Mean Squared Error and Measuring Quality of Fit -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD 1 The Setup Suppose there is a scientific problem we are interested in solving

More information

Tests for spatial randomness based on spacings

Tests for spatial randomness based on spacings Tests for spatial randomness based on spacings Lionel Cucala and Christine Thomas-Agnan LSP, Université Paul Sabatier and GREMAQ, Université Sciences-Sociales, Toulouse, France E-mail addresses : cucala@cict.fr,

More information

Lecture 9 Point Processes

Lecture 9 Point Processes Lecture 9 Point Processes Dennis Sun Stats 253 July 21, 2014 Outline of Lecture 1 Last Words about the Frequency Domain 2 Point Processes in Time and Space 3 Inhomogeneous Poisson Processes 4 Second-Order

More information

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable

More information

Notes on the Multivariate Normal and Related Topics

Notes on the Multivariate Normal and Related Topics Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Points. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Points Luc Anselin http://spatial.uchicago.edu 1 classic point pattern analysis spatial randomness intensity distance-based statistics points on networks 2 Classic Point Pattern Analysis 3 Classic Examples

More information

Decomposition of variance for spatial Cox processes Jalilian, Abdollah; Guan, Yongtao; Waagepetersen, Rasmus Plenge

Decomposition of variance for spatial Cox processes Jalilian, Abdollah; Guan, Yongtao; Waagepetersen, Rasmus Plenge Aalborg Universitet Decomposition of variance for spatial Cox processes Jalilian, Abdollah; Guan, Yongtao; Waagepetersen, Rasmus Plenge Publication date: 2011 Document Version Early version, also known

More information

Bias and variance reduction techniques for bootstrap information criteria

Bias and variance reduction techniques for bootstrap information criteria Ann Inst Stat Math (2010) 62:209 234 DOI 10.1007/s10463-009-0237-1 Bias and variance reduction techniques for bootstrap information criteria Genshiro Kitagawa Sadanori Konishi Received: 17 November 2008

More information

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University Integrated Likelihood Estimation in Semiparametric Regression Models Thomas A. Severini Department of Statistics Northwestern University Joint work with Heping He, University of York Introduction Let Y

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

Multiplicative component models for replicated point processes

Multiplicative component models for replicated point processes arxiv:1705.09693v1 [stat.me] 26 May 2017 Multiplicative component models for replicated point processes Daniel Gervini Department of Mathematical Sciences University of Wisconsin Milwaukee October 8, 2018

More information

Bias-corrected AIC for selecting variables in Poisson regression models

Bias-corrected AIC for selecting variables in Poisson regression models Bias-corrected AIC for selecting variables in Poisson regression models Ken-ichi Kamo (a), Hirokazu Yanagihara (b) and Kenichi Satoh (c) (a) Corresponding author: Department of Liberal Arts and Sciences,

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Statistics 222, Spatial Statistics. Outline for the day: 1. Problems and code from last lecture. 2. Likelihood. 3. MLE. 4. Simulation.

Statistics 222, Spatial Statistics. Outline for the day: 1. Problems and code from last lecture. 2. Likelihood. 3. MLE. 4. Simulation. Statistics 222, Spatial Statistics. Outline for the day: 1. Problems and code from last lecture. 2. Likelihood. 3. MLE. 4. Simulation. 1. Questions and code from last time. The difference between ETAS

More information

ICML Scalable Bayesian Inference on Point processes. with Gaussian Processes. Yves-Laurent Kom Samo & Stephen Roberts

ICML Scalable Bayesian Inference on Point processes. with Gaussian Processes. Yves-Laurent Kom Samo & Stephen Roberts ICML 2015 Scalable Nonparametric Bayesian Inference on Point Processes with Gaussian Processes Machine Learning Research Group and Oxford-Man Institute University of Oxford July 8, 2015 Point Processes

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics

More information

Semi-parametric estimation of non-stationary Pickands functions

Semi-parametric estimation of non-stationary Pickands functions Semi-parametric estimation of non-stationary Pickands functions Linda Mhalla 1 Joint work with: Valérie Chavez-Demoulin 2 and Philippe Naveau 3 1 Geneva School of Economics and Management, University of

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Test of Complete Spatial Randomness on Networks

Test of Complete Spatial Randomness on Networks Test of Complete Spatial Randomness on Networks A PROJECT SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Xinyue Chang IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE

More information

Model Selection and Geometry

Model Selection and Geometry Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model

More information

Using all observations when forecasting under structural breaks

Using all observations when forecasting under structural breaks Using all observations when forecasting under structural breaks Stanislav Anatolyev New Economic School Victor Kitov Moscow State University December 2007 Abstract We extend the idea of the trade-off window

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume, Issue 006 Article 9 A Method to Increase the Power of Multiple Testing Procedures Through Sample Splitting Daniel Rubin Sandrine Dudoit

More information

CONCEPT OF DENSITY FOR FUNCTIONAL DATA

CONCEPT OF DENSITY FOR FUNCTIONAL DATA CONCEPT OF DENSITY FOR FUNCTIONAL DATA AURORE DELAIGLE U MELBOURNE & U BRISTOL PETER HALL U MELBOURNE & UC DAVIS 1 CONCEPT OF DENSITY IN FUNCTIONAL DATA ANALYSIS The notion of probability density for a

More information

Unsupervised Learning with Permuted Data

Unsupervised Learning with Permuted Data Unsupervised Learning with Permuted Data Sergey Kirshner skirshne@ics.uci.edu Sridevi Parise sparise@ics.uci.edu Padhraic Smyth smyth@ics.uci.edu School of Information and Computer Science, University

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

ELEMENTS OF PROBABILITY THEORY

ELEMENTS OF PROBABILITY THEORY ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable

More information

The Bayesian Approach to Multi-equation Econometric Model Estimation

The Bayesian Approach to Multi-equation Econometric Model Estimation Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation

More information

A Shape Constrained Estimator of Bidding Function of First-Price Sealed-Bid Auctions

A Shape Constrained Estimator of Bidding Function of First-Price Sealed-Bid Auctions A Shape Constrained Estimator of Bidding Function of First-Price Sealed-Bid Auctions Yu Yvette Zhang Abstract This paper is concerned with economic analysis of first-price sealed-bid auctions with risk

More information

ECE534, Spring 2018: Solutions for Problem Set #3

ECE534, Spring 2018: Solutions for Problem Set #3 ECE534, Spring 08: Solutions for Problem Set #3 Jointly Gaussian Random Variables and MMSE Estimation Suppose that X, Y are jointly Gaussian random variables with µ X = µ Y = 0 and σ X = σ Y = Let their

More information

RESEARCH REPORT. Inhomogeneous spatial point processes with hidden second-order stationarity. Ute Hahn and Eva B.

RESEARCH REPORT. Inhomogeneous spatial point processes with hidden second-order stationarity.  Ute Hahn and Eva B. CENTRE FOR STOCHASTIC GEOMETRY AND ADVANCED BIOIMAGING www.csgb.dk RESEARCH REPORT 2013 Ute Hahn and Eva B. Vedel Jensen Inhomogeneous spatial point processes with hidden second-order stationarity No.

More information

Nonparametric Estimation of Luminosity Functions

Nonparametric Estimation of Luminosity Functions x x Nonparametric Estimation of Luminosity Functions Chad Schafer Department of Statistics, Carnegie Mellon University cschafer@stat.cmu.edu 1 Luminosity Functions The luminosity function gives the number

More information

Point Process Control

Point Process Control Point Process Control The following note is based on Chapters I, II and VII in Brémaud s book Point Processes and Queues (1981). 1 Basic Definitions Consider some probability space (Ω, F, P). A real-valued

More information

Econ 582 Nonparametric Regression

Econ 582 Nonparametric Regression Econ 582 Nonparametric Regression Eric Zivot May 28, 2013 Nonparametric Regression Sofarwehaveonlyconsideredlinearregressionmodels = x 0 β + [ x ]=0 [ x = x] =x 0 β = [ x = x] [ x = x] x = β The assume

More information

A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING

A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING A NOTE ON TESTING THE SHOULDER CONDITION IN LINE TRANSECT SAMPLING Shunpu Zhang Department of Mathematical Sciences University of Alaska, Fairbanks, Alaska, U.S.A. 99775 ABSTRACT We propose a new method

More information

Forecasting Levels of log Variables in Vector Autoregressions

Forecasting Levels of log Variables in Vector Autoregressions September 24, 200 Forecasting Levels of log Variables in Vector Autoregressions Gunnar Bårdsen Department of Economics, Dragvoll, NTNU, N-749 Trondheim, NORWAY email: gunnar.bardsen@svt.ntnu.no Helmut

More information

Fisher information for generalised linear mixed models

Fisher information for generalised linear mixed models Journal of Multivariate Analysis 98 2007 1412 1416 www.elsevier.com/locate/jmva Fisher information for generalised linear mixed models M.P. Wand Department of Statistics, School of Mathematics and Statistics,

More information