Estimating function analysis for a class of Tweedie regression models

Size: px
Start display at page:

Download "Estimating function analysis for a class of Tweedie regression models"

Transcription

1 Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal do Paraná - UFPR, Curitiba, Paraná, Brasil, wbonat@ufr.br and Deartament of Mathematics and Comuter science - IMADA, University of Southern Denmark - SDU, Odense, Denmark, wbonat@sdu.dk Abstract We roose a new way to make inference on Tweedie regression models based on the estimating function aroach. We adoted quasi-score function for regression arameters and Pearson estimating function for disersion arameters. We erform a simulation study to comare our aroach with the maximum likelihood method. The results show that both methods are similar, but estimating function aroach is better to estimate small values of the ower arameter. Some advantages to use estimating function are: i) avoid to evaluate the density function, ii) allow us estimate negative and between 0 and 1 values for the ower arameter, iii) robust secification based on second-moments assumtions. We rovide an R imlementation for our new aroach. Keywords Tweedie regression, ower variance function, estimating function, maximum likelihood. Sulement materials htt:// 1

2 1 Introduction Statistical modeling is one of the most significant fields of alied statistics with alications in many fields of scientific study, such as sociology, economy, agronomy, medicine and others. There exists an infinity of different statistical models, but the class of Generalized Linear Models (GLM) (Nelder and Wedderburn, 1972) is the most used in the last three decades. The success of this aroach is due its ability to deal with different tyes of resonse variables, such as binary, count and continuous inside a general framework with a owerful scheme of inference based on the likelihood aradigm. Some of the most imortant articular cases of GLM class are: a linear regression model based on the Gaussian distribution for real resonse variable, Gamma and inverse Gaussian regression models for ositive real resonse variable, logistic regression based on the Binomial distribution for binary data and Poisson regression for count data. All these models are linked because they belong to the class of the exonential disersion models (Jørgensen, 1997), and share an amazing characteristic: they are described by their first two moments (mean and variance). Furthermore, the variance function describes the relationshi between the mean and variance of the resonse variable. Let Y denote the resonse variable and assume that the robability function or density robability function of Y belongs to the class of exonential disersion models and assume too that the E(Y ) = µ and the V ar(y ) = V (µ) = µ then Y T w (µ, ), where T w (µ, ) denotes a Tweedie (Tweedie, 1984), (Jørgensen, 1997) random variable with mean µ and variance µ and > 0 and (, 0] [1, ) are arameters that describe the variance structure of Y. Tweedie distribution has many interesting theoretical roerties for a detailed descrition, see Jørgensen (1997). For ractical situations in statistical modeling the Tweedie distribution is interesting because it delivers many imortant articular cases, and the arameter identifies these cases. For examle, = 0 we have the Gaussian distribution, for = 1 and = 1 we have the Poisson distribution, for = 2 and = 3 corresond to the Gamma and inverse Gaussian distributions. Another imortant case is 1 < < 2 that corresonds to the Comound Poisson distribution. Just by its articular cases the Tweedie distribution is already imortant for statistical modeling, but there exists an infinity of models, once the arameter may be estimated based on a data set, making this simle relationshi between mean and variance a rich class of statistical models. For ractical alications the estimation of the arameters that describe the variance structure ( and ) is imortant and deserves the same attention devoted in the regression arameters. The orthodox aroach is based on a likelihood aradigm, that is an efficient estimation method. A articularity about the Tweedie distribution is that outside the secial cases, its robability density function cannot be written in a closed form, and requires any numerical method to evaluate the density function. Dunn and Smyth (2001) roosed some methods to evaluate the density function of the Tweedie distribution, but these methods are comutationally demanding and shows a different level of accuracy for different regions of the arameter sace. This fact makes the rocess of inference based on likelihood difficult and sometimes slow. The main objective of the aer is to roose a new way to estimate the arameters ( and ) based on Pearson estimating functions (Jørgensen and Knudsen, 2004). This method is very fast comutationally, because it emloys merely the first two moments (mean and variance) and in this way avoids evaluating the robability density function. Furthermore, we resent an efficient and stable algorithm to obtain the oint estimates. The inference is based on asymtotic results, and we show exressions for the sensitivity and the Godambe information matrix. The variance of the Pearson estimating function is aroximated based on emirical, third and fourth moments. We run a simulation study to show the roerties of our aroach and comare with the maximum likelihood estimator in a finite samle scheme. 2

3 In the next Section we give some background about Tweedie distribution. In the Section 3 we resent the Tweedie regression models and two aroaches to make inference with resect the model arameters, maximum likelihood and estimating functions. Section 4 shows the main results from our simulation study and Section 5 reorts some final remarks. 2 Background The Tweedie distribution belongs to the class of exonential disersion models (EDM) (Jørgensen, 1997). Thus, for a random variable Y which follows an EDM, the density function can be written as: Y (y; µ, ) = a(y, ) ex{(yθ k(θ))/} (1) where µ = E(Y ) = k (θ) is the mean, > 0 is the disersion arameter, θ is the canonical arameter, and k(θ) is the cumulant function. The function a(y, ) cannot be written in closed form aart the articular cases cited. The variance is given by V ar(y ) = V (µ) where V (µ) = k (θ) is called the variance function. Tweedie densities are characterized by ower variance functions of the form V (µ) = µ, where (, 0] [1, ) is the index determining the distribution. Although, Tweedie densities are not known in closed form, their cumulant generating function (cgf) is simle. The cgf is given by where k(θ) is the cumulant function, and K(t) = {k(θ + t) k(θ)}/ θ = k(θ) = { µ log µ = 1 { µ log µ = 2. The remaining factor in the density, a(y, ) needs to be evaluated numerically. Jørgensen (1997) resents two series exressions for evaluating the density: one for 1 < < 2 and one for > 2. In the first case can be shown that, } P (Y = 0) = ex { µ2 (2 )) and for y > 0 that a(y, ) = 1 W (y,, ) y with W (y,, ) = i=1 W j and W j = y jα ( 1) αj j(1 α) (2 ) j j!γ( jα), where α = (2 )/(1 ). 3

4 A similar series exansion exists for > 2 and is given by: with V = i=1 V k and a(y, ) = 1 V (y,, ) πy V k = Γ(1 + αk)k(α 1) ( 1) αk Γ(1 + k)( 2) k y αk ( 1) k sin( kπα). Dunn and Smyth (2001) resents a detailed study about these series and an algorithm to evaluate the Tweedie density function based on these series exansion. The algorithm is imlemented in the ackage tweedie (Dunn, 2013) for the statistical software R(R Core Team, 2014) through the function dtweedie.series. Dunn and Smyth (2005) and Dunn and Smyth (2008) studied two more methods to evaluate the density function of the Tweedie distributions, one based on the inversion of cumulant generating function using the Fourier inversion and the sandleoint aroximation, for more details see (Dunn, 2013). In this aer we use only the aroach described above. 3 Tweedie regression models The Tweedie regression models were resented by Jørgensen and Paes De Souza (1994), Dunn and Smyth (2005), Hasan and Dunn (2011) and others. Consider indeendent resonses Y 1, Y 2,..., Y n are observed such that Y i T w (µ i, ) where the mean µ i is linked to linear redictor through a known link function g, g(µ i ) = x T i β where x i is a vector of covariates and β is a vector of unknown regression arameters. Let q be the dimension of β. On an equivalent way we can define the model using a matricial notation. Let Y a vector of resonse variable, then the Tweedie regression model can be defined by Y T w (µ, I) (2) where I is a n n dimensional identity matrix. In this case is easy to see that E(Y) = µ = g 1 (Xβ) and the V ar(y) = C = diag(µ ). In this aer we define the link function g as the logarithm function. Note that the model is equivalently defined by its joint distribution defined in (2) or by its first two moments (mean and variance). Denote the vector of arameters by θ = (β, λ = (, )). The arameter vector can be divided in two sets, the first are the regression arameters and the second are arameters that describe the variance structure. In this aer we are interested to make inference about the second set. The orthodox method is based on likelihood function and it will be describe in the next section. 3.1 Maximum likelihood Let Y (y; β, λ) denote the Tweedie robability or density robability function as given in equation (1) and evaluated as described in Section 2. Then, the log-likelihood function for a samle of size n is given by 4

5 l(, ) = n log Y (y i ; β, λ). (3) i=1 Maximizing the equation (3) with resect to β and λ we have the maximum likelihood estimator denoted by ˆβ M and ˆλ M. The maximization rocess can be done by different ways, Dunn and Smyth (2005) roosed a method based on the BFGS algorithm and Jørgensen and Paes De Souza (1994) roosed a different scheme based on rofile likelihood. In this aer we roose to use the Nelder- Mead (Nelder and Mead, 1965) method as imlemented in the function otim of the R statistical software (R Core Team, 2014). In our simulation studies Nelder-Mead method shows stable and efficient results. To make inference about ˆθ = ( ˆβ, ˆλ) T we use the well known asymtotic distribution of the maximum likelihood estimator, ˆθ N(θ, I o (ˆθ) 1 ) where I o (θ) denote the observed information of θ. Note that, in the Tweedie regression models we cannot comute the Fisher information, because the second derivatives of log-likelihood function are not available in a closed form. In this way, we use the observed information matrix comuted numerically using the Richardson method (Soetaert and Herman, 2009), on the oint ˆθ. Basically, our algorithm obtains the maximum likelihood estimates using the Nelder-Mead algorithm and comute the asymtotic variance of ˆθ based on the inverse of negative of the Hessian matrix comuted numerically by Richardson method. Note that this aroach is comutationally exensive, because we need evaluate the robability or density robability function of the Tweedie distribution many times inside the rocess of maximization. In the next section we shall resent a new way to make inference about β and λ based on estimating functions. 3.2 Estimating functions In this Section we describe estimating functions aroach to estimate θ = (β, λ). We adoted the quasi-score function for regression arameters and Pearson estimating function for disersion arameters. Jørgensen and Knudsen (2004) describes the aroach of estimating function, as well its roerties. The quasi-score function is defined by, where D T = β µ. The q q matrix is called the sensitivity matrix of ψ β and the q q matrix ψ β (β, λ) = D T C 1 (Y µ) (4) S β = E( β ψ β ) = D T C 1 D (5) is called the variability matrix of ψ β. In a similar way the Pearson estimating function is defined by, V β = V ar(ψ β ) = D T C 1 D (6) ψ λi (β, λ) = r T W λi r tr(w λi C 1 ) (7) where W λi = C 1 C λ i C 1 and r = (Y µ). Note that everything we need to evaluate these equations are the derivatives with resect to λ 1 = and λ 2 =. Is easy to show that, 5

6 C = diag(µ ) and C = diag( log(µ)µ ). (8) The entries (i, j) of the 2 2 sensitivity matrix of ψ λ are given by, ( ) ( ) 1 C 1 C S λij = E ψ λj = tr C C. (9) λ i λ i λ j We can show using results about characteristic function of linear and quadratic forms of Non- Normal variables (Knight, 1985), that the entries of variability matrix of ψ λ are given by, V λij = Cov(ψ λi ; ψ λj ) = 2tr(W λi CW λj C) + k k (4) l (W λi ) ll (W λj ) ll (10) where k (4) denote the fourth cumulant of Y. To take into account the covariance between the vectors β and λ, we need to comute the cross sensitivity and variability matrix. The entries of the cross sensitivity matrix between β and λ are given by, ( ) S βi λ j = E ψ βi = 0. (11) λ j In a similar way the entries of the cross sensitivity matrix between λ and β are given by, ( ) ( ) 1 C 1 C S λi β j = E ψ λi = tr C C. (12) β j λ i β j Finally we can show that the entries of the cross variability matrix between β and λ, are given by, n n n V λi β j = E A (j) r lr j r k, (13) l=1 j=1 k=1 W (ij) λ i where A = D T C 1 and A (j) denote the j th collumn of A. In a similar way W (ij) λ i denote the i th and j th entries of the matrix W λi. Furthermore, the joint sensitivity matrix of ψ β and ψ λ is given by ( ) Sβ S S θ = βλ, (14) S λβ whose entries are defined in equations (5), (9), (12) and (11). Likewise, the joint variability matrix of ψ β and ψ λ is given by ( ) Vβ V V θ = βλ, (15) whose entries are defined in equations (6), (10) and (13). Denote ˆθ e = ( ˆβ e, ˆλ e ) the estimate of θ, then the asymtotic distribution of ˆθ e is where J 1 θ where S T θ V λβ S λ V λ is the inverse of Godambe information matrix, = (S 1 θ )T. k ˆθ e N(θ, J 1 θ ) (16) J 1 θ = S 1 θ V θs T θ, (17) 6

7 Jørgensen and Knudsen (2004) roosed the chaser algorithm to solve the system of equations ψ β = 0 and ψ λ = 0. β (i+1) = β (i) S 1 β ψ β(β (i), λ (i) ) λ (i+1) = λ (i) S 1 λ ψ λ(β (i+1), λ (i) ) The chaser algorithm uses the insensibility roerty, which allow us to use two searate equations to udate β and λ, for details see Jørgensen and Knudsen (2004). The described rocedure was imlemented in R and a generic function called glm.tw() is made available on the sulement material web age. To comute the variance of disersion arameters we need information about the third central moment and fourth cumulant. In the case of the Tweedie regression models we can comute these quantities based on the equations resented in the Section 2. An alternative aroach is comute the emirical versions, in this way we avoid the suosition of multivariate Tweedie distribution for the data. The emirical fourth cumulant may be comuted based on the data by the following equation: k (4) l = (y l ŷ l ) 4 3( ˆˆµˆ l )2. The emirical third central moment may be comuted based on equation (13) ignoring the exectation. The main overhead about to use emirical cumulants instead of the theorical cumulants, is that the variance should be overestimated, in this way the confidence interval based on this aroach should be a little bigger than its should be. 4 Simulation study 4.1 Design of the study We made a simulation study to evaluate the roerties of the estimator based on the estimating function aroach and comare its roerties with the maximum likelihood estimator in finite samle. Our focus here is about the arameters that describe the variance structure of Tweedie regression models (,). We use five different samle sizes (n = 50, 100, 250, 500 and 1000), and comare two measures of estimator quality (bias and coverage rate). In this manner, we have a quality measure based on oint estimates and other based on confidence intervals. To decide about which values of and we take into account in the simulation study, we first lot grahics of the likelihood contours for = 0.5 and = 1.1, near the Poisson distribution, = 2 the Gama and = 3 Inverse Gaussian distributions. The Figure 1 shows these grahics. The grahics resented in Figure 1 show that the likelihood contours are similar a quadratic function for = 2 and = 3, indicating that for these values of the asymtotic distribution is well-behaved and near the Gaussian multivariate distribution. However, for = 1.1 the likelihood behavior is non-quadratic, it shows that small values of indicate more challenging setu to make inference. Thus, we choose the values of = 1.1, 1.3, 1.5, 1.7 and 1.9 for the simulation study. The arameter measures the variability, so bigger values of indicate more challenging setu to make inference. We choose the values of = 0.5, 1, 1.5, 2 and 2.5. Combining five samles sizes, five values of and five values of we have 125 different scenarios for our simulation study. All simulations was done using the R software and the ackage tweedie (Dunn, 2013). 7

8 n=50, = 1.1 n=100, = 1.1 n=250, = 1.1 n=500, = 1.1 n=1000, = n=50, = n=100, = n=250, = n=500, = n=1000, = n=50, = n=100, = n=250, = n=500, = n=1000, = Figure 1: Likelihood contours for different values of and samle sizes. 4.2 Results We erform simulations to comare with the erformance of estimators based on Pearson estimating functions against maximum likelihood estimators. We used two measures of quality estimator: the bias b = (θ ˆθ) and the coverage rate. Our simulations consist of 1000 realizations from the Tweedie regression model (Section 3). We used a regression structure with β 0 = 0.5 and β 1 = 1, our model has one covariate, that was generated as a sequence from 0 to 2 and length deending on samle size. We used five samle sizes n = 50, 100, 250, 500 and 1000 and different combinations between the arameters and, see Section 4.1. We choose to introduce the results through grahics. The Figure 2 resents the exected bias of ˆ for different samle sizes, values of and and estimation method, PEF (Pearson Estimating Function) and MLE (Maximum Likelihood Estimator). The Figure 2 shows that in general the PEF estimator overestimate and MLE estimator underestimate for small samle size, but the bias decrease when the samle size increase as required. The bias of PEF estimator increase when the value of increase. The bias of MLE estimator is similar for all values of. In general the values of do not affect the bias of ˆ. The bias of MLE estimator is lesser than PEF estimator, but for samle size around 100, the bias of PEF estimator is small enough to be useful for ractical situations. In a similar way the Figure 3 resents the exected bias for ˆ for different samle sizes, values of and and estimation methods. The results resented in Figure 3 show that the bias of PEF estimator is small for all samle sizes and arameter combinations. In fact the arameter is well estimated using the PEF estimator for any configuration. The MLE estimator is less accurate to estimate small values of using a small samle size, in this case MLE estimator overestimate, but again the bias decreases fast 8

9 PEF = PEF = PEF = PEF = PEF = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = 2.5 = 1.1 = 1.3 = 1.5 = 1.7 = 1.9 True Samle Size Samle Size Samle Size Samle Size Samle Size Figure 2: Exected bias of ˆ for different methods, samle sizes and arameter combinations. PEF = 1.1 PEF = 1.3 PEF = 1.5 PEF = 1.7 PEF = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = 1.9 = 0.5 = 1 = 1.5 = 2 = 2.5 Samle Size Samle Size Samle Size Samle Size Samle Size Figure 3: Exected bias of ˆ for different methods, samle sizes and arameter combinations. 9

10 when the samle size increase and for samle size around 100 the bias is small enough for ractical alications. In general the results indicate that both methods given good oint estimates for the arameters and. In this manner, we evaluated the quality of oint estimates. Now, we need to evaluate the quality of confidence intervals. For this task, we comuted the coverage rate of the confidence interval of ˆ and ˆ for different samle sizes, combinations of and and estimation methods. The coverage rate for the confidence interval of ˆ is shown in Figure PEF = PEF = PEF = PEF = PEF = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = 2.5 = 1.1 = 1.3 = 1.5 = 1.7 = 1.9 Samle Size Samle Size Samle Size Samle Size Samle Size Figure 4: of for different methods, samle sizes and arameter combinations. The results resented in Figure 4 show that for both methods the coverage rate is lesser than the nominal level for small samle size. The confidence intervals based on PEF aroach achieve the nominal level for samle size around 250 for all configurations. Although, for big values of the coverage rate is slightly bigger than the nominal level. The confidence interval based on MLE aroach is not realistic for small samle size and small values of and. For examle, for = 0.5 and = 1.1 the confidence interval based on MLE aroach does not achieve the nominal level same with samle size equal the For bigger values of the situation is better and for samle size around 250 the confidence intervals show coverage rate near the nominal level. In general the results demonstrate that confidence intervals based on PEF aroach does not deend on the combinations between and values, for all configurations the results evidenced that for samle size around 250 the confidence intervals are well estimated. On the other hand, MLE aroach has difficult to estimate confidence interval for small values of and. Similar analysis is resented in Figure 5 for arameter. Figure 5 shows that the coverage rate for confidence intervals based on PEF aroach are near the nominal level for all configurations considered, but again for big values of the coverage rate is slightly bigger than the nominal level. These results indicate that in general this aroach resents confidence intervals bigger than should be. These results were exected, because we are using emirical third and fourth moments to comute the variance of ˆ and ˆ. We argue that for ractical data analysis these confidence intervals are enough accurate. The confidence intervals based on MLE aroach are not realistic for small values of and, for examle for = 1.1 the confidence interval based on MLE aroach does not achieve the nominal level, same using samle size equal to When the values of and increase the results imrove and are near the nominal level for all samle size. 10

11 PEF = PEF = PEF = PEF = PEF = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = Samle Size MLE = 1.9 = 0.5 = 1 = 1.5 = 2 = 2.5 Samle Size Samle Size Samle Size Samle Size Samle Size Figure 5: of for different methods, samle sizes and arameter combinations. In general way both methods show that are able to comute interval confidence, with a coverage rate near the nominal level. Of course, the results imrove when the samle size increase, which is exected because our inferential methods are based on asymtotic results. 5 Conclusion In this aer, we resented a new aroach to make inferences with resect arameters of Tweedie regression models. Our aroach is based on the quasi-score function for regression arameters and the Pearson estimating function for disersion arameters. It is a well known result that quasi-score function yields the same estimator that maximum likelihood aroach for regression arameters. Thus, we focus on disersion arameters or the arameters that describe the variance structure of the Tweedie regression models. We erform a simulation study to evaluate the quality of our estimator and comare with the maximum likelihood aroach. The results show that both methods are similar, but the results based on Pearson estimating function are robust in the sense that for all combinations between arameters considered in the simulation study the PEF aroach shows good results. On the other hand, the MLE aroach showed difficult to estimate small values of and. Furthermore, we have many advantages to use estimating function aroach. First, we do not need to evaluate the density function, that is a hard comutational task. Second, we do not need hoe about negative or near 1 values of, once our aroach deals with this situation naturally. Moreover, we can estimate values of between 0 and 1, because our aroach is based on secondmoments assumtions, in this way we do not need to suose that the resonse variable is distributed as the Tweedie distribution, it becomes our aroach robust to missecification. A suggestion for future work with estimating function aroach and Tweedie regression models may well be extend the Tweedie models for non-indeendent data, for examle in longitudinal data analysis or reeat measures exeriments. Tweedie models may be good models to deal with rainfall data, in this case is imortant to be able to analyze data with satial and sace-time structures, so extend Tweedie models to deal with deendent data is a romising aroach and the use of estimation function become ossible to do it in an elegant way. 11

12 References Dunn, P. K. (2013). tweedie: Tweedie exonential family models. R ackage version Dunn, P. K. and Smyth, G. K. (2001). Tweedie family densities:methods of evaluation., Proceedings of the 16th International Worksho on Statistical Modelling, Odense, Denmark. Dunn, P. and Smyth, G. (2005). Series evaluation of tweedie exonential disersion model densities, Statistics and Comuting 15(4): Dunn, P. and Smyth, G. (2008). Evaluation of tweedie exonential disersion model densities by fourier inversion, Statistics and Comuting 18(1): Hasan, M. M. and Dunn, P. K. (2011). Two tweedie distributions that are near-otimal for modelling monthly rainfall in australia, International Journal of Climatology 31(9): Jørgensen, B. (1997). The Theory of Disersion Models, Chaman Hall. Jørgensen, B. and Knudsen, S. J. (2004). Parameter orthogonality and bias adjustment for estimating functions, Scandinavian Journal of Statistics 31(1): Jørgensen, B. and Paes De Souza, M. C. (1994). Fitting tweedies comound oisson model to insurance claims data, Scandinavian Actuarial Journal 1994(1): Knight, J. L. (1985). The joint characteristic function of linear and quadratic forms of non-normal variables, Sankhyā: The Indian Journal of Statistics, Series A ( ) 47(2): Nelder, J. A. and Mead, R. (1965). A simlex method for function minimization, The Comuter Journal 7(4): Nelder, J. A. and Wedderburn, R. W. M. (1972). Generalized linear models, Journal of the Royal Statistical Society. Series A 135(3): R Core Team (2014). R: A Language and Environment for Statistical Comuting, R Foundation for Statistical Comuting, Vienna, Austria. Soetaert, K. and Herman, P. M. (2009). A Practical Guide to Ecological Modelling. Using R as a Simulation Platform, Sringer. ISBN Tweedie, M. C. K. (1984). An index which distinguishes between some imortant exonential families, in J. K. Ghosh and J. Roy (eds), Statistics: Alications and New Directions, Proceedings of the Indian Statistical Institute Golden Jubilee International Conference, Calcutta: Indian Statistical Institute. 12

Flexible Tweedie regression models for continuous data

Flexible Tweedie regression models for continuous data Flexible Tweedie regression models for continuous data arxiv:1609.03297v1 [stat.me] 12 Se 2016 Wagner H. Bonat and Célestin C. Kokonendji Abstract Tweedie regression models rovide a flexible family of

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI ** Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R

More information

Outline for today. Maximum likelihood estimation. Computation with multivariate normal distributions. Multivariate normal distribution

Outline for today. Maximum likelihood estimation. Computation with multivariate normal distributions. Multivariate normal distribution Outline for today Maximum likelihood estimation Rasmus Waageetersen Deartment of Mathematics Aalborg University Denmark October 30, 2007 the multivariate normal distribution linear and linear mixed models

More information

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment

More information

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test) Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant

More information

Finite Mixture EFA in Mplus

Finite Mixture EFA in Mplus Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered

More information

The Poisson Regression Model

The Poisson Regression Model The Poisson Regression Model The Poisson regression model aims at modeling a counting variable Y, counting the number of times that a certain event occurs during a given time eriod. We observe a samle

More information

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is

More information

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout

More information

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &

More information

ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE

ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE J Jaan Statist Soc Vol 34 No 2004 9 26 ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE Yasunori Fujikoshi*, Tetsuto Himeno

More information

Notes on Instrumental Variables Methods

Notes on Instrumental Variables Methods Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of

More information

arxiv: v2 [stat.me] 3 Nov 2014

arxiv: v2 [stat.me] 3 Nov 2014 onarametric Stein-tye Shrinkage Covariance Matrix Estimators in High-Dimensional Settings Anestis Touloumis Cancer Research UK Cambridge Institute University of Cambridge Cambridge CB2 0RE, U.K. Anestis.Touloumis@cruk.cam.ac.uk

More information

Research of power plant parameter based on the Principal Component Analysis method

Research of power plant parameter based on the Principal Component Analysis method Research of ower lant arameter based on the Princial Comonent Analysis method Yang Yang *a, Di Zhang b a b School of Engineering, Bohai University, Liaoning Jinzhou, 3; Liaoning Datang international Jinzhou

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

LOGISTIC REGRESSION. VINAYANAND KANDALA M.Sc. (Agricultural Statistics), Roll No I.A.S.R.I, Library Avenue, New Delhi

LOGISTIC REGRESSION. VINAYANAND KANDALA M.Sc. (Agricultural Statistics), Roll No I.A.S.R.I, Library Avenue, New Delhi LOGISTIC REGRESSION VINAANAND KANDALA M.Sc. (Agricultural Statistics), Roll No. 444 I.A.S.R.I, Library Avenue, New Delhi- Chairerson: Dr. Ranjana Agarwal Abstract: Logistic regression is widely used when

More information

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund

More information

Estimation of Separable Representations in Psychophysical Experiments

Estimation of Separable Representations in Psychophysical Experiments Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION STATISTICA, anno LXVIII, n., 008 ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION 1. INTRODUCTION The Inverse Gaussian distribution was first introduced

More information

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.

More information

Background. GLM with clustered data. The problem. Solutions. A fixed effects approach

Background. GLM with clustered data. The problem. Solutions. A fixed effects approach Background GLM with clustered data A fixed effects aroach Göran Broström Poisson or Binomial data with the following roerties A large data set, artitioned into many relatively small grous, and where members

More information

Biostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression

Biostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression Biostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout 4a): χ 2 test of

More information

Introduction to Probability and Statistics

Introduction to Probability and Statistics Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based

More information

1 Extremum Estimators

1 Extremum Estimators FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analysis of Variance and Design of Exeriment-I MODULE II LECTURE -4 GENERAL LINEAR HPOTHESIS AND ANALSIS OF VARIANCE Dr. Shalabh Deartment of Mathematics and Statistics Indian Institute of Technology Kanur

More information

STK4900/ Lecture 7. Program

STK4900/ Lecture 7. Program STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory

More information

Hotelling s Two- Sample T 2

Hotelling s Two- Sample T 2 Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test

More information

Chapter 3. GMM: Selected Topics

Chapter 3. GMM: Selected Topics Chater 3. GMM: Selected oics Contents Otimal Instruments. he issue of interest..............................2 Otimal Instruments under the i:i:d: assumtion..............2. he basic result............................2.2

More information

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information

Distributed Rule-Based Inference in the Presence of Redundant Information

Distributed Rule-Based Inference in the Presence of Redundant Information istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced

More information

LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL

LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL Mohammad Bozorg Deatment of Mechanical Engineering University of Yazd P. O. Box 89195-741 Yazd Iran Fax: +98-351-750110

More information

Chapter 7: Special Distributions

Chapter 7: Special Distributions This chater first resents some imortant distributions, and then develos the largesamle distribution theory which is crucial in estimation and statistical inference Discrete distributions The Bernoulli

More information

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES Emmanuel Duclos, Maurice Pillet To cite this version: Emmanuel Duclos, Maurice Pillet. AN OPTIMAL CONTROL CHART FOR NON-NORMAL PRO- CESSES. st IFAC Worsho

More information

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2 STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous

More information

Probability Estimates for Multi-class Classification by Pairwise Coupling

Probability Estimates for Multi-class Classification by Pairwise Coupling Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics

More information

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian

More information

Bayesian Model Averaging Kriging Jize Zhang and Alexandros Taflanidis

Bayesian Model Averaging Kriging Jize Zhang and Alexandros Taflanidis HIPAD LAB: HIGH PERFORMANCE SYSTEMS LABORATORY DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING AND EARTH SCIENCES Bayesian Model Averaging Kriging Jize Zhang and Alexandros Taflanidis Why use metamodeling

More information

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment

More information

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose

More information

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation 6.3 Modeling and Estimation of Full-Chi Leaage Current Considering Within-Die Correlation Khaled R. eloue, Navid Azizi, Farid N. Najm Deartment of ECE, University of Toronto,Toronto, Ontario, Canada {haled,nazizi,najm}@eecg.utoronto.ca

More information

Developing A Deterioration Probabilistic Model for Rail Wear

Developing A Deterioration Probabilistic Model for Rail Wear International Journal of Traffic and Transortation Engineering 2012, 1(2): 13-18 DOI: 10.5923/j.ijtte.20120102.02 Develoing A Deterioration Probabilistic Model for Rail Wear Jabbar-Ali Zakeri *, Shahrbanoo

More information

Adaptive estimation with change detection for streaming data

Adaptive estimation with change detection for streaming data Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment

More information

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS Proceedings of DETC 03 ASME 003 Design Engineering Technical Conferences and Comuters and Information in Engineering Conference Chicago, Illinois USA, Setember -6, 003 DETC003/DAC-48760 AN EFFICIENT ALGORITHM

More information

Estimating Time-Series Models

Estimating Time-Series Models Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary

More information

Supplementary Materials for Robust Estimation of the False Discovery Rate

Supplementary Materials for Robust Estimation of the False Discovery Rate Sulementary Materials for Robust Estimation of the False Discovery Rate Stan Pounds and Cheng Cheng This sulemental contains roofs regarding theoretical roerties of the roosed method (Section S1), rovides

More information

A PEAK FACTOR FOR PREDICTING NON-GAUSSIAN PEAK RESULTANT RESPONSE OF WIND-EXCITED TALL BUILDINGS

A PEAK FACTOR FOR PREDICTING NON-GAUSSIAN PEAK RESULTANT RESPONSE OF WIND-EXCITED TALL BUILDINGS The Seventh Asia-Pacific Conference on Wind Engineering, November 8-1, 009, Taiei, Taiwan A PEAK FACTOR FOR PREDICTING NON-GAUSSIAN PEAK RESULTANT RESPONSE OF WIND-EXCITED TALL BUILDINGS M.F. Huang 1,

More information

Generalized Coiflets: A New Family of Orthonormal Wavelets

Generalized Coiflets: A New Family of Orthonormal Wavelets Generalized Coiflets A New Family of Orthonormal Wavelets Dong Wei, Alan C Bovik, and Brian L Evans Laboratory for Image and Video Engineering Deartment of Electrical and Comuter Engineering The University

More information

A numerical assessment of the random walk particle tracking method for heterogeneous aquifers

A numerical assessment of the random walk particle tracking method for heterogeneous aquifers 288 Calibration and Reliability in Groundwater Modelling: From Uncertainty to Decision Making (Proceedings of ModelCARE 2005, The Hague, The Netherlands, June 2005). IAHS Publ. 304, 2006. A numerical assessment

More information

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com

More information

Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations

Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations PINAR KORKMAZ, BILGE E. S. AKGUL and KRISHNA V. PALEM Georgia Institute of

More information

The LmB Conferences on Multivariate Count Analysis

The LmB Conferences on Multivariate Count Analysis The LmB Conferences on Multivariate Count Analysis Title: On Poisson-exponential-Tweedie regression models for ultra-overdispersed count data Rahma ABID, C.C. Kokonendji & A. Masmoudi Email Address: rahma.abid.ch@gmail.com

More information

Maxisets for μ-thresholding rules

Maxisets for μ-thresholding rules Test 008 7: 33 349 DOI 0.007/s749-006-0035-5 ORIGINAL PAPER Maxisets for μ-thresholding rules Florent Autin Received: 3 January 005 / Acceted: 8 June 006 / Published online: March 007 Sociedad de Estadística

More information

Plotting the Wilson distribution

Plotting the Wilson distribution , Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers

More information

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION O P E R A T I O N S R E S E A R C H A N D D E C I S I O N S No. 27 DOI:.5277/ord73 Nasrullah KHAN Muhammad ASLAM 2 Kyung-Jun KIM 3 Chi-Hyuck JUN 4 A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST

More information

Adaptive Estimation of the Regression Discontinuity Model

Adaptive Estimation of the Regression Discontinuity Model Adative Estimation of the Regression Discontinuity Model Yixiao Sun Deartment of Economics Univeristy of California, San Diego La Jolla, CA 9293-58 Feburary 25 Email: yisun@ucsd.edu; Tel: 858-534-4692

More information

High-dimensional Ordinary Least-squares Projection for Screening Variables

High-dimensional Ordinary Least-squares Projection for Screening Variables High-dimensional Ordinary Least-squares Projection for Screening Variables Xiangyu Wang and Chenlei Leng arxiv:1506.01782v1 [stat.me] 5 Jun 2015 Abstract Variable selection is a challenging issue in statistical

More information

An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators

An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators S. K. Mallik, Student Member, IEEE, S. Chakrabarti, Senior Member, IEEE, S. N. Singh, Senior Member, IEEE Deartment of Electrical

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS. A Dissertation JAMES ROBERT FISHER

STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS. A Dissertation JAMES ROBERT FISHER STABILITY ANALYSIS AND CONTROL OF STOCHASTIC DYNAMIC SYSTEMS USING POLYNOMIAL CHAOS A Dissertation by JAMES ROBERT FISHER Submitted to the Office of Graduate Studies of Texas A&M University in artial fulfillment

More information

COMPARISON OF VARIOUS OPTIMIZATION TECHNIQUES FOR DESIGN FIR DIGITAL FILTERS

COMPARISON OF VARIOUS OPTIMIZATION TECHNIQUES FOR DESIGN FIR DIGITAL FILTERS NCCI 1 -National Conference on Comutational Instrumentation CSIO Chandigarh, INDIA, 19- March 1 COMPARISON OF VARIOUS OPIMIZAION ECHNIQUES FOR DESIGN FIR DIGIAL FILERS Amanjeet Panghal 1, Nitin Mittal,Devender

More information

An Analysis of Reliable Classifiers through ROC Isometrics

An Analysis of Reliable Classifiers through ROC Isometrics An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit

More information

Radial Basis Function Networks: Algorithms

Radial Basis Function Networks: Algorithms Radial Basis Function Networks: Algorithms Introduction to Neural Networks : Lecture 13 John A. Bullinaria, 2004 1. The RBF Maing 2. The RBF Network Architecture 3. Comutational Power of RBF Networks 4.

More information

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm - (23) NLP - NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS The Goldstein-Levitin-Polya algorithm We consider an algorithm for solving the otimization roblem under convex constraints. Although the convexity

More information

Extended Poisson-Tweedie: properties and regression models for count data

Extended Poisson-Tweedie: properties and regression models for count data Extended Poisson-Tweedie: properties and regression models for count data arxiv:1608.06888v2 [stat.me] 11 Sep 2016 Wagner H. Bonat and Bent Jørgensen and Célestin C. Kokonendji and John Hinde and Clarice

More information

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE THE 19 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE K.W. Gan*, M.R. Wisnom, S.R. Hallett, G. Allegri Advanced Comosites

More information

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation Uniformly best wavenumber aroximations by satial central difference oerators: An initial investigation Vitor Linders and Jan Nordström Abstract A characterisation theorem for best uniform wavenumber aroximations

More information

Efficient & Robust LK for Mobile Vision

Efficient & Robust LK for Mobile Vision Efficient & Robust LK for Mobile Vision Instructor - Simon Lucey 16-623 - Designing Comuter Vision As Direct Method (ours) Indirect Method (ORB+RANSAC) H. Alismail, B. Browning, S. Lucey Bit-Planes: Dense

More information

Hidden Predictors: A Factor Analysis Primer

Hidden Predictors: A Factor Analysis Primer Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor

More information

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION Journal of Statistics: Advances in heory and Alications Volume 8, Number, 07, Pages -44 Available at htt://scientificadvances.co.in DOI: htt://dx.doi.org/0.864/jsata_700868 MULIVARIAE SAISICAL PROCESS

More information

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668

More information

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014 Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get

More information

MULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION

MULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION MULTIVARIATE SHEWHART QUALITY CONTROL FOR STANDARD DEVIATION M. Jabbari Nooghabi, Deartment of Statistics, Faculty of Mathematical Sciences, Ferdowsi University of Mashhad, Mashhad-Iran. and H. Jabbari

More information

AKRON: An Algorithm for Approximating Sparse Kernel Reconstruction

AKRON: An Algorithm for Approximating Sparse Kernel Reconstruction : An Algorithm for Aroximating Sarse Kernel Reconstruction Gregory Ditzler Det. of Electrical and Comuter Engineering The University of Arizona Tucson, AZ 8572 USA ditzler@email.arizona.edu Nidhal Carla

More information

Time Domain Calculation of Vortex Induced Vibration of Long-Span Bridges by Using a Reduced-order Modeling Technique

Time Domain Calculation of Vortex Induced Vibration of Long-Span Bridges by Using a Reduced-order Modeling Technique 2017 2nd International Conference on Industrial Aerodynamics (ICIA 2017) ISBN: 978-1-60595-481-3 Time Domain Calculation of Vortex Induced Vibration of Long-San Bridges by Using a Reduced-order Modeling

More information

Elementary Analysis in Q p

Elementary Analysis in Q p Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some

More information

Wolfgang POESSNECKER and Ulrich GROSS*

Wolfgang POESSNECKER and Ulrich GROSS* Proceedings of the Asian Thermohysical Proerties onference -4 August, 007, Fukuoka, Jaan Paer No. 0 A QUASI-STEADY YLINDER METHOD FOR THE SIMULTANEOUS DETERMINATION OF HEAT APAITY, THERMAL ONDUTIVITY AND

More information

START Selected Topics in Assurance

START Selected Topics in Assurance START Selected Toics in Assurance Related Technologies Table of Contents Introduction Statistical Models for Simle Systems (U/Down) and Interretation Markov Models for Simle Systems (U/Down) and Interretation

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating

More information

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of..

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of.. IOSR Journal of Mathematics (IOSR-JM) e-issn: 78-578, -ISSN: 319-765X. Volume 1, Issue 4 Ver. III (Jul. - Aug.016), PP 53-60 www.iosrournals.org Asymtotic Proerties of the Markov Chain Model method of

More information

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Journal of Sound and Vibration (998) 22(5), 78 85 VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Acoustics and Dynamics Laboratory, Deartment of Mechanical Engineering, The

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015)

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015) On-Line Aendix Matching on the Estimated Proensity Score Abadie and Imbens, 205 Alberto Abadie and Guido W. Imbens Current version: August 0, 205 The first art of this aendix contains additional roofs.

More information

dn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential

dn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential Chem 467 Sulement to Lectures 33 Phase Equilibrium Chemical Potential Revisited We introduced the chemical otential as the conjugate variable to amount. Briefly reviewing, the total Gibbs energy of a system

More information

Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute

Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute ajesh Singh, ankaj Chauhan, Nirmala Sawan School of Statistics, DAVV, Indore (M.., India Florentin Smarandache Universit of New Mexico, USA atio Estimators in Simle andom Samling Using Information on Auxiliar

More information

Signaled Queueing. Laura Brink, Robert Shorten, Jia Yuan Yu ABSTRACT. Categories and Subject Descriptors. General Terms. Keywords

Signaled Queueing. Laura Brink, Robert Shorten, Jia Yuan Yu ABSTRACT. Categories and Subject Descriptors. General Terms. Keywords Signaled Queueing Laura Brink, Robert Shorten, Jia Yuan Yu ABSTRACT Burstiness in queues where customers arrive indeendently leads to rush eriods when wait times are long. We roose a simle signaling scheme

More information

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari Condence tubes for multile quantile lots via emirical likelihood John H.J. Einmahl Eindhoven University of Technology Ian W. McKeague Florida State University May 7, 998 Abstract The nonarametric emirical

More information

The Binomial Approach for Probability of Detection

The Binomial Approach for Probability of Detection Vol. No. (Mar 5) - The e-journal of Nondestructive Testing - ISSN 45-494 www.ndt.net/?id=7498 The Binomial Aroach for of Detection Carlos Correia Gruo Endalloy C.A. - Caracas - Venezuela www.endalloy.net

More information