Zero inflated negative binomial-generalized exponential distribution and its applications

Size: px
Start display at page:

Download "Zero inflated negative binomial-generalized exponential distribution and its applications"

Transcription

1 Songklanakarin J. Sci. Technol. 6 (4), , Jul. - Aug Original Article Zero inflated negative binomial-generalized eponential distribution and its applications Sirinapa Aryuyuen, Winai Bodhisuwan*, and Thidaporn Supapakorn Department of Statistics, Faculty of Science, Kasetsart University, Chatuchak, Bangkok, Thailand. Received December 01; Accepted 16 April 014 Abstract In this paper, we propose a new zero inflated distribution, namely, the zero inflated negative binomial-generalized eponential (ZINB-GE) distribution. The new distribution is used for count data with etra zeros and is an alternative for data analysis with over-dispersed count data. Some characteristics of the distribution are given, such as mean, variance, skewness, and kurtosis. Parameter estimation of the ZINB-GE distribution uses maimum likelihood estimation (LE) method. Simulated and observed data are employed to eamine this distribution. The results show that the LE method seems to he highefficiency for large sample sizes. oreover, the mean square error of parameter estimation is increased when the zero proportion is higher. For the real data sets, this new zero inflated distribution provides a better fit than the zero inflated Poisson and zero inflated negative binomial distributions. Keywords: overdispersion, zero inflated distribution, count data with etra zeros, random variate generation, LE 1. Introduction Poisson distribution provides a standard model for the analysis of count data with the assumption of equal mean and variance. However, in practice, count data often shows overdispersion, the variance is greater than the mean. In practice, negative binomial (NB) distribution was introduced to solve this problem, and it has become increasingly popular as a more fleible alternative to fit models. The NB distribution is a better fit for over-dispersed count data which is not necessarily hey-tailed (Wang, 011). For over-dispersed count data, some mied NB distributions offer a better fit when compared with the Poisson and NB distributions. Such findings are related to those of Gomez-Dèniz et al. (008), Zamani and Ismail (010), Wang (011), and Pudprommarat et al. (01). Recently, a negative binomial-generalized eponential (NB-GE) distribution was * Corresponding author. address: fsciwnb@ku.ac.th employed to fit count data. It was obtained by miing the NB distribution with a generalized eponential (GE) distribution. Also, the NB-GE distribution is more appropriate to fit count data with overdispersion and hey-tailed datasets than the Poisson and NB distributions (Aryuyuen and Bodhisuwan, 01). oreover, in practice, one frequent demonstration of overdispersion is a zero count incidence. Count data with etra zeros occur in many fields, such as public health, epidemiology, medicine, sociology, engineering and agriculture. Standard discrete distribution may fail to fit such data either because of zero inflation or over/underdispersion. Now, there is increased interest in a zero inflated distribution to account for etra zeros in data (Xie et al., 009). Count data with etra zeros create problems of violating basic assumptions implicit in the standard distribution. Failure to account for etra zeros may result in biased parameter estimates and misleading inference. Because the zero inflated distributions usually provide better statistical fit, some researchers, e.g., Lambert (199), Greene (1994), Hall (000), Famoye and Singh (00), Bodhisuwan (011) proposed these distributions.

2 484 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , 014 As mentioned above, the mied distribution defines one of the most important ways to obtain a new probability distribution in applied probability and operational research (Gomez-Dèniz et al., 008). For the purpose, we are looking for a new zero inflated distribution which is a more fleible alternative to fit count data with ecess zeros. The rest of this article is organized as follows. We propose the new zero inflated distribution that is a zero inflated negative binomial-generalized eponential (ZINB- GE) distribution. Some characteristics, graphs of probability mass function (pmf) and a random variate generation of ZINB-GE distribution are introduced in Section. In Section the maimum likelihood estimation (LE) method is handled to estimate the parameters of ZINB-GE distribution. Numerical eamples are provided in Section 4, including simulated data which are utilized to eamine the efficiency of LE method, and real data sets that are used to evaluate the performance of the proposed distribution. Finally, some conclusions are presented in Section 5.. New zero inflated negative binomial distribution The zero inflated (ZI) distribution can be used to fit count data with etra zeros, and assumes that the observed data are the result of a two-part process; a process that generates structural zeros and a process that generates random counts. The model of ZI can be summarized as follows: P( X ) ( ) (1 ) f ( ; ), (1) 0 where X is the count variable, is the etra proportion of zeros, f ( ; ) is the pmf of X with the parameter, and 0( ) 1 if 0; otherwise, ( ) The ZINB-GE distribution The ZINB-GE distribution is a new miture distribution, combining Bernoulli and NB-GE distributions. First, we introduce a definition and some characteristics of NB-GE distribution as follows. Definition 1 Let Y be a random variable of the NB-GE distribution. Random variable Y has the NB distribution with the parameters r and p ep( ), where is distributed as the GE distribution with the positive parameters and, i.e., y ~ NB( r, p ep( )) and ~ GE(, ). The pmf of Y is given by y r y 1 y f ( y) ( 1) ( r ), y 0,1,,, () y 0 Γ( 1)Γ 1 u / for,, 0 ( u r ) Γ u / 1 where Γ( ) is a gamma function denoted as for t 0, and () 1 Γ( ) t y t y e dy, 0 Net, the mean and variance of Y are respectively given by where is ( u) E( Y ) r( 1) and (1) Var( Y ) Y r r 1 () r r (1) 1 (1) ( u), ( 1) (1 u / ). (4) ( u / 1) Now, by using the epression (1) and Definition 1, we obtain the closed formulas for the pmf and some characteristics of ZINB-GE as follows. Definition Let X be a random variable of ZINB-GE distribution with the parameters r,, and, denoted as X~ ZINB-GE( r,,, ). The pmf of X is given by (1 ) ( r) if 0, f ( ) 1 r (1 ) ( 1) ( r ) if 1,, 0 where 0 1, r,, 0, and ( u) is defined in (). The graphs of ZINB-GE s pmf distribution with specified parameters r,, and, offered in Figure 1, shows that the distribution will be flat when parameter is increasing. However, the distribution will be flat when the parameter is decreasing. In addition, the pmf of ZINB-GE distribution can take different shapes when values of differ. Theorem 1 If X~ ZINB-GE(, r,, ) then some characteristics of X are as follows. (a) The mean and variance of X are given by E( ) 1 r r and Var( X ) X (1) where ( u) is defined in (4), and (5) X, (6) X r ( r 1) () (r 1) (1) r 1 r( (1) 1) 1. (b) The skewness and kurtosis of X are Sk( X ) r ( r 1)( r ) ( r 1) (r r 1) r 1 r 1 () () (1) (1) ( r 1) () (r 1) (1) r 1 r (1) 11 / X, (7) Ku( X ) (4) () r ( r 1)( r )( r ) ( r 1)(r 7r 6) ( r 1) (6r 1r 7) () (r 1)( r r 1) (1) r 1 4r (1) 1 1/ ( r 1)( r ) () ( r 1) () (r r 1) (1) r 1 6r (1) () (1) (1) X ( r 1) (r 1) r 1 r 1 1 /. (8)

3 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , Figure 1. The pmf of ZINB-GE distribution with specified parameters The proof of Theorem 1 is in the Appendi.. Random variate generation of ZINB-GE distribution To generate a random variable X from the ZINB-GE (, r,, ), one can use the following algorithm: 1) Generate U from the uniform distribution, U(0,1). 1 log 1 U from the GE distribution, 1/ ) Set ~GE(, ). ) Generate Y from the NB( r, p ep( ) ) distribution. * 4) Generate U from the uniform distribution, U(0,1). 5) If * U, then set X Y ; otherwise, X = 0.. Parameter estimation of ZINB-GE distribution In this section, the parameter estimation of ZINB-GE distribution via LE procedure is provided. The likelihood function of ZINB-GE (, r,, ) is given by n L(, r,, ) I (1 ) I i1 ( i 0) ( r), r 1 i i i (1 ) ( 1) ( i 0) ( r ) i 0 then we can write the log-likelihood of the ZINB-GE (, r,, ) as log L(, r,, ) n I log (1 ) i1 ( i 0) ( r ) I log(1 ) log Γ( r ) log Γ( r) log Γ( 1) ( 0) i i i i i Γ 1 ( r ) / log Γ( 1) log ( 1) 0 Γ ( r ) / 1. By differentiating the log-likelihood function of ZINB-GE distribution, partial derivatives of the log likelihood function with respect to, r, and are given by n r r I( 0) i1 r r ( 0) Γ / 1 (1 )Γ( 1)Γ 1 / 1 i i Γ / 1 Γ( 1)Γ 1 / 1 I n (1 )Γ( 1) Γ 1 r / I( i 0) r i1 (1 ) ( r ) r Γ r / 1 1 I ( r ) ( ) (,, ) ( i 0) i r r ( r,, ) r Γ ( s) where ( s), and Γ( s) i i Γ 1 ( r ) / ( r,, ) ( 1) 0 Γ ( r ) /. 1,

4 486 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , 014 n (1 )Γ 1 r / Γ( 1) I( i 0) i1 (1 ) ( r ) Γ r / 1 1 I ( 1) (,, ) ( i r 0) ( r,, ), n (1 )Γ( 1) Γ 1 r / I( i 0) i1 (1 ) ( r) Γ r / 1 1 I (,, ) ( i r 0). ( r,, ) The LE solutions of the parameter estimates can be obtained by using numerical optimization with the nlm function in the R program (R Core Team, 01). The R code of parameter estimation of ZINB-GE distribution using the LE method is given in the Appendi. 4. Numerical study This section presents the efficiency of the LE method for parameter estimation of ZINB-GE distribution by using simulated data. In addition, we illustrate the application study of ZINB-GE distribution compared to the zero inflated Poisson (ZIP) and zero inflated negative binomial (ZINB) distributions. 4.1 Simulation study In illustrating the simulation study, the sample data generated from the ZINB-GE distribution with specified parameters (r = 10, = 10, = 10) and four values for proportion of zero (0., 0.4, 0.6, 0.8) for the sample sizes (n) as 50, 100 and 00, respectively. In each situation, the parameters are estimated from 500 replications. Hence the maimum likelihood estimators may be biased. The biased value is a difference value between the estimator and true parameter values. The sample erage of the estimated parameter, bias, variance, standard deviation (SD), and mean squared error (SE) are computed by the formulas: ˆ ˆ t1 t, Bias( ˆ ) ˆ, ˆ Var( ) ( ˆ ˆ ) t t 500 1, SD( ˆ ) Var( ˆ ) and SE( ˆ ) Var( ˆ ) ˆ Bias ( ). Table 1 illustrates statistic values from the results of studies. From these results, LE seems to he highefficiency when the sample size is large. The efficiency of LE seems to be poor for small sample sizes. In addition, the SE of parameter estimation is increasing when the zero proportion is higher as in Figure, which displays the graphs of SE for parameter estimates of ZINB-GE distribution. 4. Application study We used two data sets for this part of the analysis to illustrate the applications of the ZINB-GE distribution. The first data set has the number of hospital stays by United States residents aged 66 and over (see Flynn, 009), which is shown in Table. This data has 80.7% of zeros and the sample inde of dispersion is In addition, Table shows the number of units of consumers good purchased by households over 6 weeks (see Lindsey, 1995). This data set has 80.60% of zeros and the sample inde of dispersion is.7. These eamples he sample indices of dispersion bigger than 1 and a high percentage of zeros. For model selection in this study, we use the criteria of AIC (Akaike information criterion) and BIC (Bayesian information criterion), the goodness of fit test (chi-squared test: -test) is used to compare between observed and epected values of data. From the results in Table -, we found that the AIC and BIC values for ZINB-GE distribution are the smallest when compared with eisting models. Thus, the ZINB-GE distribution can be chosen as the best model. Also, based on the p-values of chi-squared test, the proposed distribution is appropriate to fit the data unlike the ZIP and ZINB distributions. These three criteria indicate that the ZINB-GE distribution is the best fit, while the ZIP and ZINB distributions are very poor fits. 5. Conclusions This work proposes the new zero inflated distribution, which is called the zero inflated negative binomial-general- Figure. The SE of parameter estimates of ZINB-GE distribution

5 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , Table 1. The parameter estimates of ZINB-GE distribution with different parameter n = 50 n = 100 n = 00 Parameter Estimate SE Estimate SE Estimate SE (SD) (SD) (SD) = (0.08) (0.07) (0.05) r (7.6) (5.04) (.87) (8.06) (5.54) (4.47) (9.1) (5.98) (4.01) = (0.09) (0.07) (0.05) r (8.49) (5.55) (4.01) (9.0) (6.67) (4.85) (11.0) (6.58 ) (4.57) = (0.09) (0.06) (0.05) r (10.85) (6.75) (4.07) (1.) (7.6) (5.15) (14.87) (8.4) (4.7) = (0.08) (0.05) (0.0) r (11.86) (7.7) (5.58) (1.9) (1.74) (5.58) (16.9) (8.9) (6.1) ized eponential distribution. In particular, the closed form and some characteristics of the proposed distribution are introduced. Parameter estimation is also implemented by using LE, and the usefulness of the ZINB-GE distribution is illustrated by real and generated data. Based on the results, the proposed distribution is the best fit while the ZIP and ZINB distributions are very poor fits for count data with etra zeros. In conclusion, the ZINB-GE distribution is a fleible alternative for analysis of count data characterized by etra zeros. Also, the LE method seems to he high-efficiency for large sample sizes, and the SE of parameter estimation increases when the zero proportion is higher. Acknowledgements The authors wish to gratefully acknowledge to the referee of this paper who helped to clarify and improve the presentation. The first author would like to thank the Research Professional Development Proect under the Science Achievement Scholarship of Thailand (SAST) for funding. References Aryuyuen, S. and Bodhisuwan, W. 01. The negative binomial-generalized eponential (NB-GE) distribution. Applied athematical Sciences. 7,

6 488 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , 014 Table. Observed and epected frequencies of the number of hospital stays of United States residents aged 66 and over Number of hospital stays Observed value Epected value by fitting distribution ZIP ZINB ZINB-GE Parameter estimates ˆ ˆ ˆ ˆ ˆr.968 ˆr.040 ˆp ˆ 1.01 ˆ 7.57 AIC BIC Degree of freedom p-value < < Table. Observed and epected frequencies of the number of units of consumers good purchased by households over 6 weeks Units of consumers good Households/ Observed value Epected value by fitting distribution ZIP ZINB ZINB-GE Parameter estimates ˆ ˆ ˆ ˆ.0609 ˆr ˆr ˆp 0.6 ˆ 0.45 ˆ AIC BIC Degree of freedom p-value < <

7 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , Bodhisuwan, W Zero inflated Waring distribution and its application. Procedings of the 7th Congress on Science and Technoloby of Thailand, Thailand. Famoye, F. and Singh, K.P. 00. On inated generalized Poisson regression models. Advances and Applications in Statistics., Flynn, ore fleible GLs zero-inflated models and hybrid models. Casualty Actuarial Society E- Forum, Winter, U.S.A., pp Gomez-Dèniz E., Sarabia, J.. and Calderín-Oeda, E Univariate and multivariate versions of the negative binomial-inverse Gaussian distributions with applications. Insurance athematics and Economics, 4, Greene, W.H Accounting for ecess zeros and sample selection in Poisson and negative binomial regression models. Working paper No , Department of Econometrics, Stern School of Business, New York University, New York, U.S.A. Hall, D.B Zero-inated Poisson and binomial regression with random effects: a case study. Biometrics. 56, Lambert, D Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics. 4, Lindsey, J. K odelling Frequency and Count Data. Oford science publications, Clarendon Press, UK., p Pudprommarat, C., Bodhisuwan, W. and Zeephongsekul, P. 01. A new mied negative binomial distribution. Journal of Applied Sciences. 17, R Core Team, 01. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Wang, Z One mied negative binomial distribution with application. Statistical Planning and Inference. 141, Xie, F., Wei B. and Lin, J Score tests for zero-inflated generalized Poisson mied regression models. Computational Statistics and Data Analysis. 5, Zamani, H. and Ismail, N Negative binomial-lindley distribution and its application. Journal of athematics and Statistics. 1, 4-9.

8 490 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , 014 Proof of Theorem 1: Appendi From Definition 1, let Y~ NB-GE( r,, ) be a random variable of the NB-GE distribution and some epected values of Y are E( Y ) ( r r) ( r r) r, () (1) E( Y ) ( r r r) (r 6r r) (r r r) r, () () (1) E( Y ) ( r 6r 11r 6 r) (4r 18r 6r 1 r) (4) () (6r 18r 19r 7 r) (4r 6r 4 r r) r If X~ ZINB-GE(, r,, ), then the epected value of X is given by E( X ) f ( ) 1 (0) (1 r ) ( ) (1 r ) ( 1) ( r ) y r y 1 y From E( Y ) y f ( y) y ( 1) ( r ) r( (1) 1) y0 y1 y 0., we he E( X ) (1 )E( Y ) (1 ) r (1) 1. Consequently, we obtained the epected values of X as follows E( X ) E( X ) 4 E( X ) r 1 (1 ) ( 1) ( r ) 1 0 Y r (1 ) ( 1) r 1 ( r ) 4 (1 ) ( 1) ( r ) 1 0 From Var( ) E( X ) E( X ), the variance of X is X (1 )E( ), (1 )E( Y ), 4 (1 )E( Y ). Var( ) r ( r 1) () (r 1) (1) r 1 r( (1) 1) 1. The skewness and kurtosis of X are X Sk( X ) [E( X ) E( X ) E( X ) E( X ) ] / X, Ku( X ) [E( X ) 4E( X )E( X ) 6E( X ) E( X ) E( X ) ] / X. Now, by using the epression as below, the skewness and kurtosis of X can be written in (7) and (8), respectively.

9 S. Aryuyuen et al. / Songklanakarin J. Sci. Technol. 6 (4), , The R code of the parameter estimation of the ZINB-GE distribution with the LE method: mlogl<-function(theta,){ fzinbge<-function(theta,){ mm<-length(); k<-numeric(mm); zinbge<-function(theta,){ if(==0){ p<-(-log(theta[4]+((1-theta[4])*((gamma(theta[]+1) *gamma(1+theta[1]/theta[]))/gamma(theta[]+theta[1] /theta[]+1))))); }else if(>0){ pp1<-(gamma(theta[]+1)*(gamma(1+theta[1]/theta[]))) /(gamma(theta[]+theta[1]/theta[]+1)); for( in 1:){ p1<-((factorial()/(factorial()*factorial(-))) *(-1)^)*((gamma(theta[]+1)*gamma(1+(theta[1]+) /theta[]))/gamma(theta[]+(theta[1]+)/theta[]+1)); pp1<-pp1+p1; } p<-(-log(1-theta[4]))-log(factorial(theta[1]+-1)) +log(factorial(theta[1]-1))+log(factorial())-log(pp1); }p} for(i in 1 : length()){ k[i]<-zinbge(theta,[i])}k} sum(fzinbge(theta,))} theta.start<-c(start_r,start_alpha,start_beta,start_phi) out<-nlm(mlogl,theta.start, = ) r_le<-out$estimate[1] alpha_le<-out$estimate[] beta_le<-out$estimate[] phi_le<-out$estimate[4]

Parameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application

Parameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application Original Parameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application Pornpop Saengthong 1*, Winai Bodhisuwan 2 Received: 29 March 2013 Accepted: 15 May 2013 Abstract

More information

Songklanakarin Journal of Science and Technology SJST R4 Samutwachirawong

Songklanakarin Journal of Science and Technology SJST R4 Samutwachirawong Songlanaarin Journal of Science and Technology SJST-0-0.R Samutwachirawong The Zero-inflated Negative Binomial-Erlang Distribution: Application to Highly Pathogenic Avian Influenza HN in Thailand Journal:

More information

On the Negative Binomial-Generalized Exponential Distribution and its Applications

On the Negative Binomial-Generalized Exponential Distribution and its Applications Applied Mathematical Sciences, Vol. 11, 2017, no. 8, 345-360 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2017.612288 On the Negative Binomial-Generalized Exponential Distribution and its

More information

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago

Mohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago Mohammed Research in Pharmacoepidemiology (RIPE) @ National School of Pharmacy, University of Otago What is zero inflation? Suppose you want to study hippos and the effect of habitat variables on their

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

A Practitioner s Guide to Generalized Linear Models

A Practitioner s Guide to Generalized Linear Models A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically

More information

A THREE-PARAMETER WEIGHTED LINDLEY DISTRIBUTION AND ITS APPLICATIONS TO MODEL SURVIVAL TIME

A THREE-PARAMETER WEIGHTED LINDLEY DISTRIBUTION AND ITS APPLICATIONS TO MODEL SURVIVAL TIME STATISTICS IN TRANSITION new series, June 07 Vol. 8, No., pp. 9 30, DOI: 0.307/stattrans-06-07 A THREE-PARAMETER WEIGHTED LINDLEY DISTRIBUTION AND ITS APPLICATIONS TO MODEL SURVIVAL TIME Rama Shanker,

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Introduction to Probability Theory for Graduate Economics Fall 2008

Introduction to Probability Theory for Graduate Economics Fall 2008 Introduction to Probability Theory for Graduate Economics Fall 008 Yiğit Sağlam October 10, 008 CHAPTER - RANDOM VARIABLES AND EXPECTATION 1 1 Random Variables A random variable (RV) is a real-valued function

More information

Analysis of Count Data A Business Perspective. George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013

Analysis of Count Data A Business Perspective. George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013 Analysis of Count Data A Business Perspective George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013 Overview Count data Methods Conclusions 2 Count data Count data Anything with

More information

Control Charts for Monitoring the Zero-Inflated Generalized Poisson Processes

Control Charts for Monitoring the Zero-Inflated Generalized Poisson Processes Thai Journal of Mathematics Volume 11 (2013) Number 1 : 237 249 http://thaijmath.in.cmu.ac.th ISSN 1686-0209 Control Charts for Monitoring the Zero-Inflated Generalized Poisson Processes Narunchara Katemee

More information

Songklanakarin Journal of Science and Technology SJST R2 Atikankul

Songklanakarin Journal of Science and Technology SJST R2 Atikankul The new Poisson mixed weighted Lindley distribution with applications to insurance claims data Journal: Songklanakarin Journal of Science and Technology Manuscript ID SJST--.R Manuscript Type: Date Submitted

More information

BINOMIAL MIXTURE OF ERLANG DISTRIBUTION

BINOMIAL MIXTURE OF ERLANG DISTRIBUTION Vol.4, No.2, pp.28-38, April 216 BINOMIAL MIXTURE OF ERLANG DISTRIBUTION Kareema Abed Al-Kadim and Rusul Nasir AL-Hussani College of Education for Pure Science, University of Babylon, Dept. of Mathematics

More information

Plotting data is one method for selecting a probability distribution. The following

Plotting data is one method for selecting a probability distribution. The following Advanced Analytical Models: Over 800 Models and 300 Applications from the Basel II Accord to Wall Street and Beyond By Johnathan Mun Copyright 008 by Johnathan Mun APPENDIX C Understanding and Choosing

More information

Statistical Model Of Road Traffic Crashes Data In Anambra State, Nigeria: A Poisson Regression Approach

Statistical Model Of Road Traffic Crashes Data In Anambra State, Nigeria: A Poisson Regression Approach Statistical Model Of Road Traffic Crashes Data In Anambra State, Nigeria: A Poisson Regression Approach Nwankwo Chike H., Nwaigwe Godwin I Abstract: Road traffic crashes are count (discrete) in nature.

More information

MODELING COUNT DATA Joseph M. Hilbe

MODELING COUNT DATA Joseph M. Hilbe MODELING COUNT DATA Joseph M. Hilbe Arizona State University Count models are a subset of discrete response regression models. Count data are distributed as non-negative integers, are intrinsically heteroskedastic,

More information

Exploring the Application of the Negative Binomial-Generalized Exponential Model for Analyzing Traffic Crash Data with Excess Zeros

Exploring the Application of the Negative Binomial-Generalized Exponential Model for Analyzing Traffic Crash Data with Excess Zeros Exploring the Application of the Negative Binomial-Generalized Exponential Model for Analyzing Traffic Crash Data with Excess Zeros Prathyusha Vangala Graduate Student Zachry Department of Civil Engineering

More information

The LmB Conferences on Multivariate Count Analysis

The LmB Conferences on Multivariate Count Analysis The LmB Conferences on Multivariate Count Analysis Title: On Poisson-exponential-Tweedie regression models for ultra-overdispersed count data Rahma ABID, C.C. Kokonendji & A. Masmoudi Email Address: rahma.abid.ch@gmail.com

More information

Poisson Inverse Gaussian (PIG) Model for Infectious Disease Count Data

Poisson Inverse Gaussian (PIG) Model for Infectious Disease Count Data American Journal of Theoretical and Applied Statistics 2016; 5(5): 326-333 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20160505.22 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Counts using Jitters joint work with Peng Shi, Northern Illinois University

Counts using Jitters joint work with Peng Shi, Northern Illinois University of Claim Longitudinal of Claim joint work with Peng Shi, Northern Illinois University UConn Actuarial Science Seminar 2 December 2011 Department of Mathematics University of Connecticut Storrs, Connecticut,

More information

Model selection and comparison

Model selection and comparison Model selection and comparison an example with package Countr Tarak Kharrat 1 and Georgi N. Boshnakov 2 1 Salford Business School, University of Salford, UK. 2 School of Mathematics, University of Manchester,

More information

Estimators as Random Variables

Estimators as Random Variables Estimation Theory Overview Properties Bias, Variance, and Mean Square Error Cramér-Rao lower bound Maimum likelihood Consistency Confidence intervals Properties of the mean estimator Introduction Up until

More information

Prediction of Bike Rental using Model Reuse Strategy

Prediction of Bike Rental using Model Reuse Strategy Prediction of Bike Rental using Model Reuse Strategy Arun Bala Subramaniyan and Rong Pan School of Computing, Informatics, Decision Systems Engineering, Arizona State University, Tempe, USA. {bsarun, rong.pan}@asu.edu

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

Properties a Special Case of Weighted Distribution.

Properties a Special Case of Weighted Distribution. Applied Mathematics, 017, 8, 808-819 http://www.scirp.org/journal/am ISSN Online: 15-79 ISSN Print: 15-785 Size Biased Lindley Distribution and Its Properties a Special Case of Weighted Distribution Arooj

More information

Poisson Regression. Ryan Godwin. ECON University of Manitoba

Poisson Regression. Ryan Godwin. ECON University of Manitoba Poisson Regression Ryan Godwin ECON 7010 - University of Manitoba Abstract. These lecture notes introduce Maximum Likelihood Estimation (MLE) of a Poisson regression model. 1 Motivating the Poisson Regression

More information

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data

Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Cinzia Viroli 1 joint with E. Bonafede 1, S. Robin 2 & F. Picard 3 1 Department of Statistical Sciences, University

More information

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.

More information

Modelling using ARMA processes

Modelling using ARMA processes Modelling using ARMA processes Step 1. ARMA model identification; Step 2. ARMA parameter estimation Step 3. ARMA model selection ; Step 4. ARMA model checking; Step 5. forecasting from ARMA models. 33

More information

A Size-Biased Poisson-Aradhana Distribution with Applications

A Size-Biased Poisson-Aradhana Distribution with Applications American Journal of Mathematics and Statistics 18, 8(5): 16-15 DOI: 1.59/j.ajms.1885. A Size-Biased Poisson-Aradhana Distribution with Applications Rama Shanker 1,*, Kamlesh Kumar Shukla 1, Ravi Shanker,

More information

ebay/google short course: Problem set 2

ebay/google short course: Problem set 2 18 Jan 013 ebay/google short course: Problem set 1. (the Echange Parado) You are playing the following game against an opponent, with a referee also taking part. The referee has two envelopes (numbered

More information

Chapter 2. Discrete Distributions

Chapter 2. Discrete Distributions Chapter. Discrete Distributions Objectives ˆ Basic Concepts & Epectations ˆ Binomial, Poisson, Geometric, Negative Binomial, and Hypergeometric Distributions ˆ Introduction to the Maimum Likelihood Estimation

More information

Bivariate Weibull-power series class of distributions

Bivariate Weibull-power series class of distributions Bivariate Weibull-power series class of distributions Saralees Nadarajah and Rasool Roozegar EM algorithm, Maximum likelihood estimation, Power series distri- Keywords: bution. Abstract We point out that

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

A Size-Biased Poisson-Amarendra Distribution and Its Applications

A Size-Biased Poisson-Amarendra Distribution and Its Applications International Journal of Statistics and Applications 6, 6(6): 376-38 DOI:.93/j.statistics.666.6 A Size-Biased Poisson-Amarendra Distribution and Its Applications Rama Shanker,*, Hagos Fesshaye Department

More information

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models

More information

Estimators in simple random sampling: Searls approach

Estimators in simple random sampling: Searls approach Songklanakarin J Sci Technol 35 (6 749-760 Nov - Dec 013 http://wwwsjstpsuacth Original Article Estimators in simple random sampling: Searls approach Jirawan Jitthavech* and Vichit Lorchirachoonkul School

More information

Partitioning variation in multilevel models.

Partitioning variation in multilevel models. Partitioning variation in multilevel models. by Harvey Goldstein, William Browne and Jon Rasbash Institute of Education, London, UK. Summary. In multilevel modelling, the residual variation in a response

More information

Generalized Linear Models for Count, Skewed, and If and How Much Outcomes

Generalized Linear Models for Count, Skewed, and If and How Much Outcomes Generalized Linear Models for Count, Skewed, and If and How Much Outcomes Today s Class: Review of 3 parts of a generalized model Models for discrete count or continuous skewed outcomes Models for two-part

More information

On Size- Biased Two Parameter Poisson-Lindley Distribution and Its Applications

On Size- Biased Two Parameter Poisson-Lindley Distribution and Its Applications American Journal of Mathematics and Statistics 7, 7(): 99-7 DOI:.9/j.ajms.77. On Size- Biased Two Parameter Poisson-Lindley Distribution and Its Applications Rama Shaner,*, A. Mishra Department of Statistics,

More information

Deal with Excess Zeros in the Discrete Dependent Variable, the Number of Homicide in Chicago Census Tract

Deal with Excess Zeros in the Discrete Dependent Variable, the Number of Homicide in Chicago Census Tract Deal with Excess Zeros in the Discrete Dependent Variable, the Number of Homicide in Chicago Census Tract Gideon D. Bahn 1, Raymond Massenburg 2 1 Loyola University Chicago, 1 E. Pearson Rm 460 Chicago,

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Determination of Quick Switching System by Attributes under the Conditions of Zero-Inflated Poisson Distribution

Determination of Quick Switching System by Attributes under the Conditions of Zero-Inflated Poisson Distribution International Journal of Statistics and Systems ISSN 0973-2675 Volume 11, Number 2 (2016), pp. 157-165 Research India Publications http://www.ripublication.com Determination of Quick Switching System by

More information

A BIMODAL EXPONENTIAL POWER DISTRIBUTION

A BIMODAL EXPONENTIAL POWER DISTRIBUTION Pak. J. Statist. Vol. 6(), 379-396 A BIMODAL EXPONENTIAL POWER DISTRIBUTION Mohamed Y. Hassan and Rafiq H. Hijazi Department of Statistics United Arab Emirates University P.O. Box 7555, Al-Ain, U.A.E.

More information

Random variable X is a mapping that maps each outcome s in the sample space to a unique real number x, x. X s. Real Line

Random variable X is a mapping that maps each outcome s in the sample space to a unique real number x, x. X s. Real Line Random Variable Random variable is a mapping that maps each outcome s in the sample space to a unique real number,. s s : outcome Sample Space Real Line Eamples Toss a coin. Define the random variable

More information

Size Biased Lindley Distribution Properties and its Applications: A Special Case of Weighted Distribution

Size Biased Lindley Distribution Properties and its Applications: A Special Case of Weighted Distribution Size Biased Lindley Distribution Properties and its Applications: A Special Case of Weighted Distribution Arooj Ayesha* Department of Mathematics and Statistics, University of agriculture Faisalabad, Pakistan

More information

Performance of Autoregressive Order Selection Criteria: A Simulation Study

Performance of Autoregressive Order Selection Criteria: A Simulation Study Pertanika J. Sci. & Technol. 6 (2): 7-76 (2008) ISSN: 028-7680 Universiti Putra Malaysia Press Performance of Autoregressive Order Selection Criteria: A Simulation Study Venus Khim-Sen Liew, Mahendran

More information

2 The Polygonal Distribution

2 The Polygonal Distribution 2 The Polygonal Distribution Dimitris Karlis 1 and Evdokia Xekalaki 2 1 Department of Statistics, Athens University of Economics and Business 2 Department of Statistics, Athens University of Economics

More information

Estimation in zero-inflated Generalized Poisson distribution

Estimation in zero-inflated Generalized Poisson distribution Journal of Data Science 18(2018), 183-206 Estimation in zero-inflated Generalized Poisson distribution Kirtee K. Kamalja and Yogita S. Wagh Department of Statistics, School of Mathematical Sciences, North

More information

EE/CpE 345. Modeling and Simulation. Fall Class 5 September 30, 2002

EE/CpE 345. Modeling and Simulation. Fall Class 5 September 30, 2002 EE/CpE 345 Modeling and Simulation Class 5 September 30, 2002 Statistical Models in Simulation Real World phenomena of interest Sample phenomena select distribution Probabilistic, not deterministic Model

More information

ARCONES MANUAL FOR THE SOA EXAM P/CAS EXAM 1, PROBABILITY, SPRING 2010 EDITION.

ARCONES MANUAL FOR THE SOA EXAM P/CAS EXAM 1, PROBABILITY, SPRING 2010 EDITION. A self published manuscript ARCONES MANUAL FOR THE SOA EXAM P/CAS EXAM 1, PROBABILITY, SPRING 21 EDITION. M I G U E L A R C O N E S Miguel A. Arcones, Ph. D. c 28. All rights reserved. Author Miguel A.

More information

L06. Chapter 6: Continuous Probability Distributions

L06. Chapter 6: Continuous Probability Distributions L06 Chapter 6: Continuous Probability Distributions Probability Chapter 6 Continuous Probability Distributions Recall Discrete Probability Distributions Could only take on particular values Continuous

More information

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals. 9.1 Simple linear regression 9.1.1 Linear models Response and eplanatory variables Chapter 9 Regression With bivariate data, it is often useful to predict the value of one variable (the response variable,

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear

More information

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session CPS020) p.3863 Model Selection for Semiparametric Bayesian Models with Application to Overdispersion Jinfang Wang and

More information

Statistical Models for Defective Count Data

Statistical Models for Defective Count Data Statistical Models for Defective Count Data Gerhard Neubauer a, Gordana -Duraš a, and Herwig Friedl b a Statistical Applications, Joanneum Research, Graz, Austria b Institute of Statistics, University

More information

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification Brett Presnell Dennis Boos Department of Statistics University of Florida and Department of Statistics North Carolina State

More information

Variable Selection in Restricted Linear Regression Models. Y. Tuaç 1 and O. Arslan 1

Variable Selection in Restricted Linear Regression Models. Y. Tuaç 1 and O. Arslan 1 Variable Selection in Restricted Linear Regression Models Y. Tuaç 1 and O. Arslan 1 Ankara University, Faculty of Science, Department of Statistics, 06100 Ankara/Turkey ytuac@ankara.edu.tr, oarslan@ankara.edu.tr

More information

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA STATISTICS IN MEDICINE, VOL. 17, 59 68 (1998) CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA J. K. LINDSEY AND B. JONES* Department of Medical Statistics, School of Computing Sciences,

More information

ON MODEL SELECTION FOR STATE ESTIMATION FOR NONLINEAR SYSTEMS. Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof

ON MODEL SELECTION FOR STATE ESTIMATION FOR NONLINEAR SYSTEMS. Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof ON MODEL SELECTION FOR STATE ESTIMATION FOR NONLINEAR SYSTEMS Robert Bos,1 Xavier Bombois Paul M. J. Van den Hof Delft Center for Systems and Control, Delft University of Technology, Mekelweg 2, 2628 CD

More information

Overdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion

Overdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion Biostokastikum Overdispersion is not uncommon in practice. In fact, some would maintain that overdispersion is the norm in practice and nominal dispersion the exception McCullagh and Nelder (1989) Overdispersion

More information

PENALIZING YOUR MODELS

PENALIZING YOUR MODELS PENALIZING YOUR MODELS AN OVERVIEW OF THE GENERALIZED REGRESSION PLATFORM Michael Crotty & Clay Barker Research Statisticians JMP Division, SAS Institute Copyr i g ht 2012, SAS Ins titut e Inc. All rights

More information

Songklanakarin Journal of Science and Technology SJST R3 LAWSON

Songklanakarin Journal of Science and Technology SJST R3 LAWSON Songklanakarin Journal of Science and Technology SJST-0-00.R LAWSON Ratio Estimators of Population Means Using Quartile Function of Auxiliary Variable in Double Sampling Journal: Songklanakarin Journal

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

Multivariate negative binomial models for insurance claim counts

Multivariate negative binomial models for insurance claim counts Multivariate negative binomial models for insurance claim counts Peng Shi (Northern Illinois University) and Emiliano A. Valdez (University of Connecticut) 9 November 0, Montréal, Quebec Université de

More information

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675. McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-675 Lawrence Joseph Bayesian Analysis for the Health Sciences EPIB-675 3 credits Instructor:

More information

Exponential modified discrete Lindley distribution

Exponential modified discrete Lindley distribution Yilmaz et al. SpringerPlus 6 5:66 DOI.86/s464-6-33- RESEARCH Open Access Eponential modified discrete Lindley distribution Mehmet Yilmaz *, Monireh Hameldarbandi and Sibel Aci Kemaloglu *Correspondence:

More information

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach Kristina Pestaria Sinaga, Manuntun Hutahaean 2, Petrus Gea 3 1, 2, 3 University of Sumatera Utara,

More information

On rate of convergence in distribution of asymptotically normal statistics based on samples of random size

On rate of convergence in distribution of asymptotically normal statistics based on samples of random size Annales Mathematicae et Informaticae 39 212 pp. 17 28 Proceedings of the Conference on Stochastic Models and their Applications Faculty of Informatics, University of Debrecen, Debrecen, Hungary, August

More information

MACHINE LEARNING ADVANCED MACHINE LEARNING

MACHINE LEARNING ADVANCED MACHINE LEARNING MACHINE LEARNING ADVANCED MACHINE LEARNING Recap of Important Notions on Estimation of Probability Density Functions 2 2 MACHINE LEARNING Overview Definition pdf Definition joint, condition, marginal,

More information

Lecture-19: Modeling Count Data II

Lecture-19: Modeling Count Data II Lecture-19: Modeling Count Data II 1 In Today s Class Recap of Count data models Truncated count data models Zero-inflated models Panel count data models R-implementation 2 Count Data In many a phenomena

More information

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria Emmanuel Alphonsus Akpan Imoh Udo Moffat Department of Mathematics and Statistics University of Uyo, Nigeria Ntiedo Bassey Ekpo Department of

More information

Application of Poisson and Negative Binomial Regression Models in Modelling Oil Spill Data in the Niger Delta

Application of Poisson and Negative Binomial Regression Models in Modelling Oil Spill Data in the Niger Delta International Journal of Science and Engineering Investigations vol. 7, issue 77, June 2018 ISSN: 2251-8843 Application of Poisson and Negative Binomial Regression Models in Modelling Oil Spill Data in

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr. Simulation Discrete-Event System Simulation Chapter 4 Statistical Models in Simulation Purpose & Overview The world the model-builder sees is probabilistic rather than deterministic. Some statistical model

More information

6. Assessing studies based on multiple regression

6. Assessing studies based on multiple regression 6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal

More information

A Modified Poisson Exponentially Weighted Moving Average Chart Based on Improved Square Root Transformation

A Modified Poisson Exponentially Weighted Moving Average Chart Based on Improved Square Root Transformation Thailand Statistician July 216; 14(2): 197-22 http://statassoc.or.th Contributed paper A Modified Poisson Exponentially Weighted Moving Average Chart Based on Improved Square Root Transformation Saowanit

More information

Rejection sampling - Acceptance probability. Review: How to sample from a multivariate normal in R. Review: Rejection sampling. Weighted resampling

Rejection sampling - Acceptance probability. Review: How to sample from a multivariate normal in R. Review: Rejection sampling. Weighted resampling Rejection sampling - Acceptance probability Review: How to sample from a multivariate normal in R Goal: Simulate from N d (µ,σ)? Note: For c to be small, g() must be similar to f(). The art of rejection

More information

Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions

Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions JKAU: Sci., Vol. 21 No. 2, pp: 197-212 (2009 A.D. / 1430 A.H.); DOI: 10.4197 / Sci. 21-2.2 Bootstrap Simulation Procedure Applied to the Selection of the Multiple Linear Regressions Ali Hussein Al-Marshadi

More information

CARLETON ECONOMIC PAPERS

CARLETON ECONOMIC PAPERS CEP 16-01 The Role of Model Specification in Estimating Health Care Demand Hossein Kavand Carleton University Marcel-Cristian Voia Carleton University January 2016 CARLETON ECONOMIC PAPERS Department of

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

2. A Basic Statistical Toolbox

2. A Basic Statistical Toolbox . A Basic Statistical Toolbo Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data. Wikipedia definition Mathematical statistics: concerned

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Hidden Markov models for time series of counts with excess zeros

Hidden Markov models for time series of counts with excess zeros Hidden Markov models for time series of counts with excess zeros Madalina Olteanu and James Ridgway University Paris 1 Pantheon-Sorbonne - SAMM, EA4543 90 Rue de Tolbiac, 75013 Paris - France Abstract.

More information

On estimation of the Poisson parameter in zero-modied Poisson models

On estimation of the Poisson parameter in zero-modied Poisson models Computational Statistics & Data Analysis 34 (2000) 441 459 www.elsevier.com/locate/csda On estimation of the Poisson parameter in zero-modied Poisson models Ekkehart Dietz, Dankmar Bohning Department of

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

Confidence Intervals for the Coefficient of Variation in a Normal Distribution with a Known Mean and a Bounded Standard Deviation

Confidence Intervals for the Coefficient of Variation in a Normal Distribution with a Known Mean and a Bounded Standard Deviation KMUTNB Int J Appl Sci Technol, Vol. 10, No. 2, pp. 79 88, 2017 Research Article Confidence Intervals for the Coefficient of Variation in a Normal Distribution with a Known Mean and a Bounded Standard Deviation

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Expectation Maximization Deconvolution Algorithm

Expectation Maximization Deconvolution Algorithm Epectation Maimization Deconvolution Algorithm Miaomiao ZHANG March 30, 2011 Abstract In this paper, we use a general mathematical and eperimental methodology to analyze image deconvolution. The main procedure

More information

ON INTERVENED NEGATIVE BINOMIAL DISTRIBUTION AND SOME OF ITS PROPERTIES

ON INTERVENED NEGATIVE BINOMIAL DISTRIBUTION AND SOME OF ITS PROPERTIES STATISTICA, anno LII, n. 4, 01 ON INTERVENED NEGATIVE BINOMIAL DISTRIBUTION AND SOME OF ITS PROPERTIES 1. INTRODUCTION Since 1985, intervened tpe distributions have received much attention in the literature.

More information

Chapter 3 Single Random Variables and Probability Distributions (Part 1)

Chapter 3 Single Random Variables and Probability Distributions (Part 1) Chapter 3 Single Random Variables and Probability Distributions (Part 1) Contents What is a Random Variable? Probability Distribution Functions Cumulative Distribution Function Probability Density Function

More information

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance.

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance. Chapter 2 Random Variable CLO2 Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance. 1 1. Introduction In Chapter 1, we introduced the concept

More information

Reconstruction of individual patient data for meta analysis via Bayesian approach

Reconstruction of individual patient data for meta analysis via Bayesian approach Reconstruction of individual patient data for meta analysis via Bayesian approach Yusuke Yamaguchi, Wataru Sakamoto and Shingo Shirahata Graduate School of Engineering Science, Osaka University Masashi

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

A Comparison of Different Methods of Zero- Inflated Data Analysis and an Application in Health Surveys

A Comparison of Different Methods of Zero- Inflated Data Analysis and an Application in Health Surveys Journal of Modern Applied Statistical Methods Volume 16 Issue 1 Article 29 5-1-2017 A Comparison of Different Methods of Zero- Inflated Data Analysis and an Application in Health Surveys Si Yang University

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information