CHAPTER 4 On q-gamma Distributions, Marshall-Olkin q-gamma Distributions and Minification Processes 4.1 Introduction Several skewed distributions such as logistic, Weibull, gamma and beta distributions have been used for modelling various random phenomena which arise in engineering, time series modelling, reliability assessment studies etc. Mathai (2005) introduced a real matrix variate probability model which cover almost all real matrix-variate densities used in multivariate statistical analysis. Through the new density introduced, a pathway is created from matrix- variate type I to matrix- variate type II beta to matrix- variate gamma to matrixvariate Gaussian densities. In this chapter we consider the scalar version of the matrixvariate gamma distribution and study the various properties. Recently several q-type distributions such as q-exponential, q-weibull, and q-logistic and various pathway models are 65
developed in the context of information theory, statistical mechanics, reliability modelling etc. See Wilk and Wlodarczyk (2000, 2001), Tsallis (1988),Jose and Naik (2009). Marshall and Olkin (1997) introduced a method of construction of a family of distributions by adding a new parameter to an existing distribution which resulted in added flexibility of the distribution. Alice and Jose (2003, 2004, 2005a, 2005b), conducted detailed studies of various Marshall-Olkin distributions. Ristic et al. (2007) introduced and studied the various properties of Marshall-Olkin gamma distribution. Jose, Naik and Ristic (2008) introduced Marshall-Olkin q- Weibull distribution and also developed time series models having this distribution as marginals. Different first order autoregressive (AR(1)) time series models can be developed using the Marshall-Olkin distribution which are very useful in various fields like time series modelling,reliability studies etc. Here we also extend the q-gamma distribution to obtain a more general class called Marshall-Olkin q- gamma distribution. In this chapter we give a detailed study of q-gamma distribution and its properties. The estimation of the parameters is discussed and a theorem which is useful in stressstrength analysis is established. The new distribution is found to be a better fit than the standard gamma distribution using a real data set. We introduce the Marshall-Olkin q- gamma distribution and develop AR(1) minification model and max-min process. Also the sample path properties are studied. 4.2 The q-analogue of the Gamma Distribution Mathai (2005) introduced a pathway model connecting matrix variate gamma distribution and normal densities. The following is the scalar version of the matrix-variate density introduced by Mathai (2005). When the pathway parameter q > 1: f 1 (x) = c 1 x α 1 [1 + a(q 1) x δ ] ; (4.2.1) < x <, a > 0, > 0, δ > 0, q > 1 66
where c 1 = ( ) δ[a(q 1)] α δ Γ q 1 ( α ) ( 2Γ Γ δ q 1 α δ ), R ( q 1 > α ), R(α) > 0 δ where R( ) denotes the real part of ( ). Since in most statistical problems the parameters are real, we will consider all parameters to be real. Note that for q < 1, writing q 1 = (1 q), the density in (4.2.1) reduces to the following form: f 2 (x) = c 2 x α 1 [1 a(1 q) x δ ] 1 q ; (4.2.2) q < 1, a > 0, > 0, δ > 0, 1 a(1 q) x δ > 0 where the normalizing constant c 2 is given by c 2 = ( ) δ(a(1 q)) α δ Γ + α + 1 1 q δ 2Γ ( ) ( ), R(α) > 0. α δ Γ + 1 1 q As q 1, f 1 (x) and f 2 (x) tend to f 3 (x), which is the generalized gamma density. f 3 (x) is given by f 3 (x) = δ(a) α δ 2Γ ( ) x α 1 exp( a x δ ); < x < ; a, α,, δ > 0. (4.2.3) α δ For different values of the parameters in (4.2.1) and (4.2.2), we get different distributions like beta type-1, beta type-2, etc. More results are available in Mathai and Haubold (2007) and Mathai and Provost (2006). Here we introduce a q-analogue of the gamma distribution as a special case of Mathai (2005), by taking δ = 1, a = 1 in f 1 (x). Then for x > 0 and for q > 1, f 1 (x) reduces to the following density which we call the q-gamma distribution: f 4 (x) = k 1 x α 1 [1 + (q 1)x)] (4.2.4) where the normalizing constant k 1 is given by 67
Table 4.1: The q-gamma distribution characteristics when q > 1 Characteristics Functional form p.d.f f 4 (x) = ()α Γ [1 + (q 1)x)] ΓαΓ( ; x > 0, α, > 0, α)xα 1 CDF F 4 (x) = B(x(),α, α) B(α, α) HRF h 4 (x) = ()α x α 1 [1+()x)] CHR H 4 (x) = B(α, α) B(x(),α, α) (x(),α, α) B(α, α) (x(),α, α) s th moment E(x s ) = Γ(α+s)Γ α s Γ(α)Γ( α). ()s mean value E(x) = α (α 1)() k 1 = (q 1)α Γ ( ) (4.2.5) ΓαΓ α For x > 0 and q < 1, f 2 (x) reduces to the following density function of the q-gamma distribution: where k 2 is given by f 5 (x) = k 2 x α 1 [1 (1 q)x)] 1 q (4.2.6) k 2 = ( ) (1 q) α Γ + α + 1 1 q ( ) (4.2.7) ΓαΓ + 1 1 q for 0 x 1 (1 q), α > 0. The q-gamma distributions exhibits various shapes as can be seen from the figures 4.1 and 4.2. and hence it can be applied for modelling different types of data from various fields. The graph of the hazard rate function of MO-q-G is flexible and by giving various values for the pathway parameter q we can get different shapes which makes it applicable in a variety of situations (see figure 4.3). 68
Table 4.2: The q-gamma distribution characteristics when q < 1 Characteristics Functional form p.d.f f 4 (x) = (1 q)α Γ 1 q +α+1 x α 1 [1 (1 q)x] 1 q CDF ΓαΓ( α) 1 q F 4 (x) = B(x(1 q),α, 1 q α) B(α, 1 q α) 1 q HRF h 4 (x) = (1 q)α x α 1 [1 (1 q)x)] CHR H 4 (x) = s th moment E(x s ) = B(α, 1 q α) B(x(1 q),α, 1 q α) B(x(1 q),α, 1 q α) B(α, 1 q α) B(x(1 q),α, 1 q α) Γ(α+s)Γ 1 q +α+1 Γ(α)Γ( +α+s+1)(1 q)6s. 1 q mean value E(x) = α +(1 q)(α+1) Figure 4.1: The q-gamma pdf for q > 1 69
Figure 4.2: The q-gamma pdf for q < 1 Figure 4.3: Hazard rate function for various values of q 70
Figure 4.4: Estimated densities of gamma and q-gamma. 4.3 Estimation (i) First we consider method of moments. Since the distribution has explicit expressions for the moments we can get the values of the parameters by equating sample and population moments. We kept = 1 and estimated the values of the parameters for the data given in Gross and Clark(1975)(Data given in chapter 3). The estimated densities are given and the graph shows that q-gamma better fits the data. The estimates were obtained as α = 4.0127 and q = 0.7782. The estimated densities are given in figure (4.4). (ii) Now we consider maximum likelihood estimation for the parameters. When q > 1 the likelihood function is L = (q 1)n α ((Γ(/(q 1)) n x α 1 Γ(α) n (Γ(/(q 1) α) n i [1 + (q 1)x i )] 71
log L = nα log(q 1) + n log(γ(/(q 1))) + (α 1) (log((x i ) /(q 1))) log L α log L log L q (log((1 + (q 1)x i ))) n log(γ(α)) n log(γ(beta/(q 1) α)) = n log(q 1) + (log(y)) nψ(α) + nψ(/(q 1) α) 1 = nψ(/(q 1)) q 1 1 (log(1 + (q 1)y)) q 1 n ψ(/(q 1) α) q 1 = nα q 1 n (q 1) ψ(/(q 1)) 1 2 q 1 1 (log(1 + (q 1)y)) q 1 n ψ(/(q 1) α). q 1 The function nlm in R package can be used to solve these normal equations. The case q < 1 can be considered similarly. 4.4 Application in Stress-Strength Analysis Stress-Strength analysis is concerned with the probability of failure which is the probability of stress exceeding strength. Let X represent the stress and Y represent the strength of the system. Then R = P (X < Y ) is the probability that the system will function. More details on stress-strength analysis can be found in Kotz et al. (2007). H-function is defined as follows: [ Hp,q m,n z ] (a 1,α 1 ),(a 2,α 2 ),,(a k,α k ) (b 1, 1 ),(b 2, 2 ),,(b q, q) = 1 h(s)z s ds 2πi L where h(s) = { m } { n } Γ(b j + j s) Γ(1 a j α j s) j=1 { q j=1 } { p } Γ(1 b j j s) Γ(a j + α j s) j=m+1 j=n+1 and L is a suitable path. Here a 1, a 2,..., a p ; b 1, b 2,..., b q are complex numbers and α 1, α 2,..., α p, 1, 2,..., q 72
are positive real numbers. The poles of Γ(b j + j s), j = 1, 2,..., m are at the points s = b j+ν j where j = 1, 2,..., m, ν = 0, 1,... and the poles of Γ(1 a j α j s), j = 1, 2,..., n are at s = 1 a k+λ α k where k = 1, 2,..., n, λ = 0, 1,.... The details of the existence conditions, various properties and applications of H-functions are available in Mathai (1993). Now we state a theorem which is useful in the context of stress-strength analysis of a multi component system subjected to a number of independent stresses X 1, X 2,..., X k, whose cumulative effect is the actual effective stress against a single strength Y. Then R = P ( k i=1 X i < Y ) is the probability that the system will function and it is called the reliability of the system. Theorem 4.4.1. Let {X i, i = 1, 2,..., k} be i.i.d. random variables having a q- Gamma distribution function and Y be distributed as exponential distribution with parameter θ. Then P ( k i=1 X i < Y ) is an H-function. Proof: Let X 1, X 2,..., X k be continuous i.i.d. q-gamma r.v.s and let Y follow the exponential distribution with parameter θ. Let g 1 ( ) and g 2 ( ) be the probability density functions of the q-gamma distribution for q < 1 and for q > 1. Let g k ( ) and G k ( ) be the probability density function and cumulative distribution function of the random variable k i=1 X i. Let g 1( ) and g 2( ) be the corresponding Laplace transforms of the q-gamma distribution for q > 1 and for q < 1 respectively. Then P ( k X i < Y ) = θ i=1 = 0 G k (x)e θx dx {g1(θ)} k for q > 1 {g2(θ)} k for q < 1. Now for q > 1 we have to evaluate the integral I 1(α, λ, q : θ) = 0 e θx x α 1 [1 + (q 1)x)] dx. The integrand can be taken as a product of two integrable functions. Let x 1 and x 2 be two 73
independently distributed real scalar random variables with probability density functions f 1 (x) and f 2 (x). Consider the transformation u = x 1 x 2 and v = x 2 dx 1 dx 2 = v du dv where is the wedge product discussed in ( Mathai1997). Then the joint density of u and v is g(u, v) = vf 1 (uv)f 2 (v). Now g 1 (u) is obtained by integrating out the joint probability density function g(u, v) with respect to v. That is g 1 (u) = v vf 1(uv)f 2 (v)dv. Let f 1 (x 1 ) = c 1 e θx 1, x 1 0 and f 2 (x 2 ) = c 2 x α 1 2 [1 + (q 1)x 2 ], x2 0, α, λ > 0 where c 1 and c 2 are normalizing constants. These constants can be obtained by integrating f 1 (x) and f 2 (x) with respect to x. Thus we have g 1 (u) = c 1 c 2 Now E(x1 s 1 ) = c 1 E(x 1 s 0 2 ) = c 2 = = 0 0 c 2 (q 1) α s+1 e θuv v α 1 {1 + (q 1)v] dv. (4.4.1) x s 1 1 e θx 1 dx 1 = θ s Γ(s), R(s) > 0 x α s 2 [1 + (q 1)x 2 ] dx2 ω α s (1 + ω) dω 0 c 2 Γ(α s + 1) Γ( (q 1) α s+1 α + s 1) Γ( ), for R(1 s α ) > 0, R( 1 q 1 + s 1) > 0. α Then E(u) s 1 = E(x 1 ) s 1 E(x 2 ) 1 s θ s Γ(s)Γ(1 + α s)γ( = c 1 c α + s 1) 2 (q 1) α s+1 Γ( ), q > 1 for R(s) > 0, R(α s) > 0, R( α + s 1) > 0. Now the density of u is obtained by the inverse Mellin transform. The detailed existence conditions for Mellin and inverse Mellin transforms are available in Mathai (1993). 74
Thus g 1 (u) = 1 2πi = c+i c i c 1 c 2 (q 1) α+1 Γ u s E(u) s 1 1 c+i 2πi c i Γ(s)Γ(1 + α s)γ( q 1 α + s 1) ( ) θu s ds q 1 for R(s) > 0, R(α s) > 0, R( α s) > 0, q > 1. I 1(α,, q : θ) = [ 1 )H2,1 θ αγ( 1,2 (q 1) (0,α) (0,1),( q+1,α) ], q > 1. On substituting this we get, k P ( X i < Y ) = i=1 { [ q 1 α)h2,1 θ Γ( 1,2 (q 1) (0,α) (0,1),( q+1,α) ]} k, q > 1. Now for q < 1, we have to evaluate the integral I 2(α,, q : θ) = 0 e θx x α 1 [1 (1 q)x] 1 q dx. Let f 1 (x 1 ) = c 1 e θx 1, x 1 0 c 2 x α 1 2 [1 (1 q)x] 1 q for 1 (1 q)x > 0 and f 2 (x 2 ) = 0 otherwise where c 1 and c 2 are the normalizing constants. Then proceeding as in the case for q > 1, we have P ( k X i < Y ) = i=1 { [ 1 q 1)H2,1 θ Γ( 1 q + α + 1,2 (q 1) (0,α) (0,1),( αq+α,α) ]} k, q > 1. 75
4.5 Marshall-Olkin q-gamma Distribution Marshall and Olkin (1997) introduced a new family of survival functions which is constructed by adding a new parameter to an existing distribution. The introduction of a new parameter will result in flexibility in the distribution. Different first order autoregressive (AR(1)) time series models can be developed using the Marshall-Olkin Gamma distribution. Now we shall extend the q-gamma distributions to obtain a more general class called Marshall-Olkin q-gamma distribution using (??). Now consider the case q > 1. Substituting the q-gamma distribution function defined in F 4 (x) in (??), we get a new family of distributions called Marshall-Olkin q-gamma distribution (MO-q-G), whose distribution function and the survival function Ḡ1( )are given by G 1 (x; p, q, α, ) = B(x(q 1), α, α) pb(α, α) + (1 p)b(x(q 1), α,. (4.5.1) α) Ḡ 1 (x; p, q, α, ) = p [B(α, α) B(x(q 1), α, α)] pb(α, α) + (1 p)b(x(q 1), α,. (4.5.2) α) Then the p.d.f. corresponding to G 1 ( ) is given by g 1 (x; p, q, α, ) = p(q 1)α x α 1 [1 + (q 1)x)] B(α, α) pb(α, α) + (1 p)b(x(q 1), α,, x > 0, q > 1. α)2 Similarly when q < 1, the cdf is given by G 2 (x; p, q, α, ) = B(x(1 q), α, + 1) 1 q pb(α, + 1) + (1 p)b(x(1 q), α,. (4.5.3) + 1) 1 q 1 q 76
and the p.d.f. corresponding to G 2 ( ) is given by is given by g 2 (x; p, q, α, ) = p(1 q)α x α 1 [1 (1 q)x)] 1 q B(α, + 1) 1 q pb(α, + 1) + (1 p)b(x(1 q), α,, 0 < x < (1/1 q). + 1)2 1 q 1 q The hazard rate functions are given by h 1 (x) = pb(α, (q 1) α x α 1 [1 + (q 1)x)] B(α, α) α) + (1 p)u(x) (B(α, ) α) U(x) for q > 1 and h 2 (x) = pb(α, (1 q) α x α 1 [1 (1 q)x)] 1 q B(α, 1 q + 1) 1 q + 1) + (1 p)v (x) (B(α, ) + 1) V (x) 1 q where and U(x) = B(x(q 1), α, V (x) = B(x(1 q), α, q 1 α). 1 q + 1). Theorem 4.5.1. Let {X i, i 1} be a sequence of i.i.d. random variables with common distribution function F (x) and let N be a geometric random variable independently distributed of {X i } such that P [N = n] = θ(1 θ) n 1, n = 1, 2,..., 0 < θ < 1, which is for all i 1. Let U N = min 1 i N X i. Then {U N } is distributed as MO-q-G if and only if {X i } follows q-gamma distribution. Proof: The survival function of the random variable U N is H(x) = P (U N > x) = θ [ F (x)] n (1 θ) n 1 = n=1 θ F (x) 1 (1 θ) F (x). If X i has the survival function of the q-gamma distribution given by (6.2), then U N has the survival function of the MO-q-G distribution. The converse easily follows. 77
4.6 AR(1)models with q-gamma Marginal Distribution Tavares(1977, 1980) introduced two stationary Markov processes with similar structural forms which had applications in hydrological applications. Lewis and McKenzie (1991) discuss various aspects of first order autoregressive minification processes. In this section we develop autoregressive minification processes of order one and order k with minification structures where MO-q-G distribution is the stationary marginal distribution. We call the process as MO-q-G AR(1) process. Now we have the following theorem. Theorem 4.6.1. Consider an AR(1) structure given by ɛ n, w.p. p 1 X n = min(x n 1, ɛ n ), w.p. 1 p 1 0 < p 1 < 1 and {ɛ n } is a sequence of i.i.d. random variables independently distributed of X n. Then {X n } is a stationary Markovian AR(1) process with MO-q-G marginal if and only if {ɛ n } is distributed as q-gamma distribution. Proof: From the given structure it follows that F Xn (x) = p 1 Fɛn (x) + (1 p 1 ) F Xn 1 (x) F ɛn (x). Under stationary equilibrium, it reduces to F X (x) = On substituting the survival function F 1 (x) of ɛ, for q > 1, we get p 1 Fɛ (x) 1 (1 p 1 ) F ɛ (x). (4.6.1) F 1X (x) = p 1 [(α, α) (x(q 1), α, α] p 1 (α, α) + (1 p 1) (x(q 1), α, α) which resembles the survival function Ḡ1( ) of the MO-q-G distribution. Conversely, if 78
we take the survival function of the above form, we get the survival function of ɛ n as the q-gamma distribution under stationary equilibrium. Similar is the case when q < 1. The following theorem generalizes the results to a k th order autoregressive structure. Theorem 4.6.2. Consider an AR(k) structure given by ɛ n, w.p. p 0 X n = min(x n 1, ɛ n ), w.p. p 1 min(x n 2, ɛ n ), w.p. p 2.. min(x n k, ɛ n ), w.p. p k where {ɛ n } is a sequence of i.i.d. random variables independently distributed of X n, 0 < p i < 1, p 1 + p 2 + + p k = 1 p 0. Then the stationary marginal distribution of {X n } is MO-q-G if and only if {ɛ n } is distributed as q-gamma distribution. Proof: From the given structure the survival function is given as follows: F Xn (x) = p 0 Fɛn (x) + p 1 FXn 1 (x) F ɛn (x) + + p k FXn k (x) F ɛn (x). Under stationary equilibrium, this yields F X (x) = p 0 Fɛ (x) + p 1 FX (x) F ɛ (x) + + p k FX (x) F ɛ (x). This reduces to F X (x) = p 0 Fɛ (x) 1 (1 p 0 ) F ɛ (x). Then the theorem easily follows by similar arguments as in Theorem (4.6.1). For q > 1 and for q < 1, if we substitute the survival function of q-gama distribution we get the survival function of the MO-q-G distribution. The sample path of the minification process is given in figure 4.5. 79
Figure 4.5: Sample path of MO-q-gamma process for α = 1.7, p = 0.4, q = 1.2, = 1.8. 4.7 The Max-min AR(1) Processes Next we introduce the model called the max-min process given by Jose et al.(2008b). The structure is given as follows. Theorem 4.7.1. Consider an AR(1) structure given by X n = ɛ n, w.p. p 1 max(x n 1, ɛ n ), w.p. p 2 min(x n 1, ɛ n ), w.p. p 3 subject to the conditions 0 < p 1, p 2 < 1, p 2 < p 1 and p 1 + p 2 + p 3 = 1, where {ɛ n } is a sequence of i.i.d. random variables independently distributed of X n. Then {X n } is a stationary Markovian AR(1) max-min process with q-gamma stationary marginal distribution if and only if {ɛ n } follows Marshall-Olkin q-gamma distribution. 80
Proof: From the given structure it follows that P (X n > x) = p 1 P (ɛ n ) > x) + p 2 P (max(x n 1, ɛ n ) > x) + p 3 P (min(x n 1, ɛ n ) > x) On simplification we get, ] P (X n > x) = p 1 P (ɛ n ) > x) + p 2 [1 P (max(x n 1, ɛ n ) x) + p 3 FXn 1 (x) F ɛn (x) ] = p 1 Fɛn (x) + p 2 [1 (1 F Xn 1 (x))(1 F ɛn (x)) + p 3 FXn 1 (x) F ɛn (x). Under stationary equilibrium, F ɛ (x) = = p 1 +p 3 p 1 +p 2 FX (x) 1 (1 p 1+p 3 p 1 +p 2 ) F X (x) p FX (x) 1 (1 p ) F X (x) This has the same functional form of Marshall-Olkin survival function.substituting for F X (x) we can obtain the survival function of ɛ n. F ɛ (x) = (α, which is the required MO-q-G function. procedure as in theorem (4.6.1). p [(x(q 1), α, α] α) + (1 p ) (x(q 1), α, α) The converse can be proved using the same 81
4.8 Conclusions In this chapter we introduced the q- analogue of gamma distribution and studied its properties. Estimation problems are addressed. Its application in stress- strength analysis is discussed. The goodness of fit of the new distribution also tested. It is generalized using Marshall -olkin procedure and minification amd max-min model developed. References Alice, T., Jose, K.K. (2003). Marshall-Olkin Pareto Processes. Far East Journal of Theoretical Statistics, 9, 117 132. Alice, T., Jose, K.K. (2004). Bivariate semi-pareto minification processes. Metrika, 59, 305 313. Alice, T., Jose, K.K. (2005a). Marshall-Olkin Semi-Weibull Minification Processes. Recent Advances in Statistical Theory and Applications, I, 6 17. Alice, T. and Jose, K.K. (2005b). Marshall-Olkin logistic processes.stars Int. Journal, 6, 1 11. Gross,A.J.,Clark,V. (1975). Survival distributions: Reliability applications in the Biomedical Sciences. Wiley, New York. Jose, K.K. and Naik, S.R., (2009). On the q-weibull Distribution and Its Applications. Comm. Stat. Theory Methods, 38, 912-926. Jose, K.K., Naik, S.R. and Ristic, M.M. (2008). Marshall-Olkin q-weibull Distribution and Max-Min Processes.Statistical Papers, doi:10.1007/s00362-008-0173-9. Kotz, S., Lumelskii, Y. and Pensky, M. (2007). Stress-strength Model and Its Generalizations. World Scientific Publishing, New York. Marshall, A.W., Olkin, I. (1997). A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. 82
Biometrika, 84, 641-652. Mathai, A.M. (1993). A Handbook of Generalized Special Functions for Statistical and Physical Sciences. Oxford University Press, Oxford. Mathai, A.M. (1997). Jacobians of Matrix Transformations and Functions of Matrix Argument. World Scientific Publishing, New York. Mathai, A.M. (2005). A pathway to matrix-variate gamma and normal densities. Linear Algebra and Its Applications, 396, 317-328. Mathai, A.M., and Haubold, H.J. (2007). Pathway model, superstatistics, Tsallis statistics and a generalized measure of entropy. Physica A, 375, 110-122. Mathai, A.M. and Provost, S.B. (2006). On q-logistic and related distributions. IEEE Transactions on Reliability, 55(2), 237-244. Ristic, M.M., Jose, K.K. and Ancy, J. (2007). A Marshall-Olkin gamma distribution and minification process, STARS: International Journal (Sciences), 1(2). Tsallis, C. (1988). Possible generalizations of Boltzmann-Gibbs statistics. Journal of Statistical Physics, 52, 479-487. Wilk, G. and Wlodarczyk, Z. (2000). Interpretation of the nonextensivity parameter q in some applications of Tsallis statistics and Lévy distributions. Phys. Rev. Lett, 84, 2770-2773. Wilk, G. and Wlodarczyk, Z. (2001). Non-exponential decays and nonextensivity. Physica A, 290, 55-58. Tavares, V.L. (1977). The exact distribution of extremes of a non-gaussian process. Stochastic Processes and Their Applications, 5, 151-156. Tavares, V.L. (1980). An exponential Markovian stationary process, J. Appl. Prob., 17, 1117-1120. 83