On nonlinear weighted least squares fitting of the three-parameter inverse Weibull distribution

Similar documents
Estimation Under Multivariate Inverse Weibull Distribution

PROPERTIES OF THE GENERALIZED NONLINEAR LEAST SQUARES METHOD APPLIED FOR FITTING DISTRIBUTION TO DATA

A Quasi Gamma Distribution

Quantile Approximation of the Chi square Distribution using the Quantile Mechanics

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Method of Conditional Moments Based on Incomplete Data

A NOTE ON Q-ORDER OF CONVERGENCE

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

The Log-generalized inverse Weibull Regression Model

The existence theorem for the solution of a nonlinear least squares problem

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

Alpha-Skew-Logistic Distribution

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate

ORDER RESTRICTED STATISTICAL INFERENCE ON LORENZ CURVES OF PARETO DISTRIBUTIONS. Myongsik Oh. 1. Introduction

BAYESIAN ESTIMATION OF THE EXPONENTI- ATED GAMMA PARAMETER AND RELIABILITY FUNCTION UNDER ASYMMETRIC LOSS FUNC- TION

19.1 Problem setup: Sparse linear regression

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Multistate Modeling and Applications

On Maximisation of the Likelihood for the Generalised Gamma Distribution

Step-Stress Models and Associated Inference

Fundamentals of Reliability Engineering and Applications

p-birnbaum SAUNDERS DISTRIBUTION: APPLICATIONS TO RELIABILITY AND ELECTRONIC BANKING HABITS

THE BEST LEAST ABSOLUTE DEVIATION LINEAR REGRESSION: PROPERTIES AND TWO EFFICIENT METHODS

STAT 6350 Analysis of Lifetime Data. Probability Plotting

UTILIZING PRIOR KNOWLEDGE IN ROBUST OPTIMAL EXPERIMENT DESIGN. EE & CS, The University of Newcastle, Australia EE, Technion, Israel.

Parametric Evaluation of Lifetime Data

Inequalities Relating Addition and Replacement Type Finite Sample Breakdown Points

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract

SOLUTION FOR HOMEWORK 8, STAT 4372

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University

The Inverse Weibull Inverse Exponential. Distribution with Application

Burr Type X Distribution: Revisited

A Mixture of Modified Inverse Weibull Distribution

Continuous Univariate Distributions

Machine Learning Lecture Notes

Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution

UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES

THE WEIBULL GENERALIZED FLEXIBLE WEIBULL EXTENSION DISTRIBUTION

Research Article Multiple-Decision Procedures for Testing the Homogeneity of Mean for k Exponential Distributions

Convergence of generalized entropy minimizers in sequences of convex problems

Inference for P(Y<X) in Exponentiated Gumbel Distribution

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Breakdown points of Cauchy regression-scale estimators

Analysis of Middle Censored Data with Exponential Lifetime Distributions

Infinitely Imbalanced Logistic Regression

"ZERO-POINT" IN THE EVALUATION OF MARTENS HARDNESS UNCERTAINTY

Spectral gradient projection method for solving nonlinear monotone equations

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Beyond GLM and likelihood

Statistics for Engineers Lecture 4 Reliability and Lifetime Distributions

Estimates for probabilities of independent events and infinite series

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Further results involving Marshall Olkin log logistic distribution: reliability analysis, estimation of the parameter, and applications

The Marshall-Olkin Flexible Weibull Extension Distribution

Beta-Linear Failure Rate Distribution and its Applications

PARAMETER ESTIMATION AND ORDER SELECTION FOR LINEAR REGRESSION PROBLEMS. Yngve Selén and Erik G. Larsson

Some Properties of the Augmented Lagrangian in Cone Constrained Optimization

Quasi-Newton Methods

A derivative-free nonmonotone line search and its application to the spectral residual method

A NOTE ON A DISTRIBUTION OF WEIGHTED SUMS OF I.I.D. RAYLEIGH RANDOM VARIABLES

ON THE DYNAMICAL SYSTEMS METHOD FOR SOLVING NONLINEAR EQUATIONS WITH MONOTONE OPERATORS

An almost sure central limit theorem for the weight function sequences of NA random variables

Confidence regions and intervals in nonlinear regression

Weighted Sums of Orthogonal Polynomials Related to Birth-Death Processes with Killing

Comparison between five estimation methods for reliability function of weighted Rayleigh distribution by using simulation

On the acceleration of augmented Lagrangian method for linearly constrained optimization

The Newton-Raphson Algorithm

Weighted Inverse Weibull Distribution: Statistical Properties and Applications

On the Comparison of Fisher Information of the Weibull and GE Distributions

Trigonometric Recurrence Relations and Tridiagonal Trigonometric Matrices

From semi- to non-parametric inference in general time scale models

Nontrivial Solutions for Boundary Value Problems of Nonlinear Differential Equation

Statistical and Learning Techniques in Computer Vision Lecture 2: Maximum Likelihood and Bayesian Estimation Jens Rittscher and Chuck Stewart

MATH 205C: STATIONARY PHASE LEMMA

Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution

arxiv: v1 [math.st] 26 Jun 2011

ON A HYBRID PROXIMAL POINT ALGORITHM IN BANACH SPACES

Lecture 22 Survival Analysis: An Introduction

1 Glivenko-Cantelli type theorems

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Estimating Gaussian Mixture Densities with EM A Tutorial

Optimum Test Plan for 3-Step, Step-Stress Accelerated Life Tests

Topp-Leone Inverse Weibull Distribution: Theory and Application

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

COMPOSITE RELIABILITY MODELS FOR SYSTEMS WITH TWO DISTINCT KINDS OF STOCHASTIC DEPENDENCES BETWEEN THEIR COMPONENTS LIFE TIMES

Estimation of Gutenberg-Richter seismicity parameters for the Bundaberg region using piecewise extended Gumbel analysis

GLOBAL ATTRACTIVITY IN A NONLINEAR DIFFERENCE EQUATION

IMPROVEMENTS OF COMPOSITION RULE FOR THE CANAVATI FRACTIONAL DERIVATIVES AND APPLICATIONS TO OPIAL-TYPE INEQUALITIES

Exponentiated Rayleigh Distribution: A Bayes Study Using MCMC Approach Based on Unified Hybrid Censored Data

ECE 275A Homework 7 Solutions

On the Convolution Order with Reliability Applications

A MODIFIED WEIBULL HAZARD RATE AS GENERATOR OF A GENERALIZED MAXWELL DISTRIBUTION

Truncated Life Test Sampling Plan Under Odd-Weibull Distribution

POINT VALUES AND NORMALIZATION OF TWO-DIRECTION MULTIWAVELETS AND THEIR DERIVATIVES

Comparative Distributions of Hazard Modeling Analysis

Noninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions

A New Two Sample Type-II Progressive Censoring Scheme

Parameter addition to a family of multivariate exponential and weibull distribution

Transcription:

MATHEMATICAL COMMUNICATIONS 13 Math. Commun., Vol. 15, No. 1, pp. 13-24 (2010) On nonlinear weighted least squares fitting of the three-parameter inverse Weibull distribution Dragan Juić 1 and Darija Marović 1, 1 Department of Mathematics, University of Osije, Trg Ljudevita Gaja 6, HR-31 000 Osije, Croatia Received January 5, 2009; accepted April 14, 2009 Abstract. In this paper we consider nonlinear least squares fitting of the three-parameter inverse Weibull distribution to the given data (w i, t i, y i), i = 1,..., n, n 3. As the main result, we show that the least squares estimate exists provided that the data satisfy just the following two natural conditions: (i) 0 < t 1 < t 2 <... < t n and (ii) 0 < y 1 < y 2 <... < y n < 1. To this end, an illustrative numerical example is given. AMS subject classifications: 65D10, 62N05, 62J02, 62G07 Key words: three-parameter inverse Weibull distribution, data fitting, nonlinear least squares, least squares estimate, existence problem 1. The three-parameter inverse (or reverse) Weibull distribution Let X denote the three-parameter Weibull model with distribution function F W (t; α, β, ) = 1 e ( t α )β, t > α 0, t α. where α 0 is a location parameter, γ > 0 the scale, and β > 0 the shape parameter (see e.g. Murthy et al. [17]). Then a random variable T defined by has the distribution function T = α + 2 X α F (t; α, β, ) = e ( t α )β, t > α 0, t α (1) Presented at the Sixth Conference on Applied Mathematics and Scientific Computing, Zadar, Croatia, September 2009. This wor is supported by the Ministry of Science, Education and Sports, Republic of Croatia, through research grant 235-2352818-1034. Corresponding author. Email addresses: juicd@mathos.hr (D. Juić), darija@mathos.hr (D. Marović) http://www.mathos.hr/mc c 2010 Department of Mathematics, University of Osije

14 D. Juić and D. Marović and the density function f(t; α, β, ) = ( β ) β+1 t α e ( t α )β, t > α 0, t α. (2) The distribution of T is nown as the three-parameter inverse Weibull distribution (IWD). If α = 0, the resulting distribution is called a two-parameter IWD. Drapella [8] calls the IWD the complementary Weibull distribution, while Mudholar and Kollia [16] call it the reciprocal Weibull distribution. The IWD is also nown as a type 2 extreme value or the Fréchet distribution (Johnson et al. [10]). For more details on the IWD, see e.g. Johnson et al. [10] and Murthy et al. [17]. The IWD plays an important role in reliability and lifetime studies (see [15, 17, 18]). The reliability function R(t) and the hazard (failure rate) function h(t) for the IWD are given by R(t; α, β, ) = 1 F (t; α, β, ) = 1 e ( t α )β, t > α and t α )β f(t; α, β, ) h(t; α, β, ) = 1 F (t; α, β, ) = ββ (t α) β 1 e (, t > α. 1 e ( t α )β It can be shown that the hazard function is similar to that of the log-normal and inverse Gaussian distributions. The IWD is very flexible and by an appropriate choice of the shape parameter β the density curve can assume a wide variety of shapes (see Figure 1). The density function is strictly increasing on (α, t m ] and strictly decreasing on [t m, ), where t m = α + (1 + 1/β) 1/β. This implies that the density function is unimodal with the maximum value at t m. This is in contrast to the standard Weibull model where the shape is either decreasing (for β 1) or unimodal (for β > 1). When β = 1, the IWD becomes an inverse exponential distribution; when β = 2, it is identical to the inverse Rayleigh distribution; when β = 0.5, it approximates the inverse Gamma distribution. That is the reason why the IWD is one of the most widely used models in reliability and lifetime studies. f(t) β=0.5 β=3 β=2 β=1 t Figure 1. Plots of the inverse Weibull density for some values of β and by assuming α = 0 and = 1.2

Least squares fitting inverse Weibull distribution 15 There is no unique way to perform density reconstruction from the observed data and many different methods have been proposed in the literature. The maximum lielihood (ML) method is a traditional method since it possesses beneficial properties such as asymptotic normality and consistency. Assuming n independent observations t 1,..., t n from the density p(t; θ), the lielihood function of this sample is given by L(θ) = n p(t i; θ). The maximum lielihood estimate (MLE) of parameters θ is the value ˆθ that maximizes the lielihood function. But the MLE does not necessarily exist, and it is not necessarily unique. For the two-parameter IWB a standard MLE exists and it is unique (see e.g. Calabria and Pulcini [3]). Now we are going to show that for the three-parameter IWB the lielihood function L(α, β, ) = n f(t i ; α, β, ) = n β ( ) β+1 e ( t i α t i α )β is unbounded from above so that a standard MLE does not exist. In order to verify this, we may assume, without loss of generality, that 0 < t 1 < t 2 <... < t n. Fix (0, ). Then n L(t 1 β n+1 β (, β, ) = Since and β 0 i=2 t i t 1 + β n+1 ) β+1 e ( = 1 n ( ) β ( e β β n+1 n βn+1 )β ( ) β ( e β 0 β n+1 n ( ) β+1 ( e t i t 1 + β n+1 t i t 1 +βn+1 )β ( ) β+1 ( e t i t 1 + β n+1 i=2 βn+1 )β = e 1 t i t 1 +β n+1 )β = ( e ) n 1 n i=2 t i t 1 +β n+1 )β (3) 1 t i t 1, from (3) it follows that β 0 L(t 1 β n+1, β, ) =, and hence the MLE does not exist. In the literature, considerable effort has been devoted to such difficulties with the maximum lielihood approach (see e.g. Cheng and Iles [4], Smith and Naylor [21]). There are several other statistical methods for estimating model parameters such as the method of moments, the method of percentile and the Bayesian method. Unfortunately, none of these methods (excluding Bayesian) is appropriate for small data sets (see e.g. Lawless [15], Murthy et al. [17], Nelson [18]). A very popular method for parameter estimation is the least squares (LS) method. The method can be described as follows: Suppose we are given the data (w i, t i, y i ), i = 1,..., n, n > 3, where t i denotes the values of the independent variable, y i are the respective measured function values and w i > 0 are the data weights which describe the assumed relative accuracy of the data. The unnown parameters α, β and of function (1) have to be estimated by minimizing the functional S(α, β, ) = n [ ] 2 w i F (ti ; α, β, ) y i = n t i α w i y 2 i + n [ w i t i >α ] e ( 2 t i α )β y i

16 D. Juić and D. Marović on the set P := (α, β, ) R 3 : α 0; β, > 0 }. A point (α, β, ) P such that S(α, β, ) = inf S(α, β, ) (α,β,) P is called the least squares estimate (LSE), if it exists (see Björc [2], Gill et al. [9], Ross [19], Seber and Wild [20]). Numerical methods for solving the nonlinear LS problem are described in Dennis and Schnabel [7] and Gill et al. [9]. Before the iterative minimization of the sum of squares it is still necessary to as whether the least squares estimate (LSE) exists. In the case of nonlinear LS problems it is still extremely difficult to answer this question (see Bates and Watts [1], Björc [2], Demideno [5, 6], Hadeler et al [12], Juić et al. [11, 13, 14]). The problem of nonlinear weighted LS fitting of the three-parameter Weibull distribution to the given data (w i, t i, y i ), i = 1,..., n, is considererd by Juić et al. [11]. They showed that the LSE exists provided that the data satisfy just the following two conditions: (i) 0 < t 1 < t 2 <... < t n and (ii) 0 < y 1 < y 2 <... < y n < 1. Since the Weibull random variable T is nonegative and since numbers y i usually denote empirical CDF values, these two conditions are natural. Surprisingly, in spite of the many papers on Weibull models, we have not managed to find any paper dealing with the existence problem of a solution to a nonlinear LS problem for the three-parametric inverse Weibull distribution. The structure of the paper is as follows. In Section 2 we present our main result (Theorem 1) which guarantees the existence of the LSE for the three-parametric inverse Weibull distribution, provided the data satisfy conditions (i) and (ii). An illustrative numerical example is given in Section 3. 2. The existence theorem Before starting with the proof of Theorem 1, we need some preinary results. Lemma 1. Suppose we are given the data (w i, t i, y i ), i = 1,..., n, n > 3, such that (i) 0 < t 1 < t 2 <... < t n, (ii) 0 < y 1 < y 2 <... < y n < 1 and w i > 0, i = 1,..., n. Given any two real numbers θ 0 and A 0, let Σ θ,a := w i yi 2 + w i (y i A) 2. t i<θ t i>θ Then there exists a point in P at which functional S attains a value less than Σ θ,a. The summation (or ) is to be understood as follows: The sum over those t i >θ t i <θ indices i n for which t i > θ (or t i < θ). If there are no such points t i, the sum is empty; following the usual convention, we define it to be zero.

Least squares fitting inverse Weibull distribution 17 Proof. It is easy to verify by definition of Σ θ,a that Σ 0,A Σ t1,a for any A 0 and that Σ θ,a Σ θ,yn > Σ tn 1,y n for all θ > t n 1, A 0. Thus, it will suffice to consider the case θ [t 1, t n 1 ]. Assume that θ [t 1, t n 1 ]. Let 1,..., n 1} such that θ (t 1, t ], where t 0 = 0 by definition. Now, define y ξ 0 :=, if θ = t 1 2 (y 1 + y ), if θ t. Furthermore, let a point (τ 1, ξ 1 ) be defined in the following way: w i y i t ξ 1 := i >θ tl, if y l = ξ 1 and τ 1 := t w l 1 +t l i 2, if y l > ξ 1, where t i >θ l := mini : y i ξ 1 }. Since the sequences t i } n and y i} n are strictly increasing, it is easy to show that θ < τ 1 and ξ 0 < ξ 1. Now, we are going to construct a class of inverse Weibull distributions whose graph will contain points (θ, ξ 0 ) and (τ 1, ξ 1 ). For this purpose, we first define a function K : (0, ) (0, ) by formula K(β) := ( ln ξ1 ln ξ 0 ) 1/β 1 ( ln ξ1 ln ξ 0 ) 1/β. Function K is strictly increasing on R, β 0 K(β) = 0 and β K(β) =. Since τ 1 > θ > 0, there exists B > 0 such that θ K(B)(τ 1 θ) = 0, so that for all β (0, B), θ K(β)(τ 1 θ) > 0. Note that (α(β), β, (β)) := ( θ K(β)(τ 1 θ), β, ( ln ( 1 ξ 1 )) 1/β ( τ 1 θ + K(β)(τ 1 θ)) ) P for all β (0, B). Let us now associate with each real β (0, B) an inverse Weibull distribution ( ) ( ) β 1 τ ln 1 θ+k(β)(τ 1 θ) F (t; α(β), β, (β)) = e ξ 1 t θ+k(β)(τ 1 θ), t > θ K(β)(τ1 θ) 0, t θ K(β)(τ 1 θ). It is not difficult to show that F (θ; α(β), β, (β)) = ξ 0, F (τ 1 ; α(β), β, (β)) = ξ 1 (4)

18 D. Juić and D. Marović and F (t; α(β), β, (β)) = ξ 1, t > θ. (5) β 0 Note that only one of the following two cases can occur: (i) θ < t n 1, or (ii) θ = t n 1. Case (i): θ < t n 1. Let β 0 be an arbitrary but fixed point of (0, B) such that θ K(β)(τ 1 θ) > t 1 0. Since θ > t 1 0 and β 0 K(β) = 0, such β 0 exists. Then for all β (0, β 0 ) we have F (t i ; α(β), β, (β)) = 0, t i < θ. (6) The function t F (t; α(β), β, (β)), β (0, β 0 ), is strictly increasing on the interval (θ K(β)(τ 1 θ), ). Due to this fact and (5), we may assume that β 0 > 0 is sufficiently small, so that for all β (0, β 0 ), y i < F (t i ; α(β), β, (β)) < ξ 1, if θ < t i < τ 1 y i = F (t i ; α(β), β, (β)) = ξ 1, if t i = τ 1 (7) ξ 1 < F (t i ; α(β), β, (β)) < y i, if t i > τ 1. Since θ < t n 1, there are at least two indices i for which t i > θ. Furthermore, since the sequence y i } n is strictly increasing, the equality y i = ξ 1 can hold for at most one index i. Hence, for every β (0, β 0 ), it follows from (6) and (7) that for every β (0, β 0 ), S(α(β), β, (β)) = t i<θ < t i <θ t i <θ n [ ] 2 w i F (ti ; α(β), β, (β)) y i [ ] 2 [ ] 2 w i F (ti ; α(β), β, (β)) y i + w i F (ti ; α(β), β, (β)) y i w i y 2 i + t i >θ w i (y i ξ 1 ) 2 w i y 2 i + t i >θ w i (y i A) 2 = Σ θ,a. t i>θ The last inequality follows directly from a well-nown fact that the quadratic function x t w i>θ i(y i x) 2 attains its minimum t w i>θ i(y i ξ 1 ) 2 at point ξ 1 = t w i>θ iy i / t w i>θ i. Case (ii): θ = t n 1. First note that in this case it must be τ 1 = t n and ξ 1 = y n. Hence Σ tn 1,A = n 2 w iyi 2. Let β 1 be a point of (0, B) such that t n 1 K(β 1 )(t n t n 1 ) = t n 2. Then, since K is a strictly increasing function, for every β (β 1, B) we have that Thus F (t n 2 ; α(β), β, (β)) = e ln ( t n 1 K(β)(t n t n 1 ) < t n 2. 1 yn ) ( ) β tn t n 1 +K(β)(tn t n 1 ) t n 2 t n 1 +K(β)(tn t n 1 ), β (β1, B),

Least squares fitting inverse Weibull distribution 19 from where it follows that F (t n 2 ; α(β), β, (β)) 0 as β β 1 from the right. Then by definition of the it there exists a point ˆβ (β 1, B) such that 0 < F (t n 2 ; α( ˆβ), β, ( ˆβ)) < y n 2. (8) Without loss of generality, we may suppose that ˆβ is sufficiently close to β 1, so that 0 t n 3 < t n 1 K( ˆβ)(t n t n 1 ) < t n 2. Then As shown earlier (see (4)), F ( t i ; α( ˆβ), β, ( ˆβ) ) = 0, i = 1,..., n 3. (9) F ( t n 1 ; α( ˆβ), ˆβ, ( ˆβ) ) = y n 1 and F ( t n ; α( ˆβ), ˆβ, ( ˆβ) ) = y n. (10) From (8), (9) and (10) it follows that S(α( ˆβ), ˆβ, ( ˆβ)) < n 1 w iyi 2. This completes the proof of the lemma. Now we state our main result (Theorem 1) which guarantees the existence of the LSE for the three-parameter IWD. This theorem is also applicable in a classical nonlinear regression problem with the model function of the form (1). Theorem 1. Let the data (w i, t i, y i ), i = 1,..., n, n > 3, be such that (i) 0 < t 1 < t 2 <... < t n, (ii) 0 < y 1 < y 2 <... < y n < 1 and w i > 0, i = 1,..., n. Then, there exists the LSE for the three-parametric inverse Weibull distribution. Proof. Since functional S is nonnegative, there exists S := inf (α,β,) P S(α, β, ). It should be shown that there exists a point (α, β, ) P such that S(α, β, ) = S. Let (α, β, ) be a sequence in P, such that S = S(α, β, ) = n [ ] 2 w i F (ti ; α, β, ) y i (11) Without loss of generality, in further consideration we may assume that sequences (α ), (β ) and ( ) are monotone. This is possible because the sequence (α, β, ) has a subsequence (α l, β l, l ), such that all its component sequences are monotone; and since S(α l, β l, l ) = S(α, β, ) = S. Since each monotone sequence of real numbers converges in the extended real number system R, denote α := α, β := β, :=. Note that 0 α, β,, because (α, β, ) P.

20 D. Juić and D. Marović To complete the proof it is enough to show that (α, β, ) P, i.e. that 0 α < and β, (0, ). The continuity of the functional S will then imply that S = S(α, β, ) = S(α, β, ). It remains to show that (α, β, ) P. The proof will be done in five steps. In step 1 we will show that α. In step 2 we will show that 0. The proof that will be done in step 3. In step 4 we prove that β. Finally, in step 5 we show that β 0. Step 1. If α =, then F (t i ; α, β, ) = 0 for every sufficiently great N, and therefore from (11) it follows that S = n w iyi 2. Since n w iyi 2 > Σ tn 1,y n and since according to Lemma 1 there exists a point in P at which functional S attains a value smaller than Σ tn 1,y n, this means that in this way (α = ) functional S cannot attain its infimum. Thus, we have proved that α. Step 2. Let us show that 0. We prove this by contradiction. Suppose on the contrary that = 0. Then only one of the following two cases can occur: (i) = 0 and β = 0 or (ii) = 0 and β > 0. Now, we are going to show that functional S cannot attain its infimum in either of these two cases, which will prove that 0. Case (i): = 0 and β = 0. Since 0, for every sufficiently great N, 0 < β < 1. This means that sequence ( β ) is bounded. We may assume that it is convergent. Let β in this case we would have L [0, 1]. Since ( ) β = t α β (t α ) β = L, t > α, F (t; α, β, ) = 0, if t < α e L, if t > α (12) and hence from (11) it would follow that S w i yi 2 + w i (e L y i ) 2 = Σ α,e L. t i<α t i>α According to Lemma 1, there exists a point in P at which functional S attains a value smaller than Σ α,e L. This means that in this case functional S cannot attain its infimum. Case (ii): = 0 and β > 0. In this case we would have and therefore ( ) β = 0, t α t α F (t; α, β, ) = 1, t α. Arguing now as in case (i), it can be shown that S Σ α,1. Again, according to Lemma 1, we conclude that in this case functional S cannot attain its infimum. Thus, we have proved that 0. Step 3. Now, we are going to show that. We prove this by contradiction. Suppose =. Then, without loss of generality, we may assume that > 1 for

Least squares fitting inverse Weibull distribution 21 all N. Since in that case β 1, only one of the following can occur: (i) = and β or (ii) = and β L [1, ). Let us show that functional S cannot attain its infimum in either of these two cases, which will prove that 0. Case (i): = and β. In this case we have and therefore F (t; α, β, ) = Indeed, if t > α and β = 0, then ( t α ) β =, t > α e ( t α ) β = 0, t > α. ( ) β = t α β (t α ) β = 1 =. ( If t > α and β > 0, then obviously ) β t α =. Case (ii): = and β L [1, ). In this case, sequence (β ) must converge to 0, because by assumption. Therefore, 0, if t < α F (t; α, β, ) = e L, if t > α. Arguing now similarly to case (i) from step 2, in both cases it can be shown that in this way ( = 0) functional T cannot attain its infimum. So far, we have shown that 0 α < and 0 < <. By using this, in the next two steps we will show that 0 < β <. Step 4. Let us show that β. To see this, suppose on the contrary that β =. Then ( ) β, if α = < t < α + t α 0, if t > α + and therefore 0, if α F (t; α, β, ) = < t < α + 1, if t > α +. Arguing now similarly to case (i) from step 2, it can be shown that S Σ α +,1. By Lemma 1, there exists a point in P at which functional S attains a value smaller than Σ α +,1. Therefore, in this way (β = ) functional S cannot attain its infimum. Thus, we proved that β <. Step 5. It remains to be shown that β 0. If β 0, then and therefore ( t α ) β = 1, t > α 0, if t < α F (t; α, β, ) = e 1, if t > α. Arguing similarly to case (i) from step 2, it can be shown that S Σ α,1 e 1. According to Lemma 1, in this way functional S cannot attain its infimum. Thus, we proved that β > 0 and herewith we completed the proof.

22 D. Juić and D. Marović Remar 1. Given 1 p <, let S p (α, β, ) = n w i F (ti ; α, β, ) y i p. Arguing in a similar way as in proofs of Lemma 1 and Theorem 1, it can be easily shown that there exists a point (α p, β p, p) P such that S p (α p, β p, p) = inf (α,β,) P S p (α, β, ). 3. Numerical illustration The three-parameter IWB has been extensively used in modelling failure times. For example, consider a real data set from Murthy et al. [17, p. 291] concerning failure of the photocopier cleaning web. The observed 14 failure times (in days) are displayed in Table 1, where t i denotes the i th failure time. Suppose that the failure time T is a random variable following an inverse Weibull distribution (1). i 1 2 3 4 5 6 7 8 9 10 11 12 13 14 t i 99 269 166 159 194 100 95 245 56 36 66 69 26 31 t (i) 26 31 36 56 66 69 95 99 100 159 166 194 245 269 Table 1. Failure times t i and the order statistics t (i) There are many different ways of computing the empirical cumulative distribution function ˆF corresponding to the sample data t 1,..., t n. They all involve arranging the data in an ascending order so that t (1) < t (2) <... < t (n), which is also shown in Table 1. Most commonly used estimators can be expressed in the following form (see Lawless [15] and Nelson [18]): ˆF (t (i) ) = i c n + 1 2c =: y i, 0 c < 1. (13) Some alternatives are as follows: y i = i n+1 (mean ran estimator, c = 0), y i = i 0.5 n (median ran estimator, c = 0.5), y i = i 0.3 n+0.4 (Benard s median ran estimator, c = 0.3). Our data for least squares estimation are ( w i, t (i), y i ) ), i = 1,..., 14, where w i > 0 are data weights, t (i) are the values from Table 1 and y i are calculated by using (13). It is easy to verify that these data satisfy the conditions of Theorem 1, therefore an LSE exists. Numerical methods for minimizing the sum of squares require an initial approximation (α 0, β 0, 0 ) P, which needs to be as good as possible. We suggest to do this in the following way: For α 0 tae t (1) /2, and then calculate β 0 and 0 by transforming Weibull distribution F (t; α 0, β, ) to the form y = ln [ ln F (t; α 0, β, ) ] = β ln(t α 0 ) β ln and then fit a straight line y on x = ln(t α 0 ) using least squares. The above transformation was first proposed by Drapella [8]. A plot of y versus x is called the inverse Weibull probability paper (IWPP) plot.

Least squares fitting inverse Weibull distribution 23 In this numerical illustration we are going to use mean ran, median ran and Benard s approach. For all weights w i we too 1. Our results obtained by using the Levenberg-Marquart method (see e.g. Dennis and Schnabel [7]) are given in Table 2. Approach Parameter estimates Sum of squares (SS) mean ran α = 0, β = 1.11430, = 68.7775 SS = 0.0223877 median ran α = 0, β = 1.22757, = 71.0390 SS = 0.0249334 Benard α = 0, β = 1.17909, = 70.1115 SS = 0.0238659 Table 2. Least squares parameter estimates based on failure times Murthy et al. [17] used a mean ran estimator and fitted a two-parameter IWD to the data and obtained the following estimate of unnown parameters: ˆβ = 1.23, ˆ = 61.8. The corresponding sum of squares SS = 0.0528198 is greater than the one we obtained. The reason for this is that they used the so-called Weibull probability paper (WPP) plot. In Figure 2 we show the data ( t (i), ˆF (t (i) ) ), i = 1,..., 14, calculated by using a mean ran estimator, the graph of function F (t; α, β, ) and the graph of function F (t; 0, ˆβ, ˆ). 1 0.8 0.6 0.4 0.2 50 100 150 200 250 300 Figure 2. The data, F (t; α, β, ), - - - F (t; 0, ˆβ, ˆ) References [1] D. M. Bates, D. G. Watts, Nonlinear Regression Analysis and Its Applications, Wiley, New Yor, 1988. [2] Å. Björc, Numerical Methods for Least Squares Problems, SIAM, Philadelphia, 1996. [3] R. Calabria, G. Pulcini, On the maximum lielihood and least-squares estimation in the inverse Weibull distribution, Statist. Appl. 2(1990), 53-63. [4] R. C. H. Cheng, T. C. Iles, Embedded models in three-parameter distributions and their estimation J. R. Stat. Soc. Ser. B 52(1990), 135-149. [5] E. Z. Demideno, Criteria for global minimum of sum of squares in nonlinear regression, Comput. Statis. Data Anal. 51(2006), 1739-1753. [6] E. Z. Demideno, Is this the least squares estimate? Biometria 87(2000), 437-452. [7] J. E. Dennis, R. B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations, SIAM, Philadelphia, 1996.

24 D. Juić and D. Marović [8] A. Drapella, Complementary Weibull distribution: Unnown or just forgotten, Quality and Reliability Engineering International 9(1993), 383-385. [9] P. E. Gill, W. Murray, M. H. Wright, Practical Optimization, Academic Press, London, 1981. [10] N. L. Johnson, S. Kotz, N. Balarishnan, Continuous Univariate Distributions, Vol.2, second edition, John Wiley & Sons, New Yor, 1995. [11] D. Juić, R. Scitovsi, M. Benšić, On the existence of the nonlinear weighted least squares estimate for a three-parameter Weibull distribution, Comput. Statis. Data Anal. 52(2008), 4502-4511. doi:10.1016/j.csda.2008.03.001 [12] K. P. Hadeler, D. Juić, K. Sabo, Least squares problems for Michaelis Menten inetics, Math. Meth. Appl. Sci. 30(2007), 1231-1241. [13] D. Juić, G. Krali, R. Scitovsi, Least squares fitting Gompertz curve, J. Comput. Appl. Math. 169(2004), 359-375. [14] D. Juić, R. Scitovsi, H. Späth, Partial linearization of one class of the nonlinear total least squares problem by using the inverse model function, Computing 62(1999), 163-178. [15] J. F. Lawless, Statistical Models and Methods for Lifetime Data, Wiley, New Yor, 1982. [16] G. S. Mudholar, G. D. Kollia, Generalized Weibull family: A structural analysis, Communications in Statistics: Theory and Methods 23(1994), 1149-1171. [17] D. N. P. Murthy, M. Xie, M., R. Jiang, Weibull Models, Wiley, New Yor, 2004. [18] W. Nelson, Applied life data analysis, Wiley, New Yor, 1982. [19] G. J. S. Ross, Nonlinear Estimation, Springer, New Yor, 1990. [20] G. A. F. Seber, C. J. Wild, Nonlinear Regression, Wiley, New Yor, 1989. [21] R. L. Smith, J. C. Naylor, A comparison of maximum lielihood and Bayesian estimators for the three-parameter Weibull distribution Biometria 73(1987), 67-90.