The RS Generalized Lambda Distribution Based Calibration Model

Similar documents
Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Computing MLE Bias Empirically

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

Chapter 11: Simple Linear Regression and Correlation

Effective plots to assess bias and precision in method comparison studies

Limited Dependent Variables

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Linear Approximation with Regularization and Moving Least Squares

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

Chapter 13: Multiple Regression

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

Lab 2e Thermal System Response and Effective Heat Transfer Coefficient

Power law and dimension of the maximum value for belief distribution with the max Deng entropy

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Global Sensitivity. Tuesday 20 th February, 2018

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

This column is a continuation of our previous column

Double Acceptance Sampling Plan for Time Truncated Life Tests Based on Transmuted Generalized Inverse Weibull Distribution

Goodness of fit and Wilks theorem

Lecture 12: Discrete Laplacian

Primer on High-Order Moment Estimators

Comparison of Regression Lines

ON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EQUATION

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

Composite Hypotheses testing

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples

System in Weibull Distribution

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis

4.3 Poisson Regression

Lecture Notes on Linear Regression

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

a. (All your answers should be in the letter!

Numerical Heat and Mass Transfer

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

Support Vector Machines. Vibhav Gogate The University of Texas at dallas

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Kernel Methods and SVMs Extension

An R implementation of bootstrap procedures for mixed models

An (almost) unbiased estimator for the S-Gini index

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

A Robust Method for Calculating the Correlation Coefficient

Estimation of the Mean of Truncated Exponential Distribution

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION

x i1 =1 for all i (the constant ).

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

LECTURE 9 CANONICAL CORRELATION ANALYSIS

The Order Relation and Trace Inequalities for. Hermitian Operators

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

A Hybrid Variational Iteration Method for Blasius Equation

Feature Selection: Part 1

Chapter 8 Indicator Variables

Introduction to Generalized Linear Models

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Andreas C. Drichoutis Agriculural University of Athens. Abstract

A new Approach for Solving Linear Ordinary Differential Equations

Joint Statistical Meetings - Biopharmaceutical Section

Simulation and Probability Distribution

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Thermodynamics and statistical mechanics in materials modelling II

Bayesian predictive Configural Frequency Analysis

STAT 511 FINAL EXAM NAME Spring 2001

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

Linear Regression Analysis: Terminology and Notation

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

Convexity preserving interpolation by splines of arbitrary degree

Open Systems: Chemical Potential and Partial Molar Quantities Chemical Potential

Lecture 3 Stat102, Spring 2007

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Uncertainty as the Overlap of Alternate Conditional Distributions

Interval Estimation of Stress-Strength Reliability for a General Exponential Form Distribution with Different Unknown Parameters

Which estimator of the dispersion parameter for the Gamma family generalized linear models is to be chosen?

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

NUMERICAL DIFFERENTIATION

Time-Varying Systems and Computations Lecture 6

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL

Chapter 5 Multilevel Models

Hongyi Miao, College of Science, Nanjing Forestry University, Nanjing ,China. (Received 20 June 2013, accepted 11 March 2014) I)ϕ (k)

Appendix B: Resampling Algorithms

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

On a direct solver for linear least squares problems

POWER AND SIZE OF NORMAL DISTRIBUTION AND ITS APPLICATIONS

The lower and upper bounds on Perron root of nonnegative irreducible matrices

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

Laboratory 1c: Method of Least Squares

Statistics Chapter 4

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Negative Binomial Regression

Transcription:

Internatonal Journal of Statstcs and Probablty; Vol. 2, No. 1; 2013 ISSN 1927-7032 E-ISSN 1927-7040 Publshed by Canadan Center of Scence and Educaton The RS Generalzed Lambda Dstrbuton Based Calbraton Model Steve Su 1, Abeer Hasan 2 & We Nng 2 1 Covance Pty Ltd., Sydney, Australa; School of Mathematcs and Statstcs, Unversty of Western Australa, Crawley, Australa 2 Department of Mathematcs and Statstcs, Bowlng Green State Unversty, Bowlng Green, USA Correspondence: Steve Su, Covance Pty Ltd., Sydney, Australa; School of Mathematcs and Statstcs, Unversty of Western Australa, Crawley 6009, Australa. E-mal: dbarro2@gmal.com Receved: December 5, 2012 Accepted: January 10, 2013 Onlne Publshed: January 22, 2013 do:10.5539/sp.v2n1p101 URL: http://dx.do.org/10.5539/sp.v2n1p101 Abstract We propose a flexble lnear calbraton model wth errors from RS (Ramberg & Schmeser, 1974) generalzed lambda dstrbuton (GλD). We demonstrate the dervaton of the maxmum lkelhood estmates of RS GλD parameters and examne the estmaton performance usng a smulaton study for sample szes rangng from 30 to 200. The use of RS GλD calbraton model not only provdes statstcal modeller wth a rcher range of dstrbutonal shapes, but can also provde more precse parameter estmates compared to the standard Normal calbraton model or skewed Normal calbraton model proposed by Fgueredoa, Bolfarnea, Sandovala and Lmab (2010). Keywords: generalzed lambda dstrbuton, lnear calbraton model, skew normal dstrbuton, maxmum lkelhood estmaton 1. Introducton The statstcal calbraton model s a reverse regresson technque, where we use the response varable to predct the correspondng explanatory varable. There are number of applcatons of ths technque n scence. For example, we may use radometrc datng to ascertan the age of a tree and further verfy our result usng tree rngs. Our am, however, s to use radometrc datng to estmate age of new trees, and the problem s whether we should mnmze errors n the observaton or mnmze errors n age determnaton. There are many smlar problems n substance concentraton determnaton n bology and chemstry, physcal quanttes determnaton n physcs and blood pressure/cholsterol level measurement n medcne. The lterature on calbraton problem has a long hstory, and one of the earlest works can be found n Esenhart (1939). The usual calbraton experment s a two stage process nvolvng two random varables X and Y. The frst stage s known as the calbraton tral, where we observe the n values of the response varable y 1,, y n from a gven set of explanatory values x 1,, x n and we can estmate the lnk functon between X and Y. The second stage s known as the calbraton experment, where we observe k 1 value(s) of the response varable Y as y 01,, y 0k whch are mapped from some unknown value x 0 from the explanatory varable X. We can express these two stages by the followng equatons. y = α + βx + ε, = 1,, n; y 0 = α + βx 0 + ε 0, = 1,, k, (1.1) We usually assume that the errors ε 1,,ε n,ε 01,,ε 0k are..d and Normally dstrbuted wth mean 0 and varance σ 2. Also, x 1,, x n are known and α, β, x 0 and σ 2 are unknown parameters whch we need to estmate. As an extenson to Normal dstrbuton, Azzaln (1985) ntroduced the skewed Normal dstrbuton. The skewed Normal dstrbuton s defned as g(x; ξ, ω, λ) = 2 ( x ξ ) ( ( x ξ )) ω φ Φ λ, (1.2) ω ω where φ( ) and Φ( ) are the p.d.f. and c.d.f. of a standard normal dstrbuton respectvely. Specally, when ξ = 0 and ω = 1, we obtan the standard skewed Normal dstrbuton. 101

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 Based on (1.2), Fgueredoa et al. (2010) defned a skew-normal calbraton model by assumng that ε and ε 0 are..d. and follow a skewed Normal dstrbuton wth ξ = 0 denoted by SN(0,ω,λ). Ths gves us the followng calbraton model: y x SN(α + βx ; ω; λ), = 1., n, y 0 x 0 SN(α + βx ; ω; λ), = 1,, k. (1.3) In (1.3), the condtonal dstrbuton of y gven x and y 0 gven x 0 are governed by skewed Normal dstrbutons. Ths skewed Normal calbraton model allows the modeller to cope wth some degree of skewness n the error dstrbuton. However, ths s stll lmted as the skewed Normal dstrbuton have lmted range of shapes. The skewed Normal dstrbuton stll cannot handle heavy taled, U shape, unform, trangular or exponental upward/downward patterns. These shapes however, can be captured usng GλD (generalzed lambda dstrbutons), and we propose a further extenson to the calbraton model by usng RS GλD. Our artcle s organzed as follows. In Secton 2, we ntroduce the GλD famly. In Secton 3, we outlne the RS GλD calbraton model and dscuss possble ways to estmate parameters of the model usng maxmum lkelhood estmaton. In Secton 4, we demonstrate the estmaton performance of our proposed model across a range of dfferent sample szes from 30 to 200. As a further test to our proposed model to the lterature, we compare the performance of RS GλD calbraton model aganst Normal and skewed Normal calbraton model wth respect to a real lfe dataset used by Fgueredoa et al. (2010) n Secton 5. A dscusson of our proposed method s gven n Secton 6. 2. Generalzed Lambda Dstrbutons The RS GλD (Ramberg & Schmeser, 1974) s an extenson of Tukey s lambda dstrbuton. It s defned by ts nverse dstrbuton functon: F 1 (u) = λ 1 + uλ 3 (1 u) λ 4 0 u 1 (2.1) From (2.1), λ 1,,λ 3,λ 4 are respectvely the locaton, nverse scale and shape 1 and shape 2 parameters. Karan and Dudewcz (2000) noted that GλD s defned only f 0 for 0 u 1. The condtons λ 3 u λ 3 1 + λ 4 (1 u) λ 4 1 for whch RS GλD s a vald p.d.f. are set out n Karan and Dudewcz (2000) and these are also programmed n GLDEX package n R (Su, 2010, 2007a). Fremer, Kolla, Mudholkar and Ln (1988) descrbe another dstrbuton known as FKML GλD. The FKML GλD can be wrtten as: F 1 (u) = λ 1 + u λ 3 1 λ 3 (1 u)λ 4 1 λ 4 0 u 1 (2.2) Under (2.2), λ 1,,λ 3,λ 4 are respectvely the locaton, nverse scale and shape 1 and shape 2 parameters. The fundamental motvaton for the development of FKML GλD s that the dstrbuton s defned over all λ 3 and λ 4 (Fremer et al., 1988). The only restrcton on FKML GλD s > 0. Ths s more convenent to deal wth computatonally than RS GλD and hence t s sometmes the preferred GλD for some researchers. We restrct our attenton n ths artcle to the more dffcult problem of fttng RS GλD calbraton model to data. Wthout loss of generalty, the method we outlned below can be easly adapted to buld FKML GλD calbraton model. 3. Statstcal Model 3.1 GλD Based Calbraton Model We consder the followng usual calbraton model: y = α + βx + ɛ, = 1,, n, (3.1) y 0 = α + βx 0 + ɛ, = 1,, k. (3.2) We assume that ɛ and ɛ are..d. GλD(0,,λ 3,λ 4 ). In general, we consder x 1,, x n to be known and fxed and α, β,,λ 3 and λ 4 are parameters we need to estmate. Our GλD calbraton model takes the followng form: y x GλD(α + βx,,λ 3,λ 4 ), (3.3) 102

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 y 0 x 0 GλD(α + βx 0,,λ 3,λ 4 ). (3.4) Consequently, the lkelhood functon for RS GλD s: where L(θ, y, y 0 ) = n =1 + λ 4 (1 z ) λ 4 1 k =1 y = (α + βx ) + zλ 3 (1 z ) λ 4, y 0 = (α + βx 0 ) + zλ 3 (1 z ) λ 4, and 0 z, z 1, θ = (α, β, x 0,,λ 3,λ 4 ). 3.2 Estmaton of Parameters From (3.5), we obtan the followng log lkelhood functon: where log L(θ, y, y 0 ) = n log ( f 1 (θ, y )) + =1 f 1 (θ, y ) = f 2 (θ, y 0 ) = Takng the dervatve of (3.6), we obtan the followng: log L(θ) θ = + λ 4 (1 z ) λ 4 1, (3.5) k log ( f 2 (θ, y 0 ) ) (3.6) =1 + λ 4 (1 z ) λ 4 1, n =1 + λ 4 (1 z ) λ 4 1 1 f 1 θ + k =1 1 f 2 θ, (3.7) where θ = (α, β, x 0,,λ 3,λ 4 ). Theoretcally, the MLE of θ s the soluton of (3.7) when t s set to be equal to 0. The dervatves θ and θ are gven below. = z y z y = λ λ 3 (λ 3 1)z λ 3 2 + λ 4 (λ 4 1)(1 z ) λ 4 2 3 2 + λ 4 (1 z ) λ 4 1 ) 2 + λ 4 (1 z ) λ 4 1 zλ (1 z ) λ 4 2 = [λ 3(λ 3 1)z λ 3 2 λ 4 (λ 4 1)(1 z ) λ4 2 ](z λ 3 (1 z ) λ 4 ) + λ 4 (1 z ) λ 4 1 = ( ) [λ 3(λ 3 1)z λ3 2 λ 4 (λ 4 1)(1 z ) λ4 2 ](z λ 3 log z ) λ 3 + λ 4 (1 z ) λ 4 1 [λ 3 (λ 3 1)z λ3 2 λ 4 (λ 4 1)(1 z ) λ4 2 ]((1 z ) λ 3 log(1 z )) = λ 4 + λ 4 (1 z ) λ 4 1 α = ( λ2 2 )[λ 3(λ 3 1)z λ3 2 λ 4 (λ 4 1)(1 z ) λ4 2 ] + λ 4 (1 z ) λ 4 1 β = ( λ2 2 )[λ 3(λ 3 1)z λ3 2 λ 4 (λ 4 1)(1 z ) λ4 2 ] x + λ 4 (1 z ) λ 4 1 = [λ 3(λ 3 1)z λ 4 (λ 4 1)(1 z ) λ 4 2 ](z λ 3 + λ 4 (1 z ) λ 4 1 (1 z ) λ 4 ) 103

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 = ( ) [λ 3(λ 3 1)z λ 3 λ 4 (λ 4 1)(1 z ) λ 4 2 ](z λ 3 + λ 4 (1 z ) λ 4 1 log z ) [λ 3 (λ 3 1)z λ 4 (λ 4 1)(1 z ) λ4 2 ]((1 z ) λ 3 log(1 z )) = λ 4 + λ 4 (1 z ) λ 4 1 α = ( λ2 2 )[λ 3(λ 3 1)z λ 4 (λ 4 1)(1 z ) λ4 2 ] + λ 4 (1 z ) λ 4 1 β = ( λ2 2 )[λ 3(λ 3 1)z = ( 2 x )[λ 3(λ 3 1)z 0 λ 4 (λ 4 1)(1 z ) λ 4 2 ] x 0 + λ 4 (1 z ) λ 4 1 λ 4 (λ 4 1)(1 z ) λ 4 2 ] β + λ 4 (1 z ) λ 4 1 It s dffcult to obtan the exact solutons of settng (3.7) to zero usng the above formulatons, owng to the fact that RS GλD s defned by ts nverse quantle functon and there s a hgh degree of complexty nvolved n solvng the above equatons. As an alternatve, we carry out the maxmum lkelhood estmaton by maxmsng (3.6) drectly usng Nelder-Mead optmsaton algorthm as s customary done for maxmum lkelhood estmaton problems nvolvng GλD (see Su, 2010, 2007a, 2007b). Ths s a preferred and more relable method of estmaton as opposed to tryng to satsfy the exact condtons to whch all of the above equatons equal to zero. The GLDEX package n R (Su, 2010, 2007a) facltates the Nelder-Mead optmsaton algorthm for GλD. Our algorthm s as follows: 1) Generate a set of ntal values for α, β, x 0,,λ 3,λ 4. There are a number of strateges that can be used to determne the best set of ntal values. One strategy s to generate ntal values α, β, x 0 usng Normal or skewed Normal calbraton model and then generate some low dscrepancy quas random numbers for,λ 3,λ 4 over a range of values and select the set of ntal values that maxmses (3.6). Alternatvely all ntal values can be randomly generated usng low dscrepancy quas random numbers. 2) Set λ 1 = α + βx 0. 3) Check that GλD(λ 1,,λ 3,λ 4 ) s a vald statstcal dstrbuton, ths can be done usng GLDEX package n R. 4) Check the mnmal support of GλD(λ 1,,λ 3,λ 4 ) s lower or equal to the lowest value of y 0. Smlarly, check that the maxmum support of GλD(λ 1,,λ 3,λ 4 ) s greater or equal to the largest value of y 0. Ths s to ensure that the ftted GλD wll span the entre dataset. If these condtons are not met, choose another set of ntal values and repeat from 2). 5) Conduct Nelder Mead optmsaton by maxmsng (3.6) drectly usng the above ntal values to obtan the requred estmates. 4. Smulatons We conduct smulatons to llustrate the performance of our RS GλD calbraton model for sample sze n = 30, 50, 100 and 200 wth α = 3,β = 1.5, x 0 = 15 or 40,λ 3 = 10,λ 4 = 1, and = 2, 5, 10. We further generate x 1, x 2,, x n from Un f orm(10, 30), and we set k = 1. We use the true parameters as our ntal values to kck start the optmsaton process to obtan our MLE estmate for x 0. We repeat ths process 1000 tmes, whch gve us 1000 ˆx 0m estmates of x 0. The mean ˆx 0, Bas(x 0 ) and MSE(x 0 ) are calculated as follows: ˆx 0 = 1 1000 ˆx 0m 1000 m=1 Bas(x 0 ) = 1 1000 (ˆx 0m x 0 ) 1000 m=1 MSE(x 0 ) = 1 1000 (ˆx 0m x 0 ) 2 1000 104 m=1

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 The results of above smulatons are shown n Tables 1 and 2. As expected, the MSE decreases as we ncrease the sample sze or ncrease the value of nverse scale parameter. In terms of bas, we observe that the performance appear to be farly consstent across sample szes, ths gves confdence n the use of RS GλD calbraton model for smaller samples, even though there are are more parameters that need to be estmated from ths model. There also appears to be a tendency for RS GλD calbraton model to slghtly overestmate as nearly all the bas results are postve. Increasng the shape parameter λ 3 does not always result n ncrease n MSE, ths s because the shape parameter spaces of λ 3 and λ 4 for RS GλD are farly complex. Table 1. Smulatons results wth x 0 = 15, α = 3, β = 1.5, λ 4 = 1 = 2 = 5 = 10 n λ 3 ˆx 0 Bas MSE ˆx 0 Bas MSE ˆx 0 Bas MSE 30 10 15.1105 0.1105 0.0263 15.0386 0.0386 0.0042 15.0178 0.0178 0.0010 50 10 15.0944 0.0944 0.0232 15.0352 0.0352 0.0040 15.0172 0.0172 0.0010 100 10 15.0994 0.0994 0.0184 15.0396 0.0396 0.0035 15.0185 0.0185 0.0008 200 10 15.1053 0.1053 0.0166 15.0340 0.0340 0.0030 15.0173 0.0173 0.0007 30 5 15.1430 0.1430 0.0292 15.0578 0.0578 0.0056 15.0285 0.0285 0.0012 50 5 15.1445 0.1445 0.0270 15.0530 0.0530 0.0047 15.0292 0.0292 0.0012 100 5 15.1381 0.1381 0.0214 15.0531 0.0531 0.0043 15.0264 0.0264 0.0010 200 5 15.1429 0.1429 0.0187 15.0534 0.0534 0.0038 15.0227 0.0227 0.0009 30 1 15.0271 0.0271 0.0244 15.0014 0.0014 0.0061 15.0040 0.0040 0.0017 50 1 15.0367 0.0367 0.0169 15.0030 0.0030 0.0048 14.9993-0.0007 0.0014 100 1 15.0292 0.0292 0.0084 15.0093 0.0093 0.0030 15.0029 0.0029 0.0010 200 1 15.0262 0.0262 0.0052 15.0130 0.0130 0.0016 15.0022 0.0022 0.0007 Table 2. Smulatons results wth x 0 = 40, α = 3, β = 1.5, λ 4 = 1 = 2 = 5 = 10 n λ 3 ˆx 0 Bas MSE ˆx 0 Bas MSE ˆx 0 Bas MSE 30 10 40.1070 0.1070 0.0259 40.0375 0.0375 0.0049 40.0189 0.0189 0.0012 50 10 40.1051 0.1051 0.0235 40.0388 0.0388 0.0039 40.0177 0.0177 0.0009 100 10 40.1077 0.1077 0.0205 40.0353 0.0353 0.0031 40.0188 0.0188 0.0008 200 10 40.1088 0.1088 0.0169 40.0387 0.0387 0.0028 40.0184 0.0184 0.0008 30 5 40.1339 0.1339 0.0319 40.0557 0.0557 0.0064 40.0288 0.0288 0.0014 50 5 40.1391 0.1391 0.0302 40.0554 0.0554 0.0046 40.0280 0.0280 0.0013 100 5 40.1405 0.1405 0.0232 40.0479 0.0479 0.0039 40.0264 0.0264 0.0010 200 5 40.1538 0.1538 0.0236 40.0474 0.0474 0.0035 40.0205 0.0205 0.0007 30 1 40.0331 0.0331 0.0290 39.9984-0.0016 0.0058 40.0035 0.0035 0.0016 50 1 40.0348 0.0348 0.0159 40.0031 0.0031 0.0045 40.0022 0.0022 0.0013 100 1 40.0311 0.0311 0.0099 40.0078 0.0078 0.0024 39.9996-0.0004 0.0009 200 1 40.0217 0.0217 0.0036 40.0114 0.0114 0.0017 40.0031 0.0031 0.0007 Table 3. Smulatons results wth x 0 = 15,α = 3,β = 1.5, true error dstrbuton GEV(0.1860, 0.4016, 0.1511) s approxmated by RS GλD wth λ 1 = 0, 0.0374,λ 3 0.0027,λ 4 0.0212 n ˆx 0 Bas MSE 30 15.3140 0.3140 0.2149 50 15.3269 0.3269 0.2154 100 15.2815 0.2815 0.1774 200 15.2860 0.2860 0.1689 We further consdered usng RS GλD to approxmate generalzed extreme value dstrbuton (GEV) wth locaton, scale and shape parameters beng 0.1860, 0.4016, 0.1511 respectvely. We choose RS GλD wth λ 1 = 0, 0.0374,λ 3 0.0027,λ 4 0.0212 for ths demonstraton (Fgure 1). We then generate smulated data based on GEV and use our approxmated RS GλD to estmate x 0 wth α = 3,β= 1.5 and repeat ths over 1000 smulaton runs. The result of ths smulaton s gven n Table 3. We observe that the RS GλD calbraton model tends to overestmate the true x 0 by a small margn, but the bas appears to decrease as sample sze ncreases. 105

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 Generalzed Extreme Value Dstrbuton approxmated by RS GλD GEV(0.1860, 0.4016, 0.1511) GλD(0, 0.0374, 0.0027, 0.0212) Densty 0.0 0.2 0.4 0.6 0.8 0 2 4 6 8 10 12 x Fgure 1. Approxmatng GEV usng RS GλD 5. Applcaton We apply the RS GλD calbraton model to a dataset whch measures teenager testcular volume (ml 3 ). Ths dataset s from Chpkevtch, Nshmura, Tu and Galea-Raas (1996) and conssts of 42 observatons. Fgueredoa et al. (2010) consdered two measurement methods from Chpkevtch et al. (1996): dmensonal measurement wth a calper (DM) and measurement by ultrasonography (US) and the data s gven n Table 4. In ther paper, Fgueredoa et al. (2010) consder the x 0 value of 16.4, whch s observed twce by ultrasonography. They subsequently treated ths value as unknown, wth correspondng y 0 values of y 01 = 10.3 and y 02 = 17.3. Then, they estmate x 0 usng ther skewed Normal calbraton model and compared ths wth the standard Normal calbraton model. We dd the same usng the RS GλD calbraton model and our results are shown n Table 5. Table 4. Measurements obtaned by dmensonal measurement wth a calper (DM) and by ultrasonography (US) from the rght tests for 42 teenagers, n ml 3 DM US DM US DM US DM US DM US DM US 5.9 5 17.3 16.4 7.2 6.7 4.8 5.7 17.3 17.6 5.9 5.3 6.8 7.4 7.9 10 16.3 20 3.1 2.6 4.4 4.1 16.3 18.8 5 5.7 11.4 12.7 12.2 13.9 4.4 6.1 4.1 2.7 10.3 9.4 6 6.2 11.1 10.2 10.8 9.1 8.8 10.4 15.3 16.5 13 14.1 7.9 9.1 3.9 4.5 8.4 9.3 13 14.8 4.5 5.6 22.1 20.9 10.3 16.4 9.7 11 10.6 11.5 8.2 9.6 11.3 9.2 9.7 9.7 19.8 15.7 8.8 8.5 11.6 13.7 2 3 6.1 5.4 8.1 8.9 Table 5. A comparson of lnear calbraton models RS GλD model SN model Normal model Parameter Estmate Stdev. Estmate Stdev. Estmate Stdev. α 0.014 0.497-0.69-0.32 0.56 β 0.855 0.035 0.86 0.07 0.92 0.05 σ - - 2.13-1.55 0.17 x 0 12.128 0.963 12.66 1.81 14.58 1.24 λ - - 2.16 1.73 - - 0.146 0.355 - - - - λ 3-0.030 0.061 - - - - λ 4-0.162 0.184 - - - - AIC 150.36 160.69 163.74 BIC 160.79 169.38 170.69 HQ 144.58 156.55 161.15 The theoretcal dervaton of the varablty of our estmates under RS GλD s not readly tractable as n the cases of skewed Normal and Normal dstrbutons. As we need to numercally derve our calculatons, small errors n 106

www.ccsenet.org/sp Internatonal Journal of Statstcs and Probablty Vol. 2, No. 1; 2013 numercal procedures could accumulate nto large errors even f we could evaluate the exact theoretcal soluton. As a workaround, we adopt the followng procedure. Once we obtaned the parameters of our model, α, β, x 0,, λ 3, λ 4, we conduct smulatons to estmate the varablty of our estmate. We use our estmated parameters from the RS GλD calbraton model and x (excludng x = 16.4) from the orgnal data to randomly generate y 0 and y accordng to (3.1) and (3.2). We then maxmse the lkelhood n (3.6) usng Nelder Mead Smplex algorthm wth ntal values beng our orgnal estmated parameters. We repeat the process 1000 tmes and calculate the sample standard devatons of our estmated parameters. Table 5 lsts the estmated parameters and ther standard devatons from RS GλD, skewed Normal and Normal calbraton models. We compute the Akake, Bayesan and Hannan-Qunn nformaton crteron (AIC, BIC, and HQ) to allow model selecton between three models. All three crteron favors the RS GλD calbraton model. In addton, the RS GλD model s much more effcent compared to the other models, wth the smallest varablty n ts parameter estmates. 6. Concludng Remarks We propose a new calbraton model wth RS GλD errors, whch s an extremely flexble model that can cope wth a wde range of dfferent error dstrbutons. Our method also lends to the development of FKML GλD calbraton model, whch may have better propertes wth regard to numercal convergence. Our smulatons studes suggest our proposed model perform well for small sample szes across a range of nverse scale and shape parameters of RS GλD. We further demonstrate that the RS GλD calbraton model can outperform skewed Normal or Normal calbraton model, wth lower AIC, BIC and HQ nformaton crteron and lower varablty n our parameter estmates n the context of a real lfe data. These smulaton results are promsng and future statstcal models should am to develop statstcal technque that are talored to data, rather than requrng emprcal data to satsfy a partcular statstcal model. One possble extenson of our model s the development of a mxture RS GλD calbraton model, whch would extend the flexblty of our model even further but also present a very challengng problem for data wth small samples. References Azzaln, A. (1985). A class of dstrbutons whch ncludes the normal one. Scandnavan Journal of Statstcs, 12, 171-178. Chpkevtch, E., Nshmura, R., Tu, D., & Galea-Raas, M. (1996). Clncal measurements of testcular volume n adolescents: Comparson of the relablty of 5 methods. The Journal of Urology, 156, 2050-2053. http://dx.do.org/10.1016/s0022-5347(01)65433-8 Esenhart, C. (1939). The nterpretaton of certan regresson methods and ther use n bologcal and ndustral research. Annals of Mathematcal Statstcs, 10, 162-186. http://dx.do.org/10.1214/aoms/1177732214 Fgueredoa, C., Bolfarnea, H., Sandovala, M., & Lmab, C. (2010). On the skew-normal calbraton model. Journal of Appled Statstcs, 37(3), 435-451. http://dx.do.org/10.1080/02664760802715906 Fremer, M., Kolla, G., Mudholkar, G. S., & Ln, C. T. (1988). A study of the generalsed tukey lambda famly. Communcatons n Statstcs-Theory and Methods, 17, 3547-3567. http://dx.do.org/10.1080/03610928808829820 Karan, Z. A., & Dudewcz, E. J. (2000). Fttng statstcal dstrbutons: The generalzed lambda dstrbuton and generalsed bootstrap methods. New York: Chapman and Hall. http://dx.do.org/10.1201/9781420038040 Ramberg, J. S., & Schmeser, B. W. (1974). An approxmate method for generatng asymmetrc random varables. Communcatons of the Assocaton for Computng Machnery, 17, 78-82. http://dx.do.org/10.1145/360827.360840 Su, S. (2007a). Fttng sngle and mxture of generalsed lambda dstrbutons to data va dscretzed and maxmum lkelhood methods: GLDEX n R. Journal of Statstcal Software, 21(9). Su, S. (2007b). Numercal maxmum log lkelhood estmaton for generalzed lambda dstrbutons. Computatonal Statstcs and Data Analyss, 51(8), 3983-3998. http://dx.do.org/10.1016/.csda.2006.06.008 Su, S. (2010). Handbook of dstrbuton fttng methods wth R. In E. Karan, & Z. Dudewcz (Eds.), Fttng GLD to data Usng the GLDEX 1.0.4 n R (Chap. 15). CRC Press. 107