TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

Similar documents
Machine Learning Basics: Estimators, Bias and Variance

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Biostatistics Department Technical Report

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

Non-Parametric Non-Line-of-Sight Identification 1

Testing equality of variances for multiple univariate normal populations

Block designs and statistics

Meta-Analytic Interval Estimation for Bivariate Correlations

RAFIA(MBA) TUTOR S UPLOADED FILE Course STA301: Statistics and Probability Lecture No 1 to 5

Limited Failure Censored Life Test Sampling Plan in Burr Type X Distribution

Probability Distributions

Support recovery in compressed sensing: An estimation theoretic approach

Lower Bounds for Quantized Matrix Completion

A Note on the Applied Use of MDL Approximations

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Research in Area of Longevity of Sylphon Scraies

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Estimating Parameters for a Gaussian pdf

On the Maximum Likelihood Estimation of Weibull Distribution with Lifetime Data of Hard Disk Drives

DERIVING PROPER UNIFORM PRIORS FOR REGRESSION COEFFICIENTS

AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS

On Conditions for Linearity of Optimal Estimation

Lost-Sales Problems with Stochastic Lead Times: Convexity Results for Base-Stock Policies

Tail Estimation of the Spectral Density under Fixed-Domain Asymptotics

Pseudo-marginal Metropolis-Hastings: a simple explanation and (partial) review of theory

Polygonal Designs: Existence and Construction

CS Lecture 13. More Maximum Likelihood

IN modern society that various systems have become more

An Introduction to Meta-Analysis

Topic 5a Introduction to Curve Fitting & Linear Regression

Distributed Subgradient Methods for Multi-agent Optimization

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all

Research Article On the Isolated Vertices and Connectivity in Random Intersection Graphs

Inspection; structural health monitoring; reliability; Bayesian analysis; updating; decision analysis; value of information

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME

1 Identical Parallel Machines

ANALYSIS OF HALL-EFFECT THRUSTERS AND ION ENGINES FOR EARTH-TO-MOON TRANSFER

Feature Extraction Techniques

MSEC MODELING OF DEGRADATION PROCESSES TO OBTAIN AN OPTIMAL SOLUTION FOR MAINTENANCE AND PERFORMANCE

Risk & Safety in Engineering. Dr. Jochen Köhler

STOPPING SIMULATED PATHS EARLY

General Properties of Radiation Detectors Supplements

Random Process Review

Pattern Recognition and Machine Learning. Artificial Neural networks

A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics, EPFL, Lausanne Phone: Fax:

3.3 Variational Characterization of Singular Values

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Inferences using type-ii progressively censored data with binomial removals

Sampling How Big a Sample?

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics

Best Linear Unbiased and Invariant Reconstructors for the Past Records

BIVARIATE NONCENTRAL DISTRIBUTIONS: AN APPROACH VIA THE COMPOUNDING METHOD

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence

A Simple Regression Problem

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t.

In this chapter, we consider several graph-theoretic and probabilistic models

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

Asynchronous Gossip Algorithms for Stochastic Optimization

The degree of a typical vertex in generalized random intersection graph models

C na (1) a=l. c = CO + Clm + CZ TWO-STAGE SAMPLE DESIGN WITH SMALL CLUSTERS. 1. Introduction

Inference in the Presence of Likelihood Monotonicity for Polytomous and Logistic Regression

M ath. Res. Lett. 15 (2008), no. 2, c International Press 2008 SUM-PRODUCT ESTIMATES VIA DIRECTED EXPANDERS. Van H. Vu. 1.

Moments of the product and ratio of two correlated chi-square variables

Approximation in Stochastic Scheduling: The Power of LP-Based Priority Policies

Intelligent Systems: Reasoning and Recognition. Artificial Neural Networks

Bayesian Approach for Fatigue Life Prediction from Field Inspection

Multi-Dimensional Hegselmann-Krause Dynamics

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

1 Bounding the Margin

Uniform Approximation and Bernstein Polynomials with Coefficients in the Unit Interval

Fairness via priority scheduling

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

Two New Unbiased Point Estimates Of A Population Variance

Computable Shell Decomposition Bounds

Optimal Jamming Over Additive Noise: Vector Source-Channel Case

When Short Runs Beat Long Runs

On Pearson families of distributions and its applications

SPECTRUM sensing is a core concept of cognitive radio

Pattern Recognition and Machine Learning. Artificial Neural networks

Forecasting Financial Indices: The Baltic Dry Indices

Empirical phi-divergence test-statistics for the equality of means of two populations

Interactive Markov Models of Evolutionary Algorithms

Spine Fin Efficiency A Three Sided Pyramidal Fin of Equilateral Triangular Cross-Sectional Area

Physics 2107 Oscillations using Springs Experiment 2

GEE ESTIMATORS IN MIXTURE MODEL WITH VARYING CONCENTRATIONS

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry

Sample Mean Deviation (d) Chart Under the Assumption of Moderateness and its Performance Analysis Under Normality Against Moderateness

4 = (0.02) 3 13, = 0.25 because = 25. Simi-

Analyzing Simulation Results

Optimal nonlinear Bayesian experimental design: an application to amplitude versus offset experiments

Kinetic Theory of Gases: Elementary Ideas

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

Recursive Algebraic Frisch Scheme: a Particle-Based Approach

THERMAL ENDURANCE OF UNREINFORCED UNSATURATED POLYESTERS AND VINYL ESTER RESINS = (1) ln = COMPOSITES & POLYCON 2009

Prediction by random-walk perturbation

A Self-Organizing Model for Logical Regression Jerry Farlow 1 University of Maine. (1900 words)

Testing the lag length of vector autoregressive models: A power comparison between portmanteau and Lagrange multiplier tests

Boosting with log-loss

Transcription:

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES S. E. Ahed, R. J. Tokins and A. I. Volodin Departent of Matheatics and Statistics University of Regina Regina, Saskatchewan, CANADA S4S 0A October 8, 00 Abstract Consider ( ) independent rando saples fro lognoral populations with ean paraeter θ 1, θ respectively. A large saple test for the hoogeneity of the ean paraeters is developed. An estiator and a confidence interval are proposed for the coon ean paraeter. The asyptotic distribution of the proposed test-statistic under the null hypothesis as well as under local alternative is derived. Key Words and Phrases: Coon ean, Cobination of lognoral odels, Asyptotic tests and confidence interval, Asyptotic power. 1. INTRODUCTION Let the rando variable Y be distributed norally with ean µ and variance σ ; then X = e Y of X is given by will have a lognoral distribution. The probability distribution function f(x) = { } 1 xσ π exp (ln x µ), 0 < x < σ 1

Evidently, the lognoral odel is related to the noral distribution in the sae way that the Weibull is related to the extree value distribution. Both noral and lognoral odels have received considerable use in lifetie and reliability probles. In edical and engineering applications, one often encounters rando variables X such that logarith of X has a noral distribution. This lognoral odel is coonly used in edicine and econoics, the basic process under consideration leads to phenoena which are often a ultiplication of factors. Also, as stated in Cheng (1977), reliability studies indicate that any sei-conductor devices follow lifetie distributions, which are well represented by the lognoral. We refer to an edited volue by Crow and Shiizu (1988) for a coprehensive review of the subject s theory and applications. Suppose X j1, X j,, X jnj is a rando saple fro a two-paraeter lognoral distribution with ean θ j and variance τj, denoted by Λ(θ j, τj ), corresponding to the noral distribution with ean µ j and variance σj, j = 1,,,. Thus, θ j = e µ j+ σ j, τ j = e µ j+σj e σj 1. Define Y ji = ln(x ji ), 1 i. If µ j and σ j are unknown, then the axiu likelihood estiator (MLE) of (µ j, σ j ) is (ˆµ j, ˆσ j ) with ˆµ j = 1 Y ji, ˆσ j = 1 i=1 (Y ji ˆµ j ). (1.1) i=1 Hence, the axiu likelihood estiators (MLE) of (θ j, τ j ) are ˆθ j = eˆµ j+ 1 ˆσ j, ˆτ j = e ˆµ j+ˆσ j eˆσ j 1. (1.) The sapling distribution of ˆθ j is given in the following lea. Lea AT (Ahed and Tokins, 1995) For each j = 1,,,, nj (ˆθj θ j ) D N (0, ν j ) as tends to, D eans convergence in distribution and ν j = σj (1 + σj /) exp{µ j + σj }. We consider the estiation and testing of the lognoral eans when ultiple-saples are available. For exaple, the data ay have been acquired at a different tie or space. Also, in any experiental situations it is a coon practice to replicate an experient. In these situations, we often encounter the proble of pooling independent estiates of a paraeter obtained fro different sources. Each such estiate is reported

as a nuber with an estiated standard error. In this case two probles are to be considered. First, can all these estiates be considered to be hoogeneous, i.e., are they estiating the sae paraeter? Second, if the estiates are hoogeneous, what is the best way of cobining the to obtain a single estiate? We will address these questions in an orderly anner. The focal point of this investigation is to develop an inference procedure when several parallel saples are available. In a large-saple classical setup, we propose a test of the hoogeneity of the lognoral eans. Further, a point estiator and interval estiator of the coon paraeter is given.. MULTIPLE SAMPLE PROBLEMS In this section we discuss estiation and testing procedures to provide inforation about properties of the ean paraeters based on several rando saples taken fro lognoral populations..1. Pooling of Estiates In any real occasions it is desirable to cobine the individual saple estiates to obtain a cobined or pooled estiate of coon paraeter θ. It is known that the variance of a linear cobination l j ξ j of the rando variables ξ 1,..., ξ, subject to l j = 1 is iniized by choosing l j = σj σk σ j is the variance of the ξ j, j = 1,...,. Hence, the following theore is proposed. Theore.1 Let X j Λ(θ, τ j ), j = 1,,, and suppose that observations fro a saple of size are available for each population. Then a cobined saple estiate of θ which has iniu variance aong the class of the unbiased estiators of θ which are linear functions of ˆθ 1,... ˆθ is given by, θ = ˆθj ˆν j, ˆν j ˆν j = ˆσ j (1 + ˆσ j /) exp{ˆµ j + ˆσ j }. Then, θ n is approxiately norally distributed with ean θ and asyptotic variance [ 1 /ν j ] 1. 3

The proof of the above theore follows fro Lea AT (stated in the previous section). It is concluded fro Theore.1 that θ provides a good estiate for a coon θ based on several saples. More iportantly, it is not necessary for the ν j to be equal in this case. We refer to Rao (1981, pp. 389-391) for further discussions on the topic... Test of Hypothesis There are any situations when one ust ake a decision that is based on an unknown paraeter s value(s). On such occasions a test of the hypothesis about the paraeters ay be ore appropriate. We consider two classes of testing probles for lognoral data: (i) siple null versus global alternative, and (ii) test for hoogeneity...1. Test of siple null versus global alternative Using failiar atrix notation, let θ = (θ 1,, θ ) be a 1 be vector of paraeters and ˆθ = (ˆθ 1,, ˆθ ) be the axiu likelihood estiator vector of θ. Suppose it is desired to test the siple hypothesis H o : θ = θ o, θ o = (θ1, o θ, o, θ) o (.1) against the global alternative H a : θ θ o. Define n = n 1 + n +... + n. It is natural to construct a test statistic for the null hypothesis, which is defined by the noralized distance of ˆθ fro θ o. Hence, define T 1 = n(ˆθ θ o ) S 1 1 (ˆθ θ o ), n n S 1 = Diag ˆν 1,, ˆν. n1 n Note that the test statistics T 1 can be rewritten as By using Lea AT we arrive at the following theore. ) (ˆθj θj T 1 = (.) ˆν j Theore.: Let n and also /n approches to a constant for any j = 1,...,. Then under the null hypothesis in (.1), the test statistic T 1 follows a chisquare distribution with degrees of freedo. 4

Proof. Follows fro the Lea AT and the fact that T 1 is the su of squares of independent asyptotically noral rando variables. Thus, when the null hypothesis is true, the upper α-level critical value of T 1, by C n,α, ay be approxiated by the central χ distribution with degrees of freedo. Note that σ j is a function of θ j ; i.e., σ j = (ln θ j µ j ). Hence, under the null hypothesis σ j = σ o j = (ln θ o j ˆµ j ) and in this case we will have ˆν j = ˆν o j = σ o j (1 + σ o j /) exp{ˆµ j + σ o j }. Thus, we can define another test statistic for the proble at hand as follows: T 1 = n(ˆθ θ o ) S 1 1 (ˆθ θ o ), n S 1 = Diag ˆν 1, o n, ˆν o. n1 n Note that the test statistics T can be rewritten as ) (ˆθj θ o T1 j = n ˆν j o j Under H o, for large n, T 1 will have a χ distribution with degrees of freedo..3. Test of Hoogeneity In this section we focus on developing a testing ethodology for the hoogeneity of lognoral eans when saples are pooled. Let us suppose that two or ore saples are available for which a coon value of θ is assued. The statistical proble is to test the following null hypothesis of hoogeneity of the ean paraeters: the coon value of θ is unknown. Define θ = ( θ, θ) H o : θ 1 = θ = θ = θ, (.3) = θ1, 1 = (1,, 1). We propose the following test statistic to test the null hypothesis in relation (.): T = n(ˆθ θ1 ) S 1 (ˆθ θ1 ), n n S = Diag ν 1,, ν, n1 n 5

and ν j = σ j (1 + σ j /) exp{ˆµ j + σ j }, σ j = ln( θ j ˆµ j ). And again, we can rewrite test statistics T without of atrix notations as (ˆθj θ ) T =, (.4) ν j In an effort to derive the null distribution of T we consider the asyptotic distribution of soe rando variables related to the proposed test statistic. Define U n = n(ˆθ θ1 ). It can be seen that U n = n Cˆθ C = I 1ˆω J D, Theore.3: ( n1 D = Diag,, n ), J = I 1 1 ν 1 ν D, and ˆω = nj For large n, under the null hypothesis in (.), the test statistic T is distributed as a chi-square distribution with ( 1) degrees of freedo. Proof. Follows fro the Lea AT and the fact that T is the su of squares of asyptotically independent asyptotically noral rando variables. As the consequence of the above theore, under the null hypothesis and for large n, for given α, the critical value of T ay be approxiated by χ 1,α, the upper 100α% point of the chi-square distribution with ( 1) degrees of freedo..4. Power of the Tests It is iportant to note that, for a fixed alternative that is different fro the null hypothesis, the power of both test statistics proposed earlier will converge to one as n. This follows fro the fact that test statistics tends to infinity if θ θ 0 (cf. the siilar aguent given in Sen and Singer (1993, pages 37-38)). Thus, to study the asyptotic power properties of T 1 and T, we ust confine ourselves to a sequence of local alternatives {K n }. When θ is the paraeter of interest, such a sequence ay be specified by K n : θ = θ o + ξ. n 1 6 ν j

ξ is a vector of fixed real nubers. Evidently, θ approaches θ o at a rate to n 1/. Stochastic convergence of ˆθ to θ ensures that ˆθ p θ under local alternatives as well. Hence, nonnull distributions and the power of the proposed test statistic can be deterined under the local alternatives. Theore.4: Under the local alternatives and as n we have the following distributional result: n 1/ {ˆθ θ o } D N (ξ, Γ 1 ), ( ν1 Γ 1 = Diag,, ν ). ω 1 ω Theore.5: Under the local alternatives and as n we have the following distributional result: n 1/ {ˆθ θ1 } D N (Jξ, Γ ), and ( ν1 Γ = Diag,, ν ) C, ω 1 ω C = I 1 ω JD, ( ω1 D = Diag,, ω ), J = I 1 1 ν 1 ν D, and ω = ωj. ν j Proof of the both theores can be obtained using the general contiguity theory (cf. Roussas (197)). By thores.4 and.5, a su of squares of asyptotically independent and asyptotically noral rando variables with the unit variance and nonzero eans has the asyptotic distribution noncentral χ -square with the corresponding paraeter of noncentrality. Thus, under local alternatives, test statistics T 1 and T will have asyptotically a noncentral chi-square distribution with and 1 degree of freedo respectively, and noncentrality paraeters Θ 1 = ξγ 1 1 ξ, Θ = (Jξ) Γ 1 (Jξ), respectively. Hence, using a noncentral chi-square distribution, one can do the power calculations of the proposed test statistics..4. Interval Estiation 7

Let z α be the usual percentile point such that 1 Φ z α = α, Φ( ) is the cuulative distribution function for a standard noral rando variable. Noting that, P r ˆθ j z α ( ˆνj ) 1/ θ j ˆθ j + z α 1/ ˆνj converges to 1 α as. Hence intervals having 1 α coverage probability for θ j can be expressed as ˆθ j ± z α 1/ ˆνj. (.5) If the null hypothesis H o : θ 1 = θ = = θ is not rejected, it ay be of interest to obtain a 100(1 α)% confidence interval about the coon value of θ. A 100(1 α)% confidence interval about θ ay be obtained by using the cobined data. Note that, ( 1/ ( ) 1/ 1 P r z α θ θ ( / ν j )) θ 1 + z α ( / ν j ) converges to 1 α as n. Thus, a 100(1 α)% confidence interval about coon paraeter θ ay be obtained as follows: ( 1/ 1 θ ± z α. (.6) ( / ν j )) Clearly this interval will provide shorter confidence interval than that based on individual estiates, for any given α. 3. Concluding Rearks A large saple analysis is presented when lognoral eans are cobined. A test of the hoogeneity of the eans is presented, and a point and interval estiator of the coon ean paraeter is also provided. As a word of caution, the statistical procedures based on cobined estiates are sensitive to departure fro the null hypothesis. Therefore, soe other alternatives to pooled estiator should be considered. Furtherore, the proposed procedures involve nonlinear functions of asyptotic noral estiators that ay not be well approxiated by a noral law unless the saple sizes are large. Acknowledgeents The work was supported by grants fro the Natural Sciences and Engineering Research Council of Canada. 8

REFERENCES Ahed, S. E. and R. J. Tokins (1995). Estiating lognoral eans under uncertain prior inforation. Pakistan Journal of Statistics, 11, 67-9. Cheng, S. S. (1977). Optial replaceent rate of devices with lognoral failure distribution. IEEE Trans. Reliability R-6, 174-178. Crow, E. L. and Shiizu, K. (1988). Lognoral Distributions. Marcel Dekker: New York. Rao, C. R. (1981). Linear Statistical Inference and its Applications (second edition). Wiley Eastern Liited: New Delhi. Roussas, G. G. (197) Contiguity of Probability Measures, Cabridge University Press. Sen, P.K. and Singer, J.M. (1993) Large Saple Methods in Statistics, Chapan and Hall: New York - London. 9