ACTA UNIVERSITATIS LODZIENSIS. Grzegorz Kończak*

Similar documents
ON THE MODIFICATION OF THE EMPTY CELLS TEST

A C T A U N I V E R S I T A T I S L O D Z I E N S I S FOLIA OECONOMICA 228, Janusz L. W yw iał*

KRUSKAL-WALLIS TEST ÍN MULTIPLE COMPARISONS

STT 843 Key to Homework 1 Spring 2018

A C T A U N I V E R S I T A T I S L O D Z I E N S I S FOLIA OECONOMICA 225, Daniel Kosiorowski*

REMARKS ON CLASSICAL MEANS FOR ONE AND MANY SAMPLES. 1. Introduction

Storm Open Library 3.0

COMPARING INCOME DISTRIBUTIONS - METHODS AND THEIR APPLICATION TO WAGE DISTRIBUTIONS IN POLAND

PROOFS OF THE NORMALIZATION OF THE FUNCTION OF THE THICKNESS CLASSES ONE-DIMENSIONAL NORMAL DISTRIBUTIONS

NUMERICAL SIMULATION OF MHD-PROBLEMS ON THE BASIS OF VARIATIONAL APPROACH

Asymptotic Statistics-III. Changliang Zou

A C T A U N I V E R S I T A T I S L O D Z I E N S I S. Czeslaw D om ański*, Dariusz Parys** THE CLOSED BOOTSTRAP MULTIPLE TEST PROCEDURE

About One way of Encoding Alphanumeric and Symbolic Information

Final Examination Statistics 200C. T. Ferguson June 11, 2009

Preliminary Statistics. Lecture 3: Probability Models and Distributions

Multivariate Distributions

ESTIMATION OF VARIANCE OF LOGISTIC REGRESSION PREDICTORS FOR SMALL AREAS

FOREIGN OFFICE. ;<э. File HNP zibr. KM 2Щ-г_ (CLAIMS) N a m e o f File :

EXTREME VALUE DISTRIBUTIONS AND ROBUST ESTIMATION

ACTA UNIVERSITATIS LODZIENSIS. Andrzej Grzybowski*

Large Sample Properties of Estimators in the Classical Linear Regression Model

REMARKS ON ESTIMATING THE LEBESGUE FUNCTIONS OF AN ORTHONORMAL SYSTEM UDC B. S. KASlN

ACTA UNIVERSITATIS LODZIENSIS. Dariusz Parys* M ULTIPLE ENDPOINTS

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

POSITIVE FIXED POINTS AND EIGENVECTORS OF NONCOMPACT DECRESING OPERATORS WITH APPLICATIONS TO NONLINEAR INTEGRAL EQUATIONS** 1.

Chapter 2. Review of basic Statistical methods 1 Distribution, conditional distribution and moments

Bivariate Paired Numerical Data

HANDBOOK OF APPLICABLE MATHEMATICS

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ

APPLICATION OF FUNCTIONAL BASED ON SPATIAL QUANTILES

On the Distribution o f Vertex-Degrees in a Strata o f a Random Recursive Tree. Marian D O N D A J E W S K I and Jerzy S Z Y M A N S K I

P (x). all other X j =x j. If X is a continuous random vector (see p.172), then the marginal distributions of X i are: f(x)dx 1 dx n

Czechoslovak Mathematical Journal

Lecture 11. Multivariate Normal theory

Math 510 midterm 3 answers

ECON Fundamentals of Probability

ACTA UNIVERSITATIS LODZIENSIS. Jan Kowalik* KRIG IN G - A M ETH O D OF STATISTICAL LXTERPOLATION OF SPATIAL DATA

A C T A U N I V E R S I T A T I S L O D Z I E N S I S. Dorota Kwiatkówska-Ciotucha*, Urszula Załuska**, Paweł Hańcz ar***

ACM 116: Lectures 3 4

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

Problem Selected Scores

C-CC514 NT, C-CC514 PL C-CC564 NT, C-CC564 PL C-CC574 NT, C-CC574 PL C-CC714 NT, C-CC714 PL C-CC764 NT, C-CC764 PL C-CC774 NT, C-CC774 PL

Department of Chemical Engineering, Slovak Technical Bratislava. Received 8 October 1974

Multiple Random Variables

A Short Course in Basic Statistics

ON ALGORITHMS INVARIANT T 0 NONLINEAR SCALING WITH INEXACT SEARCHES

Probability for Statistics and Machine Learning

Gaussian random variables inr n

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Modeling and testing long memory in random fields

tfb 1 U ъ /ъ /5 FOREIGN OFFICE llftil HNP XI?/ Чо /о/ F il e ... SI» ИJ ... -:v. ' ' :... N a m e of File : Re turned Re turned Re turned.

COMPLETE HYPERSURFACES WITH CONSTANT SCALAR CURVATURE AND CONSTANT MEAN CURVATURE IN Я4. Introduction

Asymptotic Statistics-VI. Changliang Zou

Chapter 5 Matrix Approach to Simple Linear Regression

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

STA 2201/442 Assignment 2

Multivariate Regression

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

BTRY 4090: Spring 2009 Theory of Statistics

The purpose of this section is to derive the asymptotic distribution of the Pearson chi-square statistic. k (n j np j ) 2. np j.

3 Joint Distributions 71

Preliminaries. Copyright c 2018 Dan Nettleton (Iowa State University) Statistics / 38

This does not cover everything on the final. Look at the posted practice problems for other topics.

Multivariate Random Variable

Partitioned Covariance Matrices and Partial Correlations. Proposition 1 Let the (p + q) (p + q) covariance matrix C > 0 be partitioned as C = C11 C 12

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

SINGULAR PERTURBATIONS FOR QUASIUNEAR HYPERBOUC EQUATIONS. a " m a *-+».(«, <, <,«.)

Modeling and Performance Analysis with Discrete-Event Simulation

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Chapter 7 - Section 8, Morris H. DeGroot and Mark J. Schervish, Probability and Statistics, 3 rd

Gaussian vectors and central limit theorem

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

STA 2101/442 Assignment 3 1

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

Probability Theory and Statistics. Peter Jochumzen

2. Matrix Algebra and Random Vectors

Exam P Review Sheet. for a > 0. ln(a) i=0 ari = a. (1 r) 2. (Note that the A i s form a partition)

Formulas for probability theory and linear models SF2941

Practice Problems Section Problems

ESTIMATION OF PARAMETERS OF TÖ R NQ UIST S FUNCTIONS WITH NEW TON-RAPH SO N S M ETHOD

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Economics 573 Problem Set 5 Fall 2002 Due: 4 October b. The sample mean converges in probability to the population mean.

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart

Today. Probability and Statistics. Linear Algebra. Calculus. Naïve Bayes Classification. Matrix Multiplication Matrix Inversion

Random Process. Random Process. Random Process. Introduction to Random Processes

Structural Reliability

PLISKA STUDIA MATHEMATICA BULGARICA

A C T A U N I V E R S I T A T I S L O D Z I E N S I S FOLIA OECONOMICA 90, 1989 INFLUENTIAL OBSERVATIONS TN TUE GENERALIZED ANALYSIS OF VARIANCE MODEL

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper

TUTORIAL 8 SOLUTIONS #

Random Vectors and Multivariate Normal Distributions

ON THE RESAMPLING METHOD IN SAMPLE MEDIAN ESTIMATION

Stat 206: Sampling theory, sample moments, mahalanobis

Topic 7 - Matrix Approach to Simple Linear Regression. Outline. Matrix. Matrix. Review of Matrices. Regression model in matrix form

Test Problems for Probability Theory ,

Multivariate Analysis and Likelihood Inference

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0)

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

. Find E(V ) and var(v ).

Transcription:

ACTA UNIVERSITATIS LODZIENSIS FOLIA OECONOM ICA 206, 2007 Grzegorz Kończak* JO INT M ONITORING OF THE M EAN, DISPERSION AND ASYMMETRY IN PROCESS CONTROL Abstract. In this paper the quadratic form o f sample mean, sample variance and sample asymmetry is considered. This quadratic form can be used for testing hypothesis on the expected value, variance and asymmetry of observed variable. This statistic can be used for constructing a control chart for monitoring these three parameters in process control or in acceptance sampling by variables. It is very difficult to find the exact distribution o f the proposed statistic. The asymptotic distribution o f the proposed statistic is presented. The quantiles o f exact distributions have been derived using M onte Carlo study for the case o f normal distribution of the monitored variable. Key words: test, mean, variance, asymmetry, distribution. 1. INTRODUCTION The control charts arc developed by W. A. Shewhart and were first used to m onitor a process mean. The charts as X S and X R were used to m onitor the process mean and its variance in two separated charts. There were some propositions for joint m onitoring of these two parameters. F. F. G a n (2000) considered Hotelling T type control chart and F. F. G a n (1995) considered joint monitoring of sample mean and sample variance using exponentially weighted moving average control charts. G. Kończak and J. Wywial (2001) proposed a modification of control chart X S. They added to the standard chart a third chart to monitor asymmetry. * Ph.D., Department o f Statistics, University of Economica Katowice, e-mail: koncz- @ ae.katowi ce.pl.

The quadratic form of the sample mean, sample variance and sample asymmetry is considered in this paper. This quadratic form can be applied as the test statistic for the hypothesis on expected value, variance and asymmetry of normal distribution. The distribution function of this statistic leads to x2 distribution with 3 degrees of freedom, if sample size leads to infinity. The critical values for this statistic in the case of small size sample were found using M onte Carlo method. The table with approximated critical values of the test statistic has been prepared for three standard significance levels. Using these tables the hypothesis on expected value, variance and asymmetry of a diagnostic variable can be tested when sample size is greater or equal to 4. 2. BASIC DEFINITIONS Let X be the random variable which has moments of at least 6й1 order. Let us consider the random variable vector X X = (X-EX)2 (X - EX)3 (1) The first component of the vector X measures the mean of the random variable X, the second one dispersion and the third one asymmetry. Let us denote the expected value of the random variable X by and the r-th central m om ent of the variable X E(X) = Ц (2) The second central m oment is denoted by Щ Х -Е XY = rtr (3) E(X - EX)2 = г/г = a2 On the basis o f expressions (2)-(4) we have

The covariance m atrix o f this vector is as follows Пг Пз >74 I ( X ) = //3 tją - ц\ rj5-rj2rj3 (6) JIa - ЧгЧъ Пь ~ 7?. Let X {, X2, X n be the simple sample from the population. Basing on this sample we would like to test a hypothesis ' EX Mo Я : E(X - EX)2 = oo E(X - EX)3.»/3,0. where the alternative hypothesis is as follows Let us consider the following vector Vn X (8) where I is a sample m ean given by (9) S2 is a sample variance C3 is the sample third central moment

In the case of small size of the sample we can use following variance and third m om ent estimators S2= - 4 i(x-x)2 (12) n ~ 1 /-i <13> To find the covariance m atrix of vector V we use the formulas from H. Cramer (1958) cov(x, Ck(x)) =~(rik+i-kr]2rik_l) + 0(n-'1) (14) n COv(Ck(x), Ck(x)) = ^ (t]k+s - + ksrj1rjk_,>7,-1 - kt]k_ltji+l - + «7*+1'7.,- ) + 0(и~2) (15) where Cr is the r-th sample central m om ent given by i - l The covariance m atrix of the vector V takes form h Уз У a - 3 722 Уз У A- y l У 5 - ЬУгУъ У 4-3^1 í7j - 4/72/73-6 а д 4 + 9^2 -?/з2. (16) If the param eters ß and a of the random variable X are unknown the covariance matrix (16) can be estimated using the following estimator of Z : Г C2 C3 C4-3 C? Ź = - C3 C4 C\ C5-4 C 2C3 (17) n [ c 4-3 S 4 Cj 4C2C3 C6-6C2C4 + 9C - C32_

T o test the hypothesis (7) we can use the W ald type statistic Z Zn [X /л, Sn o',c 3 ji3] X - Ц - S i - «1 c 3 Рз (18) This statistic has asymptotic (if n leads to infinity) chi square distribution with 3 degrees of freedom. If the parameters of the random variables X are unknown we can construct the test statistic Z * based on the estimator of the covariance m atrix Z'n = [ X - ß, S l - a \ С3- Д 3] % -1 X - Ц ' Sn ^ 3 /4 (19) If n leads to infinity the distribution leads to T 2 Hotelling distribution (e.g. Fuchs, Kenett, 1998) Using the following dependecy z : - t \ n-*oo f p F T 2~ f - p + l pj-p+l (20) where / = n 1 and n 1 >p, the critical values for the statistics (19) can be found. 3. THE CASE OF THE NORM AL DISTRIBUTION Let us assume that the simple sample is taken from the normal distribution with the mean ц and the standard deviation a. We can write this as follows X: N(p, a2). In this case the expected value and the со variance m atrix of the vector V statistic are following:

and a 2 0 0 ' 0 2a 4 0 0 0 15a6 (22) The inverse m atrix o f the covariance m atrix is following: '30a4 0 O' E - = 30a6 0 15a2 0 0 0 2 (23) The statistic (18) takes form [30a \X -м )2+ 15a2(S2 - a2)2 + 2C32] 30a6 (24) or equally it can be written 7 _ Г3 0 ( Г - А)2 15(S2 er) 2C3 " I i* 2 30a4 30a6 30a2 ] (25) This statistic has assymptotic chi square distribution with 3 degrees of freedom (eg. Fuchs, Kenett, 1998). 4. M ONTE CARLO STUDY Because it is very difficult to find the exact distribution of the statistic Z there were made M onte Carlo study. For sample sizes of n = 4,5,...,20 there were found quantiles of this statistic. The quantiles were found for three significance levels 0.10, 0.05 and 0.01. The run of computer simulations was following: 1) there were generated values of random variables X t,x 2,...,Xn from normal distribution with mean ß = 100 and standard deviation a = 5, 2) for each n from 4 to 20 step 1 was repeated 10 000 times. The values of the statistic Z were calculated, 3) the quantiles 0.90, 0.95 and 0.99 from empirical distribution were accepted as estimates of quantiles of statistic Z.

The results of M onte Carlo study are presented in Tab. 1. The Fig. 1 presents the quantiles of the statistics Z for the significance levels 0.1, 0.05 and 0.01. These quantiles were estimated for sample sizes from 4 to 20. Table l Quantiles of the statistic Z - the results of M onte Carlo study Sample size Quantil q(l a) n a = 0.1 a = 0.05 a = 0.01 4 4.25 5.85 10.97 5 4.29 5.78 11.25 6 4.39 6.07 11.92 7 4.43 6.19 11.75 8 4.53 6.28 11.49 9 4.59 6.36 11.54 10 4.67 6.41 11.42 11 4.84 6.52 11.66 12 4.77 6.39 11.26 13 4.78 6.51 11.52 14 4.8 6.36 11.05 15 4.69 6.41 10.76 16 4.93 6.51 11.20 17 4.83 6.32 11.69 18 4.94 6.74 11.17 19 4.93 6.43 11.01 20 4.87 6.37 10.85 Source: own calculations.

- -»Штттт0)Ш*,щ * фщ 4-0 - ---- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1------- 1 1 1 1 1 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 n ----------- a = 0.1 a = 0.05 - - - a = 0.01 Fig. 1. Quantiles o f the statistic Z n Source: own calculations based on Tab. 1. 5. APPLICATIONS Acceptance sampling is commonly used by manufacturers to screen incoming lots of material for an excessive number of defective units. There are two types of acceptance sampling plans: for attributes and for variables. The proposed statistic can be used in acceptance sampling plans by variables. Let us assume that there is a mixture of two subsamples in the incoming lot. Let the characteristic of the first subsample has normal distribution with mean p, and standard deviation er, and the second subsample with mean ц г and standard deviation er2. We can then write that X, : N ( /i,,o -,) and X 2: N(/i2,o-2). Let the fraction of units from the first subsample be denoted by a and the fraction of units from the second subsample by 1 a. Let /,(x ) and f 2(x) are the density functions of random variables X, and X 2. The density functions of the mixture of random variables will be then as follows: f(x) = a/, (x) + ( 1 - a]f2(x) (26) The expected value o f this mixture will be given by E(X) = a/i, + (1 - a)/i2 (27)

and the variance D 2(X) = аст,2 + (1 - a)a\ + a(l - a)gu, - ц2)2 (28) The third central moment of the random variable X is following: C3(X) = а(1 - a)(ß2 - //,)[(2o - l)(ß2 - цх)2 -L Ъ(а\ - а,2)] (29) The Fig. 2 presents the density functions of random variables X 2 and their mixtures in the case where р{<ц2 and at > a2. Under these assumptions and if а = 0.5 we have C3(x) < 0. The third central moment will inform us that the incoming lot isn t homogeneous and the hypothesis (7) m ay be rejected. Fig. 2. The density function o f the mixture of two random variables Source: own calculations. 6. SUM M ARY The proposed statistic Z we can use to test the hypothesis on expected value, variance and asymmetry of random variable. We can use this test in quality control procedures especially in acceptance sampling. When we have a sample which is a mixture of some subsamples with different means and standard deviations it is possible that mean is equal to /.i and variance is equal to cr2. In such situations the third central moment can give inform ation to reject the sample.

REFERENCES Cramer H. (1958), M etody matematyczne w statystyce, PW N, Warszawa. Fuchs C., Kene 11 R. S. (1998), Multivariate Quality Control, Marcel Dekker, New Y ork-basel-h ong Kong. G a n F. F. (1995), Joint Monitoring o f Process Mean and Variance Using Exponentially Weighted Moving Average Control Charts, Technom etrics, 37, 446-453. G a n F. F. (2000), Joint Monitoring o f Process Mean and Variance Based on the Exponentially Weighted Moving Averages, [in:] Statistical Process Monitoring and Optimization, Marcel Dekker, N ew York-Base), 189-208. Kończak G., Wywiał J. (2001), O pewnej m odyfikacji karty kontrolnej Х -S, [in:] K. Jajuga, M. W a l e s i a k (red.), Klasyfikacja i analiza danych - teoria i zastosowania, Taksonom ia, 8, (AE, Wrocław), 140-148. Grzegorz Kończak ŁĄCZNE M ONITO ROW ANIE ŚREDNIEJ, W ARIANCJI I ASYM ETRII W PROCEDURACH KONTROLI JAKOŚCI W artykule analizowano rozkład formy kwadratowej średniej, wariancji oraz asymetrii z próby. Przyjęto założenie, że próba pochodzi z populacji o rozkładzie normalnym. Proponowana statystyka może być wykorzystana w procesach sterowania jakością do jednoczesnego m onitorowania poziomu przeciętnego, rozproszenia oraz asymetrii rozkładu badanej zmiennej. Trudne jest wyznaczenie rozkładu dokładnego proponowanej statystyki. Podano asymptotyczny rozkład rozważanej zmiennej oraz, wykorzystując symulacje komputerowe, wyznaczono kwantyle dla rozkładów dokładnych, uwzględniając różne liczebności prób.