Using the IML Procedure to Examine the Efficacy of a New Control Charting Technique

Similar documents
Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

1 Introduction to reducing variance in Monte Carlo simulations

Simulation. Two Rule For Inverting A Distribution Function

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

A new distribution-free quantile estimator

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

CONTROL CHARTS FOR THE LOGNORMAL DISTRIBUTION

Random Variables, Sampling and Estimation

Department of Mathematics

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Bayesian Control Charts for the Two-parameter Exponential Distribution

Stat 421-SP2012 Interval Estimation Section

Problem Set 4 Due Oct, 12

Estimation of Gumbel Parameters under Ranked Set Sampling

ECO 312 Fall 2013 Chris Sims LIKELIHOOD, POSTERIORS, DIAGNOSING NON-NORMALITY

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

1.010 Uncertainty in Engineering Fall 2008

Output Analysis (2, Chapters 10 &11 Law)

Data Analysis and Statistical Methods Statistics 651

Monte Carlo method and application to random processes

Chapter 6 Sampling Distributions

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

A proposed discrete distribution for the statistical modeling of

Power Comparison of Some Goodness-of-fit Tests

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7:

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Expectation and Variance of a random variable

1036: Probability & Statistics

Estimation for Complete Data

Mathematical Statistics - MS

c. Explain the basic Newsvendor model. Why is it useful for SC models? e. What additional research do you believe will be helpful in this area?

Clases 7-8: Métodos de reducción de varianza en Monte Carlo *

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Chapter 6. Sampling and Estimation

Stat 319 Theory of Statistics (2) Exercises

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Extreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution


ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

x = Pr ( X (n) βx ) =

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Modified Lilliefors Test

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Sample Size Determination (Two or More Samples)

6 Sample Size Calculations

Lecture 7: Properties of Random Samples

Statistical inference: example 1. Inferential Statistics

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Review Questions, Chapters 8, 9. f(y) = 0, elsewhere. F (y) = f Y(1) = n ( e y/θ) n 1 1 θ e y/θ = n θ e yn

Confidence Intervals

1 Inferential Methods for Correlation and Regression Analysis

MOMENT-METHOD ESTIMATION BASED ON CENSORED SAMPLE

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS

Minimax Estimation of the Parameter of Maxwell Distribution Under Different Loss Functions

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Statistical Fundamentals and Control Charts

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Unbiased Estimation. February 7-12, 2008

There is no straightforward approach for choosing the warmup period l.

MONTE CARLO VARIANCE REDUCTION METHODS

Confidence Intervals For P(X less than Y) In The Exponential Case With Common Location Parameter

Using SAS to Evaluate Integrals and Reverse Functions in Power and Sample Size Calculations

6. Sufficient, Complete, and Ancillary Statistics

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

1 Models for Matched Pairs

ESTIMATION AND PREDICTION BASED ON K-RECORD VALUES FROM NORMAL DISTRIBUTION

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Statistical Theory MT 2008 Problems 1: Solution sketches

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

CHAPTER 10 INFINITE SEQUENCES AND SERIES

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Trimmed Mean as an Adaptive Robust Estimator of a Location Parameter for Weibull Distribution

Control Charts for Mean for Non-Normally Correlated Data


Element sampling: Part 2

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

Output Analysis and Run-Length Control

Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation

( µ /σ)ζ/(ζ+1) µ /σ ( µ /σ)ζ/(ζ 1)

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

An Introduction to Randomized Algorithms

Statistical Theory MT 2009 Problems 1: Solution sketches

Chapter 6 Principles of Data Reduction

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Transcription:

Paper 2894-2018 Usig the IML Procedure to Examie the Efficacy of a New Cotrol Chartig Techique Austi Brow, M.S., Uiversity of Norther Colorado; Bryce Whitehead, M.S., Uiversity of Norther Colorado ABSTRACT For may years, cotrol charts have bee utilized to moitor processes, improve quality, ad icrease profitability. However, the body of literature tilts overwhelmigly toward charts moitorig ormally distributed processes. I practice, the uderlyig distributio of a process may ot follow a ormal distributio, ad may of those techiques may ot be most effective. Mukherjee, McCracke, ad Chakraborti (2015) suggested three cotrol charts for simultaeous moitorig of the locatio ad scale parameters for processes followig the shifted expoetial distributio. This study examies their proposed, Shifted Expoetial Maximum Likelihood Estimator-Max Chart, (SEMLE-max) ad suggests utilizatio of pealized maximum likelihood estimators (MLE) istead of traditioal MLEs because of ubiasedess ad miimum variace amog ubiased estimators. The ew chart, the Pealized SEMLEmax chart is costructed usig similar methodology, ad simulated data is used to compare average ru legths of the proposed chart to those obtaied by the origial chart. INTRODUCTION I the work published by Mukherjee, McCracke, ad Chakraborti (2015), ample justificatio was give to the creatio ad practical implemetatio of o-ormally distributed statistical process cotrol charts. Specifically, the authors focus was upo moitorig time-to-evet processes, as frequetly modeled by the shifted expoetial distributio: f(x) = 1 θ e 1 θ (x η), x > η, < η < I the publicatio, the authors itroduced three cotrol charts explicitly desiged for such processes. The cotrol chart of iterest i this study is their proposed, Shifted Expoetial Maximum Likelihood Estimator Max Chart, or SEMLE-max. I this chartig scheme, the maximum likelihood estimators (MLE) for the scale ad locatio parameters are used to build two plottig statistics based upo the stadard ormal distributio. I order to avoid issues arisig from joit moitorig of two processes, they authors suggest usig a sigle plottig statistic for each respective sample take, which is the absolute value of the maximum of the two respective statistics. The drawback to usig the traditioal MLEs i this circumstace is that they are biased estimators of the parameters. Sarha (1954) derived two ew estimators for the scale ad locatio parameters of the shifted expoetial distributio, usig a least squares techique, which are ubiased ad uiformly miimum variace for ubiased estimators. Zheg (2013) showed these estimators ca be derived from a pealized likelihood fuctio. Thus, i this study we recommed utilizig Sarhas estimators as opposed to the traditioal MLEs for the desirable qualities aforemetioed. We will use Mote Carlo simulatio methods i PROC IML to compare the i-cotrol ad out-of-cotrol average ru legths (IC-ARL & OOC-ARL) betwee the origial chart, ad the proposed modified chart. ESTIMATORS Similar to the derivatio of the traditioal MLEs for the scale ad locatio paramters ( θ ML = X X 1:, η ML = X 1: ), Zheg (2013) itroduced a pealty to the likelihood fuctio of the radom sample. The purpose of the pealty is with specific respect to the overestimatio of η, as P[η ML = X 1: > η] = 1. Thus, 1

the differece betwee the miimum order statistic ad η is defied to be the likelihood fuctio s pealty, ad this differece scales the likelihood fuctio, as follows: (X 1: η)l(θ, η) = (X 1: η) 1 θ e 1 θ (x i η) Usig the traditioal MLE derivatio techique, the pealized MLEs for η ad θ are: Ad: l(l(θ, η)) θ l(l(θ, η)) η i=1 = θ + 1 θ 2 (x i η ) = 0 i=1 θ = 1 (x i η ) i=1 1 = X 1: η + θ = 0 η = X 1: θ Pluggig η ito the equatio for θ, the estimators the become: These estimators are ubiased: η = X 1: X 1 θ = 1 (X X 1: ) E[η ] = E [ X 1: X 1 ] = 1 1 (E[X 1: ] E[X ]) = 1 1 ( (θ + η) η θ) = η E[θ ] = E [ 1 (X X 1: )] = 1 (E[X ] E[X 1: ]) = ( 1) (θ + η θ ) = θ Sarha (1954) also proved these estimators to be uiformly miimum variace amog the ubiased estimators. It should also be oted that θ is distributed as a gamma radom variable, with parameters θ/( 1) ad κ = 1, early idetically to the origial MLE, which was show i Mukherjee et all (2015). A closed form for the distributio of η could ot be derived, ad thus, its desity was estimated usig simulatio. CONSTRUCTION OF THE CONTROL CHART As oted i Mukherjee et al (2015), there are a array of issues associated with joitly moitorig two or more ukow parameters. Oe solutio proposed i their work was calculatig two idetically distributed statistics, ad desigatig the plottig statistic for a give sample as the maximum betwee the two. These two statistics are stadardized (as they are ot idetically distributed i their origial form) usig the iverse cumulative desity fuctio (CDF) of the stadard ormal distributio, i.e.: B 1 = Φ 1 (F(x)) B 2 = Φ 1 (G(x)) 2

Where Φ 1 deotes the iverse CDF of the stadard ormal distributio, F(x) deotes the CDF of η, ad G(x) deotes the CDF of θ. For a give sample of the process beig moitored, both B 1 ad B 2 will be calculated ad the maximum of their absolute value, say M i, becomes the plottig statistic. Mathematically, M i = max( B 1, B 2 ) Sice oly positive values are yielded by the plottig statistic, oly a upper cotrol limit (UCL) is ecessary to specify. For a desired i-cotrol average ru legth (IC-ARL), called ARL 0, Mukherjee et al (2015) showed the UCL ca be determied by: UCL = Φ 1 (0.5 (1 + 1 (ARL 0 ) 1 )) Thus, for a desired IC-ARL of 500, the UCL to be used is 3.29. Note, this IC-ARL will be used i comparig out-of-cotrol ARL (OOC-ARL) performace of the modified chart to the chart origially proposed i Mukherjee et al (2015). Therefore, for a give sample, the process is deemed OOC whe M i > UCL. Of ote, a OOC sigal ca assist i idicatig which of the two parameters has shifted away from its specified IC value. Whe M i = B 1 > UCL, this implies η may have shifted, ad coversely, whe M i = B 2 > UCL, this implies θ may have shifted. However, whe both B 1 ad B 2 are greater tha the UCL, it becomes difficult to determie which of the two values (or possibly both) is sigalig the OOC without physical ivestigatio. SIMULTATION METHODS & RESULTS Give the guidelies for the cotrol charts costructio as outlied i the previous sectio, the code for the simulatio was writte i similar steps. First, a radom sample from the OOC shifted expoetial distributio had to be simulated. This was performed usig the iverse probability itegral trasform. This trasformatio utilizes the kowledge of a radom variables cumulative desity fuctio (CDF) beig distributed uiformly betwee 0 & 1 (i.e., Y = F(x) UNIF(0,1)). Values from the uiform distributio are evaluated i a give radom variables iverse CDF. I this case, the shifted expoetial distributios CDF was set equal to Y ad solved for X as give by: Solvig for X: Y = 1 e 1 θ (x η) X = η θ l(1 Y) Thus, values radomly sampled from UNIF(0,1) are iput for Y, ad the resultig X values are distributed as the shifted expoetial for a give value of η ad θ. Usig two DO loops i the DATA step, 10,000 samples, each of sample size = 5, were simulated, ad the estimators for the respective parameters from the OOC distributio were obtaied usig PROC MEANS. At this poit, the plottig statistic for θ, B 2, was obtaied usig the PROBCHI ad QUANTILE fuctios i the DATA step. Here, the probability associated with a give observed value of θ (usig PROBCHI) was iputted ito the iverse CDF of the stadard ormal distributio (i.e., QUANTILE( NORMAL ) ), ad the absolute value of the resultig z-score is the value B 2 for a give sample. Seeig as the desity of η was estimated usig simulatio by ecessity, a hadful of extra procedures were implemeted i order to obtai its plottig statistic, B 1. Usig the same procedure described above, a radom sample from the IC shifted expoetial distributio was obtaied. This radom sample acts as the desity for η. Usig PROC IML, the OOC value of η from each respective sample was iteratively compared to the simulated IC distributio i order to obtai a pseudo-probability. Z-scores for this vector of pseudo-probabilities was obtaied i order to yield the plottig statistic B 1 i a early idetical maer as described for B 2. Now that values for B 1 ad B 2 for each of the respective 10,000 samples are obtaied, they ca be plotted agaist the specified UCL of 3.29 i order to evaluate the effectiveess of these ew estimators. This process was performed iteratively usig PROC IML. Whe a sample plotted above the UCL, thus sigalig a OOC process, the sample umber was 4 recorded, ad the mea differece betwee each 3

observed OOC poit was take to be the OOC-ARL for a give pair of OOC values of θ ad η. The etire aforemetioed process was performed 10 times for each pair of OOC values of θ ad η, ad mea of those results were take to be the OOC-ARL used for compariso agaist the Mukherjee et al (2015) chart s performace. The procedure described above was performed for 20 paris of OOC values of η ad θ, where the specified IC values were η = 0 ad θ = 1, respectively, ad the sample size take was = 5. The results of this study are give i the below table, deoted by ARL S, ad the results obtaied i the origial study are deoted ARL C. Table 1: Compariso of Simulated OOC-ARL η θ ARL S ARL C 0 0.5 170.54 141.33 0.25 0.5 175.81 164.64 0.5 0.5 153.01 163.90 1 0.5 51.10 85.95 1.25 0.5 3.87 1.21 0 0.75 578.82 462.96 0.25 0.75 492.24 599.91 0.5 0.75 331.09 390.97 1 0.75 25.06 30.64 1.25 0.75 2.77 1.14 0 1 483.97 498.67 0.25 1 396.12 359.83 0.5 1 218.77 141.72 1 1 15.90 13.23 1.25 1 4.24 1.10 0 1.25 109.60 136.85 0.25 1.25 128.53 91.62 0.5 1.25 66.25 46.44 1 1.25 13.41 7.78 1.25 1.25 4.01 1.09 CONCLUSION As show i Table 1, the utilizatio of the pealized MLEs give improved performace over that of the stadard estimators for the dowward shifts i θ coupled with icreases i η. Both chartig schemes, as the locatio parameter shifts upward away from its omial IC value, are highly effective i quickly detectig these large shifts, which is a similar coclusio draw i Mukherjee et al (2015). However, cosiderig the small scale, oe may assume the aalyzed OOC shifts are small, but the magitude of the shifts is quite large. While the proposed modificatios to the estimators yield improvemets i some cases, such as the istace whe OOC value of θ = 0.75 ad η = 0.5, a ))C-ARL of 331.09, while a improvemet over 390.97, is still far too slow for a 25% shift i θ ad a icreased η. 4

A big cotributor to this relatively slow performace is the costructio of the chart itself. While creatively thought out ad mathematically elegat, this techique leds itself oly to detectio of very extreme values, as evideced by the quick detectio rate at large OOC η. However, cosider the case of θ. Sice it ca be trasformed ito a χ 8 2 radom variable i the case whe = 5, it ca easily be determied what the z-score would be for a particular quatile of the χ 8 2 distributio. Say θ is perfectly estimated to be 0.50, a 50% reductio i the IC value of θ. The probability associated with this estimate is about 0.24, which is associated with a z-score of about -0.70, which certaily would ot sigal a shift i θ. I fact, settig the UCL to 3.29 suggests that i order for a OOC sigal to be detected, the lower-tailed probability associated with our estimators would have to be approximately 0.9995. Thus, as show by the results i Table 1, uless η has shifted substatially from 0, it is highly improbable to detect a OOC shift. Cosequetly, it would ot be recommeded to implemet this scheme with such a high UCL uless specific circumstaces warrat it. While a smaller UCL (for example, 2.78) would sacrifice the IC performace of the chartig scheme i both istaces, it would make up the differece by beig far more sesitive to shifts i either parameter. 5

REFERENCES Mukherjee, A., McCracke, A., & Chakraborti, S. (2015). Cotrol charts for simultaeous moitorig of parameters of a shifted expoetial distributio. Joural of Quality Techology, 47(2), 176-192. Sarha, A.E. (1954). Estimatio of the Mea ad Stadard Deviatio by Order Statistics. The Aals of Mathematical Statistics, Vol. 25, 317-328 Zheg, M. (2013). Pealized Maximum Likelihood Estimatio of Two-parameter Expoetial Distributios (Doctoral dissertatio, Master Thesis, Uiversity of Miesota). 6

CONTACT INFORMATION Your commets ad questios are valued ad ecouraged. Cotact the authors at: Austi Brow & Bryce Whitehead Uiversity of Norther Colorado 970-351-2807 austi.brow@uco.edu & bryce.whitehead@uco.edu www.uco.edu/cebs/asrm SAS ad all other SAS Istitute Ic. product or service ames are registered trademarks or trademarks of SAS Istitute Ic. i the USA ad other coutries. idicates USA registratio. Other brad ad product ames are trademarks of their respective compaies. 7

8