The Robustness of the Multivariate EWMA Control Chart

Similar documents
A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic

Nonparametric Multivariate Control Charts Based on. A Linkage Ranking Algorithm

An Investigation of Combinations of Multivariate Shewhart and MEWMA Control Charts for Monitoring the Mean Vector and Covariance Matrix

Directionally Sensitive Multivariate Statistical Process Control Methods

Robustness of the EWMA control chart for individual observations

On the Distribution of Hotelling s T 2 Statistic Based on the Successive Differences Covariance Matrix Estimator

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee

Module B1: Multivariate Process Control

Multivariate Change Point Control Chart Based on Data Depth for Phase I Analysis

MYT decomposition and its invariant attribute

Design and Implementation of CUSUM Exceedance Control Charts for Unknown Location

The Effect of Level of Significance (α) on the Performance of Hotelling-T 2 Control Chart

An Adaptive Exponentially Weighted Moving Average Control Chart for Monitoring Process Variances

A Multivariate EWMA Control Chart for Skewed Populations using Weighted Variance Method

MCUSUM CONTROL CHART PROCEDURE: MONITORING THE PROCESS MEAN WITH APPLICATION

A multivariate exponentially weighted moving average control chart for monitoring process variability

Multivariate Charts for Multivariate. Poisson-Distributed Data. Busaba Laungrungrong

COMPARISON OF MCUSUM AND GENERALIZED VARIANCE S MULTIVARIATE CONTROL CHART PROCEDURE WITH INDUSTRIAL APPLICATION

ON CONSTRUCTING T CONTROL CHARTS FOR RETROSPECTIVE EXAMINATION. Gunabushanam Nedumaran Oracle Corporation 1133 Esters Road #602 Irving, TX 75061

DIAGNOSIS OF BIVARIATE PROCESS VARIATION USING AN INTEGRATED MSPC-ANN SCHEME

Generalized Variance Chart for Multivariate. Quality Control Process Procedure with. Application

Detecting Assignable Signals via Decomposition of MEWMA Statistic

A Multivariate Process Variability Monitoring Based on Individual Observations

A problem faced in the context of control charts generally is the measurement error variability. This problem is the result of the inability to

Research Article Robust Multivariate Control Charts to Detect Small Shifts in Mean

Confirmation Sample Control Charts

Monitoring General Linear Profiles Using Multivariate EWMA schemes

Chart for Monitoring Univariate Autocorrelated Processes

On the Hotelling s T, MCUSUM and MEWMA Control Charts Performance with Different Variability Sources: A Simulation Study

Monitoring Production Processes Using Multivariate Control Charts

Retrospective Hotelling s T 2 Control Chart for Automotive Stamped Parts: a Case Study

Distribution-Free Monitoring of Univariate Processes. Peihua Qiu 1 and Zhonghua Li 1,2. Abstract

THE DETECTION OF SHIFTS IN AUTOCORRELATED PROCESSES WITH MR AND EWMA CHARTS

A New Bootstrap Based Algorithm for Hotelling s T2 Multivariate Control Chart

Multivariate Process Control Chart for Controlling the False Discovery Rate

An exponentially weighted moving average scheme with variable sampling intervals for monitoring linear profiles

A Modified Poisson Exponentially Weighted Moving Average Chart Based on Improved Square Root Transformation

A MACRO FOR MULTIVARIATE STATISTICAL PROCESS CONTROL

Robust economic-statistical design of multivariate exponentially weighted moving average control chart under uncertainty with interval data

Bootstrap-Based T 2 Multivariate Control Charts

Self-Starting Control Chart for Simultaneously Monitoring Process Mean and Variance

Hotelling s T 2 charts with variable sample size and control limit

Received 17 November 2004; Revised 4 April 2005

A Study on the Power Functions of the Shewhart X Chart via Monte Carlo Simulation

System Monitoring with Real-Time Contrasts

On Monitoring Shift in the Mean Processes with. Vector Autoregressive Residual Control Charts of. Individual Observation

21.1 Traditional Monitoring Techniques Extensions of Statistical Process Control Multivariate Statistical Techniques

arxiv: v1 [stat.me] 14 Jan 2019

Quality Control & Statistical Process Control (SPC)

A New Demerit Control Chart for Monitoring the Quality of Multivariate Poisson Processes. By Jeh-Nan Pan Chung-I Li Min-Hung Huang

A first look at the performances of a Bayesian chart to monitor. the ratio of two Weibull percentiles

AN ANALYSIS OF SHEWHART QUALITY CONTROL CHARTS TO MONITOR BOTH THE MEAN AND VARIABILITY KEITH JACOB BARRS. A Research Project Submitted to the Faculty

Multi-Variate-Attribute Quality Control (MVAQC)

JINHO KIM ALL RIGHTS RESERVED

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

A STUDY ON IMPROVING THE PERFORMANCE OF CONTROL CHARTS UNDER NON-NORMAL DISTRIBUTIONS SUN TINGTING

MONITORING BIVARIATE PROCESSES WITH A SYNTHETIC CONTROL CHART BASED ON SAMPLE RANGES

Construction of An Efficient Multivariate Dynamic Screening System. Jun Li a and Peihua Qiu b. Abstract

Control charts continue to play a transformative role in all walks of life in the 21st century. The mean and the variance of a

Variations in a manufacturing process can be categorized into common cause and special cause variations. In the presence of

A Simulation Study Of The Impact Of Forecast Recovery For Control Charts Applied To ARMA Processes

Performance of Conventional X-bar Chart for Autocorrelated Data Using Smaller Sample Sizes

Statistical Research Unit Department of Economics University of Gothenburg

Nonparametric Monitoring of Multiple Count Data

Univariate and Multivariate Surveillance Methods for Detecting Increases in Incidence Rates

Run sum control charts for the monitoring of process variability

CONTROL charts are widely used in production processes

CONTROL CHARTS FOR MULTIVARIATE NONLINEAR TIME SERIES

CUMULATIVE SUM CHARTS FOR HIGH YIELD PROCESSES

Weighted Likelihood Ratio Chart for Statistical Monitoring of Queueing Systems

Application and Use of Multivariate Control Charts In a BTA Deep Hole Drilling Process

A New Model-Free CuSum Procedure for Autocorrelated Processes

A COMPARISON OF PHASE I CONTROL CHARTS. University of Pretoria, South Africa 1 4

covariance nor the comparison of these charts have been well studied. We will study these in this paper. The charts for covariance surveyed in Alt and

PRODUCT yield plays a critical role in determining the

Zero-Inflated Models in Statistical Process Control

First Semester Dr. Abed Schokry SQC Chapter 9: Cumulative Sum and Exponential Weighted Moving Average Control Charts

Monitoring aggregated Poisson data with probability control limits

A Theoretically Appropriate Poisson Process Monitor

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Introduction to Time Series (I)

Confidence Intervals for the Process Capability Index C p Based on Confidence Intervals for Variance under Non-Normality

A nonparametric two-sample wald test of equality of variances

Multivariate T-Squared Control Chart

An Economic Alternative to the c Chart

Monitoring Expense Report Errors: Control Charts Under Independence and Dependence. Darren Williams. (Under the direction of Dr.

Directionally Sensitive Multivariate Statistical Process Control Procedures with Application to Syndromic Surveillance

Re-weighted Robust Control Charts for Individual Observations

Optimal Cusum Control Chart for Censored Reliability Data with Log-logistic Distribution

Statistical Process Control for Multivariate Categorical Processes

MS-E2112 Multivariate Statistical Analysis (5cr) Lecture 1: Introduction, Multivariate Location and Scatter

The Model Building Process Part I: Checking Model Assumptions Best Practice

The Non-Central Chi-Square Chart with Double Sampling

APPLICATION OF Q CHARTS FOR SHORT-RUN AUTOCORRELATED DATA

A process capability index for discrete processes

Monitoring autocorrelated processes using a distribution-free tabular CUSUM chart with automated variance estimation

Supplementary Material for Wang and Serfling paper

The F distribution and its relationship to the chi squared and t distributions

Continuous Univariate Distributions

Diskussionsbeiträge des Fachgebietes Unternehmensforschung

Transcription:

The Robustness of the Multivariate EWMA Control Chart Zachary G. Stoumbos, Rutgers University, and Joe H. Sullivan, Mississippi State University Joe H. Sullivan, MSU, MS 39762 Key Words: Elliptically symmetric, skewed, statistical process control. Control charts are graphical tools widely used to monitor manufacturing processes to quickly detect any change in a process that may result in a change in product quality. The statistic plotted on a control chart is based on samples of n 1 observations (rational subgroups) that may be taken at regular sampling intervals. However, there are numerous practical applications using individual observations (n = 1), as in many chemical and process industries or when the rate of production is slow. See, for example, Montgomery (1997, pp. 221-222) and Ryan (2000, p. 133) for many applications using individual observations. The standard approach to control charts has been univariate, with the most frequently investigated charts being those designed to monitor the mean µ of a single normally distributed process variable X. However, with rapidly evolving data-acquisition technology, it is now common practice to simultaneously monitor several, usually correlated, quality variables. Using separate univariate charts does not account for correlation between variables, so a multivariate control chart is more suitable. Two multivariate control charts that have received attention in the statistical process control (SPC) literature are the Shewhart-type χ 2 chart, originating in the work of Hotelling (1947), and the multivariate exponentially weighted moving average (MEWMA) chart, proposed much more recently by Lowry et al. (1992). Both of these charts were originally developed for the problem of monitoring the mean vector, say µ, of a continuous multivariate process variable (vector), say x. The χ 2 chart can be relatively easily applied and is helpful in identifying large shifts in µ, but is ineffective at detecting small and moderate-sized shifts. Unlike the χ 2 chart, the MEWMA chart accumulates information from past observations, making it more sensitive in detecting small, sustained shifts. The survey articles of Alt and Smith (1988), Wierda (1994), Lowry and Montgomery (1995), and Mason et al. (1997) give good discussions on multivariate control charts and extensive lists of references. When monitoring the mean vector, control chart design is usually predicated on the approximate multivariate normality (multinormality) of process data. Nonnormality generally is not a major concern with large subgroups, because the central limit theorem applies and the sample mean vector x is approximately multinormal, for any practical distribution of the individual observations. However, with small samples quite common in SPC practices from a non-normal population, x may be far from multinormal. The problem is most severe with the smallest sized subgroups, individual observations. The univariate non-normality problem was recently studied by Stoumbos and Reynolds (2000), Reynolds and Stoumbos (2001), and Borror, Montgomery, and Runger (1999) with the conclusion that non-normality can seriously degrade the performance of the Shewhart X chart, but the EWMA and cumulative sum (CUSUM) charts can be designed to be robust. The more general multivariate problem has not been investigated. We investigate the performance of the MEWMA chart and of Hotelling s χ 2 chart for multivariate nonnormal observations. We show that the in-control (IC) performance of the individuals χ 2 chart suffers from an excessive number of false alarms with many types of non-normality. This greatly limits the χ 2 chart s practical usefulness because of excessive process disruptions and adjustments, loss of confidence in the control procedure, and, ultimately, lost productivity. We recommend the MEWMA chart as a robust and effective alternative to the individuals χ 2 chart. We show that an individuals MEWMA control chart can be designed to have very nearly the designed IC performance regardless of the distribution of the individual observations. The same design also provides excellent out-of-control (OOC) performance across a wide range of process shifts regardless of the distribution of the process data. Note that while the MEWMA chart offers excellent performance in detecting sustained shifts in the mean vector, if designed for robustness it may not necessarily detect all isolated outlying observations. Data with outliers can be regarded as having a non-normal distribution in the sense that the observations can be modeled as a mixture of observations from two (or more) distributions or, alternatively, from a distribution with heavy tails. For example both the multivariate gamma and multivariate t distributions, for some parameters, have an increased probability for observations that could be regarded as outliers. Thus, in designing the chart to be robust against all non-normal distributions, one natural consequence is some reduction in sensitivity to process outliers. The MEWMA chart and Hotelling s χ 2 chart are presented in the next section, and then multivariate generalizations of the univariate t and gamma distributions are discussed. The MEWMA Chart and Hotelling s χ 2 Chart Let x 1, x 2, be a sequence of p-variate process ob- 3362

servations, with mean vectors µ 1 = µ 0 + δ 1, µ 2 = µ 0 + δ 2,, respectively, an IC mean vector µ 0, and constant covariance matrix Σ X. We will study the case in which there is a single, sustained shift from µ 0, so that δ i = 0 for i T, and δ i = δ (δ 0) for all subsequent observations; i.e., T is the unknown, random time after which the process mean vector shifts from µ 0 to µ 0 + δ. The IC parameters µ 0 and Σ X are assumed known, estimated with negligible error from a sufficiently large sample, but T and δ are unknown. The magnitude or size of the T 1 shift δ we consider is λ = δ Σ X δ. In the context of multivariate normality, λ is called the noncentrality parameter. Define the vector-accumulations of the observations as z i = r( xi µ 0 ) + (1 r) z i 1, for i = 1, 2, K, where 0 < r 1 is the smoothing parameter, and the starting value is usually chosen to be z 0 = 0. Then, the control 2 T 1 statistic of the MEWMA chart is Ti = z i Σ Z z i, for i = 1, 2,, where Σ Z is the asymptotic covariance matrix 2i r(1 (1 r) ) r of z i, Σ Z = lim i Σ X = Σ X. 2 r 2 r The MEWMA control chart signals at the first sampling time i for which T i 2 > h, where h > 0 denotes the chart s upper control limit (UCL), usually determined so that the MEWMA chart achieves a desired IC average run length (ARL). There is no lower control limit. The ARL is the expected number of sample subgroups (which may be individual observations) taken until the chart generates a signal. For r = 1, the MEWMA chart reduces to Hotelling s χ 2 chart. A large value of r renders the MEWMA chart effective at detecting large shifts. Small and moderate sustained shifts are detected much more effectively with a chart having a small r, but a small r may delay slightly the detection of very large shifts. These issues and the choice of a value for r are discussed later. Before evaluating the chart performance, we discuss possible generalizations of the univariate t and gamma distributions to multivariate process observations. The Multivariate t and Gamma Distributions Following the univariate analyses of Stoumbos and Reynolds (2000), Reynolds and Stoumbos (2001), and Borror, Montgomery, and Runger (1999), we considered distributions that were either elliptically symmetrical, such as the multinormal and the multivariate t distribution, or skewed such as the multivariate gamma distribution. First, consider the multivariate generalization of the univariate t distribution, which is the ratio of a standard-normal random variable, divided by the square root of an independent chi-squared random variable, in turn, divided by its degrees of freedom. One generalization would be simply a vector of univariate t distributions. However, more commonly, a single chisquared random variable is used to divide a vector of standard-normal random variables, the definition we used. This latter distribution is called the multivariate t with common denominator by Johnson and Kotz (1972, p. 134), who say it is often called the general multivariate t distribution. If x has a standardized p- variate normal distribution (with zero mean vector) and covariance matrix R, which is also the correlation matrix since the elements are individually standardized, and S 2 has an independent chi-squared distribution with ν degrees of freedom, then y =µ + x S 2 / ν has a p- variate t distribution with ν degrees of freedom, and with E (y) = µ and Var(y) = ν R/(ν 2). This p-variate t distribution will be denoted here by t p (ν ). With an elliptically symmetrical distribution, the statistical performance of the MEWMA chart depends on the shift and the covariance matrix only through the noncentrality parameter (see Sullivan and Stoumbos (2002)). There is less agreement in the literature on a specific multivariate generalization of the gamma distribution, with a number of possible alternative definitions. Often, the multivariate gamma distribution is given as one-half the diagonal elements of a matrix having a Wishart distribution, the distribution we used. A symmetric matrix has a Wishart distribution with n degrees of freedom and scale matrix Σ if it can be written as X T X, where X is a data matrix of n observations from the multivariate normal distribution with zero mean vector and covariance matrix Σ. Note that Σ is the covariance matrix of the implicit multinormal distribution and is not the covariance matrix of the actual multivariate gamma observations. This distribution is described by Krishnaiah (1985) as a (central or noncentral) multivariate gamma distribution with n/2 as the shape parameter and Σ as the covariance matrix of the accompanying multivariate normal. The p-variate gamma distribution with shape parameter α and scale parameter β will be denoted here by Gam p (α, β ). We take β = 1 with no loss of generality. The statistical performance with multivariate gamma observations is not fully specified by λ (see Sullivan and Stoumbos (2002)). Some shifts increase the ARL. In general, we wish to evaluate a chart s performance in detecting a shift in the mean vector when the higher moments are fixed. Ideally, a control chart would be distribution-free in the sense that its performance would be independent of the distribution s shape as determined by the higher moments. The second moment has no effect on the chart performance, provided it is known accurately from a sufficiently large number of IC data points. Our study concerns how the distribu- 3363

tion s higher moments, or more generally, its characteristic function, affect the chart s ability to detect a shift in the mean vector. We elected to study the multivariate t distribution, which differs from the multinormal in the fourth and higher moments, and the multivariate gamma distribution, because it differs in the third and higher moments. Some distributions, such as the gamma with shape parameter of 1, depart from normality to a great extent. Our approach has been to study the performance of nearly normal distributions as well as those that differ greatly with respect to the third or fourth moments (small shape parameter or small degrees of freedom). Thus, we considered Gam p (α,1) observations having shape parameter values of 1 (very large skewness and kurtosis), 4, 16, 64, 256, and 1024 (1024 being nearly normal). For the t p (ν ) observations we considered degrees of freedom values 3 (kurtosis very different from normal), 4, 6, 10, 20, 40, 100, and 1000 (1000 being nearly normal). With degrees of freedom below 3, the second moment is undefined. These parameter values give a comprehensive view of the performance by including pathologically non-multinormal observations, observations nearly multinormal and a generous assortment of intermediate distributions. Technically, the mean vector cannot be specified independently of the covariance matrix for the multivariate gamma distribution. The diagonal elements of a Wishart matrix have as their mean and variance, respectively, nσ 2 ii and 2nσ ii. Therefore, in generating IC observations we subtracted the mean vector from the vector of multivariate observations and generated the OOC observations by adding a shift vector. Specifically, if x i is a multivariate observation from a specified process distribution, we calculated the vector accumulation as zi = r( xi µ 0 + δi ) + (1 r) zi 1. We also wanted to study the effect of a wide range of values of r. We used 10 values that are evenly spaced / 3 logarithmically, as given by the formula r = 10 k, for k = 0, 1,, 9. In the following section, we discuss issues associated with the computation of the ARL for the MEWMA chart and for Hotelling s χ 2 chart and with the determination of their control limits. The ARL is traditionally used as the performance measure for control chart comparisons. The control limits are generally adjusted so that all charts have approximately the same IC ARL, and the control chart with the lowest OOC ARL is considered best at detecting that particular shift. Following the studies of Lowry et al. (1992), Rigdon (1995), and Prabhu and Runger (1997), among others, in order to study the robustness of the MEWMA chart, the UCL (h) was determined to give an IC ARL 200 under multivariate normality. In-Control Performance Comparisons We start with a detailed discussion of chart performance for bivariate observations, and then discuss higher dimensions using less detail, as the discussion generalizes. Table 1, given at the end of this article, provides a detailed tabulation of ARL values for bivariate MEWMA and Hotelling s χ 2 charts. The columns correspond to λ = 0, 0.05, 0.1, 0.5, 1, 1.5, 2, and 3, with λ = 0 corresponding to IC ARL values. Table 1 is organized into panels (sub-tables), each corresponding to a different distribution. On the left are gamma distributions with shape parameters α = 1 and 16. At the bottom left is the multinormal distribution. On the right are t distributions with ν = 3, 6, and 100. Within each panel are rows for the same set of values for r. For example, the top left panel corresponds to the highly skewed Gam 2 (1,1) distribution. For this distribution, as expected, the IC ARL is greatly reduced when r is large. The IC ARL of the χ 2 chart is only 31.7. Similarly, with the t 2 (3) distribution the IC ARL is only 41.3, which indicates many more false alarms than with normal observations. On the other hand, as the shape parameter increases or the degrees of freedom increase, the distributions approach normality, and the IC ARL of the MEWMA chart approaches 200, regardless of the value of r. One of the major findings of our study is that, although the IC ARL of Hotelling s χ 2 chart is highly sensitive to violations of multivariate normality, for an appropriate choice of r, the IC ARL of the MEWMA chart is robust even to highly non-normal distributions. For example, consider the row for r = 0.0464, with IC ARL values of 198.7 and 207.3 for the highly skewed Gam 2 (1,1) distribution and the extremely heavy-tailed t 2 (3) distribution, respectively. Values larger than 0.0464 show some reduction in the IC ARL for these extreme distributions, with the difference decreasing as the distribution approaches normality. For example, with r = 0.1 and Gam 2 (16,1) or t 2 (20) the IC ARL is 195.8 or 192.6, respectively, both very close to the multinormal IC ARL. However, where the process data may follow a distribution that is extremely non-normal, the value of r = 0.1 may not be sufficiently small to completely guard against an excessive number of false alarms. Thus, for bivariate data, we recommend using a value for r between 0.02 and 0.05 when there is concern about non-normality. Out-of-Control Performance Comparisons We now proceed to the OOC chart performance. The multivariate t distribution is elliptically symmetrical, and the MEWMA chart s performance depends on the shift only through λ. This issue is discussed in Lowry, et al. (1992) for the multinormal case. Without loss of generality, the shift was in the direction e 1 = (1, 0,, 0) T, with the identity covariance matrix. 3364

With non-elliptically symmetrical distributions, the shift direction matters. As shown in Figure 1, the directional sensitivity is evident with skewed univariate data, a point not noted in some previous robustness studies. For example, with and r = 1 and an IC ARL of 200 with normal data, with Gam 1 (1,1) data the IC ARL decreases to 44.9. The OOC ARL increases to an average of 69.6 for a shift of ±1, and further increases to 105.7 for a shift of ±1.5, before decreasing to 5.8 for a shift of ±2. Note that the OOC ARL can exceed even the IC ARL with normal observations. 250 200 150 100 ARL 50-3 -2-1 1 2 3 Shift Figure 1. Univariate ARL versus Shift, r = 1, Gam 1 (1,1) We averaged the ARL over all possible shift directions, using a uniform prior distribution for the direction. The ARL for a specific direction can differ considerably from the average, sometimes exceeding the IC ARL. However, the purpose of this article is not so much to chronicle poor performance with non-normal data, as it is to identify the way to consistently achieve excellent performance, so reporting the average suffices for the latter purpose. While the performance is invariant to a change in the covariance matrix of the actual multivariate gamma observations, it is not invariant to the covariance matrix of the implicit Wishart distribution. Since we are not aware of any reasonable model for the distribution of this Wishart covariance matrix, especially in higher dimensions, we did not average the results over such a distribution. Instead, we used the identity matrix for the simulations, realizing the details, but not the general conclusions, depend on the choice. In addition to IC ARL values, Table 1 gives OOC ARL values for bivariate observations. The panel for multinormal observations confirms that the MEWMA chart is much more effective than Hotelling s χ 2 chart at detecting small and moderate-sized shifts. It is well documented in the SPC literature that the MEWMA chart can be designed to give excellent performance over a range of shifts. Therefore, the main objective of our analysis is to compare the non-normal ARL values with the respective ones for the normal distribution. We now consider the choice of r to best detect shifts of various sizes in the context of various distributions. For a given distribution and shift size, there is a unique optimal r, for which the corresponding MEWMA chart has the smallest OOC ARL among all MEWMA charts with the same IC ARL. Using a non-optimal r increases the ARL, the increase depending on the size of the shift and the distribution. For the values of λ and r used in the bivariate simulations, the increases are tabulated in Table 2 for multinormal observations. For example, for a 0.05 shift, the best r (among those shown) is 0.0046, with an ARL of 177.3. Increasing r to 0.0464 increases the ARL to 182.1, a very small increase of 4.8 for that combination of r and λ. For each value of r, there is a unique λ for which the increase is worst. For example, with r = 0.046 the increase is greatest (among those shown) at 11.0 for a shift size of 0.1. In practice, the exact shift size is seldom known in advance. Instead, a range of shift sizes should be effectively detected, and often the range is not bounded above. From this perspective, the r = 0.1 chart detects all shifts of 0.5 or greater with the minor increase in ARL of no more than 1.5. At 0.046, the upper end of our recommended range for r, the maximum increase is only about 1.8. Thus, although this value is smaller than typically recommended, the inertia problem only amounts to at most 1.8 observations, which seems very small. At 0.022, the lower end of our recommended range, the maximum increase is merely about 3.2. Since the chart is robust to non-normality, the same results generally apply regardless of the distribution of the observations. This is the basis for our recommendation, namely that performance is very near optimal regardless of (1) the distribution of the data, (2) the size of the subgroup, or (3) the magnitude of the shift. Table 2. Out-of-Control ARL Increase Shift Magnitude λ r 0.05 0.10 0.25 0.50 1.00 1.50 2.00 3.00 1.000 22.2 59.7 108.7 89.1 32.0 10.3 3.3 0.2 0.464 18.8 50.2 70.7 33.3 4.6 0.6 0 0 0.215 15.3 37.5 37.6 9.8 0.2 0 0.2 0.4 0.100 10.2 23.3 14.9 1.4 0 0.6 0.8 0.9 0.046 4.8 11.0 3.2 0 1.2 1.8 1.8 1.6 0.022 1.4 2.7 0 1.7 3.1 3.2 3.0 2.5 0.010 0.2 0 1.5 4.5 5.2 4.7 4.1 3.3 0.005 0 0.4 4.3 7.2 6.9 6.0 5.1 3.9 0.002 0.6 1.8 6.6 9.2 8.2 6.9 5.8 4.4 0.001 0.8 2.8 8.1 10.4 8.9 7.4 6.2 4.7 A major conclusion is that a suitable choice of r gives robust OOC as well as robust IC performance. Consider the values for r = 0.0464. For shifts of any size, the ARL is very nearly the same regardless of the distribution, depending only on the size of the shift. Even with extremely non-normal observations, this value of the smoothing parameter gives nearly the same performance regardless of the third or fourth moment of the distribution, distribution-free performance in a 3365

sense. Thus, we recommend using a value for r between 0.02 and 0.05 for a bivariate MEWMA chart, where there is concern about non-multinormal process data. The univariate case and some higher dimensional cases are discussed in the following section. Other Dimensions We also conducted simulations for various other dimensions besides the bivariate case, although space does not allow tabulation of these results. In the univariate case, for r = 1, the decrease in IC ARL with t 1 (3) data is only 70%, compared to 79% for t 2 (3) data. On the other hand, with λ = 1, the OOC ARL increases by 19% compared to a 29% decrease for t 2 (3) data. With ten-dimensional data from the highly skewed Gam 10 (1,1) distribution, the IC ARL decreases by 19% for r = 0.46, the upper end of the range we recommend for bivariate data. For r = 0.22, the decrease reduces to only 3.5%, and becomes negligible a smaller r. With t 10 (3) data, the changes in the IC ARL are, respectively, a 22% decrease and an increase of merely 0.9%. Thus, our recommended range of r between 0.02 and 0.05 offers robust performance to the MEWMA control chart up to p = 10, even with extremely non-multinormal process data. This is completely unlike Hotelling s χ 2 chart, which generates an excessive number of false alarms with non-multinormal data of all dimensions. Concluding Remarks This article has demonstrated that Hotelling s χ 2 chart is not effective at detecting small and moderatesized shifts in the mean vector of a multivariate quality variable x. Moreover, designing the individuals χ 2 chart under the assumption of multivariate normality, when the process observations actually follow a multivariate t or gamma distribution, results in a much smaller in-control ARL, which leads to a much larger number of false alarms than anticipated. The out-ofcontrol performance also deteriorates, with the ARL increasing compared with that of the same sized shift and multinormal observations. For practical applications, the MEWMA control chart offers much better performance than the χ 2 chart. Furthermore, unlike the χ 2 chart, the MEWMA chart that is based on the asymptotic covariance matrix can be designed to be robust to non-normality for both incontrol and out-of-control data across a wide range of shifts. Although in most cases, a smoothing parameter of r between 0.02 and 0.05 gives robust performance, for applications in excess of five dimensions, a value for r of 0.02 or smaller will ensure performance comparable to that with multinormal data, even for highly skewed and extremely heavy-tailed non-normal distributions. Furthermore, since the use of individual observations represents the case most sensitive to nonnormality, by extension, the MEWMA chart can be made to be robust for subgroups of any size. The need for robust, distribution-free multivariate SPC has been noted in a number of recent articles (see, e.g., Woodall and Montgomery (1999), Stoumbos et al. (2000), and references therein). In particular, Coleman (1997) noted that he would never believe the multivariate normal assumption for industrial data, and that even if he wanted to, he cannot believe that there are tests for multivariate normality with sufficient power for practical sample sizes that he would even bother to use them. He concluded that, distribution-free multivariate SPC is what we need. The performance of an appropriately designed MEWMA control chart comes very close to being distributionfree. In many practical applications, the MEWMA chart would be more appealing than multivariate nonparametric control charts, which are less powerful, more computationally intensive, and generally do not apply to skewed distributions (see, e.g., Stoumbos and Jones (2000) and Stoumbos et al. (2000)). Acknowledgement Zachary G. Stoumbos work was funded in part by the Law School Admission Council (LSAC) and by 2000 and 2001 Rutgers Faculty of Management Research Fellowships. The opinions and conclusions contained in this publication are those of the authors and do not necessarily reflect the position or policy of LSAC. References Alt, F.B. and Smith, N.D. (1988), Multivariate Process Control, in Handbook of Statistics, Vol. 7, Eds. P. R. Krishnaiah and C. R. Rao, Amsterdam: Elsevier Science. Borror, C. M., Montgomery, D. C., and Runger, G. C. (1999), Robustness of the EWMA Control Chart to Nonnormality, Journal of Quality Technology, 31, 309-316. Coleman, D. E. (1997), Individual Contributions, in A Discussion on Statistically-Based Process Monitoring and Control, Eds. D. C. Montgomery and W. H. Woodall, Journal of Quality Technology, 29, 148-149. Hotelling H. (1947), Multivariate Quality Control, in Techniques of Statistical Analysis, Eds. C. Eisenhart, M. W. Hastay, and W. A. Wallis, New York: McGraw-Hill. Johnson, N. L., and Kotz, S. (1972), Distributions in Statistics, Continuous Multivariate Distributions, Vol. 4, New York: Wiley. Kotz, S., and Johnson, N. L. (1985), Encyclopedia of Statistical Sciences, New York: Wiley. Krishnaiah, P. R. (1985), Multivariate Gamma Distributions, in Encyclopedia of Statistical Sciences, Eds. N. L. Johnson and S. Kotz, New York: Wiley. Lowry, C. A. and Montgomery D. C. (1995), A Review of Multivariate Control Charts IIE Transactions, 27, 800-810. Lowry, C. A, Woodall, W. H., Champ, C. W., and Rigdon, S. E. (1992), A Multivariate Exponentially 3366

Weighted Moving Average Control Chart, Technometrics, 34, 46-53. Mason, R. L., Champ, C. W., Tracy, N. D., Wierda, S. J., and Young J. C. (1997), Assessment of Multivariate Process Control Techniques, Journal of Quality Technology, 29, 140-143. Montgomery, D. C. (1997), Introduction to Statistical Quality Control (3rd ed.), New York: Wiley. Reynolds, M. R., Jr., and Stoumbos, Z. G. (2002), Control Charts and the Efficient Allocation of Sampling Resources, preprint. Prabhu, S. S., and Runger, G. C. (1997), Designing a Multivariate EWMA Control Chart, Journal of Quality Technology, 29, 8-15. Rigdon, S. E. (1995), An Integral Equation for the In- Control Average Run Length of a Multivariate Exponentially Weighted Moving Average Control Chart, Journal of Statistical Computation and Simulation, 52, 351-365. Ryan, T. P. (2000), Statistical Methods for Quality Improvement (2nd ed.), New York: Wiley. Stoumbos, Z. G., and Jones, L. A. (2000), On the Properties and Design of Individuals Control Charts Based on Simplicial Depth, Nonlinear Studies, 7, 147-178. Table 1. ARL Values for Bivariate MEWMA and Hotelling s χ 2 (r = 1) Charts Stoumbos, Z. G., Jones, L. A., Woodall, W. H., and Reynolds, M. R., Jr. (2000), On Shewhart-Type Nonparametric Multivariate Control Charts Based on Data Depth, in Frontiers in Statistical Quality Control 6, eds. H.-J. Lenz and P.-Th. Wilrich, Heidelberg, Germany: Springer-Verlag, pp. 207-227. Stoumbos, Z. G., and Reynolds, M. R., Jr. (2000), Robustness to Non-normality and Autocorrelation of Individuals Control Charts, Journal of Statistical Computation and Simulation, 66, 145-187. Stoumbos, Z. G., Reynolds, M. R., Jr., Ryan, T. P., and Woodall W. H. (2000), The State of Statistical Process Control as We Proceed into the 21st Century, Journal of the American Statistical Association, 95, 992-998. Stoumbos, Z. G., and Sullivan, J. H. (2002), Robustness to Non-normality of the Multivariate EWMA Control Chart, Journal of Quality Technology, 34, 260-276. Wierda, S. J. (1994), Multivariate Statistical Process Control Recent Results and Directions for Future Research, Statistica Neerlandica, 48, 147-168. Woodall W. H., and Montgomery, D. C. (1999), Research Issues and Ideas in Statistical Process Control, Journal of Quality Technology, 31, 376-386. Shift Magnitude λ Shift Magnitude λ r 0.00 0.05 0.10 0.50 1.00 1.50 2.00 3.00 r 0.00 0.05 0.10 0.50 1.00 1.50 2.00 3.00 Gam 2(1,1) t 2(3) 1.000 31.7 31.8 31.7 30.9 28.7 25.0 14.8 2.3 1.000 41.3 41.3 41.2 38.0 29.7 19.6 10.9 2.4 0.464 47.5 47.5 47.3 43.6 19.3 6.6 3.7 2.0 0.464 57.2 57.0 56.4 42.7 19.2 7.4 3.7 2.0 0.215 87.5 87.3 86.6 41.3 10.6 5.5 3.7 2.4 0.215 96.1 95.3 92.7 42.3 11.5 5.6 3.7 2.3 0.100 150.0 147.5 139.1 28.7 10.2 6.1 4.4 2.9 0.100 153.4 149.2 137.1 31.9 10.4 6.1 4.4 2.9 0.046 198.7 183.6 149.5 26.9 11.4 7.3 5.4 3.6 0.046 207.3 193.8 161.0 27.9 11.4 7.2 5.4 3.6 0.022 207.7 184.8 141.1 28.4 13.3 8.7 6.5 4.4 0.022 231.9 207.0 156.2 28.8 13.3 8.7 6.5 4.4 0.010 205.4 180.8 137.8 31.2 15.3 10.2 7.7 5.2 0.010 233.0 201.9 148.0 31.4 15.3 10.2 7.7 5.2 0.005 203.8 180.2 137.3 34.0 17.1 11.4 8.7 5.9 0.005 227.5 196.0 144.6 34.0 17.1 11.4 8.7 5.9 0.002 202.7 179.6 138.4 35.8 18.3 12.3 9.4 6.4 0.002 222.6 192.6 144.0 36.0 18.3 12.3 9.4 6.4 0.001 201.9 179.6 139.5 37.1 19.0 12.9 9.8 6.7 0.001 219.0 191.8 144.2 37.1 19.0 12.9 9.8 6.7 Gam 2(16,1) t 2(6) 1.000 105.4 105.3 105.0 94.5 49.1 17.3 7.1 2.2 1.000 54.8 54.7 54.5 46.3 29.5 15.8 7.8 2.2 0.464 139.6 138.4 135.9 62.6 15.0 6.1 3.6 2.0 0.464 77.6 77.0 75.4 44.1 15.4 6.3 3.6 2.0 0.215 177.8 173.3 160.7 36.6 10.3 5.5 3.7 2.4 0.215 123.7 121.2 114.6 36.0 10.5 5.5 3.7 2.4 0.100 195.8 185.2 158.2 28.0 10.1 6.1 4.4 2.9 0.100 167.9 160.7 141.2 28.5 10.2 6.1 4.4 2.9 0.046 200.3 183.3 146.7 26.6 11.4 7.3 5.4 3.6 0.046 192.2 178.2 144.6 26.8 11.4 7.2 5.4 3.6 0.022 201.1 180.1 138.7 28.3 13.2 8.7 6.5 4.4 0.022 200.5 180.2 139.5 28.3 13.2 8.7 6.5 4.4 0.010 200.8 177.6 136.1 31.1 15.3 10.2 7.7 5.2 0.010 202.2 179.7 136.9 31.1 15.3 10.2 7.7 5.2 0.005 200.4 176.8 136.0 33.9 17.1 11.5 8.7 5.9 0.005 202.1 178.8 137.1 33.8 17.0 11.4 8.7 5.9 0.002 200.2 177.6 137.5 35.8 18.3 12.4 9.4 6.4 0.002 201.9 179.4 137.9 35.8 18.3 12.3 9.4 6.4 0.001 200.0 178.3 138.6 37.0 19.0 12.9 9.8 6.7 0.001 201.9 179.4 138.9 37.0 19.0 12.9 9.8 6.7 Bivariate Normal t 2(100) 1.000 200.2 199.4 195.3 115.7 42.1 15.7 6.9 2.2 1.000 176.3 175.2 172.2 107.0 40.8 15.7 6.9 2.2 0.464 199.7 196.1 185.7 59.9 14.7 6.1 3.6 2.0 0.464 187.7 184.5 174.7 58.7 14.7 6.1 3.6 2.0 0.215 200.1 192.6 173.1 36.4 10.3 5.5 3.7 2.4 0.215 195.7 188.8 170.1 36.1 10.2 5.4 3.7 2.4 0.100 199.9 187.5 158.9 28.0 10.1 6.1 4.4 2.9 0.100 199.1 186.7 158.5 27.9 10.1 6.1 4.4 2.9 0.046 199.4 182.1 146.5 26.6 11.3 7.3 5.4 3.6 0.046 199.7 182.7 146.4 26.5 11.3 7.2 5.4 3.6 0.022 199.3 178.6 138.3 28.3 13.2 8.7 6.5 4.4 0.022 200.4 179.6 138.3 28.2 13.2 8.7 6.5 4.4 0.010 199.6 177.4 135.5 31.1 15.3 10.2 7.7 5.2 0.010 200.7 177.4 135.7 31.1 15.3 10.2 7.7 5.2 0.005 199.9 177.3 135.9 33.8 17.0 11.5 8.7 5.9 0.005 200.8 177.3 136.2 33.8 17.0 11.4 8.7 5.9 0.002 200.0 177.8 137.3 35.8 18.3 12.4 9.4 6.4 0.002 200.4 177.1 137.5 35.8 18.3 12.3 9.4 6.4 0.001 199.9 178.0 138.4 37.0 19.0 12.9 9.8 6.7 0.001 200.6 178.3 138.4 36.9 19.0 12.9 9.8 6.7 3367