0 0'0 2S ~~ Employment category

Size: px

Start display at page:

Download "0 0'0 2S ~~ Employment category"

Daniella Octavia Andrews
5 years ago
Views:

1 Analyze Phase O~----, ,------,,------, ,----- N = ' V~ 00 0' i-.~ fl' ~G ~~ ~O~ ()0 -S 0 -S ~~ 0 ~~ 0 ~G d> ~0~ ~0 0 0'0 2S ~~ (j «l FIGURE 10.7 Boxplots of salary by job category. Employment category Boxplots are particularly useful for comparing the distribution of values in several groups. Figure 10.7 shows boxplots for the salaries for several different job titles. The boxplot makes it easy to see the different properties of the distributions. The location, variability, and shapes of the distributions are obvious at a glance. This ease of interpretation is something that statistics alone cannot provide. Statistical Inference This section discusses the basic concept of statistical inference. The reader should also consult the glossary in the Appendix for additional information. Inferential statistics belong to the enumerative class of statistical methods. All statements made in this section are valid only for stable processes, that is, processes in statistical control. Although most applications of Six Sigma are analytic, there are times when enumerative statistics prove useful. The term inference is defined as (1) the act or process of deriving logical conclusions from premises known or assumed to be true, or (2) the act of reasoning from factual knowledge or evidence. Inferential statistics provide information that is used in the process of inference. As can be seen from the definitions, inference involves two domains: the premises and the evidence or factual knowledge. Additionally, there are two conceptual frameworks for addressing premises questions in inference: the design-based approach and the model-based approach. As discussed by Koch and Gillings (1983), a statistical analysis whose only assumptions are random selection of units or random allocation of units to experimental conditions results in design-based inferences; or, equivalently, randomization-based inferences. The objective is to structure sampling such that the sampled population has the same

332 C hap te r Ten characteristics as the target population. If this is accomplished then inferences from the sample are said to have internal validity.

2 332 C hap te r Ten characteristics as the target population. If this is accomplished then inferences from the sample are said to have internal validity. A limitation on design-based inferences for experimental studies is that formal conclusions are restricted to the finite population of subjects that actually received treatment, that is, they lack external validity. However, if sites and subjects are selected at random from larger eligible sets, then models with random effects provide one possible way of addressing both internal and external validity considerations. One important consideration for external validity is that the sample coverage includes all relevant subpopulations; another is that treatment differences be homogeneous across subpopulations. A common application of design-based inference is the survey. Alternatively, if assumptions external to the study design are required to extend inferences to the target population, then statistical analyses based on postulated probability distributional forms (e.g., binomial, normal, etc.) or other stochastic processes yield model-based inferences. A focus of distinction between design-based and modelbased studies is the population to which the results are generalized rather than the nature of the statistical methods applied. When using a model-based approach, external validity requires substantive justification for the model's assumptions, as well as statistical evaluation of the assumptions. Statistical inference is used to provide probabilistic statements regarding a scientific inference. Science attempts to provide answers to basic questions, such as can this machine meet our requirements? Is the quality of this lot within the terms of our contract? Does the new method of processing produce better results than the old? These questions are answered by conducting an experiment, which produces data. If the data vary, then statistical inference is necessary to interpret the answers to the questions posed. A statistical model is developed to describe the probabilistic structure relating the observed data to the quantity of interest (the parameters), that is, a scientific hypothesis is formulated. Rules are applied to the data and the scientific hypothesis is either rejected or not. In formal tests of a hypothesis, there are usually two mutually exclusive and exhaustive hypotheses formulated: a null hypothesis and an alternate hypothesis. Chi-Square, Student's T, and F Distributions In addition to the distributions present earlier in the Measure phase, these three distributions are used in Six Sigma to test hypotheses, construct confidence intervals, and compute control limits. Chi-Square Many characteristics encountered in Six Sigma have normal or approximately normal distributions. It can be shown that in these instances the distribution of sample variances has the form (except for a constant) of a chi-square distribution, symbolized X2. Tables have been constructed giving abscissa values for selected ordinates of the cumulative X2 distribution. One such table is given in Appendix 4. The X2 distribution varies with the quantity u, which for our purposes is equal to the sample size minus 1. For each value of u there is a different X2 distribution. Equation (10.3) gives the pdf for the X2. (10.3)

3 336 Chapter Ten F(2,2) F F FIGURE F distributions. denominator. Appendix 5 and 6 provide values for the 1 and 5% percentage points for the F distribution. The percentages refer to the areas to the right of the values given in the tables. Figure illustrates two F distributions. Point and Interval Estimation So far, we have introduced a number of important statistics including the sample mean, the sample standard deviation, and the sample variance. These sample statistics are called point estimators because they are single values used to represent population parameters. It is also possible to construct an interval about the statistics that has a predetermined probability of including the true population parameter. This interval is called a confidence interval. Interval estimation is an alternative to point estimation that gives us a better idea of the magnitude of the sampling error. Confidence intervals can be either one-sided or two-sided. A one-sided or confidence interval places an upper or lower bound on the value of a parameter with a specified level of confidence. A twosided confidence interval places both upper and lower bounds. In almost all practical applications of enumerative statistics, including Six Sigma applications, we make inferences about populations based on data from samples. In this chapter, we have talked about sample averages and standard deviations; we have even used these numbers to make statements about future performance, such as long term

4 Analyze Phase 337 yields or potential failures. A problem arises that is of considerable practical importance: any estimate that is based on a sample has some amount of sampling error. This is true even though the sample estimates are the "best estimates" in the sense that they are (usually) unbiased estimators of the population parameters. Estimates of the Mean For random samples with replacement, the sampling distribution of X has a mean /.1!..nd a standard deviation equal to (J/.};;. For large samples the sampling distribution of X is approximately normal and normal tables can be used to find the probability that a sample mean will be within a given distance of /.1. For example, in 95% of the samples we will observe..e mean within t..1.96(j/.};; of /.1. In other words, in 95% of the samples the interval from X -1.96(J/.};; to X (J/.};; will include /.1. This interval is called a "95% confidence interval for estimating /.1." It is usually shown using inequality symbols: X -1.96(J/.};; < /.1X (J/.};; The factor 1.96 is the Z value obtained from the normal in the Appendix 2. It corresponds to the Z value beyond which 2.5% of the population lie. Since the normal distribution is symmetric, 2.5% of the distribution lies above Z and 2.5% below -Z. The notation commonly used to denote Z values for confidence interval construction or hypothesis testing is Za/ z where 100(1 - a) is the desired confidence level in percent. For example, if we want 95% confidence, a = 0:05,100(1- a) = 95%, and ZO.025 = In hypothesis testing the value of a is known as the significance level. Example: Estimating Jl When 0' Is Known Supp~e that cr is known to be 2.8. Assume that we collect a sample of n = 16 and compute X = Using the e equation mentioned in previous section we find the 95% confidence interval for /.1 as follows: X-1.96cr/.};; < /.1 < X cr/.};; (2.8/.Ji6) < /.1 < (2.8/.Ji6) < /.1 < There is a 95% level of confidence associated with this interval. The numbers and are sometimes referred to as the confidence limits. Note that this is a two-sided confidence interval. There is a 2.5% probability that is lower than /.1 and a 2.5% probability that is greater than /.1. If we were only interested in, say, the probability that /.1 were greater than 14.33, then the onesided confidence interval would be /.1 > and the one-sided confidence level would be 97.5%. Example of Using Microsoft Excel to Calculate the Confidence Interval for the Mean When Sigma Is Known Microsoft Excel has a built-in capability to calculate confidence intervals for the mean. The dialog box in Fig shows the input. The formula result near the bottom of

5 338 C hap te r Ten ONFIDENCE Alpha 1.05 ~ ;;;; 0.05 Standard_de v 1'""12-.. s ,!)"...;;;; 2.8 Size 116 ~ = 10 = RelJJrns the confidence interval fur a popularon meajl. See Help fur the equation used. Size Is the sample Si28. Formula result = OK cancel FIGURE Example of finding the confidence interval when sigma is known using Microsoft Excel. the screen gives the interval width as To find the lower confidence limit subtract the width from the mean. To find the upper confidence limit add the width to the mean. Example: Estimating Jl When 0' Is Unknown When cr is not known and we wish to replace cr with s in calculating confidence intervals for /-l, we must replace Z a/ 2 with t a/2 and obtain the percentiles from tables for student's t distribution instead of the normal tables. Let's revisit the example above and assume that instead of knowing cr, it was estij!lcl.ted from the sample, that is, based on the sample of n = 16, we computed s = 2.8 and X = Then the 95% confidence interval becomes: x s/J;; < /-l < X s/J;; (2.8/Ji6) < /-l < (2.8/Ji6) < /-l < It can be seen that this interval is wider than the one obtained for known cr. The t a/ 2 value found for 15 df is (see Table 3 in the Appendix), which is greater than Z a/2 = 1.96 above. Example of Using Microsoft Excel to Calculate the Confidence Interval for the Mean When Sigma Is Unknown Microsoft Excel has no built-in capability to calculate confidence intervals for the mean when sigma is not known. However, it does have the ability to calculate t-values when given probabilities and degrees of freedom. This information can be entered into an equation and used to find the desired confidence limits. Figure illustrates the approach. The formula bar shows the formula for the 95% upper confidence limit for the mean in cell B7.

6 A n a I y z e P has e 339 B7 = =$B$:1 + TINV($8$4,$8$3-1)* \--..., J A $8$2/SQRT( $8$3) 1 Mean 2 sigma 3 n 4 Alpha Lower Confi dence 6 Limit Upper Confi dence 7 Limit FIGURE Example of finding the confidence interval when sigma is unknown using Microsoft Excel. Hypothesis Testing Statistical inference generally involves four steps: 1. Formulating a hypothesis about the population or "state of nature" 2. Collecting a sample of observations from the population 3. Calculating statistics based on the sample 4. Either accepting or rejecting the hypothesis based on a predetermined acceptance criterion There are two types of error associated with statistical inference: Type I error (a error)-the probability that a hypothesis that is actually true will be rejected. The value of a is known as the significance level of the test. Type II error (~ error)-the probability that a hypothesis that is actually false will be accepted. Type II errors are often plotted in what is known as an operating characteristics curve. Confidence intervals are usually constructed as part of a statistical test of hypotheses. The hypothesis test is designed to help us make an inference about the true population value at a desired level of confidence. We will look at a few examples of how hypothesis testing can be used in Six Sigma applications. Example: Hypothesis Test of Sample Mean Experiment: The nominal specification for filling a bottle with a test chemical is 30 cc. The plan is to draw a sample of n = 25 units from a stable process and, using the sample mean and standard deviation, construct a two-sided confidence interval (an interval that extends on either side of the sample average) that has a 95% probability of including the true population mean. If the interval includes 30, conclude that the lot mean is 30, otherwise conclude that the lot mean is not 30.

7 340 C hap te r Ten Result: A sample of 25 bottles was measured and the following statistics computed x = 28 cc s = 6 cc The appropriate test statistic is t, given by the formula t= X-Il = =-1.67 s/$z 6/Es Table 3 in the Appendix gives values for the t statistic at various degrees of freedom. There are n -1 degrees of freedom (d ). For our example we need the t 975 column and the row for 24 df. This gives a t value of Since the absolute value of this t value is greater than our test statistic, we fail to reject the hypothesis that the lot mean is 30 cc. Using statistical notation this is shown as: Ho:1l = 30 cc (the null hypothesis) H 1 :11 is not equal to 30 cc (the alternate hypothesis) a =.05 (Type I error or level of significance) Critical region: ::S; to::s; Test statistic: t = Since t lies inside the critical region, fail to reject H o ' and accept the hypothesis that the lot mean is 30 cc for the data at hand. Example: Hypothesis Test of Two Sample Variances The variance of machine X's output, based on a sample of n = 25 taken from a stable process, is 100. Machine Y's variance, based on a sample of 10, is 50. The manufacturing representative from the supplier of machine X contends that the result is a mere "statistical fluke." Assuming that a "statistical fluke" is something that has less than 1 chance in 100, test the hypothesis that both variances are actually equal The test statistic used to test for equality of two sample variances is the F statistic, which, for this example, is given by the equation S2 100 F = s~ = 50 = 2,numerator df = 24,denominator df = 9 Using Table 5 in the Appendix for F 99 we find that for 24 df in the numerator and 9 df in the denominator F = Based on this we conclude that the manufacturer of machine X could be right, the result could be a statistical fluke. This example demonstrates the volatile nature of the sampling error of sample variances and standard deviations. Example: Hypothesis Test of a Standard Deviation Compared to a Standard Value A machine is supposed to produce parts in the range of inch plus or minus inch. Based on this, your statistician computes that the absolute worst standard deviation tolerable is inch. In looking over your capability charts you find that the best machine in the shop has a standard deviation of , based on a sample of 25 units.

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation