Quality Control Using Inferential Statistics in Weibull-based Reliability Analyses

Size: px
Start display at page:

Download "Quality Control Using Inferential Statistics in Weibull-based Reliability Analyses"

Transcription

1 GRAPHITE TESTING FOR NUCLEAR APPLICATIONS: THE SIGNIFICANCE OF TEST SPECIMEN VOLUME AND GEOMETRY AND THE STATISTICAL SIGNIFICANCE OF TEST SPECIMEN POPULATION 1 STP 1578, 2014 / available online at / doi: /STP Stephen F. Duffy 1 and Ankurben Parikh 2 Quality Control Using Inferential Statistics in Weibull-based Reliability Analyses Reference Duffy, Stephen F. and Parikh, Ankurben, Quality Control Using Inferential Statistics in Weibull-based Reliability Analyses, Graphite Testing for Nuclear Applications: The Significance of Test Specimen Volume and Geometry and the Statistical Significance of Test Specimen Population, STP 1578, Nassia Tzelepi and Mark Carroll, pp. 1 18, doi: / STP , ASTM International, West Conshohocken, PA ABSTRACT Design codes and fitness-for-service protocols have recognized the need to characterize the tensile strength of graphite as a random variable through the use of probability density functions. Characterizing probability density functions require more tests than typically needed to simply define an average value for tensile strength. ASTM and the needs of nuclear design codes should dovetail on this issue. The two-parameter Weibull distribution (an extreme-value distribution) is adopted for the tensile strength of this material. The failure data from bend tests or tensile tests are used to determine the Weibull modulus (m) and Weibull characteristic strength (r h ). To determine an estimate of the true Weibull distribution parameters, maximum likelihood estimators are used. The quality of the estimated parameters relative to the true distribution parameters depends fundamentally on the number of samples taken to failure. The statistical concepts of confidence intervals and hypothesis testing are presented pertaining to their use in assessing the goodness of the estimated distribution parameters. The inferential statistics tools enable the calculation of likelihood confidence rings. The concept of how the true distribution parameters lie within a likelihood ring with a specified confidence is presented. A material acceptance criterion is Manuscript received August 15, 2013; accepted for publication March 10, 2014; published online July 18, Ph.D., P.E., Cleveland State Univ., Cleveland, OH 44115, United States of America. 2 Cleveland State Univ., Cleveland, OH 44115, United States of America. 3 ASTM Symposium on Graphite Testing for Nuclear Applications: The Significance of Test Specimen Volume and Geometry and the Statistical Significance of Test Specimen Population on Sept 19 20, 2013 in Seattle, WA. Copyright VC 2014 by ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA

2 2 STP 1578 On Graphite Testing for Nuclear Applications defined here, and the criterion depends on establishing an acceptable probability of failure of the component under design, as well as an acceptable level of confidence associated with the estimated distribution parameter determined using failure data from a specific type of strength test. Keywords graphite, Weibull, confidence bounds, likelihood ratio rings Nomenclature H 0 ¼ null hypothesis H 1 ¼ alternative hypothesis L ¼ likelihood function L ¼ natural log of the likelihood function m ¼ Weibull modulus ~m ¼ estimated Weibull modulus n ¼ sample size P f ¼ probability of failure T ¼ test statistic a ¼ significance level b ¼ probability of a Type II error c ¼ confidence level H 0 ¼ vector of all the maximum likelihood estimator parameter estimates H c 0 ¼ vector of all point estimates that are not maximum likelihood estimator parameter estimates r h ¼ Weibull characteristic strength ~r h ¼ estimated Weibull characteristic strength Introduction This work presents the mathematical concepts behind statistical tools that, when combined properly, lead to a simple quality control program for components fabricated from graphite. The data on mechanistic strength (which is treated as a random variable) should be used to accept or reject a graphite material for a given application. The two-parameter Weibull distribution is used to characterize the tensile strength of graphite. The Weibull distribution is an extreme-value distribution, and this facet makes it the preferred distribution for characterizing a material s minimum strength. Estimates of the true Weibull distribution parameters should be determined using maximum likelihood estimators (MLEs). The quality of the estimated parameters relative to the true distribution parameters depends fundamentally on the number of samples taken to failure. The statistical concepts of confidence intervals and hypothesis testing are employed to assess quality. Quality is defined by how

3 DUFFY AND PARIKH, DOI /STP close the estimated parameters are to the true distribution parameters. The quality of the distribution parameters can have a direct effect on whether a certain grade or type of graphite material is acceptable for a given application. Both inferential statistics concepts (i.e., confidence intervals and hypothesis testing) enable the calculation of likelihood confidence rings. Work showing how the true distribution parameters lie within a likelihood ring with a specified confidence is presented here. The size of the ring has direct bearing on the quality of the estimated parameters. One must specify and associate an acceptable level of confidence with the estimated distribution parameters. This work shows how to construct likelihood ratio confidence rings that establish an acceptance region based on a given level of confidence. Material performance curves are presented that are based on an acceptable component probability of failure. Combining the two elements (i.e., the material performance curve and a likelihood ratio ring) allows the design engineer to determine whether a material is suited for the component design at hand. The result is a simple approach to a quality assurance criterion. Point Estimates and Confidence Bounds Data related to the tensile strength of graphite can be generated through the use of tensile tests outlined in ASTM C565 [1], ASTM C749 [2], ASTM C781 [3], and ASTM D7775 [4]. Bend tests are preferred for their simplicity, and flexural test procedures are outlined in ASTM C651 [5]. Given data on the tensile strength of graphite, the first step is to ascertain values for the Weibull distribution parameters using this information. MLEs, outlined in ASTM D7846 [6], are used to compute point estimates. The next question is, have we estimated parameters to the best of our ability? This is directly related to the fundamental question asked repeatedly, How many samples must be tested? The typical answer to this question seems to be about 30. However, the appropriate question to ask is, How many samples must be tested to establish a given confidence level for component reliability? The work outlined here answers this question quantitatively, utilizing interval estimates along with hypothesis testing. The methods outlined here are currently being implemented in the ASME Boiler and Pressure Vessel Code [7]. Confidence intervals are used to indicate the potential variability in estimated parameters. Every time a sample is taken from the same population, a point estimate of the distribution parameters can be calculated. Successive samples produce different point estimate values, and thus the point estimates are treated as random variables. Thus interval estimates bracketing the true distribution parameters are as necessary as point estimates. If the interval that brackets the true distribution parameter contains the estimated parameter, then the estimate is consistent with the true value. Increasing the sample size will always narrow the interval bounds and provide point estimates that approach the true distribution parameters. Interval bounds on parameter estimates represent the range of values for the distribution parameters that are both reasonable and plausible.

4 4 STP 1578 On Graphite Testing for Nuclear Applications Inferences and Hypothesis Testing As noted above, there is a need to know whether a sample is large enough that the point estimates of the distribution parameters are in the same statistical neighborhood as the true population distribution parameters. The techniques for making this kind of assessment utilize inferential statistics. The type of inference focused on here is the bounds on the true population parameters (which are never known) given a particular sample. Here a particular type of bound that is referred to as a likelihood ratio confidence bound is employed. The basic issue is this: consider an infinitely large population with a known frequency distribution but unknown distribution parameters. Because of diminished knowledge of the overall population available from a small sample taken from the infinitely large population, that sample will generate a frequency distribution that is different from that of the parent population different in the sense that the distribution parameters estimated from the small sample (a subset) will not be the same as the parameters of the parent population. The population and the sample can be characterized by the same frequency distribution, but the two will have different distribution parameters. As the sample size increases, the frequency distribution associated with the sample more closely resembles that of the parent population (i.e., the estimated parameters approach the true distribution parameters of the parent population). Hypothesis testing is used to establish whether the true distribution parameters lie close to the point estimates. Two statistical hypotheses are proposed concerning the estimated parameter values. The first, H 1, is referred to as the alternative hypothesis. The latter hypothesis, H 0, is referred to as the null hypothesis. Both hypotheses are then tested with the samples taken from the parent population. The goal of the analyst is to decide whether there is enough evidence (data) to refute the null hypothesis H 0. That decision is made based on the value of a test statistic. Here that test statistic is the ratio of two likelihood functions whose probability is known under the assumption that H 0, the null hypothesis, is true. If the test statistic takes on a value rarely encountered using the data collected, then the test statistic indicates that the null hypothesis is unlikely and H 0 is rejected. The value of the test statistic at which the rejection is made defines a rejection region. The probability that the test statistic falls into the rejection region by chance is referred to as the significance level, denoted by a. The significance level is defined as the probability of mistakenly rejecting a hypothesis when the hypothesis is valid. Rejecting Hypotheses Making a decision regarding a hypothesis is associated with a statistical event with an attending probability, so an ability to assess the probability of making incorrect decisions is required. Fisher [8] established a method for quantifying the amount of evidence required in order for an event to be deemed unlikely to occur by chance.

5 DUFFY AND PARIKH, DOI /STP He originally defined this quantity as the significance level. Significance levels are different than confidence levels, but the two are related. The significance level and the confidence level are functionally related through the following simple expression, with the confidence level denoted by c: (1) c ¼ 1 a The confidence level is associated with a range, or more specifically with bounds or an interval, within which a true population parameter resides. The confidence level and, through the equation above, the significance level are chosen a priori based on the design application at hand. Given a significance level a defined by Eq 1, a rejection region can be established. This is known as the critical region for the test statistic selected. For our purposes, the observed tensile strength data for a graphite material are used to determine whether the computed value of the test statistic associated with a hypothesis (not the parameter estimates) lies within or outside the rejection region. The amount of data helps define the size of the critical region. If the test statistic is within the rejection region, then we say the hypothesis is rejected at the 100a % significance level. If a is quite small, then the probability of rejecting the null hypothesis when it is true can be made quite small. Type I and Type II Errors Consider that for a true distribution parameter h there is a need to test the null hypothesis that h ¼ h 0 (in this context h 0 is a stipulated value) against the alternative that h = h 0 at a significance level a. Under these two hypotheses, a confidence interval can be constructed that contains the true population parameters with a probability of c ¼ (1 a). In addition, this interval also contains the value h 0. Mistakes can be made in rejecting the null hypothesis given above. In hypothesis testing, two types of errors are possible. 1. Type I error: the rejection of the null hypothesis (H 0 ) when it is true. The probability of committing a Type I error is denoted by a. 2. Type II error: the failure to reject the hypothesis (H 0 ) when the alternative hypothesis (H 1 ) is true. The probability of committing a Type II error is denoted by b. In either situation, judgment of the null hypothesis H 0 is incorrect. Now consider the situations in which correct decisions have been made. In the first case, the null hypothesis is not rejected and the null hypothesis is true. The probability of making this choice is c ¼ (1 a). This is the same probability associated with the confidence interval for the true population distribution parameters discussed. For the second situation, the probability of correctly rejecting the null hypothesis is the statistical complement of a Type II error, that is, (1 b). In statistics this is known as the power of the test of the null hypothesis. This quantity is used

6 6 STP 1578 On Graphite Testing for Nuclear Applications to determine the sample size. Maximizing the probability of making a correct decision requires high values of (1 a) and (1 b). So decision protocols must be designed so as to minimize the probability of either type of error in an optimal fashion. The probability of a Type I error is controlled by making a a suitable number, say, 1 in 10 or 1 in 20, or something smaller depending on the consequence of making a Type I error. Minimizing the probability of making a Type II error is not straightforward. b is dependent on the alternative hypothesis, on the sample size n, and on the true value of the distribution parameters tested. As discussed in the next section, the alternative hypothesis is greatly influenced by the test statistic chosen to help quantify the decision. Hence when hypothesis testing is applied to the distribution parameters, a statement of equality is made in the null hypothesis H 0. Achieving statistical significance for this is akin to accepting that the observed results (the point estimates of the distribution parameters) are plausible values if the null hypothesis is not rejected. The alternative hypothesis does not in general specify particular values for the true population parameters. However, as shown in a section that follows, the alternative hypothesis helps us establish bounds on the true distribution parameters. This is important when a confidence ring is formulated for a parameter distribution pair. The size of the ring can be enlarged or reduced based on two controllable parameters, the significance level a and the sample size n. The Likelihood Ratio Test Statistic The test statistic used to influence decision making regarding the alternative hypothesis is a ratio of the natural log of two likelihood functions. In simple terms, one likelihood function is associated with a null hypothesis, and the other is associated with an alternative hypothesis. For a probability density function with a single distribution parameter, the general approaches to testing the null and alternative hypotheses are defined, respectively, as (2) (3) H 0 : h ¼ h 0 H 1 : h ¼ h 1 Note that in the expression for the alternative hypothesis H 1, the fact that h equals h 1 implies that h is not equal to h 0, and the alternative hypothesis is consistent with the discussion in the previous section. As they are used here, the hypotheses describe two complementary notions regarding the distribution parameters, and these notions compete with each other. In this sense the hypotheses can be better described mathematically as (4) (5) H 0 : h 2 H 0 ¼ð 1 h 0 ; 2 h 0 ; :::; r h 0 Þ H 1 : h 2 H c 0 ¼ð 1 h 1 ; 2 h 1 ; :::; r h 1 Þ where r corresponds to the number of parameters in the probability density function. Conceptually h 0 and h 1 are scalar values, whereas H 0 and its complement H c 0

7 DUFFY AND PARIKH, DOI /STP are vectors of distribution parameters. A likelihood function associated with each hypothesis can be formulated, (6) L 0 ¼ Yn i¼1 f ðx i jh 2 H 0 Þ for the null hypothesis and (7) L 1 ¼ Yn i¼1 f ðx i jh 2 H c 0 Þ for the alternative function. The likelihood function L 0 associated with the null hypothesis is evaluated using the maximum likelihood parameter estimates. The sample population (i.e., graphite failure data) is assumed to be characterized by a two-parameter Weibull distribution. There are methods to test the validity of this assumption. However, the material strength is characterized by a random variable, so it makes sense to use a minimum extreme-value distribution such as the Weibull distribution. Because this is a proof-of-concept effort focused on likelihood ratio rings, goodness-of-fit tests that can be used to discriminate between alternative underlying population distributions are left to others to pursue. A vector of distribution parameters whose components are the MLE parameter estimates is identified as (8) ð 1 h 0 ; 2 h 0 Þ¼ð h ~ 1 ; h ~ 2 Þ ¼ð~m; ~r h Þ where: ~m ¼ maximum likelihood estimate of the Weibull modulus, and ~r h ¼ maximum likelihood estimate of the characteristic strength. Now (9) H 0 : h 2 H 0 ¼ð~m; ~r h Þ that is, H 0 contains MLE parameter estimates, and (10) H 1 : h 2 H c 0 with H c 0 representing a vector of point estimates that are not MLE parameter estimates. In essence, we are testing the null hypothesis that the true distribution parameters are equal to the MLE parameter estimates, with an alternative hypothesis that the true distribution parameters are not equal to the MLE parameter estimates. The likelihood functions are now expressed as (11) ~L 0 ¼ Yn i¼1 f ðx i j ~m; ~r h Þ

8 8 STP 1578 On Graphite Testing for Nuclear Applications (12) L 1 ¼ Yn f ðx i jm; r h Þ i¼1 A test statistic is introduced that is defined as the natural log of the ratio of the likelihood functions, T ¼ 2ln L 1 (13) ~L 0 The Neyman Pearson lemma [9] states that this likelihood ratio test is the most powerful test statistic available for testing the null hypothesis. We can rewrite this last expression as T ¼ 2 L ~ (14) L where (15) (16) ( ) Y L ¼ ln ~L n 0 ¼ ln f ðx i j ~m; ~r h Þ i¼1 ( ) Y ~L ¼ ln ~L n 1 ¼ ln f ðx i jm; r h Þ i¼1 The natural log of the likelihood ratio of a null hypothesis to an alternative hypothesis is our test statistic, and its distribution can be determined in the limit as the sample size approaches infinity. The test statistic is then used to form decision regions where the null hypothesis can be accepted or rejected. A convenient result attributable to Wilks [10] indicates that as the sample size n approaches infinity, the value 2 ln(t) will be asymptotically v 2 -distributed for a nested composite hypothesis. If one hypothesis can be derived as a limiting sequence of another, we say that the two hypotheses are nested. In our case the sample ( r X 1, r X 2,..., r X n ) representing the rth sample is drawn from a Weibull distribution under H 0. These same samples are used in the alternative hypothesis H 1, and because their parent distribution is assumed to be a Weibull distribution under both hypotheses, the two hypotheses are nested and conditions are satisfied for the application of Wilks s theorem. The test statistic is designed in such a way that the probability of a Type I error does not exceed a, a value that we control. Thus the probability of a Type I error is fixed, and we search for the test statistic that maximizes (1 b), where again b is the probability of a Type II error. Where inferences are being made on parameters from a population characterized by a two-parameter Weibull distribution, the degree of freedom for the v 2 distribution is one, and the values of the v 2 distribution are easily calculated. One can compute the likelihood ratio T and compare 2ln (T) to the v 2 value corresponding to a desired significance level to define a rejection region. This is outlined in the next section. The value of the ratio of the two

9 DUFFY AND PARIKH, DOI /STP likelihood functions defined above (L 1 /~L 0 ) approaches 1 in the optimal critical region (i.e., the value of the test statistic T should be small). This is a result of minimizing a and maximizing (1 b). The ratio is high in the complementary region. A high ratio corresponds to a high probability of a correct decision under H 0. The likelihood ratio test implies that the null hypothesis should be rejected if the value of the ratio is too small. How small is too small depends on the significance level of the test (i.e., on what probability of Type I error is considered tolerable). Lower values of the likelihood ratio mean that the observed result is much less likely to occur under the null hypothesis than under the alternative hypothesis. Higher values of the likelihood ratio mean that the observed outcome is more or equally likely (or nearly as likely) to occur under the null hypothesis, and the null hypothesis cannot be rejected. The likelihood ratio test and its close relationship to the v 2 test can be used to determine what sample size will provide a reasonable approximation of the true population parameters. The Likelihood Ratio Ring The likelihood ratio confidence bounds are based on the inequality T ¼ 2 L ~ L ¼ 2ln L ð m; r (17) hþ v 2 a;1 L ð~m; ~r h Þ The equality in Eq 17 can be expressed as (18) Lðm; r h Þ ¼ L ð~m; ~r h Þexp v2 a;1 2 Here, ~m and ~r h are maximum likelihood estimates of the distribution parameters based on the data obtained from a sample. These parameter estimates are random variables (they vary from sample to sample), as are the test statistic T and v 2. Equation 17 stipulates a relationship between random variables. The true distribution parameters m and r h are fixed values, but they are unknown to us unless the population is completely sampled. However, if a is designated, then a value for v 2 (i.e., a realization) is established. Once this realization is established for v 2, a realization for the test statistic T can be established through Eq 17. For a given significance level, confidence bounds m 0 and r 0 h can be computed that satisfy Eq 18 (i.e., these bounds satisfy the following expression). (19) L m 0 ; r 0 h L ð ~m; ~rh Þexp v2 a;1 ¼ 0 2 With a given value of m 0, a pair of values can be found for r 0 h. This procedure is repeated for a range of m 0 values until there are enough values to produce a smooth

10 10 STP 1578 On Graphite Testing for Nuclear Applications FIG. 1 Log likelihood frequency plot of L(m,r h ) with likelihood confidence ring and associated test statistic T. ring. These parameters map a contour ring in a plane perpendicular to the log likelihood axis (see Fig. 1). A change in the significance level results in a different-sized ring. From the geometry in Fig. 1 we can see that the true distribution parameters that are unknown to us will lie within the ring. Aspects of Likelihood Confidence Rings In order to present aspects of the likelihood confidence rings, Monte Carlo simulation is utilized to obtain test data. Using Monte Carlo simulation allows us the convenience of knowing what the true distribution parameters are for a particular dataset. Here, it is arbitrarily assumed that the Weibull modulus is 17 and the Weibull characteristic strength is 400 MPa. Figure 2 shows the likelihood confidence ring for a 90 % confidence level and a sample size of 10, along with the true distribution parameters and the estimated distribution parameters. If the true distribution parameter pair were unknown, we would be 90 % confident that the true distribution parameters were within the ring. If the Monte Carlo simulation process were continued nine more times (i.e., if we were in possession of ten simulated datasets), then on average one of those datasets would produce a likelihood confidence ring that did not contain the true distribution parameter pair. In Fig. 3 the effect of holding the sample size fixed and varying the confidence level is presented. The result is a series of nested likelihood confidence rings. Here

11 DUFFY AND PARIKH, DOI /STP FIG. 2 Confidence ring contour for a sample size of 10 (m ¼ 17, r h ¼ 400). we have one dataset and multiple rings associated with increments of the confidence level from 50 % to 95 %. Note that as the confidence level increases, the size of the likelihood confidence ring expands. For a given number of test specimens in a dataset, the area encompassed by the likelihood confidence ring expands as we become more and more confident that the true distribution parameters are contained in the ring. FIG. 3 Dependence of likelihood confidence rings on c for a sample size of 30 (m ¼ 17, r h ¼ 400).

12 12 STP 1578 On Graphite Testing for Nuclear Applications FIG. 4 Likelihood confidence rings for sample sizes ranging from 10 to 100 (m ¼ 17, r h ¼ 400). The next figure, Fig. 4, depicts the effect of varying the sample size and holding the confidence level fixed at c ¼ 90 %. The sample size was increased from n ¼ 10 to n ¼ 100. Note that all the likelihood confidence rings encompass the true distribution parameters used to generate each sample. In addition, the area within the rings grows smaller as the sample size increases. As the sample size increases, we gain information on the population and thereby reduce the region that could contain the true distribution parameters for a given level of confidence. Figure 5 depicts a sampling procedure in which the size of the sample is held fixed (i.e., n ¼ 10) and the sampling process and ring generation have been repeated 100 times. For a fixed confidence level of 90 %, one would expect that ten rings would not encompass the true distribution parameters. Indeed that is the case. The 90 likelihood confidence rings that encompassed the true distribution parameters are outlined in blue. The ten likelihood confidence rings that did not contain the distribution parameters are outlined in dark orange. Confidence Rings and Material Acceptance The material acceptance approach outlined here depends on several things. First one must have the ability to compute the probability of failure of the component under design. This probability is designated (P f ) component and is quantified using a

13 DUFFY AND PARIKH, DOI /STP FIG likelihood confidence rings. For all rings, n ¼ 10 and c ¼ 0.9 (m ¼ 17, r h ¼ 400). hazard rate format that is, the probability of failure is expressed as a fraction with a numerator of 1. The method for computing this quantity is available in the ASME Boiler and Pressure Vessel Code [7]. The component probability of failure is modeled assuming the underlying strength is characterized by a two-parameter Weibull distribution. Thus a component probability of failure curve can be depicted in an m r h graph as shown in Fig. 6. Points along the curve represent parameter pairs equal to a specified probability of failure. This curve is referred to as a material performance curve. We overlay this graph with point estimates of the Weibull distribution parameters obtained from tensile strength data that a typical material supplier might provide. Point estimates from these data that plot to the right of the material performance curve represent a lower probability of failure. Conversely, point estimates to the left of this curve are associated with performance curves with a higher probability of failure. Thus the material performance curve defines two regions of the m r h space, an acceptable performance region and a rejection region relative to a specified component probability of failure. The material performance curve is easily married to a likelihood confidence ring (discussed in previous sections). This allows the component fabricator to decide whether the material supplier is providing a material with high enough

14 14 STP 1578 On Graphite Testing for Nuclear Applications FIG. 6 Generic material performance curve. quality predicated on the component design and the failure data. Keep in mind that parameter estimates are estimates of the true distribution parameters of the population, values that are never known in real-life applications. However, through the use of the likelihood confidence ring method we can define a region in some close proximity of the estimated point parameter pair, knowing with some level of assurance that the true distribution parameters are contained within that region. If that region in its entirety falls to the right of the test specimen performance curve, the component fabricator can accept the material with a known level of quality (i.e., the significance level). Not surprisingly, we define this procedure as the quality acceptance criterion. We have combined the two concepts, the likelihood confidence ring and the material performance curve, in one figure (Fig. 7). Here the material performance curve given in Fig. 6 is overlain with the likelihood confidence ring from Fig. 2. This is a graphical representation of the quality assurance process. Rings that reside completely to the right of the material performance curve would represent acceptable materials. Those rings to the left would represent unacceptable materials and would be rejected. In the specific case presented, the material performance curve cuts through the likelihood confidence ring. In this case there are certain regions of the likelihood confidence ring that produce a safe design space, and there is a region of the likelihood confidence ring that produces an unsafe design space. In this situation we know the distribution parameters, and they are purposely to the right of the material performance curve. But given the sample size, the ring did not reside entirely in the safe region. Moreover, in normal designs we never know the true

15 DUFFY AND PARIKH, DOI /STP FIG. 7 Material performance curve with likelihood confidence ring contour, n ¼ 10 (m ¼ 17, r h ¼ 400). distribution parameters, so we do not know where the true distribution parameter pair resides inside the likelihood confidence ring. When the likelihood confidence ring resides totally to the left of the performance curve, the choice to reject the material is quite clear. When the likelihood confidence ring lies completely to the right of the material performance curve, then once again, the choice is quite clear: accept the material. When the material performance curve slices through the likelihood confidence ring, we can shift the material performance curve to the left, as depicted in Fig. 8. This shift represents a reduction of component reliability or, alternatively, an increase in the component probability of failure. Alternatively, the confidence bound associated with likelihood confidence ring can be reduced so the ring shrinks enough such that the ring is completely to the right of the material performance curve. This is depicted in Fig. 9. An interesting aspect of this approach is that it seems that the likelihood confidence rings give a good indication of which side of the material performance curve the true distribution parameters lie on. If the material performance curve slices through a likelihood confidence ring for a specified confidence level, then as the ring size is diminished the ring becomes tangent to one side of the curve or the other. When this paper was written it was our experience that the side of the component reliability curve that the ring becomes tangent to matches with the side on which the true distribution parameters lie. It is emphasized that this is anecdotal. An example in which the true distribution parameters were chosen to the left of the material performance curve is depicted in Fig. 10. The true distribution parameters

16 16 STP 1578 On Graphite Testing for Nuclear Applications FIG. 8 Two parallel material performance curves with likelihood confidence ring (m ¼ 17, r h ¼ 400). FIG. 9 Material performance curves with likelihood confidence rings for changing values of c (m ¼ 17, r h ¼ 400).

17 DUFFY AND PARIKH, DOI /STP FIG. 10 Material performance curves with likelihood confidence rings for changing values of c (m ¼ 6, r h ¼ 350). are known because we are conducting a Monte Carlo simulation exercise to produce failure data. As the confidence level decreases in Fig. 10, the rings become tangent to the curve on the rejection side. Summary This effort focused on graphite materials and the details associated with calculating point estimates for the Weibull distribution parameters associated with the tensile strength. One can easily generate point estimates from failure data using maximum likelihood estimators. More information regarding the population (i.e., more failure data) always improves the quality of point estimates; the question becomes how much data is sufficient given the application. The work outlined here speaks directly to this issue. Hypothesis testing and the relationship it maintains with parameter estimation were outlined. A test statistic was adopted that allows one to map out an acceptance region in the m r h parameter distribution space. The theoretical support for the equations used to generate the likelihood rings was outlined. Inferential statistics allowed us to generate confidence bounds on the true distribution parameters utilizing the test data at hand. These bounds are dependent on the size of the sample used to calculate point estimates. The effort focused on a particular type of confidence bound known as likelihood confidence rings. Component reliability curves were discussed. The concepts of the likelihood confidence rings and the component probability of failure curves were combined graphically. This combination gives rise to a material qualification process. This

18 18 STP 1578 On Graphite Testing for Nuclear Applications process combines information regarding the reliability of the component and the parameter estimates to assess the quality of the material. References [1] ASTM C565-93(2010)e1: Test Methods for Tension Testing of Carbon and Graphite Mechanical Materials, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [2] ASTM C749-08(2010)e1: Test Method for Tensile Stress-Strain of Carbon and Graphite, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [3] ASTM C781-08: Practice for Testing Graphite and Boronated Graphite Materials for Hightemperature Gas-cooled Nuclear Reactor Components, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [4] ASTM D e1: Guide for Measurements on Small Graphite Specimens, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [5] ASTM C651-11: Test Method for Flexural Strength of Manufactured Carbon and Graphite Articles Using Four-point Loading at Room Temperature, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [6] ASTM D : Reporting Uniaxial Strength Data and Estimating Weibull Distribution Parameters for Advanced Graphites, Annual Book of ASTM Standards, ASTM International, West Conshohocken, PA, [7] ASME, Article HHA-II-3000, Section III, Division 5, High Temperature Reactors, Rules for Construction of Nuclear Facility Components, ASME Boiler and Pressure Vessel Code, ASME, New York, [8] Fisher, R. A., Theory of Statistical Estimation, Proc. Cambridge Philos. Soc., Vol. 22, 1925, pp [9] Neyman, J. and Pearson, E., On the Problem of the Most Efficient Tests of Statistical Hypotheses, Philos. Trans. R. Soc. London Series A, Vol. 231, 1933, pp [10] Wilks, S. S., The Large Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses, Ann. Math. Stat., Vol. 9, No. 1, 1938, pp

Quality Control Using Inferential Statistics In Weibull Based Reliability Analyses S. F. Duffy 1 and A. Parikh 2

Quality Control Using Inferential Statistics In Weibull Based Reliability Analyses S. F. Duffy 1 and A. Parikh 2 Quality Control Using Inferential Statistics In Weibull Based Reliability Analyses S. F. Duffy 1 and A. Parikh 2 1 Cleveland State University 2 N & R Engineering www.inl.gov ASTM Symposium on Graphite

More information

QUALITY CONTROL USING INFERENTIAL STATISTICS IN WEIBULL ANALYSES FOR COMPONENTS FABRICATED FROM MONOLITHIC CERAMICS ANKURBEN PARIKH

QUALITY CONTROL USING INFERENTIAL STATISTICS IN WEIBULL ANALYSES FOR COMPONENTS FABRICATED FROM MONOLITHIC CERAMICS ANKURBEN PARIKH QUALITY CONTROL USING INFERENTIAL STATISTICS IN WEIBULL ANALYSES FOR COMPONENTS FABRICATED FROM MONOLITHIC CERAMICS ANKURBEN PARIKH Bachelor of Engineering in Civil - Irrigation and Water Management The

More information

Statistical Failure Analysis with Application to the Design of Nuclear Components Fabricated From Silicon Carbide

Statistical Failure Analysis with Application to the Design of Nuclear Components Fabricated From Silicon Carbide Statistical Failure Analysis with Application to the Design of Nuclear Components Fabricated From Silicon Carbide Stephen F Duffy a, b and Eric H Baker b Presented at 3 rd SiC Workshop Jeju Island, South

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

Standard Test Methods for Glass-Bonded Mica Used as Electrical Insulation 1

Standard Test Methods for Glass-Bonded Mica Used as Electrical Insulation 1 Designation: D 1039 94 (Reapproved 1999) e1 An American National Standard Standard Test Methods for Glass-Bonded Mica Used as Electrical Insulation 1 This standard is issued under the fixed designation

More information

Standard Practice for Heat Aging of Plastics Without Load 1

Standard Practice for Heat Aging of Plastics Without Load 1 Designation: D 3045 92 (Reapproved 2003) Standard Practice for Heat Aging of Plastics Without Load 1 This standard is issued under the fixed designation D 3045; the number immediately following the designation

More information

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Parameter Estimation and Fitting to Data

Parameter Estimation and Fitting to Data Parameter Estimation and Fitting to Data Parameter estimation Maximum likelihood Least squares Goodness-of-fit Examples Elton S. Smith, Jefferson Lab 1 Parameter estimation Properties of estimators 3 An

More information

A Comparison of Equivalence Criteria and Basis Values for HEXCEL 8552 IM7 Unidirectional Tape computed from the NCAMP shared database

A Comparison of Equivalence Criteria and Basis Values for HEXCEL 8552 IM7 Unidirectional Tape computed from the NCAMP shared database Report No: Report Date: A Comparison of Equivalence Criteria and Basis Values for HEXCEL 8552 IM7 Unidirectional Tape computed from the NCAMP shared database NCAMP Report Number: Report Date: Elizabeth

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

HYPOTHESIS TESTING: FREQUENTIST APPROACH. HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous

More information

Finite-Element Analysis of Stress Concentration in ASTM D 638 Tension Specimens

Finite-Element Analysis of Stress Concentration in ASTM D 638 Tension Specimens Monika G. Garrell, 1 Albert J. Shih, 2 Edgar Lara-Curzio, 3 and Ronald O. Scattergood 4 Journal of Testing and Evaluation, Vol. 31, No. 1 Paper ID JTE11402_311 Available online at: www.astm.org Finite-Element

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

hypothesis a claim about the value of some parameter (like p)

hypothesis a claim about the value of some parameter (like p) Testing hypotheses hypothesis a claim about the value of some parameter (like p) significance test procedure to assess the strength of evidence provided by a sample of data against the claim of a hypothesized

More information

Chapter Three. Hypothesis Testing

Chapter Three. Hypothesis Testing 3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being

More information

Physics 403. Segev BenZvi. Classical Hypothesis Testing: The Likelihood Ratio Test. Department of Physics and Astronomy University of Rochester

Physics 403. Segev BenZvi. Classical Hypothesis Testing: The Likelihood Ratio Test. Department of Physics and Astronomy University of Rochester Physics 403 Classical Hypothesis Testing: The Likelihood Ratio Test Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Bayesian Hypothesis Testing Posterior Odds

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests http://benasque.org/2018tae/cgi-bin/talks/allprint.pl TAE 2018 Benasque, Spain 3-15 Sept 2018 Glen Cowan Physics

More information

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing STAT 515 fa 2016 Lec 20-21 Statistical inference - hypothesis testing Karl B. Gregory Wednesday, Oct 12th Contents 1 Statistical inference 1 1.1 Forms of the null and alternate hypothesis for µ and p....................

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

Maan Jawad Global Engineering & Technology Camas, Washington, U.S.A.

Maan Jawad Global Engineering & Technology Camas, Washington, U.S.A. Proceedings of the ASME 018 Symposium on Elevated Temperature Application of Materials for Fossil, Nuclear, and Petrochemical Industries ETAM018 April 3-5, 018, Seattle, WA, USA ETAM018-6737 ALLOWABLE

More information

Topic 10: Hypothesis Testing

Topic 10: Hypothesis Testing Topic 10: Hypothesis Testing Course 003, 2016 Page 0 The Problem of Hypothesis Testing A statistical hypothesis is an assertion or conjecture about the probability distribution of one or more random variables.

More information

Lecture Notes on Inductive Definitions

Lecture Notes on Inductive Definitions Lecture Notes on Inductive Definitions 15-312: Foundations of Programming Languages Frank Pfenning Lecture 2 September 2, 2004 These supplementary notes review the notion of an inductive definition and

More information

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Partitioning the Parameter Space. Topic 18 Composite Hypotheses Topic 18 Composite Hypotheses Partitioning the Parameter Space 1 / 10 Outline Partitioning the Parameter Space 2 / 10 Partitioning the Parameter Space Simple hypotheses limit us to a decision between one

More information

23. MORE HYPOTHESIS TESTING

23. MORE HYPOTHESIS TESTING 23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

STAT T&E COE-Report Reliability Test Planning for Mean Time Between Failures. Best Practice. Authored by: Jennifer Kensler, PhD STAT T&E COE

STAT T&E COE-Report Reliability Test Planning for Mean Time Between Failures. Best Practice. Authored by: Jennifer Kensler, PhD STAT T&E COE Reliability est Planning for Mean ime Between Failures Best Practice Authored by: Jennifer Kensler, PhD SA &E COE March 21, 2014 he goal of the SA &E COE is to assist in developing rigorous, defensible

More information

Lecture Notes on Inductive Definitions

Lecture Notes on Inductive Definitions Lecture Notes on Inductive Definitions 15-312: Foundations of Programming Languages Frank Pfenning Lecture 2 August 28, 2003 These supplementary notes review the notion of an inductive definition and give

More information

APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV

APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV DOI 10.1007/s11018-017-1213-4 Measurement Techniques, Vol. 60, No. 5, August, 2017 APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV B. Yu. Lemeshko and T.

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Class 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 19 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 8.3-8.4 Lecture Chapter 8.5 Go over Exam. Problem Solving

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Machine Learning, Midterm Exam: Spring 2008 SOLUTIONS. Q Topic Max. Score Score. 1 Short answer questions 20.

Machine Learning, Midterm Exam: Spring 2008 SOLUTIONS. Q Topic Max. Score Score. 1 Short answer questions 20. 10-601 Machine Learning, Midterm Exam: Spring 2008 Please put your name on this cover sheet If you need more room to work out your answer to a question, use the back of the page and clearly mark on the

More information

8.1-4 Test of Hypotheses Based on a Single Sample

8.1-4 Test of Hypotheses Based on a Single Sample 8.1-4 Test of Hypotheses Based on a Single Sample Example 1 (Example 8.6, p. 312) A manufacturer of sprinkler systems used for fire protection in office buildings claims that the true average system-activation

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

FHWA Bridge Design Guidance No. 1 Revision Date: July 21, Load Rating Evaluation of Gusset Plates in Truss Bridges

FHWA Bridge Design Guidance No. 1 Revision Date: July 21, Load Rating Evaluation of Gusset Plates in Truss Bridges FHWA Bridge Design Guidance No. 1 Revision Date: July 21, 2008 Load Rating Evaluation of Gusset Plates in Truss Bridges By Firas I. Sheikh Ibrahim, PhD, PE Part B Gusset Plate Resistance in Accordance

More information

DEVELOPMENT OF TEST GUIDANCE FOR COMPACT TENSION FRACTURE TOUGHNESS SPECIMENS CONTAINING NOTCHES INSTEAD OF FATIGUE PRE-CRACKS

DEVELOPMENT OF TEST GUIDANCE FOR COMPACT TENSION FRACTURE TOUGHNESS SPECIMENS CONTAINING NOTCHES INSTEAD OF FATIGUE PRE-CRACKS Transactions, SMiRT-23 Division II, Paper ID 287 Fracture Mechanics and Structural Integrity DEVELOPMENT OF TEST GUIDANCE FOR COMPACT TENSION FRACTURE TOUGHNESS SPECIMENS CONTAINING NOTCHES INSTEAD OF

More information

Available online at ScienceDirect. 5th Fatigue Design Conference, Fatigue Design 2013

Available online at   ScienceDirect. 5th Fatigue Design Conference, Fatigue Design 2013 Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 66 ( 2013 ) 626 634 5th Fatigue Design Conference, Fatigue Design 2013 Fatigue Damage Analysis on Aluminium Alloy Specimens

More information

Life Prediction of Structural Components

Life Prediction of Structural Components Life Prediction of Structural Components Steve Duffy Cleveland State University, Connecticut Reserve Technologies, LLC Lesley A. Janosik NASA Glenn Research Center Andy Wereszczak Oak Ridge National Laboratory

More information

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation & Hypothesis Testing In doing research, we are usually interested in some feature of a population distribution (which

More information

Materials Selection and Design Materials Selection - Practice

Materials Selection and Design Materials Selection - Practice Materials Selection and Design Materials Selection - Practice Each material is characterized by a set of attributes that include its mechanical, thermal, electrical, optical, and chemical properties; its

More information

Topic 17: Simple Hypotheses

Topic 17: Simple Hypotheses Topic 17: November, 2011 1 Overview and Terminology Statistical hypothesis testing is designed to address the question: Do the data provide sufficient evidence to conclude that we must depart from our

More information

Supplementary Notes on Inductive Definitions

Supplementary Notes on Inductive Definitions Supplementary Notes on Inductive Definitions 15-312: Foundations of Programming Languages Frank Pfenning Lecture 2 August 29, 2002 These supplementary notes review the notion of an inductive definition

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Chapter 12: Estimation

Chapter 12: Estimation Chapter 12: Estimation Estimation In general terms, estimation uses a sample statistic as the basis for estimating the value of the corresponding population parameter. Although estimation and hypothesis

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Laboratory 4 Bending Test of Materials

Laboratory 4 Bending Test of Materials Department of Materials and Metallurgical Engineering Bangladesh University of Engineering Technology, Dhaka MME 222 Materials Testing Sessional.50 Credits Laboratory 4 Bending Test of Materials. Objective

More information

The problem of base rates

The problem of base rates Psychology 205: Research Methods in Psychology William Revelle Department of Psychology Northwestern University Evanston, Illinois USA October, 2015 1 / 14 Outline Inferential statistics 2 / 14 Hypothesis

More information

Maximum-Likelihood Estimation: Basic Ideas

Maximum-Likelihood Estimation: Basic Ideas Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators

More information

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Statistical Methods in Particle Physics Lecture 1: Bayesian methods Statistical Methods in Particle Physics Lecture 1: Bayesian methods SUSSP65 St Andrews 16 29 August 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Unit 19 Formulating Hypotheses and Making Decisions

Unit 19 Formulating Hypotheses and Making Decisions Unit 19 Formulating Hypotheses and Making Decisions Objectives: To formulate a null hypothesis and an alternative hypothesis, and to choose a significance level To identify the Type I error and the Type

More information

Composite Hypotheses. Topic Partitioning the Parameter Space The Power Function

Composite Hypotheses. Topic Partitioning the Parameter Space The Power Function Toc 18 Simple hypotheses limit us to a decision between one of two possible states of nature. This limitation does not allow us, under the procedures of hypothesis testing to address the basic question:

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

Probability and Statistics. Joyeeta Dutta-Moscato June 29, 2015

Probability and Statistics. Joyeeta Dutta-Moscato June 29, 2015 Probability and Statistics Joyeeta Dutta-Moscato June 29, 2015 Terms and concepts Sample vs population Central tendency: Mean, median, mode Variance, standard deviation Normal distribution Cumulative distribution

More information

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Week 12 Hypothesis Testing, Part II Comparing Two Populations Week 12 Hypothesis Testing, Part II Week 12 Hypothesis Testing, Part II Week 12 Objectives 1 The principle of Analysis of Variance is introduced and used to derive the F-test for testing the model utility

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

Basics of Uncertainty Analysis

Basics of Uncertainty Analysis Basics of Uncertainty Analysis Chapter Six Basics of Uncertainty Analysis 6.1 Introduction As shown in Fig. 6.1, analysis models are used to predict the performances or behaviors of a product under design.

More information

Statistical Process Control (contd... )

Statistical Process Control (contd... ) Statistical Process Control (contd... ) ME522: Quality Engineering Vivek Kumar Mehta November 11, 2016 Note: This lecture is prepared with the help of material available online at https://onlinecourses.science.psu.edu/

More information

Standard Test Method for Open Hole Tensile Strength of Polymer Matrix Composite Laminates 1

Standard Test Method for Open Hole Tensile Strength of Polymer Matrix Composite Laminates 1 This document is not an ASTM standard and is intended only to provide the user of an ASTM standard an indication of what changes have been made to the previous version. Because it may not be technically

More information

Introduction to Statistical Inference

Introduction to Statistical Inference Introduction to Statistical Inference Ping Yu Department of Economics University of Hong Kong Ping Yu (HKU) Statistics 1 / 30 1 Point Estimation 2 Hypothesis Testing Ping Yu (HKU) Statistics 2 / 30 The

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 71. Decide in each case whether the hypothesis is simple

More information

Statistical Methods for Particle Physics (I)

Statistical Methods for Particle Physics (I) Statistical Methods for Particle Physics (I) https://agenda.infn.it/conferencedisplay.py?confid=14407 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin Note on Bivariate Regression: Connecting Practice and Theory Konstantin Kashin Fall 2012 1 This note will explain - in less theoretical terms - the basics of a bivariate linear regression, including testing

More information

Standard Guide for Determination of the Thermal Resistance of Low-Density Blanket-Type Mineral Fiber Insulation 1

Standard Guide for Determination of the Thermal Resistance of Low-Density Blanket-Type Mineral Fiber Insulation 1 Designation: C 653 97 Standard Guide for Determination of the Thermal Resistance of Low-Density Blanket-Type Mineral Fiber Insulation 1 This standard is issued under the fixed designation C 653; the number

More information

Testing for Equivalence. Purpose of Equivalence CMH17 Statistical Analysis Methods

Testing for Equivalence. Purpose of Equivalence CMH17 Statistical Analysis Methods Testing for Equivalence Purpose of Equivalence CMH17 Statistical Analysis Methods September 14-16, 2010 Safety Awareness Workshop Purpose of Equivalence To demonstrate material equivalency: to assure that

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

Probability and Statistics. Terms and concepts

Probability and Statistics. Terms and concepts Probability and Statistics Joyeeta Dutta Moscato June 30, 2014 Terms and concepts Sample vs population Central tendency: Mean, median, mode Variance, standard deviation Normal distribution Cumulative distribution

More information

STA 732: Inference. Notes 2. Neyman-Pearsonian Classical Hypothesis Testing B&D 4

STA 732: Inference. Notes 2. Neyman-Pearsonian Classical Hypothesis Testing B&D 4 STA 73: Inference Notes. Neyman-Pearsonian Classical Hypothesis Testing B&D 4 1 Testing as a rule Fisher s quantification of extremeness of observed evidence clearly lacked rigorous mathematical interpretation.

More information

Hypothesis testing. Chapter Formulating a hypothesis. 7.2 Testing if the hypothesis agrees with data

Hypothesis testing. Chapter Formulating a hypothesis. 7.2 Testing if the hypothesis agrees with data Chapter 7 Hypothesis testing 7.1 Formulating a hypothesis Up until now we have discussed how to define a measurement in terms of a central value, uncertainties, and units, as well as how to extend these

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

Chapter 2 Complex Modulus Variation by Manipulation of Mechanical Test Method and Print Direction

Chapter 2 Complex Modulus Variation by Manipulation of Mechanical Test Method and Print Direction Chapter 2 Complex Modulus Variation by Manipulation of Mechanical Test Method and Print Direction Megan L. Liu, Katherine K. Reichl, and Daniel J. Inman Abstract 3D printing technologies have made creating

More information

Many natural processes can be fit to a Poisson distribution

Many natural processes can be fit to a Poisson distribution BE.104 Spring Biostatistics: Poisson Analyses and Power J. L. Sherley Outline 1) Poisson analyses 2) Power What is a Poisson process? Rare events Values are observational (yes or no) Random distributed

More information

Hypothesis Testing. File: /General/MLAB-Text/Papers/hyptest.tex

Hypothesis Testing. File: /General/MLAB-Text/Papers/hyptest.tex File: /General/MLAB-Text/Papers/hyptest.tex Hypothesis Testing Gary D. Knott, Ph.D. Civilized Software, Inc. 12109 Heritage Park Circle Silver Spring, MD 20906 USA Tel. (301) 962-3711 Email: csi@civilized.com

More information

Hypothesis Testing. BS2 Statistical Inference, Lecture 11 Michaelmas Term Steffen Lauritzen, University of Oxford; November 15, 2004

Hypothesis Testing. BS2 Statistical Inference, Lecture 11 Michaelmas Term Steffen Lauritzen, University of Oxford; November 15, 2004 Hypothesis Testing BS2 Statistical Inference, Lecture 11 Michaelmas Term 2004 Steffen Lauritzen, University of Oxford; November 15, 2004 Hypothesis testing We consider a family of densities F = {f(x; θ),

More information

Ch. 5 Hypothesis Testing

Ch. 5 Hypothesis Testing Ch. 5 Hypothesis Testing The current framework of hypothesis testing is largely due to the work of Neyman and Pearson in the late 1920s, early 30s, complementing Fisher s work on estimation. As in estimation,

More information

Information measures in simple coding problems

Information measures in simple coding problems Part I Information measures in simple coding problems in this web service in this web service Source coding and hypothesis testing; information measures A(discrete)source is a sequence {X i } i= of random

More information

3.2.2 principal material coordinate system, n a coordinate system with axes that are normal to the planes of symmetry inherent to a material

3.2.2 principal material coordinate system, n a coordinate system with axes that are normal to the planes of symmetry inherent to a material Designation: Standard Test Method for Open Hole Tensile Strength of Polymer Matrix Composite Laminates 1 This standard is issued under the fixed designation D 5766/D 5766M; the number immediately following

More information

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test 1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type

More information

IENG581 Design and Analysis of Experiments INTRODUCTION

IENG581 Design and Analysis of Experiments INTRODUCTION Experimental Design IENG581 Design and Analysis of Experiments INTRODUCTION Experiments are performed by investigators in virtually all fields of inquiry, usually to discover something about a particular

More information

Topic 10: Hypothesis Testing

Topic 10: Hypothesis Testing Topic 10: Hypothesis Testing Course 003, 2017 Page 0 The Problem of Hypothesis Testing A statistical hypothesis is an assertion or conjecture about the probability distribution of one or more random variables.

More information

Lab Exercise #5: Tension and Bending with Strain Gages

Lab Exercise #5: Tension and Bending with Strain Gages Lab Exercise #5: Tension and Bending with Strain Gages Pre-lab assignment: Yes No Goals: 1. To evaluate tension and bending stress models and Hooke s Law. a. σ = Mc/I and σ = P/A 2. To determine material

More information

Hypothesis for Means and Proportions

Hypothesis for Means and Proportions November 14, 2012 Hypothesis Tests - Basic Ideas Often we are interested not in estimating an unknown parameter but in testing some claim or hypothesis concerning a population. For example we may wish

More information

Sampling, Confidence Interval and Hypothesis Testing

Sampling, Confidence Interval and Hypothesis Testing Sampling, Confidence Interval and Hypothesis Testing Christopher Grigoriou Executive MBA HEC Lausanne 2007-2008 1 Sampling : Careful with convenience samples! World War II: A statistical study to decide

More information

Fundamental Probability and Statistics

Fundamental Probability and Statistics Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are

More information

Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis

Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis Introduction 1. An analyst claims to have a reliable model for Twitter s quarterly revenues. His model predicted that the most recent

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

Some History of Optimality

Some History of Optimality IMS Lecture Notes- Monograph Series Optimality: The Third Erich L. Lehmann Symposium Vol. 57 (2009) 11-17 @ Institute of Mathematical Statistics, 2009 DOl: 10.1214/09-LNMS5703 Erich L. Lehmann University

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes Neyman-Pearson paradigm. Suppose that a researcher is interested in whether the new drug works. The process of determining whether the outcome of the experiment points to yes or no is called hypothesis

More information

Basic Concepts of Inference

Basic Concepts of Inference Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).

More information