Statistical Hypothesis Testing: Problems and Alternatives

Size: px
Start display at page:

Download "Statistical Hypothesis Testing: Problems and Alternatives"

Transcription

1 FORUM Statistical Hypothesis Testing: Problems and Alternatives NORMAN S. MATLOFF Division of Computer Science, University of California at Davis, Davis, California Environ. Entomol. 20(5): (1991) ABSTRACT Scientists must make decisions based on experimental data and often use statistical hypothesis tests, either formally or via P values, as the basis for such decisions. However, these tests are designed for answering questions which are almost never of scientific relevance. This paper demonstrates this problem and discusses specific alternative methods. KEY WORDS hypothesis testing, confidence intervals, simultaneous inference THE ANALYSISOF EXPERIMENTALDATAis inher- ently a statistical problem because the data either explicitly or implicitly arise via sampling from some population. Suppose, for example, we feed a group of animals of a certain breed an ordinary diet for a given period of time, then feed the same animals a zinc-enriched diet, taking before-and-after measurements of some attribute X. Then we wish to extrapolate the results to the (conceptual) population of all animals of this breed. If, say, the mean of X was 24.7 units higher in these animals after the zinc-enriched diet, we would like to be able to say that the mean of X would be substantially higher when computed among all animals of this breed. In most cases, the scientist is called upon to make a decision of some kind; e.g., he or she might be asked to make a recommendation as to whether the zinc-enriched diet produces important benefits. A glance at the literature in almost any scientific field immediately reveals that statistical hypothesis tests, typically in the form of "p values" (i.e., observed significance levels), playa dominant role in this decision-making. Tables summarizing experimental results often are presented using the "star system," in which asterisks are used to flag the results that have small p values. However, hypothesis tests can produce misleading results. In the zinc example described above, for instance, suppose that zinc enrichment produces only a very minor effect of no scientific importance. If we collect data on a large sample of animals, we will probably get a small P value. As is customary, this small P value would be called "significant" in spite of the fact the difference caused by zinc-enrichment is not of scientific significance. In other words, statistical significance is not the same as scientific significance. The divergence between these two concepts stems from the fact that a hypothesis test, either a formal one or one implied by the reporting of P values, is addressing the wrong question. Specifically, the test is asking whether a certain condition holds exactly, and this exactness is almost never of scientific interest. For instance, the hypothesis being tested in the diet example described above would probably be Ho: J.tdiff = 0, where J.tdlff is the population mean difference between the zinc-enriched and nonzinc-enriched values of X. But Ho is not the question of scientific interest. If JA.diff is nonzero but close to 0, then Ho is false in a mathematical sense yet is essentially true from the scientist's point of view. Unfortunately, the test is designed to address only the mathematical sense. The problem can be even worse in other settings, such as goodness-of-fit tests. For instance, consider a genetics problem in which the model states that phenotype ratios are 9:3:3:1. A goodness-of-fit test is designed to answer whether these ratios hold exactly. However, we know a priori this is not true; no model can completely capture all possible genetical mechanisms. What we are interested in instead is whether a given model holds to a sufficiently close approximation that the model will be scientifically useful. Yet, again, a standard goodness-of-fit test is not designed to answer such a question. It is designed to answer whether the model holds exactly, and thus a large enough sample will pounce on even the most minor deviations from that model. Even in cases in which the exactness of a certain null hypothesis Ho seems to be of scientific interest, our test may pick up "departures" from Ho that are due to "noise" rather than to real differences. For example, a measuring instrument being used in the experiment may have a small but nonzero bias. With a large enough sample, we would reject H 0 even if it is true. So, a hypothesis test can be very misleading with large samples. But the other side of the coin is that a test can be equally misleading with small samples. Here, even a departure from Ho that is large enough Xj91jI246_1250$02.00jO 1991 Entomological Society of America

2 October 1991 MATLOFF: STATISTICAL HYPOTHESIS TESTING 1247 to be of scientific interest may easily be missed because of the small sample size. If so, no asterisks will be reported next to the result, and it will be announced that no significant difference was found, even if the treatment was of substantial value. These problems could be ameliorated to some extent if the power of the test, typically denoted 1 - {3,were computed at various types and degrees of departures from Ho. But this is difficult to do in complex models and in any case is still only indirectly answering the questions of real scientific interest. Recognition of problems such as those cited above is not new. They have been mentioned occasionally, such as by Snedecor & Cochran (1967), Graybill (1976), Freedman et al. (1978), Jones & Matloff (1986), Matloff (1988), and Moore & McCabe (1989). The usual alternative cited is to create confidence intervals (CIs). However, although the use of CIs is straightforward in simple settings comparing two means such as that of the zinc example, it is less clear how this approach can be extended to other types of settings such as the goodness-offit problem mentioned earlier. This paper discusses the drawbacks of hypothesis testing in more depth, using a different approach than do the references above. It will also present a generally applicable tool for decision-making that does not involve testing. The concepts are illustrated using the genetics example mentioned above, which will now be described more formally. Let p, denote the population proportion of phenotype i; i = 1, 2, 3, 4. The null hypothesis is then 933 P, = 16' P2 = 16' P3 = 16' 1 P. = 16' (1.1) Should We Be "Guided by the Stars"? As already mentioned, it is common in the presentation of experimental results to use asterisks to flag statistics that have small P values. A statistic is described as being "One-star," "two-star," and so on. It was argued qualitatively in the introduction that such descriptions are useless and even dangerous. This section presents a quantitative discussion of this problem. Suppose we perform a x 2 goodness-of-fit test on the genetics model hypothesized in equation (1.1) above, based on a random sample of size n. Assume that we report our results with the following "star system": P value> 0.05, no stars; P < 0.05, one star; P < 0.01, two stars; P < 0.005, three stars; p < 0.001, four stars. How many stars can we expect under various conditions, such as type or degree of departure from Ho and size of n? To investigate this question, a computer simulation was conducted under the five sets of phenotype proportions p, shown in Table 1. By comparison, Ho in decimal form is Ho: PI = , P2 = , P3 = , P. = (2.1) Thus, settings I through V represent progressively larger departures from H o. Specifically, these departures from Ho are taken in the direction toward the setting in which p, = c" where c, is 1 and all the other c, are O. Let h, be the value of p, hypothesized by Ho. Then setting] has p, = AJb; + (1 - AJ)c" i = 1, 2,3, 4, and] = I, II, III, IV, V. The values used for AJ were 0.95, 0.90, 0.85, 0.75, and 0.65 for] = I, II, III, IV, V, respectively. For each set of proportions p" sample sizes n = 50, 100, 250, 500, 750, and 1,000 were simulated. For each set of proportions p, and each value of n, 10,000 samples were generated, recording the number of stars for each sample. The goal was to find the expected value of the number of stars for each setting and each value of n. The results are shown in Fig. 1, which certainly shows the effects of large and small sample sizes discussed in the introduction. The proportions p, for curve II are only slightly different from those of H 0, yet for samples of size 750, the expected number of stars is one. In other words, we will probably characterize the p, as being "significantly different" from the specification in H0' even though the difference is actually rather small. On the other hand, the proportions p, for curve V represent a very substantial departure from H 0, yet the curve shows that this departure will not have a high probability of being detected if our sample size is only 50. In fact, in a separate computation not shown in Fig. 1, it was found that only 37% of the samples of size 50 yielded one or more stars. Thus, a substantial departure is highly likely to be overlooked in small samples. A horizontal line at height 1.0 intersects four of the five curves within the range of sample sizes studied. Thus, if we are told only that a certain statistic is "one-star," this information by itself is useless. "One-starness" is a property that would be consistent with any of the proportion sets p, from II to V. In other words, the knowledge that our statistic rates one star does not tell us whether the departure from Ho is substantial or is very minor and possibly of no scientific interest. Thus the number of stars by itself is noninformative for scientific purposes. As mentioned in the introduction, the number of stars by itself is relevant only to the question of whether Ho is exactly true-a question which is almost always not of interest to us, especially be-

3 1248 ENVIRONMENTAL ENTOMOLOGY Vol. 20, no. 5 Table 1. Alternative phenotype proportions J values PI P2 pg P4 4.0 I III II III mean IV V stars II 1.0 cause we usually know a priori that H 0 cannot be exactly true. Of course, if we were to be armed with charts similar to Fig. 1 when we read the scientific literature, we could view the number of stars appended to a statistic in proper perspective. This kind of chart, combined with our knowledge of the sample size n, could in theory impart some real meaning to the star counts. For any given setting, such a chart would give us an indication of how the value of n affects the number of stars reported. But this would be impractical. A different chart would need to be constructed for each setting, because the size of n has quite different effects in different settings. Most readers do not have the time and resources to generate such charts. Moreover, note that even Fig. 1 is woefully incomplete. It presents curves for only a few proportion sets p" and only sets of a particular kind-sets in which p, is increasing toward 1.0 and P2' Po, and P. are decreasing to 0.0. We have no idea what the curves for other types of departures look like. And in any case, the use of such charts would still only be a patch up repair to the problems stemming from the fact that hypothesis tests do not address questions of scientific interest. Alternatives The major point developed in the preceding section was that P values by themselves have very little scientific meaning. They are designed only to assess the exact validity of Ho, a question which is almost never of scientific interest. Thus, instead, we should base our analyses on methods that do allow direct examination of questions of scientific interest. Consider again the zinc example. Suppose our sample consists of n animals. Using the n differences X.fte< - Xbefore, we can form a CI for J.tdrH, which is denoted here as (a, b). Consider the following four situations: (1) a is near 0, and b - a is large; (2) a is near 0, and b - a is small; (3) a is far from 0, and b - a is large; (4) a is far from 0, and b - a is small. As can be seen in the situations listed above, a CI for J.tdiff actually carries two major pieces of information, which are called here "the estimate (E) part" and "the accuracy (A) part." The E part estimates the size of the effect of zinc. Here, the E part is the distance from a to O. The A part provides an indication of the accuracy of the E n Fig. 1. Mean number of asterisks part. Here the A part is the size of b - a. Of course, the words "near," "far," "large," and "small" have to be defined in this context of the experiment itself. For example, "near" would be defined to be within the range that the investigator, or the reader of the investigator's report, considers to represent an unimportant effect. In situation 1, we see that, although the E part of the CI suggests that the effect of zinc might not be large, the A part tells us that the effect might on the other hand actually be quite substantial. This contrasts with situation 2, in which we have strong evidence that the effect is small. The main point here is that the "star system" includes neither an E part nor an A part and thus excludes vital information. In situation 2, for instance, in which the CI is telling us that we can be fairly confident that zinc does not have an important effect, a hypothesis test would indicate a "significant" effect. Reliance on hypothesis tests in such a situation is clearly contrary to good scientific method and in fact defeats the purpose of requiring statistical analyses when reporting experimental results. In the preceding section, it was shown that hypothesis tests can be misleading in both small and large samples. The dangers of hypothesis tests in small samples occur mainly in situation I, in which a possibly substantial effect might be missed because of insufficient sample size. There is no such danger in basing our analysis on CIs, because in situation 1, the A part of the CI would warn us that the sample size was too small. Similarly, the danger of hypothesis testing in large samples lies mainly in situation 2, in which a small effect of no scientific interest might be declared as "highly significant." Again, this danger is avoided if one uses CIs, because the E part would show us that the effect is minor. The question to be addressed now is how to extend CI analysis such as that above to more complex settings. This will be done through the formation of simultaneous CIs, analogous to the Scheffe or Duncan methodology that is familiar to many scientists. In fact, the methodology presented here is an extension of Scheffe's techniques. Following is the technical environment which is I

4 October 1991 MATLOFF: STATISTICAL HYPOTHESIS TESTING 1249 assumed (some readers may wish to skip this paragraph and proceed directly to the example). Suppose we have statistics T"..., T w which have an approximately multivariate normal distribution, with the estimated covariance between T, and TJ being denoted by C.,. It is required that the matrix C be nonsingular. Let 'Y, be the mean of T,. The values 'Y. are the parameters of interest, such as population means or proportions, and the statistics T, are estimates of these parameters. We might be interested in various linear combinations of the form The derivation of Scheffe's method can be generalized (Rao 1973, Hochberg & Tamhane 1987). The result is that the CIs w ~ at T, ± VdX2w;a,-, hold simultaneously with approximate confidence level 1 - a, where X 2 w;a is the upper-a percentile of ax distribution having w degrees of freedom, and w w w d = ~ C u r2, + 2 ~ ~ a,ajcij' (3.3) '-1 (=1 }=1 (3.1) (3.2) Example 1. Consider the genetics problem mentioned above. Here we have the following correspondences: T, is PI' the estimate of p,; 'Y, is PI; w is 3 (not 4, because P. is redundant, because the p, sum to 1); C'J is -P,PJ/n for i "* j and P.(l - PI) for j = i. We can get simultaneous CIs for p" P2, and P3 using equation (3.2). For example, to get the interval for p" we use a] = 1 and a 2 = a 3 = O. These CIs could then be used to assess goodness of fit of the genetics model (equation 1.1). Again, it is crucial that we pay attention to both the E and A parts of the CIs. For example, if the E part shows that the model's value for Ph , is just slightly outside our CI for Ph then we may decide that the model is still scientifically useful (compared with the star system, which would summarily discard the model). On the other hand, if the A part shows that the CI contains but is very wide, then this is a signal that we do not have enough data to properly assess the validity of the model. Example 2. Consider a two-way analysis of variance setting, with r rows and c columns. Let X'Jk be the kth observation at level i of the first treatment and level j of the second treatment, i = I,..., r; j = 1,..., c; k = 1,..., n. To preserve the single-subscript notation above, instead of denoting the (i, j)th population cell mean as IL,!, we will call that mean 'Y(l-Ile+1' In other words, the means in the top row are denoted 'Y, through 'Ye, the means in the second row are called 'Ye+l through 'Y2o, and so on. Of course, the (i, j)th sample cell mean is accordingly T(I-IIc+1' The value of w here is rc. C qq is the sample variance of observations in the q-th cell, and C". is 0 if P "* q. Suppose we want to investigate the additivity (i.e., noninteraction), meaning that we want to investigate how closely the relationship /Lbd - /Led = /Lbi - /Lei a 3 = 1,a. = -1, as = -1, ag = 1. (3.4) holds for all b, d, e, f. We can do this by forming simultaneous CIs for the quantities /Lbd - /Led - (/Lbi - /Lei)' (3.5) For example, say T = 4 and c = 5. Then for b = 1, d = 3, e = 2, and f = 4, the nonzero values of a, would be Again, we should not discard the idea of additivity here if the E parts of the CIs show that 0 is slightly outside some of the CIs for (3.5); i.e., that there are slight discrepancies from equation 3.4. In such a case, the additive model might still be a very valuable description. On the other hand, if the A parts of the CIs show that the CIs contain 0 but are very wide, this is a signal that we do not have sufficient data to assess the additivity. This is very important, because it can be shown that hypothesis tests for interaction typically have very low power. The methodology here is based on large-sample approximations. The reader may wonder how large n must be to make the approximations work well. This is an important question, although it should be pointed out that even "exact" statistical methodology is actually approximate in practice; no population has an exact normal distribution, nor are variances exactly homogeneous, and independence assumptions are often violated to at least some degree. However, toward this end, a simulation was conducted to assess how well equation 3.2 works. The setting used was that of example 1 above. The simulation generated 10,000 samples of size 100. A count was kept of how many samples there were in which all three of the confidence intervals for Ph P2' and P3 contained their respective parameters. The value of a was 0.05, so that the intervals had nominal confidence level 95%. The result was that 94.2% of the samples were such that all three CIs contained their respective parameters. Summary It has been shown that the use of hypothesis tests, including informal P values, is generally non informative and can be misleading. One of the prob-

5 1250 ENVIRONMENTAL ENTOMOLOGY Vol. 20, no. 5 lems is that a test addresses the question of the exact validity of some model, yet this exactness is usually not of scientific interest. Another problem is that hypothesis tests are very sensitive to sample sizes, yet give the investigator no feedback as to the degree of the effect of sample size in his or her particular setting. These problems can be solved by using CI analysis. In contrast to analysis based on hypothesis tests, the E parts of the CIs address questions of direct scientific interest, and the A parts give explicit feedback as to whether the investigator has collected sufficient data for making reasonable conclusions. Basic texts in statistics usually do not pro vide much information about forming CIs in nonsimple settings. However, it is shown here that simultaneous CIs can be easily obtained for many such settings. References Cited Freedman, D., R. Pisani & R. Purves Statistics, 1st ed. Norton, New York. Graybill, F Theory and application of the linear model. Duxbury, Boston,Mass. Hochberg, Y. & A. Tamhane Multiple comparison procedures. Wiley, New York. Jones, D. & N. MatIotr Statistical hypothesis testing in biology: a contradiction in terms. J. Econ. Entomol. 79: Matlotr, N Probability modeling and computer simulation, with applications to engineering and computer science. Prindle, Weber & Schmidt-Kent, Boston, Mass. Moore, D. & G. McCabe Introduction to the practice of statistics, Freeman, San Francisco. Rao, C Linear statistical inference and its applications, 2nd ed. Wiley, New York. Snedecor, G. & W. Cochran Statistical methods, 6th ed. Iowa State University Press, Ames. Received for publication 18 September 1990; accepted 23 April 1991.

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Chapter 7 Class Notes Comparison of Two Independent Samples

Chapter 7 Class Notes Comparison of Two Independent Samples Chapter 7 Class Notes Comparison of Two Independent Samples In this chapter, we ll compare means from two independently sampled groups using HTs (hypothesis tests). As noted in Chapter 6, there are two

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

Multiple Comparison Methods for Means

Multiple Comparison Methods for Means SIAM REVIEW Vol. 44, No. 2, pp. 259 278 c 2002 Society for Industrial and Applied Mathematics Multiple Comparison Methods for Means John A. Rafter Martha L. Abell James P. Braselton Abstract. Multiple

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

CH.9 Tests of Hypotheses for a Single Sample

CH.9 Tests of Hypotheses for a Single Sample CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance

More information

Solution: chapter 2, problem 5, part a:

Solution: chapter 2, problem 5, part a: Learning Chap. 4/8/ 5:38 page Solution: chapter, problem 5, part a: Let y be the observed value of a sampling from a normal distribution with mean µ and standard deviation. We ll reserve µ for the estimator

More information

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation

More information

Journeys of an Accidental Statistician

Journeys of an Accidental Statistician Journeys of an Accidental Statistician A partially anecdotal account of A Unified Approach to the Classical Statistical Analysis of Small Signals, GJF and Robert D. Cousins, Phys. Rev. D 57, 3873 (1998)

More information

Tutorial 2: Power and Sample Size for the Paired Sample t-test

Tutorial 2: Power and Sample Size for the Paired Sample t-test Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,

More information

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements: Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:

More information

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments The hypothesis testing framework The two-sample t-test Checking assumptions, validity Comparing more that

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

Liquid-in-glass thermometer

Liquid-in-glass thermometer Liquid-in-glass thermometer Objectives The objective of this experiment is to introduce some basic concepts in measurement, and to develop good measurement habits. In the first section, we will develop

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

TABLES OF P-VALUES FOR t- AND CHI-SQUARE REFERENCE DISTRIBUTIONS. W. W. Piegorsch

TABLES OF P-VALUES FOR t- AND CHI-SQUARE REFERENCE DISTRIBUTIONS. W. W. Piegorsch TABLES OF P-VALUES FOR t- AND CHI-SQUARE REFERENCE DISTRIBUTIONS by W. W. Piegorsch University of South Carolina Statistics Technical Report No. 194 62Q05-3 August 2002 Department of Statistics University

More information

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements:

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements: Tutorial 1: Power and Sample Size for the One-sample t-test Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in large part by the National

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

Applied Multivariate and Longitudinal Data Analysis

Applied Multivariate and Longitudinal Data Analysis Applied Multivariate and Longitudinal Data Analysis Chapter 2: Inference about the mean vector(s) Ana-Maria Staicu SAS Hall 5220; 919-515-0644; astaicu@ncsu.edu 1 In this chapter we will discuss inference

More information

6 Pattern Mixture Models

6 Pattern Mixture Models 6 Pattern Mixture Models A common theme underlying the methods we have discussed so far is that interest focuses on making inference on parameters in a parametric or semiparametric model for the full data

More information

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data? Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE

CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE 3.1 Model Violations If a set of items does not form a perfect Guttman scale but contains a few wrong responses, we do not necessarily need to discard it. A wrong

More information

A Better Way to Do R&R Studies

A Better Way to Do R&R Studies The Evaluating the Measurement Process Approach Last month s column looked at how to fix some of the Problems with Gauge R&R Studies. This month I will show you how to learn more from your gauge R&R data

More information

Algorithmisches Lernen/Machine Learning

Algorithmisches Lernen/Machine Learning Algorithmisches Lernen/Machine Learning Part 1: Stefan Wermter Introduction Connectionist Learning (e.g. Neural Networks) Decision-Trees, Genetic Algorithms Part 2: Norman Hendrich Support-Vector Machines

More information

In most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up

In most cases, a plot of d (j) = {d (j) 2 } against {χ p 2 (1-q j )} is preferable since there is less piling up THE UNIVERSITY OF MINNESOTA Statistics 5401 September 17, 2005 Chi-Squared Q-Q plots to Assess Multivariate Normality Suppose x 1, x 2,..., x n is a random sample from a p-dimensional multivariate distribution

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

A Brief Introduction to Proofs

A Brief Introduction to Proofs A Brief Introduction to Proofs William J. Turner October, 010 1 Introduction Proofs are perhaps the very heart of mathematics. Unlike the other sciences, mathematics adds a final step to the familiar scientific

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

A Monte-Carlo study of asymptotically robust tests for correlation coefficients Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods Permutation Tests Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods The Two-Sample Problem We observe two independent random samples: F z = z 1, z 2,, z n independently of

More information

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience Hypothesis Testing Definitions Hypothesis a claim about something Null hypothesis ( H 0 ) the hypothesis that suggests no change from previous experience Alternative hypothesis ( H 1 ) the hypothesis that

More information

Introduction to Metalogic

Introduction to Metalogic Philosophy 135 Spring 2008 Tony Martin Introduction to Metalogic 1 The semantics of sentential logic. The language L of sentential logic. Symbols of L: Remarks: (i) sentence letters p 0, p 1, p 2,... (ii)

More information

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials. One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine

More information

What is A + B? What is A B? What is AB? What is BA? What is A 2? and B = QUESTION 2. What is the reduced row echelon matrix of A =

What is A + B? What is A B? What is AB? What is BA? What is A 2? and B = QUESTION 2. What is the reduced row echelon matrix of A = STUDENT S COMPANIONS IN BASIC MATH: THE ELEVENTH Matrix Reloaded by Block Buster Presumably you know the first part of matrix story, including its basic operations (addition and multiplication) and row

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

ST505/S697R: Fall Homework 2 Solution.

ST505/S697R: Fall Homework 2 Solution. ST505/S69R: Fall 2012. Homework 2 Solution. 1. 1a; problem 1.22 Below is the summary information (edited) from the regression (using R output); code at end of solution as is code and output for SAS. a)

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Adaptive Designs: Why, How and When?

Adaptive Designs: Why, How and When? Adaptive Designs: Why, How and When? Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj ISBS Conference Shanghai, July 2008 1 Adaptive designs:

More information

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function

More information

IENG581 Design and Analysis of Experiments INTRODUCTION

IENG581 Design and Analysis of Experiments INTRODUCTION Experimental Design IENG581 Design and Analysis of Experiments INTRODUCTION Experiments are performed by investigators in virtually all fields of inquiry, usually to discover something about a particular

More information

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED by W. Robert Reed Department of Economics and Finance University of Canterbury, New Zealand Email: bob.reed@canterbury.ac.nz

More information

Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Multiple Regression Analysis: Inference ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Introduction When you perform statistical inference, you are primarily doing one of two things: Estimating the boundaries

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Lecture 10: F -Tests, ANOVA and R 2

Lecture 10: F -Tests, ANOVA and R 2 Lecture 10: F -Tests, ANOVA and R 2 1 ANOVA We saw that we could test the null hypothesis that β 1 0 using the statistic ( β 1 0)/ŝe. (Although I also mentioned that confidence intervals are generally

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

THE POWER OF THE MATRIX. Shayle R. Searle

THE POWER OF THE MATRIX. Shayle R. Searle THE POWER OF THE MATRIX BU-1563-M March 2001 Shayle R. Searle Keywords: matrix proofs, linear models. Abstract: The first linear models book by Graybill (1961) was one of the early major works to address

More information

Slope Fields: Graphing Solutions Without the Solutions

Slope Fields: Graphing Solutions Without the Solutions 8 Slope Fields: Graphing Solutions Without the Solutions Up to now, our efforts have been directed mainly towards finding formulas or equations describing solutions to given differential equations. Then,

More information

1. Least squares with more than one predictor

1. Least squares with more than one predictor Statistics 1 Lecture ( November ) c David Pollard Page 1 Read M&M Chapter (skip part on logistic regression, pages 730 731). Read M&M pages 1, for ANOVA tables. Multiple regression. 1. Least squares with

More information

Unified approach to the classical statistical analysis of small signals

Unified approach to the classical statistical analysis of small signals PHYSICAL REVIEW D VOLUME 57, NUMBER 7 1 APRIL 1998 Unified approach to the classical statistical analysis of small signals Gary J. Feldman * Department of Physics, Harvard University, Cambridge, Massachusetts

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

Inferences about a Mean Vector

Inferences about a Mean Vector Inferences about a Mean Vector Edps/Soc 584, Psych 594 Carolyn J. Anderson Department of Educational Psychology I L L I N O I S university of illinois at urbana-champaign c Board of Trustees, University

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007 MIT OpenCourseWare http://ocw.mit.edu HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing Spring 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Significance Testing with Incompletely Randomised Cases Cannot Possibly Work

Significance Testing with Incompletely Randomised Cases Cannot Possibly Work Human Journals Short Communication December 2018 Vol.:11, Issue:2 All rights are reserved by Stephen Gorard FRSA FAcSS Significance Testing with Incompletely Randomised Cases Cannot Possibly Work Keywords:

More information

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018 SRMR in Mplus Tihomir Asparouhov and Bengt Muthén May 2, 2018 1 Introduction In this note we describe the Mplus implementation of the SRMR standardized root mean squared residual) fit index for the models

More information

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses 280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Practical Statistics for the Analytical Scientist Table of Contents

Practical Statistics for the Analytical Scientist Table of Contents Practical Statistics for the Analytical Scientist Table of Contents Chapter 1 Introduction - Choosing the Correct Statistics 1.1 Introduction 1.2 Choosing the Right Statistical Procedures 1.2.1 Planning

More information

OPTIMIZATION OF FIRST ORDER MODELS

OPTIMIZATION OF FIRST ORDER MODELS Chapter 2 OPTIMIZATION OF FIRST ORDER MODELS One should not multiply explanations and causes unless it is strictly necessary William of Bakersville in Umberto Eco s In the Name of the Rose 1 In Response

More information

BIO 682 Nonparametric Statistics Spring 2010

BIO 682 Nonparametric Statistics Spring 2010 BIO 682 Nonparametric Statistics Spring 2010 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 5 Williams Correction a. Divide G value by q (see S&R p. 699) q = 1 + (a 2-1)/6nv

More information

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption Summary: the confidence interval for the mean (σ known) with gaussian assumption on X Let X be a Gaussian r.v. with mean µ and variance σ. If X 1, X,..., X n is a random sample drawn from X then the confidence

More information

A-LEVEL STATISTICS. SS04 Report on the Examination June Version: 1.0

A-LEVEL STATISTICS. SS04 Report on the Examination June Version: 1.0 A-LEVEL STATISTICS SS04 Report on the Examination 6380 June 2016 Version: 1.0 Further copies of this Report are available from aqa.org.uk Copyright 2016 AQA and its licensors. All rights reserved. AQA

More information

Notes: Pythagorean Triples

Notes: Pythagorean Triples Math 5330 Spring 2018 Notes: Pythagorean Triples Many people know that 3 2 + 4 2 = 5 2. Less commonly known are 5 2 + 12 2 = 13 2 and 7 2 + 24 2 = 25 2. Such a set of integers is called a Pythagorean Triple.

More information

STAT 285 Fall Assignment 1 Solutions

STAT 285 Fall Assignment 1 Solutions STAT 285 Fall 2014 Assignment 1 Solutions 1. An environmental agency sets a standard of 200 ppb for the concentration of cadmium in a lake. The concentration of cadmium in one lake is measured 17 times.

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Gov 2002: 3. Randomization Inference

Gov 2002: 3. Randomization Inference Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last week: This week: What can we identify using randomization? Estimators were justified via

More information

ESTIMATION OF CONSERVATISM OF CHARACTERS BY CONSTANCY WITHIN BIOLOGICAL POPULATIONS

ESTIMATION OF CONSERVATISM OF CHARACTERS BY CONSTANCY WITHIN BIOLOGICAL POPULATIONS ESTIMATION OF CONSERVATISM OF CHARACTERS BY CONSTANCY WITHIN BIOLOGICAL POPULATIONS JAMES S. FARRIS Museum of Zoology, The University of Michigan, Ann Arbor Accepted March 30, 1966 The concept of conservatism

More information

Rejection regions for the bivariate case

Rejection regions for the bivariate case Rejection regions for the bivariate case The rejection region for the T 2 test (and similarly for Z 2 when Σ is known) is the region outside of an ellipse, for which there is a (1-α)% chance that the test

More information

ECS 120 Lesson 18 Decidable Problems, the Halting Problem

ECS 120 Lesson 18 Decidable Problems, the Halting Problem ECS 120 Lesson 18 Decidable Problems, the Halting Problem Oliver Kreylos Friday, May 11th, 2001 In the last lecture, we had a look at a problem that we claimed was not solvable by an algorithm the problem

More information

Post-Selection Inference

Post-Selection Inference Classical Inference start end start Post-Selection Inference selected end model data inference data selection model data inference Post-Selection Inference Todd Kuffner Washington University in St. Louis

More information

Uncorrected author proof---not final version!

Uncorrected author proof---not final version! The Localization Hypothesis and Machines Abstract In a recent article in Artificial Life, Chu and Ho suggested that Rosen s central result about the simulability of living systems might be flawed. This

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,

More information

Soc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS

Soc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS Soc 3811 Basic Social Statistics Second idterm Exam Spring 010 our Name [50 points]: ID #: INSTRUCTIONS: ANSERS (A) rite your name on the line at top front of every sheet. (B) If you use a page of notes

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your

More information

Sample Size Determination

Sample Size Determination Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

20.0 Experimental Design

20.0 Experimental Design 20.0 Experimental Design Answer Questions 1 Philosophy One-Way ANOVA Egg Sample Multiple Comparisons 20.1 Philosophy Experiments are often expensive and/or dangerous. One wants to use good techniques that

More information

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS NOTES FROM PRE- LECTURE RECORDING ON PCA PCA and EFA have similar goals. They are substantially different in important ways. The goal

More information

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials Research Article Received 29 January 2010, Accepted 26 May 2010 Published online 18 April 2011 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.4008 Mixtures of multiple testing procedures

More information

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I. Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses

More information

Chapter Three. Hypothesis Testing

Chapter Three. Hypothesis Testing 3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being

More information

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30

Problem Set 2. MAS 622J/1.126J: Pattern Recognition and Analysis. Due: 5:00 p.m. on September 30 Problem Set 2 MAS 622J/1.126J: Pattern Recognition and Analysis Due: 5:00 p.m. on September 30 [Note: All instructions to plot data or write a program should be carried out using Matlab. In order to maintain

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

Chapter 2 Sampling for Biostatistics

Chapter 2 Sampling for Biostatistics Chapter 2 Sampling for Biostatistics Angela Conley and Jason Pfefferkorn Abstract Define the terms sample, population, and statistic. Introduce the concept of bias in sampling methods. Demonstrate how

More information

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data? Data Analysis: Agonistic Display in Betta splendens By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information