Statistical Inference: Uses, Abuses, and Misconceptions

Size: px
Start display at page:

Download "Statistical Inference: Uses, Abuses, and Misconceptions"

Transcription

1 Statistical Inference: Uses, Abuses, and Misconceptions Michael W. Trosset Indiana Statistical Consulting Center Department of Statistics ISCC is part of IU s Department of Statistics, chaired by Stanley Wasserman. WIM

2 Experiments are performed for the purpose of obtaining information about a population that is imperfectly understood. The population represents all of the experimental outcomes that might occur (the sample space), together with a probability distribution P that describes the behavior of the experiment. The observed outcomes are a random sample of the population. Notation: X 1,..., X n P, x = {x 1,..., x n } Statistical inference is concerned with how one draws conclusions about P from x. WIM

3 Example Consider an experiment to investigate the effect of a gasoline additive on miles per gallon (mpg). For car i = 1,..., n, let Car i improves iff X i > 0. B i = mpg before additive, A i = mpg after additive, X i = A i B i = improvement in mpg. Write the experiment as X 1,..., X n P and observe x = {x 1,..., x n }. What might we like to know about P? For example... What is P (X i > 0)? Is P (X i > 0) > 0.5? What is EX i? Is EX i > 0? WIM

4 Two very different concerns are often conflated: 1. What would we like to know about P? 2. How can we use x to draw inferences about the attributes of P that interest us? WIM

5 1. Describing a Population Suppose that P is known. Then inference is not necessary and we can focus on deciding what attributes of P are interesting and/or important. We might begin by displaying/examining/reporting the probability density function (pdf) of P, say f. For example... WIM

6 f(x) WIM x

7 Population Attributes To summarize a probability distribution, statisticians usually begin by computing a measure of centrality (e.g., median, mean) and a measure of dispersion (e.g., interquartile range, standard deviation). These quantities are often useful, but may not reveal important features of the population. In the preceding example, the quartiles are q 1. = 0.59, q2. = 0.14, and q3. = The median is q 2. = 0.14 and the interquartile range is q3 q 1. = = The mean is µ = 0.3, the variance is σ 2 = 1.714, and the standard deviation is σ. = If one only computes measures of centrality and dispersion, then one may overlook the shape of this unusual pdf. In fact, f is a bimodal mixture. Ninety percent of the population is Normal(0, 1), but ten percent of the population is Normal(3, 0.04). For the latter subpopulation, the additive provides a definite benefit. WIM

8 Effect Size It is often recommended that researchers examine effect size. If one construes this advice to mean that one should examine the magnitude of the effect, i.e., the location of P relative to some baseline, by quantifying the salient attributes of P, then this is exemplary advice. Unfortunately, Jacob Cohen s well-intentioned efforts to help researchers design experiments have led many researchers to obsess on specific measures, e.g., d = µ/σ. Cohen was not concerned with reporting salient features of P, but on performing specific calculations related to the choice of sample size. These calculations depend on specifying a value of d and Cohen offered rough guidelines for how to do so when µ and σ are not known. WIM

9 Cohen s rough guidelines for choosing a sample size when little is known about P have led many researchers to embrace uncritical statements like d = 0.5 is an effect of medium size. In fact, Cohen anticipated and warned against such abuse: The terms small, medium, and large are relative, not only to each other but to... the specific content and research method being employed in any given investigation. In the face of this relativity, there is a certain risk inherent in offering conventional operational definitions for these terms... J. Cohen, Statistical Power Analysis for the Behavioral Sciences, 1988, p. 25. Researchers should be interested in the size of an effect, but they should also think about how to quantify and interpret its magnitude in the context of their discipline. For example... Instead of reporting whether µ < 0 or µ > 0, report the value of µ and think critically about its material significance (importance). WIM

10 2. Inference In practice, P is unknown and we draw x to obtain information about P. We may construct... Graphical displays of x: stem-and-leaf plots, box plots Estimates of P : empirical distributions & cdfs, quantile-quantile plots Estimates of f: histograms, average shifted histograms, kernel density estimates To illustrate, consider the forearm lengths (in inches) of n = 140 adult males... WIM

11 WIM

12 Box Plot Normal Q Q Plot PDF Estimate Sample Quantiles Density Theoretical Quantiles N = 140 Bandwidth = WIM

13 In practice, most inference is parametric. For example, assume that P Normal(µ, σ 2 ) and draw inferences about µ. Or... Let n = 100 and assume that in which case X 1,..., X n Bernoulli(p), Y = n X i Binomial(n; p). i=1 We observe y = 32. What might we infer about p? Point estimation is concerned with guessing the true value of p. Hypothesis testing is concerned with making a binary decision about p, e.g., is p = 0.5 or is p 0.5? Set estimation is concerned with constructing a set of plausible values of p. WIM

14 Point Estimation Plug-In Approach. Estimate p = EX i, the population mean, with the sample mean, ˆp = x = y n = = Maximum Likelihood Approach. Estimate p with In this case, p = ˆp = p = argmax a P (Y = 32 p = a). WIM

15 Decision-Theoretic Approaches. Let δ denote an estimator and let L (δ(y), p) = [δ(y) p] 2 denote the squared error loss when p is true and y is observed. Let R(p; δ) = E p L (δ(y ), p) denote the risk (expected loss) of δ when p is true. For example, δ(y) = y/n has risk function R(p; δ) = p(1 p)/n. Ideally, we would like to use an estimator that simultaneously minimizes the risk for every possible value of p. Unfortunately, this is impossible. We can proceed in one of two ways, either by restricting the class of estimators that we are willing to use, or by minimizing some real-valued attribute of the risk function. The Bayes Principle tells us to choose δ to minimize a weighted average of the possible risks: 1 0 R(p; δ)w(p) dp, WIM

16 where w is a pdf that weights the p (0, 1). The optimal estimator, δ w, is called the Bayes estimator with respect to w. Example: If w is the pdf of Beta(a, b), then δ w (y) = δ a,b (y) = a + y a + b + n = ( ) a + b a a + b + n a + b + ( ) n y a + b + n n. For n = 100 and y = 32 we obtain... a a b a+b δ a,b (32) WIM

17 w(p) WIM p

18 Hypothesis Testing Hypothesis testing is concerned with binary decisions. Partition the parameter set, here (0, 1), into null and alternative hypotheses, e.g., H 0 : p = 0.5 vs H 1 : p 0.5 or H 0 : p 0.5 vs H 1 : p < 0.5. H 0 is privileged, i.e., we retain H 0 unless we find compelling evidence against it. A test statistic measures evidence against H 0. Under H 0 : p = 0.5, EY = np = 50; hence, Y 50 measures evidence against H 0 : p = 0.5. We observed y 50 = = 18. Under H 0 : p = 0.5, p = P H0 ( Y 50 18) = P H0 (Y 32) + P H0 (Y 68). = We reject H 0 iff p is sufficiently small, i.e., p α for some predetermined α. The computed quantity p is the significance probability; the fixed quantity α is the significance level. WIM

19 Statistical significance is not material significance! Example: Let X i Normal(µ, σ 2 ) denote car i s improvement in mpg. A government agency will sanction advertising the additive iff H 0 : µ 1 is rejected at significance level α = A large corporation manufactures an additive that increases mileage by an average of µ = 1.01 miles per gallon. The corporation funds a large study of n = 900 vehicles in which x = 1.01 and s = 0.1 are observed. This results in a test statistic of t = x µ 0 s/ = n 0.1/ 900 = 3 and a significance probability of p = P (T n 3). = < 0.05 = α, so H 0 is decisively rejected and advertising is authorized. WIM

20 2. An amateur automotive mechanic invents an additive that increases mileage by an average of µ = 1.21 miles per gallon. The mechanic funds a small study of n = 9 vehicles in which x = 1.21 and s = 0.4 are observed. This results in a test statistic of t = x µ 0 s/ n and a significance probability of = / 9 = p = P (T n 1.575). = > 0.05 = α, so H 0 is retained and advertising is not authorized. WIM

21 Significance levels are error rates! Suppose that H 0 is true. A test that rejects H 0 iff p α will falsely reject H 0 with probability α. For example, suppose that I give a fair penny to each of 600 students. Each student tosses his/her penny 100 times and tests H 0 : p = 0.5 at significance level α = Let X i = 1 if student i rejects H 0 and X i = 0 if student i retains H 0. Because H 0 is true and p = 0.5, X i Bernoulli(α = 0.05) and the total number of students who falsely reject H 0 is Y = 600 i=1 Binomial(600; 0.05). It follows that EY = 30, P (Y 20). = 0.98, etc. WIM

22 Set Estimation Imagine testing H 0 : p = p 0 at significance level α for every p 0 (0, 1). The p 0 that are retained constitute a set of plausible values of p. This is an example of a confidence set. For Y Binomial(100; p), y = 32, and α = 0.1, the confidence set is [0.245, 0.404]. Notice that this interval does not contain p 0 = 0.5, which we have already rejected as implausible. Let C( x) denote the set of plausible values. Because x is a random sample, C( x) may or may not cover p. The probability that the random set C( X) covers p is 1 α, the confidence coefficient. Confidence is not a guarantee! If 600 researchers construct confidence sets with confidence coefficient 0.95, then we expect that 30 confidence sets will fail to cover the parameter of interest. The probability that at least 20 confidence sets will fail to cover is approximately WIM

23 Bayesian Inference A confidence coefficient tells us the probability that C( X) will cover p. This fact provides information about how the procedure for constructing C( x) will behave when used repeatedly; it does not tell us whether or not the observed C( x) actually covers p. In contrast, Bayesian methods endeavor to quantify the evidence in x. Suppose that X 1,..., X n P θ, θ Ω. Let p( x θ) denote the joint pdf of x and let p(θ) denote a pdf on Ω. Then, with squared error loss, the Bayes estimate of θ turns out to be θ p( x θ) p(θ) dθ Ω δ p ( x) =. p( x θ) p(θ) dθ Formally, δ p ( x) is the mean of the conditional pdf Ω Ω p( x θ) p(θ) p( x, θ) = p( x θ) p(θ) dθ p( x) = p(θ x). WIM

24 Let us conceive of the experiment as occurring in two stages: 1. Nature draws θ from the marginal pdf p(θ). At this stage, absent a sample from p( x θ) and knowing only p(θ), we would estimate θ to be the mean of p(θ). 2. We draw x from the conditional pdf p( x θ). To incorporate x into our estimate of θ, we replace p(θ) with p(θ x). Thus, we modify our initial knowledge of θ in light of the information about θ provided by x. In this context, p(θ) is the prior distribution and p(θ x) is the posterior distribution. But what if θ is not random? What if θ is a physical constant, e.g., the speed of light? WIM

25 The Basic Tenet of Bayesianity asserts that statisticians should treat known quantities as fixed and unknown quantities probabilistically. The basic tenet implies the following: Inference should be conditional on x. Uncertainty about θ should be modeled by p(θ). (This requirement may necessitate a subjectivist interpretation of probability.) Bayesians insist that inference should focus on the posterior distribution, which reveals how the prior distribution is modified by the data. One may display/report the entire posterior or certain of its attributes. Instead of a confidence set, one may compute a region of highest posterior density (HPD), i.e., the smallest subset of Ω for which the posterior probability is 1 α. L. J. Savage famously quipped that The only use I know for a confidence interval is to have confidence in it. In contrast, Bayesians can legitimately claim that the (possibly subjective) probability that θ lies in a region of HPD is 1 α. WIM

26 3. More About Hypothesis Testing True State of Nature H 0 H 1 Statistical retain H 0 Type II error Test reject H 0 Type I error Hypothesis testing is not intended for the purpose of choosing the more plausible hypothesis. It is only intended to determine if there is sufficient evidence to reject the null hypothesis. Legal Analogy: In a criminal trial, the defendant is innocent until proven guilty. The jury must decide if there is sufficient evidence to convict, i.e., if the prosecution s case is beyond a reasonable doubt. WIM

27 Power Function For simplicity, we restrict attention to a simple example. Assume that X 1,..., X n Normal(θ, 1) and that we want to test H 0 : θ 0 versus H 1 : θ > 0. Then the power function of a test φ is power φ (θ) = P (φ rejects H 0 X 1,..., X n Normal(θ, 1)). For any reasonable test, the power function should look something like this... WIM

28 P(reject H0) !1.0! population mean WIM

29 Given a significance level, α, we require that φ satisfy power φ (θ) α for θ 0. Among such level α tests, we prefer tests for which power φ (θ) is large for θ > 0. WIM

30 Let α = The usual test of H 0 : θ 0 versus H 1 : θ > 0 rejects H 0 iff x is sufficiently large, specifically, iff The power function of this test is power (θ) = P It follows that... = P = P x 0 1/ n = n x > ( ( n X > X Normal θ, 1 )) n ( X > ( X Normal θ, 1 )) n n n) θ ( X θ 1/ n > (1.645/ 1/ n ( X Normal θ, 1 )) n = P ( Z > θ n Z Normal(0, 1) ). WIM

31 power (0) = P (Z > 1.645) = 0.05 Interpretation: If θ = 0, then the probability of rejecting H 0 : θ 0, i.e., the probability of committing a Type I error, is power (0.5) = P (Z > n/2) increases as n increases Interpretation: If θ = 0, then the probability of rejecting H : θ 0 increases as n increases. Equivalently, the probability of retaining H 0, i.e., the probability of committing a Type II error, decreases as n increases. Notice that the above are properties of the test, not the data. Power is not an observed quantity. WIM

32 Power Analysis Typically a power analysis is performed when designing an experiment, to inform the choice of sample size. The test that rejects H 0 : θ 0 iff n x > satisfies power (0) = 0.05 n. To control power (0.5) = P ( Z > n/2 ), we must adjust the sample size. To achieve power (0.5) = 0.8: To achieve power (0.5) = 0.9: n/2 = qnorm(.2) qnorm(.95)-qnorm(.2) = n/2 n = 4 [qnorm(.95)-qnorm(.2)] 2. = 24.7 n = 4 [qnorm(.95)-qnorm(.1)] 2. = 34.3 WIM

33 Observed Power Suppose that we observe x = 0.5. Is this sufficient evidence to reject H 0 : θ 0 in favor of H 1 : θ > 0? The answer depends on the sample size. We need n/2 > 1.645, or n > ( ) 2. = 10.8, to reject H0. Remark: This answer confirms our intuition: the larger the sample, the more compelling the evidence that x = 0.5 was not drawn under θ 0, i.e., that H 0 is false. Suppose that n 10, in which case we retain H 0. Can we ascertain if the data provide affirmative evidence that H 0 is true, as opposed to merely providing insufficient evidence that H 0 is false? (Analogously, the defendant is found not guilty. Is s/he innocent, or was the charge merely not proven?) Remark: Hypothesis testing was not designed to answer this question. (Nor do criminal trials in Anglo-Saxon law distinguish between innocent and not proven. Scottish law does draw this distinction.) WIM

34 The answer should be obvious. Observing x > 0 tends to suggest that H 0 is false, i.e., observing x = 0.5 provides more evidence that H 0 is false than evidence that H 0 is true. Nevertheless, some authors have suggested that we can quantify the evidence that favors H 0 by computing the observed power, viz., power ( x) = power (0.5) = P ( Z > n/2 ). Recall, however, that this quantity increases as n increases. Hence, the more data that we collect (and therefore the more that x = 0.5 favors H 1 ), the larger power ( x) becomes. Large values of power ( x) should NOT be construed as evidence favoring H 0. J.M. Hoenig & D.M. Heisey, The abuse of power: the pervasive fallacy of power calculations for data analysis, The American Statistician, 55(1):19 24, Feb WIM

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

Chapter Three. Hypothesis Testing

Chapter Three. Hypothesis Testing 3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being

More information

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Hypothesis Testing. ECE 3530 Spring Antonio Paiva Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is

More information

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

Section 9.1 (Part 2) (pp ) Type I and Type II Errors Section 9.1 (Part 2) (pp. 547-551) Type I and Type II Errors Because we are basing our conclusion in a significance test on sample data, there is always a chance that our conclusions will be in error.

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Section 6.2 Hypothesis Testing

Section 6.2 Hypothesis Testing Section 6.2 Hypothesis Testing GIVEN: an unknown parameter, and two mutually exclusive statements H 0 and H 1 about. The Statistician must decide either to accept H 0 or to accept H 1. This kind of problem

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

The Components of a Statistical Hypothesis Testing Problem

The Components of a Statistical Hypothesis Testing Problem Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one

More information

Statistical Inference. Hypothesis Testing

Statistical Inference. Hypothesis Testing Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS THUY ANH NGO 1. Introduction Statistics are easily come across in our daily life. Statements such as the average

More information

20 Hypothesis Testing, Part I

20 Hypothesis Testing, Part I 20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she

More information

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 9. Hypothesis testing. 9.1 Introduction Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

Hypothesis tests

Hypothesis tests 6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis

More information

Preliminary Statistics. Lecture 5: Hypothesis Testing

Preliminary Statistics. Lecture 5: Hypothesis Testing Preliminary Statistics Lecture 5: Hypothesis Testing Rory Macqueen (rm43@soas.ac.uk), September 2015 Outline Elements/Terminology of Hypothesis Testing Types of Errors Procedure of Testing Significance

More information

Unit 19 Formulating Hypotheses and Making Decisions

Unit 19 Formulating Hypotheses and Making Decisions Unit 19 Formulating Hypotheses and Making Decisions Objectives: To formulate a null hypothesis and an alternative hypothesis, and to choose a significance level To identify the Type I error and the Type

More information

Section 5.4: Hypothesis testing for μ

Section 5.4: Hypothesis testing for μ Section 5.4: Hypothesis testing for μ Possible claims or hypotheses: Ball bearings have μ = 1 cm Medicine decreases blood pressure For testing hypotheses, we set up a null (H 0 ) and alternative (H a )

More information

Hypotheses Test Procedures. Is the claim wrong?

Hypotheses Test Procedures. Is the claim wrong? Hypotheses Test Procedures MATH 2300 Sections 9.1 and 9.2 Is the claim wrong? An oil company representative claims that the average price for gasoline in Lubbock is $2.30 per gallon. You think the average

More information

Statistical Preliminaries. Stony Brook University CSE545, Fall 2016

Statistical Preliminaries. Stony Brook University CSE545, Fall 2016 Statistical Preliminaries Stony Brook University CSE545, Fall 2016 Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 2 Random Variables X: A mapping from

More information

Inference for a Population Proportion

Inference for a Population Proportion Al Nosedal. University of Toronto. November 11, 2015 Statistical inference is drawing conclusions about an entire population based on data in a sample drawn from that population. From both frequentist

More information

(1) Introduction to Bayesian statistics

(1) Introduction to Bayesian statistics Spring, 2018 A motivating example Student 1 will write down a number and then flip a coin If the flip is heads, they will honestly tell student 2 if the number is even or odd If the flip is tails, they

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Bayesian Inference. Chapter 2: Conjugate models

Bayesian Inference. Chapter 2: Conjugate models Bayesian Inference Chapter 2: Conjugate models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master in

More information

Lecture 2: Statistical Decision Theory (Part I)

Lecture 2: Statistical Decision Theory (Part I) Lecture 2: Statistical Decision Theory (Part I) Hao Helen Zhang Hao Helen Zhang Lecture 2: Statistical Decision Theory (Part I) 1 / 35 Outline of This Note Part I: Statistics Decision Theory (from Statistical

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

Math Review Sheet, Fall 2008

Math Review Sheet, Fall 2008 1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the

More information

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33 Hypothesis Testing Econ 690 Purdue University Justin L. Tobias (Purdue) Testing 1 / 33 Outline 1 Basic Testing Framework 2 Testing with HPD intervals 3 Example 4 Savage Dickey Density Ratio 5 Bartlett

More information

The Empirical Rule, z-scores, and the Rare Event Approach

The Empirical Rule, z-scores, and the Rare Event Approach Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Two examples of the use of fuzzy set theory in statistics. Glen Meeden University of Minnesota.

Two examples of the use of fuzzy set theory in statistics. Glen Meeden University of Minnesota. Two examples of the use of fuzzy set theory in statistics Glen Meeden University of Minnesota http://www.stat.umn.edu/~glen/talks 1 Fuzzy set theory Fuzzy set theory was introduced by Zadeh in (1965) as

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics

More information

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting Estimating the accuracy of a hypothesis Setting Assume a binary classification setting Assume input/output pairs (x, y) are sampled from an unknown probability distribution D = p(x, y) Train a binary classifier

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

Review: Statistical Model

Review: Statistical Model Review: Statistical Model { f θ :θ Ω} A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The statistical model

More information

Class 26: review for final exam 18.05, Spring 2014

Class 26: review for final exam 18.05, Spring 2014 Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event

More information

Inverse Sampling for McNemar s Test

Inverse Sampling for McNemar s Test International Journal of Statistics and Probability; Vol. 6, No. 1; January 27 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Inverse Sampling for McNemar s Test

More information

MATH4427 Notebook 4 Fall Semester 2017/2018

MATH4427 Notebook 4 Fall Semester 2017/2018 MATH4427 Notebook 4 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 4 MATH4427 Notebook 4 3 4.1 K th Order Statistics and Their

More information

Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs

Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs Presented August 8-10, 2012 Daniel L. Gillen Department of Statistics University of California, Irvine

More information

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To

More information

7. Estimation and hypothesis testing. Objective. Recommended reading

7. Estimation and hypothesis testing. Objective. Recommended reading 7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing

More information

STAT Exam Jam Solutions. Contents

STAT Exam Jam Solutions. Contents s Contents 1 First Day 2 Question 1: PDFs, CDFs, and Finding E(X), V (X).......................... 2 Question 2: Bayesian Inference...................................... 3 Question 3: Binomial to Normal

More information

2016 SISG Module 17: Bayesian Statistics for Genetics Lecture 3: Binomial Sampling

2016 SISG Module 17: Bayesian Statistics for Genetics Lecture 3: Binomial Sampling 2016 SISG Module 17: Bayesian Statistics for Genetics Lecture 3: Binomial Sampling Jon Wakefield Departments of Statistics and Biostatistics University of Washington Outline Introduction and Motivating

More information

2 Belief, probability and exchangeability

2 Belief, probability and exchangeability 2 Belief, probability and exchangeability We first discuss what properties a reasonable belief function should have, and show that probabilities have these properties. Then, we review the basic machinery

More information

Topic 17: Simple Hypotheses

Topic 17: Simple Hypotheses Topic 17: November, 2011 1 Overview and Terminology Statistical hypothesis testing is designed to address the question: Do the data provide sufficient evidence to conclude that we must depart from our

More information

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015 Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences 18.30 21.15h, February 12, 2015 Question 1 is on this page. Always motivate your answers. Write your answers in English. Only the

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Frequentist Statistics and Hypothesis Testing Spring

Frequentist Statistics and Hypothesis Testing Spring Frequentist Statistics and Hypothesis Testing 18.05 Spring 2018 http://xkcd.com/539/ Agenda Introduction to the frequentist way of life. What is a statistic? NHST ingredients; rejection regions Simple

More information

Mathematical statistics

Mathematical statistics October 20 th, 2018 Lecture 17: Tests of Hypotheses Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

8: Hypothesis Testing

8: Hypothesis Testing Some definitions 8: Hypothesis Testing. Simple, compound, null and alternative hypotheses In test theory one distinguishes between simple hypotheses and compound hypotheses. A simple hypothesis Examples:

More information

23. MORE HYPOTHESIS TESTING

23. MORE HYPOTHESIS TESTING 23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Introduction to Bayesian Statistics

Introduction to Bayesian Statistics Bayesian Parameter Estimation Introduction to Bayesian Statistics Harvey Thornburg Center for Computer Research in Music and Acoustics (CCRMA) Department of Music, Stanford University Stanford, California

More information

Tests about a population mean

Tests about a population mean October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA

Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box 90251 Durham, NC 27708, USA Summary: Pre-experimental Frequentist error probabilities do not summarize

More information

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over Point estimation Suppose we are interested in the value of a parameter θ, for example the unknown bias of a coin. We have already seen how one may use the Bayesian method to reason about θ; namely, we

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses 280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null

More information

Confidence Intervals and Hypothesis Tests

Confidence Intervals and Hypothesis Tests Confidence Intervals and Hypothesis Tests STA 281 Fall 2011 1 Background The central limit theorem provides a very powerful tool for determining the distribution of sample means for large sample sizes.

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

Basic Concepts of Inference

Basic Concepts of Inference Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).

More information

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL 1). Two urns are provided as follows: urn 1 contains 2 white chips and 4 red chips, while urn 2 contains 5 white chips and 3 red chips. One chip is chosen

More information

2. A Basic Statistical Toolbox

2. A Basic Statistical Toolbox . A Basic Statistical Toolbo Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data. Wikipedia definition Mathematical statistics: concerned

More information

Chapter 8 of Devore , H 1 :

Chapter 8 of Devore , H 1 : Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in

More information

Hypothesis testing: theory and methods

Hypothesis testing: theory and methods Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable

More information

Statistical Methods for Astronomy

Statistical Methods for Astronomy Statistical Methods for Astronomy Probability (Lecture 1) Statistics (Lecture 2) Why do we need statistics? Useful Statistics Definitions Error Analysis Probability distributions Error Propagation Binomial

More information

Bayesian Inference. Introduction

Bayesian Inference. Introduction Bayesian Inference Introduction The frequentist approach to inference holds that probabilities are intrinsicially tied (unsurprisingly) to frequencies. This interpretation is actually quite natural. What,

More information

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner Fundamentals CS 281A: Statistical Learning Theory Yangqing Jia Based on tutorial slides by Lester Mackey and Ariel Kleiner August, 2011 Outline 1 Probability 2 Statistics 3 Linear Algebra 4 Optimization

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing

More information

Precept 4: Hypothesis Testing

Precept 4: Hypothesis Testing Precept 4: Hypothesis Testing Soc 500: Applied Social Statistics Ian Lundberg Princeton University October 6, 2016 Learning Objectives 1 Introduce vectorized R code 2 Review homework and talk about RMarkdown

More information

An Informal Introduction to Statistics in 2h. Tim Kraska

An Informal Introduction to Statistics in 2h. Tim Kraska An Informal Introduction to Statistics in 2h Tim Kraska Goal of this Lecture This is not a replacement for a proper introduction to probability and statistics Instead, it only tries to convey the very

More information

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains: CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are

More information

MTMS Mathematical Statistics

MTMS Mathematical Statistics MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing

More information

Non-parametric Inference and Resampling

Non-parametric Inference and Resampling Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing

More information

Introduction to Probabilistic Machine Learning

Introduction to Probabilistic Machine Learning Introduction to Probabilistic Machine Learning Piyush Rai Dept. of CSE, IIT Kanpur (Mini-course 1) Nov 03, 2015 Piyush Rai (IIT Kanpur) Introduction to Probabilistic Machine Learning 1 Machine Learning

More information

Solution: First note that the power function of the test is given as follows,

Solution: First note that the power function of the test is given as follows, Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

STAT 4385 Topic 01: Introduction & Review

STAT 4385 Topic 01: Introduction & Review STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics

More information

F79SM STATISTICAL METHODS

F79SM STATISTICAL METHODS F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is

More information

Math 101: Elementary Statistics Tests of Hypothesis

Math 101: Elementary Statistics Tests of Hypothesis Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an

More information

Chapter 7: Hypothesis Testing - Solutions

Chapter 7: Hypothesis Testing - Solutions Chapter 7: Hypothesis Testing - Solutions 7.1 Introduction to Hypothesis Testing The problem with applying the techniques learned in Chapter 5 is that typically, the population mean (µ) and standard deviation

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses Space Telescope Science Institute statistics mini-course October 2011 Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses James L Rosenberger Acknowledgements: Donald Richards, William

More information

Introduction: MLE, MAP, Bayesian reasoning (28/8/13)

Introduction: MLE, MAP, Bayesian reasoning (28/8/13) STA561: Probabilistic machine learning Introduction: MLE, MAP, Bayesian reasoning (28/8/13) Lecturer: Barbara Engelhardt Scribes: K. Ulrich, J. Subramanian, N. Raval, J. O Hollaren 1 Classifiers In this

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior: Pi Priors Unobservable Parameter population proportion, p prior: π ( p) Conjugate prior π ( p) ~ Beta( a, b) same PDF family exponential family only Posterior π ( p y) ~ Beta( a + y, b + n y) Observed

More information

Linear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017

Linear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017 Linear Models: Comparing Variables Stony Brook University CSE545, Fall 2017 Statistical Preliminaries Random Variables Random Variables X: A mapping from Ω to ℝ that describes the question we care about

More information