Statistical Inference: Uses, Abuses, and Misconceptions
|
|
- Primrose Hudson
- 5 years ago
- Views:
Transcription
1 Statistical Inference: Uses, Abuses, and Misconceptions Michael W. Trosset Indiana Statistical Consulting Center Department of Statistics ISCC is part of IU s Department of Statistics, chaired by Stanley Wasserman. WIM
2 Experiments are performed for the purpose of obtaining information about a population that is imperfectly understood. The population represents all of the experimental outcomes that might occur (the sample space), together with a probability distribution P that describes the behavior of the experiment. The observed outcomes are a random sample of the population. Notation: X 1,..., X n P, x = {x 1,..., x n } Statistical inference is concerned with how one draws conclusions about P from x. WIM
3 Example Consider an experiment to investigate the effect of a gasoline additive on miles per gallon (mpg). For car i = 1,..., n, let Car i improves iff X i > 0. B i = mpg before additive, A i = mpg after additive, X i = A i B i = improvement in mpg. Write the experiment as X 1,..., X n P and observe x = {x 1,..., x n }. What might we like to know about P? For example... What is P (X i > 0)? Is P (X i > 0) > 0.5? What is EX i? Is EX i > 0? WIM
4 Two very different concerns are often conflated: 1. What would we like to know about P? 2. How can we use x to draw inferences about the attributes of P that interest us? WIM
5 1. Describing a Population Suppose that P is known. Then inference is not necessary and we can focus on deciding what attributes of P are interesting and/or important. We might begin by displaying/examining/reporting the probability density function (pdf) of P, say f. For example... WIM
6 f(x) WIM x
7 Population Attributes To summarize a probability distribution, statisticians usually begin by computing a measure of centrality (e.g., median, mean) and a measure of dispersion (e.g., interquartile range, standard deviation). These quantities are often useful, but may not reveal important features of the population. In the preceding example, the quartiles are q 1. = 0.59, q2. = 0.14, and q3. = The median is q 2. = 0.14 and the interquartile range is q3 q 1. = = The mean is µ = 0.3, the variance is σ 2 = 1.714, and the standard deviation is σ. = If one only computes measures of centrality and dispersion, then one may overlook the shape of this unusual pdf. In fact, f is a bimodal mixture. Ninety percent of the population is Normal(0, 1), but ten percent of the population is Normal(3, 0.04). For the latter subpopulation, the additive provides a definite benefit. WIM
8 Effect Size It is often recommended that researchers examine effect size. If one construes this advice to mean that one should examine the magnitude of the effect, i.e., the location of P relative to some baseline, by quantifying the salient attributes of P, then this is exemplary advice. Unfortunately, Jacob Cohen s well-intentioned efforts to help researchers design experiments have led many researchers to obsess on specific measures, e.g., d = µ/σ. Cohen was not concerned with reporting salient features of P, but on performing specific calculations related to the choice of sample size. These calculations depend on specifying a value of d and Cohen offered rough guidelines for how to do so when µ and σ are not known. WIM
9 Cohen s rough guidelines for choosing a sample size when little is known about P have led many researchers to embrace uncritical statements like d = 0.5 is an effect of medium size. In fact, Cohen anticipated and warned against such abuse: The terms small, medium, and large are relative, not only to each other but to... the specific content and research method being employed in any given investigation. In the face of this relativity, there is a certain risk inherent in offering conventional operational definitions for these terms... J. Cohen, Statistical Power Analysis for the Behavioral Sciences, 1988, p. 25. Researchers should be interested in the size of an effect, but they should also think about how to quantify and interpret its magnitude in the context of their discipline. For example... Instead of reporting whether µ < 0 or µ > 0, report the value of µ and think critically about its material significance (importance). WIM
10 2. Inference In practice, P is unknown and we draw x to obtain information about P. We may construct... Graphical displays of x: stem-and-leaf plots, box plots Estimates of P : empirical distributions & cdfs, quantile-quantile plots Estimates of f: histograms, average shifted histograms, kernel density estimates To illustrate, consider the forearm lengths (in inches) of n = 140 adult males... WIM
11 WIM
12 Box Plot Normal Q Q Plot PDF Estimate Sample Quantiles Density Theoretical Quantiles N = 140 Bandwidth = WIM
13 In practice, most inference is parametric. For example, assume that P Normal(µ, σ 2 ) and draw inferences about µ. Or... Let n = 100 and assume that in which case X 1,..., X n Bernoulli(p), Y = n X i Binomial(n; p). i=1 We observe y = 32. What might we infer about p? Point estimation is concerned with guessing the true value of p. Hypothesis testing is concerned with making a binary decision about p, e.g., is p = 0.5 or is p 0.5? Set estimation is concerned with constructing a set of plausible values of p. WIM
14 Point Estimation Plug-In Approach. Estimate p = EX i, the population mean, with the sample mean, ˆp = x = y n = = Maximum Likelihood Approach. Estimate p with In this case, p = ˆp = p = argmax a P (Y = 32 p = a). WIM
15 Decision-Theoretic Approaches. Let δ denote an estimator and let L (δ(y), p) = [δ(y) p] 2 denote the squared error loss when p is true and y is observed. Let R(p; δ) = E p L (δ(y ), p) denote the risk (expected loss) of δ when p is true. For example, δ(y) = y/n has risk function R(p; δ) = p(1 p)/n. Ideally, we would like to use an estimator that simultaneously minimizes the risk for every possible value of p. Unfortunately, this is impossible. We can proceed in one of two ways, either by restricting the class of estimators that we are willing to use, or by minimizing some real-valued attribute of the risk function. The Bayes Principle tells us to choose δ to minimize a weighted average of the possible risks: 1 0 R(p; δ)w(p) dp, WIM
16 where w is a pdf that weights the p (0, 1). The optimal estimator, δ w, is called the Bayes estimator with respect to w. Example: If w is the pdf of Beta(a, b), then δ w (y) = δ a,b (y) = a + y a + b + n = ( ) a + b a a + b + n a + b + ( ) n y a + b + n n. For n = 100 and y = 32 we obtain... a a b a+b δ a,b (32) WIM
17 w(p) WIM p
18 Hypothesis Testing Hypothesis testing is concerned with binary decisions. Partition the parameter set, here (0, 1), into null and alternative hypotheses, e.g., H 0 : p = 0.5 vs H 1 : p 0.5 or H 0 : p 0.5 vs H 1 : p < 0.5. H 0 is privileged, i.e., we retain H 0 unless we find compelling evidence against it. A test statistic measures evidence against H 0. Under H 0 : p = 0.5, EY = np = 50; hence, Y 50 measures evidence against H 0 : p = 0.5. We observed y 50 = = 18. Under H 0 : p = 0.5, p = P H0 ( Y 50 18) = P H0 (Y 32) + P H0 (Y 68). = We reject H 0 iff p is sufficiently small, i.e., p α for some predetermined α. The computed quantity p is the significance probability; the fixed quantity α is the significance level. WIM
19 Statistical significance is not material significance! Example: Let X i Normal(µ, σ 2 ) denote car i s improvement in mpg. A government agency will sanction advertising the additive iff H 0 : µ 1 is rejected at significance level α = A large corporation manufactures an additive that increases mileage by an average of µ = 1.01 miles per gallon. The corporation funds a large study of n = 900 vehicles in which x = 1.01 and s = 0.1 are observed. This results in a test statistic of t = x µ 0 s/ = n 0.1/ 900 = 3 and a significance probability of p = P (T n 3). = < 0.05 = α, so H 0 is decisively rejected and advertising is authorized. WIM
20 2. An amateur automotive mechanic invents an additive that increases mileage by an average of µ = 1.21 miles per gallon. The mechanic funds a small study of n = 9 vehicles in which x = 1.21 and s = 0.4 are observed. This results in a test statistic of t = x µ 0 s/ n and a significance probability of = / 9 = p = P (T n 1.575). = > 0.05 = α, so H 0 is retained and advertising is not authorized. WIM
21 Significance levels are error rates! Suppose that H 0 is true. A test that rejects H 0 iff p α will falsely reject H 0 with probability α. For example, suppose that I give a fair penny to each of 600 students. Each student tosses his/her penny 100 times and tests H 0 : p = 0.5 at significance level α = Let X i = 1 if student i rejects H 0 and X i = 0 if student i retains H 0. Because H 0 is true and p = 0.5, X i Bernoulli(α = 0.05) and the total number of students who falsely reject H 0 is Y = 600 i=1 Binomial(600; 0.05). It follows that EY = 30, P (Y 20). = 0.98, etc. WIM
22 Set Estimation Imagine testing H 0 : p = p 0 at significance level α for every p 0 (0, 1). The p 0 that are retained constitute a set of plausible values of p. This is an example of a confidence set. For Y Binomial(100; p), y = 32, and α = 0.1, the confidence set is [0.245, 0.404]. Notice that this interval does not contain p 0 = 0.5, which we have already rejected as implausible. Let C( x) denote the set of plausible values. Because x is a random sample, C( x) may or may not cover p. The probability that the random set C( X) covers p is 1 α, the confidence coefficient. Confidence is not a guarantee! If 600 researchers construct confidence sets with confidence coefficient 0.95, then we expect that 30 confidence sets will fail to cover the parameter of interest. The probability that at least 20 confidence sets will fail to cover is approximately WIM
23 Bayesian Inference A confidence coefficient tells us the probability that C( X) will cover p. This fact provides information about how the procedure for constructing C( x) will behave when used repeatedly; it does not tell us whether or not the observed C( x) actually covers p. In contrast, Bayesian methods endeavor to quantify the evidence in x. Suppose that X 1,..., X n P θ, θ Ω. Let p( x θ) denote the joint pdf of x and let p(θ) denote a pdf on Ω. Then, with squared error loss, the Bayes estimate of θ turns out to be θ p( x θ) p(θ) dθ Ω δ p ( x) =. p( x θ) p(θ) dθ Formally, δ p ( x) is the mean of the conditional pdf Ω Ω p( x θ) p(θ) p( x, θ) = p( x θ) p(θ) dθ p( x) = p(θ x). WIM
24 Let us conceive of the experiment as occurring in two stages: 1. Nature draws θ from the marginal pdf p(θ). At this stage, absent a sample from p( x θ) and knowing only p(θ), we would estimate θ to be the mean of p(θ). 2. We draw x from the conditional pdf p( x θ). To incorporate x into our estimate of θ, we replace p(θ) with p(θ x). Thus, we modify our initial knowledge of θ in light of the information about θ provided by x. In this context, p(θ) is the prior distribution and p(θ x) is the posterior distribution. But what if θ is not random? What if θ is a physical constant, e.g., the speed of light? WIM
25 The Basic Tenet of Bayesianity asserts that statisticians should treat known quantities as fixed and unknown quantities probabilistically. The basic tenet implies the following: Inference should be conditional on x. Uncertainty about θ should be modeled by p(θ). (This requirement may necessitate a subjectivist interpretation of probability.) Bayesians insist that inference should focus on the posterior distribution, which reveals how the prior distribution is modified by the data. One may display/report the entire posterior or certain of its attributes. Instead of a confidence set, one may compute a region of highest posterior density (HPD), i.e., the smallest subset of Ω for which the posterior probability is 1 α. L. J. Savage famously quipped that The only use I know for a confidence interval is to have confidence in it. In contrast, Bayesians can legitimately claim that the (possibly subjective) probability that θ lies in a region of HPD is 1 α. WIM
26 3. More About Hypothesis Testing True State of Nature H 0 H 1 Statistical retain H 0 Type II error Test reject H 0 Type I error Hypothesis testing is not intended for the purpose of choosing the more plausible hypothesis. It is only intended to determine if there is sufficient evidence to reject the null hypothesis. Legal Analogy: In a criminal trial, the defendant is innocent until proven guilty. The jury must decide if there is sufficient evidence to convict, i.e., if the prosecution s case is beyond a reasonable doubt. WIM
27 Power Function For simplicity, we restrict attention to a simple example. Assume that X 1,..., X n Normal(θ, 1) and that we want to test H 0 : θ 0 versus H 1 : θ > 0. Then the power function of a test φ is power φ (θ) = P (φ rejects H 0 X 1,..., X n Normal(θ, 1)). For any reasonable test, the power function should look something like this... WIM
28 P(reject H0) !1.0! population mean WIM
29 Given a significance level, α, we require that φ satisfy power φ (θ) α for θ 0. Among such level α tests, we prefer tests for which power φ (θ) is large for θ > 0. WIM
30 Let α = The usual test of H 0 : θ 0 versus H 1 : θ > 0 rejects H 0 iff x is sufficiently large, specifically, iff The power function of this test is power (θ) = P It follows that... = P = P x 0 1/ n = n x > ( ( n X > X Normal θ, 1 )) n ( X > ( X Normal θ, 1 )) n n n) θ ( X θ 1/ n > (1.645/ 1/ n ( X Normal θ, 1 )) n = P ( Z > θ n Z Normal(0, 1) ). WIM
31 power (0) = P (Z > 1.645) = 0.05 Interpretation: If θ = 0, then the probability of rejecting H 0 : θ 0, i.e., the probability of committing a Type I error, is power (0.5) = P (Z > n/2) increases as n increases Interpretation: If θ = 0, then the probability of rejecting H : θ 0 increases as n increases. Equivalently, the probability of retaining H 0, i.e., the probability of committing a Type II error, decreases as n increases. Notice that the above are properties of the test, not the data. Power is not an observed quantity. WIM
32 Power Analysis Typically a power analysis is performed when designing an experiment, to inform the choice of sample size. The test that rejects H 0 : θ 0 iff n x > satisfies power (0) = 0.05 n. To control power (0.5) = P ( Z > n/2 ), we must adjust the sample size. To achieve power (0.5) = 0.8: To achieve power (0.5) = 0.9: n/2 = qnorm(.2) qnorm(.95)-qnorm(.2) = n/2 n = 4 [qnorm(.95)-qnorm(.2)] 2. = 24.7 n = 4 [qnorm(.95)-qnorm(.1)] 2. = 34.3 WIM
33 Observed Power Suppose that we observe x = 0.5. Is this sufficient evidence to reject H 0 : θ 0 in favor of H 1 : θ > 0? The answer depends on the sample size. We need n/2 > 1.645, or n > ( ) 2. = 10.8, to reject H0. Remark: This answer confirms our intuition: the larger the sample, the more compelling the evidence that x = 0.5 was not drawn under θ 0, i.e., that H 0 is false. Suppose that n 10, in which case we retain H 0. Can we ascertain if the data provide affirmative evidence that H 0 is true, as opposed to merely providing insufficient evidence that H 0 is false? (Analogously, the defendant is found not guilty. Is s/he innocent, or was the charge merely not proven?) Remark: Hypothesis testing was not designed to answer this question. (Nor do criminal trials in Anglo-Saxon law distinguish between innocent and not proven. Scottish law does draw this distinction.) WIM
34 The answer should be obvious. Observing x > 0 tends to suggest that H 0 is false, i.e., observing x = 0.5 provides more evidence that H 0 is false than evidence that H 0 is true. Nevertheless, some authors have suggested that we can quantify the evidence that favors H 0 by computing the observed power, viz., power ( x) = power (0.5) = P ( Z > n/2 ). Recall, however, that this quantity increases as n increases. Hence, the more data that we collect (and therefore the more that x = 0.5 favors H 1 ), the larger power ( x) becomes. Large values of power ( x) should NOT be construed as evidence favoring H 0. J.M. Hoenig & D.M. Heisey, The abuse of power: the pervasive fallacy of power calculations for data analysis, The American Statistician, 55(1):19 24, Feb WIM
ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12
ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean
More informationChapter Three. Hypothesis Testing
3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being
More informationHypothesis Testing. ECE 3530 Spring Antonio Paiva
Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is
More informationSection 9.1 (Part 2) (pp ) Type I and Type II Errors
Section 9.1 (Part 2) (pp. 547-551) Type I and Type II Errors Because we are basing our conclusion in a significance test on sample data, there is always a chance that our conclusions will be in error.
More informationMathematical Statistics
Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics
More informationSection 6.2 Hypothesis Testing
Section 6.2 Hypothesis Testing GIVEN: an unknown parameter, and two mutually exclusive statements H 0 and H 1 about. The Statistician must decide either to accept H 0 or to accept H 1. This kind of problem
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More informationStatistical Inference. Hypothesis Testing
Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More informationDIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS
DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS THUY ANH NGO 1. Introduction Statistics are easily come across in our daily life. Statements such as the average
More information20 Hypothesis Testing, Part I
20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she
More informationChapter 9. Hypothesis testing. 9.1 Introduction
Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second
More informationIntroductory Econometrics. Review of statistics (Part II: Inference)
Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationLecture 1: Probability Fundamentals
Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability
More informationHypothesis tests
6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis
More informationPreliminary Statistics. Lecture 5: Hypothesis Testing
Preliminary Statistics Lecture 5: Hypothesis Testing Rory Macqueen (rm43@soas.ac.uk), September 2015 Outline Elements/Terminology of Hypothesis Testing Types of Errors Procedure of Testing Significance
More informationUnit 19 Formulating Hypotheses and Making Decisions
Unit 19 Formulating Hypotheses and Making Decisions Objectives: To formulate a null hypothesis and an alternative hypothesis, and to choose a significance level To identify the Type I error and the Type
More informationSection 5.4: Hypothesis testing for μ
Section 5.4: Hypothesis testing for μ Possible claims or hypotheses: Ball bearings have μ = 1 cm Medicine decreases blood pressure For testing hypotheses, we set up a null (H 0 ) and alternative (H a )
More informationHypotheses Test Procedures. Is the claim wrong?
Hypotheses Test Procedures MATH 2300 Sections 9.1 and 9.2 Is the claim wrong? An oil company representative claims that the average price for gasoline in Lubbock is $2.30 per gallon. You think the average
More informationStatistical Preliminaries. Stony Brook University CSE545, Fall 2016
Statistical Preliminaries Stony Brook University CSE545, Fall 2016 Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 2 Random Variables X: A mapping from
More informationInference for a Population Proportion
Al Nosedal. University of Toronto. November 11, 2015 Statistical inference is drawing conclusions about an entire population based on data in a sample drawn from that population. From both frequentist
More information(1) Introduction to Bayesian statistics
Spring, 2018 A motivating example Student 1 will write down a number and then flip a coin If the flip is heads, they will honestly tell student 2 if the number is even or odd If the flip is tails, they
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population
More informationBayesian Inference. Chapter 2: Conjugate models
Bayesian Inference Chapter 2: Conjugate models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master in
More informationLecture 2: Statistical Decision Theory (Part I)
Lecture 2: Statistical Decision Theory (Part I) Hao Helen Zhang Hao Helen Zhang Lecture 2: Statistical Decision Theory (Part I) 1 / 35 Outline of This Note Part I: Statistics Decision Theory (from Statistical
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationMath Review Sheet, Fall 2008
1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the
More informationHypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33
Hypothesis Testing Econ 690 Purdue University Justin L. Tobias (Purdue) Testing 1 / 33 Outline 1 Basic Testing Framework 2 Testing with HPD intervals 3 Example 4 Savage Dickey Density Ratio 5 Bartlett
More informationThe Empirical Rule, z-scores, and the Rare Event Approach
Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More informationTwo examples of the use of fuzzy set theory in statistics. Glen Meeden University of Minnesota.
Two examples of the use of fuzzy set theory in statistics Glen Meeden University of Minnesota http://www.stat.umn.edu/~glen/talks 1 Fuzzy set theory Fuzzy set theory was introduced by Zadeh in (1965) as
More informationLecture Testing Hypotheses: The Neyman-Pearson Paradigm
Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:
More informationPreliminary Statistics Lecture 5: Hypothesis Testing (Outline)
1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics
More informationEstimating the accuracy of a hypothesis Setting. Assume a binary classification setting
Estimating the accuracy of a hypothesis Setting Assume a binary classification setting Assume input/output pairs (x, y) are sampled from an unknown probability distribution D = p(x, y) Train a binary classifier
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationReview: Statistical Model
Review: Statistical Model { f θ :θ Ω} A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The statistical model
More informationClass 26: review for final exam 18.05, Spring 2014
Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event
More informationInverse Sampling for McNemar s Test
International Journal of Statistics and Probability; Vol. 6, No. 1; January 27 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Inverse Sampling for McNemar s Test
More informationMATH4427 Notebook 4 Fall Semester 2017/2018
MATH4427 Notebook 4 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 4 MATH4427 Notebook 4 3 4.1 K th Order Statistics and Their
More informationSequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs
Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs Presented August 8-10, 2012 Daniel L. Gillen Department of Statistics University of California, Irvine
More informationCHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:
CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To
More information7. Estimation and hypothesis testing. Objective. Recommended reading
7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing
More informationSTAT Exam Jam Solutions. Contents
s Contents 1 First Day 2 Question 1: PDFs, CDFs, and Finding E(X), V (X).......................... 2 Question 2: Bayesian Inference...................................... 3 Question 3: Binomial to Normal
More information2016 SISG Module 17: Bayesian Statistics for Genetics Lecture 3: Binomial Sampling
2016 SISG Module 17: Bayesian Statistics for Genetics Lecture 3: Binomial Sampling Jon Wakefield Departments of Statistics and Biostatistics University of Washington Outline Introduction and Motivating
More information2 Belief, probability and exchangeability
2 Belief, probability and exchangeability We first discuss what properties a reasonable belief function should have, and show that probabilities have these properties. Then, we review the basic machinery
More informationTopic 17: Simple Hypotheses
Topic 17: November, 2011 1 Overview and Terminology Statistical hypothesis testing is designed to address the question: Do the data provide sufficient evidence to conclude that we must depart from our
More informationExam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015
Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences 18.30 21.15h, February 12, 2015 Question 1 is on this page. Always motivate your answers. Write your answers in English. Only the
More informationFundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur
Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new
More informationFrequentist Statistics and Hypothesis Testing Spring
Frequentist Statistics and Hypothesis Testing 18.05 Spring 2018 http://xkcd.com/539/ Agenda Introduction to the frequentist way of life. What is a statistic? NHST ingredients; rejection regions Simple
More informationMathematical statistics
October 20 th, 2018 Lecture 17: Tests of Hypotheses Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation
More informationStatistical Inference
Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts
More information8: Hypothesis Testing
Some definitions 8: Hypothesis Testing. Simple, compound, null and alternative hypotheses In test theory one distinguishes between simple hypotheses and compound hypotheses. A simple hypothesis Examples:
More information23. MORE HYPOTHESIS TESTING
23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationIntroduction to Bayesian Statistics
Bayesian Parameter Estimation Introduction to Bayesian Statistics Harvey Thornburg Center for Computer Research in Music and Acoustics (CCRMA) Department of Music, Stanford University Stanford, California
More informationTests about a population mean
October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence
More informationHypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006
Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)
More informationTesting Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA
Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box 90251 Durham, NC 27708, USA Summary: Pre-experimental Frequentist error probabilities do not summarize
More informationORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing
ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.
More informationCENTRAL LIMIT THEOREM (CLT)
CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic
More informationDecision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over
Point estimation Suppose we are interested in the value of a parameter θ, for example the unknown bias of a coin. We have already seen how one may use the Bayesian method to reason about θ; namely, we
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More information280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses
280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null
More informationConfidence Intervals and Hypothesis Tests
Confidence Intervals and Hypothesis Tests STA 281 Fall 2011 1 Background The central limit theorem provides a very powerful tool for determining the distribution of sample means for large sample sizes.
More informationME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV
Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More informationBasic Concepts of Inference
Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).
More informationMTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL
MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL 1). Two urns are provided as follows: urn 1 contains 2 white chips and 4 red chips, while urn 2 contains 5 white chips and 3 red chips. One chip is chosen
More information2. A Basic Statistical Toolbox
. A Basic Statistical Toolbo Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data. Wikipedia definition Mathematical statistics: concerned
More informationChapter 8 of Devore , H 1 :
Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in
More informationHypothesis testing: theory and methods
Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable
More informationStatistical Methods for Astronomy
Statistical Methods for Astronomy Probability (Lecture 1) Statistics (Lecture 2) Why do we need statistics? Useful Statistics Definitions Error Analysis Probability distributions Error Propagation Binomial
More informationBayesian Inference. Introduction
Bayesian Inference Introduction The frequentist approach to inference holds that probabilities are intrinsicially tied (unsurprisingly) to frequencies. This interpretation is actually quite natural. What,
More informationFundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner
Fundamentals CS 281A: Statistical Learning Theory Yangqing Jia Based on tutorial slides by Lester Mackey and Ariel Kleiner August, 2011 Outline 1 Probability 2 Statistics 3 Linear Algebra 4 Optimization
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing
More informationPrecept 4: Hypothesis Testing
Precept 4: Hypothesis Testing Soc 500: Applied Social Statistics Ian Lundberg Princeton University October 6, 2016 Learning Objectives 1 Introduce vectorized R code 2 Review homework and talk about RMarkdown
More informationAn Informal Introduction to Statistics in 2h. Tim Kraska
An Informal Introduction to Statistics in 2h Tim Kraska Goal of this Lecture This is not a replacement for a proper introduction to probability and statistics Instead, it only tries to convey the very
More informationCHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:
CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are
More informationMTMS Mathematical Statistics
MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing
More informationNon-parametric Inference and Resampling
Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing
More informationIntroduction to Probabilistic Machine Learning
Introduction to Probabilistic Machine Learning Piyush Rai Dept. of CSE, IIT Kanpur (Mini-course 1) Nov 03, 2015 Piyush Rai (IIT Kanpur) Introduction to Probabilistic Machine Learning 1 Machine Learning
More informationSolution: First note that the power function of the test is given as follows,
Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationF79SM STATISTICAL METHODS
F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is
More informationMath 101: Elementary Statistics Tests of Hypothesis
Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an
More informationChapter 7: Hypothesis Testing - Solutions
Chapter 7: Hypothesis Testing - Solutions 7.1 Introduction to Hypothesis Testing The problem with applying the techniques learned in Chapter 5 is that typically, the population mean (µ) and standard deviation
More informationReview. December 4 th, Review
December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter
More informationSpace Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses
Space Telescope Science Institute statistics mini-course October 2011 Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses James L Rosenberger Acknowledgements: Donald Richards, William
More informationIntroduction: MLE, MAP, Bayesian reasoning (28/8/13)
STA561: Probabilistic machine learning Introduction: MLE, MAP, Bayesian reasoning (28/8/13) Lecturer: Barbara Engelhardt Scribes: K. Ulrich, J. Subramanian, N. Raval, J. O Hollaren 1 Classifiers In this
More informationPurposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions
Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationUnobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:
Pi Priors Unobservable Parameter population proportion, p prior: π ( p) Conjugate prior π ( p) ~ Beta( a, b) same PDF family exponential family only Posterior π ( p y) ~ Beta( a + y, b + n y) Observed
More informationLinear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017
Linear Models: Comparing Variables Stony Brook University CSE545, Fall 2017 Statistical Preliminaries Random Variables Random Variables X: A mapping from Ω to ℝ that describes the question we care about
More information