Statistical Testing I. De gustibus non est disputandum

Size: px
Start display at page:

Download "Statistical Testing I. De gustibus non est disputandum"

Transcription

1 Statistical Testing I De gustibus non est disputandum

2 The Pepsi Challenge "Take the Pepsi Challenge" was the motto of a marketing campaign by the Pepsi-Cola Company in the 1980's. A total of 100 Coca-Cola drinkers were asked to blindly taste unmarked cups of Diet Pepsi and Diet Coke, and to select their favorite. A subsequent Pepsi TV commercial stated "... in recent blind taste tests, more than half of all Diet Coke drinkers surveyed said they preferred the taste of Diet Pepsi". Assume that, out of the 100 Diet Coke drinkers, 56 preferred Diet Pepsi. Would this result support the claim that more than half of all Diet Coke drinkers prefer Diet Pepsi to Diet Coke?

3 "Scientific Method" "The validity of knowledge is tied to the probability of falsification." Karl Popper ( ) "Scientific propositions can be falsified empirically. On the other hand, unscientific claims are always 'right' and cannot be falsified at all."

4 Statistical Testing New Knowledge Through Falsification current knowledge new knowledge H 0 falsification H A

5 Decision Making - Scientific questions are often formulated in the form of mutually exclusive hypotheses (i.e. H 0 versus H A ) about one or more population parameters. - A statistical test is a decision rule that allows a researcher to either reject H 0 ("statistically significant result") or maintain H 0 on the basis of sample data.

6 Statistical Testing Null Hypothesis The null hypothesis usually implies the opposite of what a researcher expects (or wishes) to be true. It often represents conservatism or common opinion. H 0 : The expected diastolic blood pressure of patients with a particular disease equals that of control individuals.

7 Statistical Testing Alternative Hypothesis The alternative hypothesis usually implies what a researcher expects (or wishes) to be true. The alternative hypothesis is regarded as established when the null hypothesis is rejected. H A : The expected diastolic blood pressure of patients with a particular disease differs from that of control individuals.

8 Blood Pressure and Myocardial Infarction A study was carried out to assess whether the expected diastolic blood pressure (DBP) of patients with myocardial infarction (MI) differs from the expected DBP of control individuals, namely 80 mmhg. H 0 : µ=µ 0 H A : µ µ 0

9 Statistical Testing Procedure - All information from the sample data is collapsed in a single numerical quantity, called the test statistic (T). - The maintenance region of the test comprises all values of T for which H 0 is maintained. - The rejection region comprises all values of T for which H 0 is rejected. - The maintenance and rejection regions are demarcated by the critical values.

10 Statistical Testing Procedure critical value critical value H 0 rejection region maintenance region rejection region T T in maintenance region T in rejection region maintain H 0 reject H 0

11 Statistical Testing Possible Errors A type I error is made when H 0 is rejected although it is true. A type II error is made when H 0 is maintained although it is wrong. truth decision maintain H 0 reject H 0 H 0 correct type I error H A type II error correct

12 Statistical Testing Significance Level - A statistical test has significance level α if the probability of making a type I error is at most α. - Before data collection, the critical values of a test are chosen such that the test has a pre-specified significance level (e.g. 0.05). - The choice of critical values depends upon the prespecified significance level and the nature of H 0, but not the nature of H A.

13 Blood Pressure and Myocardial Infarction H 0 : µ=µ 0 H A : µ µ 0 The significance level of a test of H 0 versus H A limits the probability of erroneously claiming a difference in expected DBP between MI patients and control individuals.

14 Statistical Testing Critical Values H 0 α/2 α/2 c α/2 c 1-α/2 T

15 One-sample t-test Procedure Random Variable Hypotheses Test Statistic Rejection Region X N(µ,σ 2 ) both parameters unknown H : µ 0 X µ = 0 H A : µ µ 0 S/ µ 0 T = n T t α/2,n-1 or T t 1-α/2,n-1 =-t α/2,n-1 'degrees of freedom' (ν)

16 Blood Pressure and Myocardial Infarction A study was carried out to assess whether the expected diastolic blood pressure (DBP) of patients with myocardial infarction (MI) differs from the expected DBP of control individuals, namely 80 mmhg. The following DBP values were observed in 9 patients with MI: 92, 87, 79, 87, 99, 82, 74, 83, 103 x = mmhg s = 9.34 mmhg t = t.975, 8 0 = 2.306

17 t-distribution Quantiles

18 Statistical Testing Power - The probability of making a type II error (i.e. to adhere to H 0 if, in fact, H A is true) is designated as β. - The converse probability 1-β, i.e. the probability of avoiding a type II error, is called the power of a test. - The power of a statistical test depends upon the nature of H A, but not the nature of H 0.

19 Statistical Testing Error Probabilities truth decision H 0 H A maintain H 0 1-α β reject H 0 α 1-β

20 Statistical Testing Critical Values H 0 H A α/2 α/2 c α/2 c 1-α/2 β T

21 Blood Pressure and Myocardial Infarction H 0 : µ=80 H A : µ 80 σ=10 mmhg µ P µ (T , T 2.306) α=0.05 H H A 81 (79) 85 (75) 90 (70) β 1-β 1-β

22 Statistical Testing Effect Size and Power H 0 H A α/2 α/2 c α/2 c 1-α/2 β T

23 Statistical Testing Significance and Power H 0 H A α'/2 β' α'/2 T c α'/2 c 1-α'/2

24 t-distribution Quantiles

25 Blood Pressure and Myocardial Infarction H 0 : µ=80 H A : µ 80 σ=10 mmhg µ P µ (T , T 2.896) α=0.02 H H A 81 (79) 85 (75) 90 (70) β 1-β 1-β

26 Alternative Hypotheses Two-Sided A two-sided alternative hypothesis does not specify a direction of the expected findings and usually - reflects a lack of prior knowledge about realistic alternatives to the null hypothesis - reads "is different from" or "deviates from" H A : The expected diastolic blood pressure of patients with a particular disease differs from that of control individuals.

27 Alternative Hypotheses Two-Sided H A (?) H 0 H A α/2 α/2 c α/2 c 1-α/2 β T

28 Alternative Hypotheses One-Sided H 0 H A β c 1-α α T

29 Alternative Hypotheses One-Sided A one-sided alternative hypothesis specifies the direction of the expected findings and usually - reflects common sense or suitable knowledge from previous scientific experiments - reads "is larger than", "is heavier than" or "is longer than" H A : The expected diastolic blood pressure of patients with a particular disease exceeds that of control individuals.

30 Clinical Studies In a clinical study, researchers often wish to compare the respective probability of therapeutic success between a new medication (π M ) and placebo (π P ). H A : π M >π P H 0 : π M π P significance level power upper limit for the probability to declare a useless medication effective probability to recognise an effective medication as effective

31 One-sample t-test One-Sided Random Variable Hypotheses Test Statistic X N(µ,σ 2 ) both parameters unknown or X S/ H : µ 0 0 H : µ < µ µ 0 A 0 µ 0 H A : µ > µ 0 H : µ µ 0 T = n Rejection Region or T t α,n-1 T t 1-α,n-1

32 t-distribution Quantiles

33 Blood Pressure and Myocardial Infarction H 0 : µ 80 H A : µ>80 σ=10 mmhg µ P µ (T 1.860) α=0.02 P µ ( T 2.306) H H A β 1-β

34 One-sample t-test Sample Size Which sample size, n, is required to detect, at significance level α, a given effect µ-µ 0 with power 1-β? one-sided two-sided n σ z 1 α + z 1 µ µ 0 β 2 n σ z 1 + z α/2 µ µ 0 1 β 2

35 One-sample t-test Sample Size (one-sided) 1000 σ = 10 α = β = 0.90, 0.80, 0.70 n µ µ 0

36 One-sample t-test Sample Size (two-sided) 1000 σ = 10 α = β = 0.90, 0.80, 0.70 n µ µ 0

37 The Pepsi Challenge H 0 : Pepsi does not taste better than Coke (π 0.5). H A : Pepsi tastes better than Coke (π>0.5). P P i 100 i ( T 59) = = = i 59 i i 100 i ( T 58) = = = i 58 i c 0.05 = 59 Conclusion: The number of Diet Coke drinkers who preferred Diet Pepsi (i.e. 56) was not significantly higher than the number who preferred Diet Coke (i.e. 44).

38 Statistics and Truth Egon Pearson ( ) Jerzy Neyman ( ) "No test based upon the theory of probability can by itself provide any valuable evidence of the truth or falsehood of a hypothesis." Neyman J, Pearson E (1933) Phil Trans R Soc A, 231:

39 Statistics and Truth Ronald A. Fisher ( ) "It would, therefore, add greatly to the clarity with which the tests of significance are regarded if it were generally understood that the tests of significance, when used accurately, are capable of rejecting or invalidating hypotheses, in so far as they are contradicted by the data: but that they are never capable of establishing them as certainly true."

40 p Value H 0 p T t obs The p-value is the probability of obtaining the observed, or an even less probable, value of T than t obs when the null hypothesis is correct.

41 p Value Evidence Against H 0 p value evidence none "moderate" "strong" "very strong"

42 Blood Pressure and Myocardial Infarction H 0 : µ 80 H A : µ>80 p = P(T>2.354) = H 0 : µ=80 H A : µ 80 p = P( T >2.354) = The Pepsi Challenge H 0 : π 0.5 H A : π>0.5 p 100 = P i = 56 i ( ) 100 i 100 i X 56 = =

43 Pravastatin and Cardiovascular Disease major cardiovascular outcome placebo (n=2078) Pravastatin (n=2081) p non-fatal MI or death from CHD CABG or PTCA <0.001 Stroke CAGB: coronary artery bypass grafting, PTCA: percutaneous transluminal coronary angioplasty Sacks FM et al. (1996) N Engl J Med 335:

44 Negative Findings Negative findings are as important as positive findings because they reduce ignorance and may suggest interesting new hypotheses and lines of investigation. They are also necessary to guide future research in the field of interest (publication bias).

45 Summary -Statistical problems are usually defined as mutally exclusive hypotheses about population parameters. -Statistical tests are decision rules to either maintain or reject a given null hypothesis on the basis of sample data. -When performing a statistical test, two types of error can occur through falsely rejecting either the null hypothesis or the alternative hypothesis. -The probability of making a type I error is limited by the significance level of the test; the probability of avoiding a type II error is called the power of the test. -The p value is a measure of the discrepancy between the data and the null hypothesis.

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test

More information

5.2 Tests of Significance

5.2 Tests of Significance 5.2 Tests of Significance Example 5.7. Diet colas use artificial sweeteners to avoid sugar. Colas with artificial sweeteners gradually lose their sweetness over time. Manufacturers therefore test new colas

More information

Fundamental Probability and Statistics

Fundamental Probability and Statistics Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are

More information

The Purpose of Hypothesis Testing

The Purpose of Hypothesis Testing Section 8 1A:! An Introduction to Hypothesis Testing The Purpose of Hypothesis Testing See s Candy states that a box of it s candy weighs 16 oz. They do not mean that every single box weights exactly 16

More information

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation & Hypothesis Testing In doing research, we are usually interested in some feature of a population distribution (which

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

http://www.math.uah.edu/stat/hypothesis/.xhtml 1 of 5 7/29/2009 3:14 PM Virtual Laboratories > 9. Hy pothesis Testing > 1 2 3 4 5 6 7 1. The Basic Statistical Model As usual, our starting point is a random

More information

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials Testing a secondary endpoint after a group sequential test Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 9th Annual Adaptive Designs in

More information

8. MORE PROBABILITY; INDEPENDENCE

8. MORE PROBABILITY; INDEPENDENCE 8. MORE PROBABILITY; INDEPENDENCE Combining Events: The union A B is the event consisting of all outcomes in A or in B or in both. The intersection A B is the event consisting of all outcomes in both A

More information

Power and sample size calculations

Power and sample size calculations Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk April 8, 2014 Planning an investigation How many individuals

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 9. Hypothesis testing. 9.1 Introduction Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Tests about a population mean

Tests about a population mean October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Partitioning the Parameter Space. Topic 18 Composite Hypotheses Topic 18 Composite Hypotheses Partitioning the Parameter Space 1 / 10 Outline Partitioning the Parameter Space 2 / 10 Partitioning the Parameter Space Simple hypotheses limit us to a decision between one

More information

Basic Concepts of Inference

Basic Concepts of Inference Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay 1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information

Philosophy and History of Statistics

Philosophy and History of Statistics Philosophy and History of Statistics YES, they ARE important!!! Dr Mick Wilkinson Fellow of the Royal Statistical Society The plan (Brief) history of statistics Philosophy of science Variability and Probability

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Lecture 8: Summary Measures

Lecture 8: Summary Measures Lecture 8: Summary Measures Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 8:

More information

Statistical Inference. Hypothesis Testing

Statistical Inference. Hypothesis Testing Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist

More information

Sampling Distributions

Sampling Distributions Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Remember sampling? Sampling Part 1 of definition Selecting a subset of the population to create a sample Generally random sampling

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains: CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are

More information

CH.9 Tests of Hypotheses for a Single Sample

CH.9 Tests of Hypotheses for a Single Sample CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance

More information

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes Neyman-Pearson paradigm. Suppose that a researcher is interested in whether the new drug works. The process of determining whether the outcome of the experiment points to yes or no is called hypothesis

More information

How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty?

How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty? How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty? by Eduard Hofer Workshop on Uncertainties in Radiation Dosimetry and their Impact on Risk Analysis, Washington, DC, May 2009

More information

a Sample By:Dr.Hoseyn Falahzadeh 1

a Sample By:Dr.Hoseyn Falahzadeh 1 In the name of God Determining ee the esize eof a Sample By:Dr.Hoseyn Falahzadeh 1 Sample Accuracy Sample accuracy: refers to how close a random sample s statistic is to the true population s value it

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

Sample Size Determination

Sample Size Determination Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Mathematical statistics

Mathematical statistics November 15 th, 2018 Lecture 21: The two-sample t-test Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

Comparison of Two Population Means

Comparison of Two Population Means Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have

More information

Mathematical statistics

Mathematical statistics October 20 th, 2018 Lecture 17: Tests of Hypotheses Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

Extending the results of clinical trials using data from a target population

Extending the results of clinical trials using data from a target population Extending the results of clinical trials using data from a target population Issa Dahabreh Center for Evidence-Based Medicine, Brown School of Public Health Disclaimer Partly supported through PCORI Methods

More information

MTMS Mathematical Statistics

MTMS Mathematical Statistics MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

One- and Two-Sample Tests of Hypotheses

One- and Two-Sample Tests of Hypotheses One- and Two-Sample Tests of Hypotheses 1- Introduction and Definitions Often, the problem confronting the scientist or engineer is producing a conclusion about some scientific system. For example, a medical

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test

More information

Mathematical statistics

Mathematical statistics November 1 st, 2018 Lecture 18: Tests about a population mean Overview 9.1 Hypotheses and test procedures test procedures errors in hypothesis testing significance level 9.2 Tests about a population mean

More information

Power and sample size calculations

Power and sample size calculations Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk October 28 2013 Planning an investigation How many

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Adaptive designs beyond p-value combination methods Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Outline Introduction Combination-p-value method and conditional error function

More information

Effect of investigator bias on the significance level of the Wilcoxon rank-sum test

Effect of investigator bias on the significance level of the Wilcoxon rank-sum test Biostatistics 000, 1, 1,pp. 107 111 Printed in Great Britain Effect of investigator bias on the significance level of the Wilcoxon rank-sum test PAUL DELUCCA Biometrician, Merck & Co., Inc., 1 Walnut Grove

More information

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints The Design of Group Sequential Clinical Trials that Test Multiple Endpoints Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Bruce Turnbull

More information

Hypothesis for Means and Proportions

Hypothesis for Means and Proportions November 14, 2012 Hypothesis Tests - Basic Ideas Often we are interested not in estimating an unknown parameter but in testing some claim or hypothesis concerning a population. For example we may wish

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

First we look at some terms to be used in this section.

First we look at some terms to be used in this section. 8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling

More information

Power and sample size calculations

Power and sample size calculations Patrick Breheny October 20 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 26 Planning a study Introduction What is power? Why is it important? Setup One of the most important

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Quantitative Analysis and Empirical Methods

Quantitative Analysis and Empirical Methods Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Hypotheses Test Procedures. Is the claim wrong?

Hypotheses Test Procedures. Is the claim wrong? Hypotheses Test Procedures MATH 2300 Sections 9.1 and 9.2 Is the claim wrong? An oil company representative claims that the average price for gasoline in Lubbock is $2.30 per gallon. You think the average

More information

Hypothesis Testing. BS2 Statistical Inference, Lecture 11 Michaelmas Term Steffen Lauritzen, University of Oxford; November 15, 2004

Hypothesis Testing. BS2 Statistical Inference, Lecture 11 Michaelmas Term Steffen Lauritzen, University of Oxford; November 15, 2004 Hypothesis Testing BS2 Statistical Inference, Lecture 11 Michaelmas Term 2004 Steffen Lauritzen, University of Oxford; November 15, 2004 Hypothesis testing We consider a family of densities F = {f(x; θ),

More information

AP Statistics Ch 12 Inference for Proportions

AP Statistics Ch 12 Inference for Proportions Ch 12.1 Inference for a Population Proportion Conditions for Inference The statistic that estimates the parameter p (population proportion) is the sample proportion p ˆ. p ˆ = Count of successes in the

More information

8.1-4 Test of Hypotheses Based on a Single Sample

8.1-4 Test of Hypotheses Based on a Single Sample 8.1-4 Test of Hypotheses Based on a Single Sample Example 1 (Example 8.6, p. 312) A manufacturer of sprinkler systems used for fire protection in office buildings claims that the true average system-activation

More information

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power. Announcements Announcements Unit 3: Foundations for inference Lecture 3:, significance levels, sample size, and power Statistics 101 Mine Çetinkaya-Rundel October 1, 2013 Project proposal due 5pm on Friday,

More information

Comparing the effects of two treatments on two ordinal outcome variables

Comparing the effects of two treatments on two ordinal outcome variables Working Papers in Statistics No 2015:16 Department of Statistics School of Economics and Management Lund University Comparing the effects of two treatments on two ordinal outcome variables VIBEKE HORSTMANN,

More information

Gov 2002: 3. Randomization Inference

Gov 2002: 3. Randomization Inference Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last week: This week: What can we identify using randomization? Estimators were justified via

More information

20 Hypothesis Testing, Part I

20 Hypothesis Testing, Part I 20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity Prof. Kevin E. Thorpe Dept. of Public Health Sciences University of Toronto Objectives 1. Be able to distinguish among the various

More information

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing STAT 515 fa 2016 Lec 20-21 Statistical inference - hypothesis testing Karl B. Gregory Wednesday, Oct 12th Contents 1 Statistical inference 1 1.1 Forms of the null and alternate hypothesis for µ and p....................

More information

Overview of statistical methods used in analyses with your group between 2000 and 2013

Overview of statistical methods used in analyses with your group between 2000 and 2013 Department of Epidemiology and Public Health Unit of Biostatistics Overview of statistical methods used in analyses with your group between 2000 and 2013 January 21st, 2014 PD Dr C Schindler Swiss Tropical

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

9-7: THE POWER OF A TEST

9-7: THE POWER OF A TEST CD9-1 9-7: THE POWER OF A TEST In the initial discussion of statistical hypothesis testing the two types of risks that are taken when decisions are made about population parameters based only on sample

More information

PHIL12A Section answers, 28 Feb 2011

PHIL12A Section answers, 28 Feb 2011 PHIL12A Section answers, 28 Feb 2011 Julian Jonker 1 How much do you know? Give formal proofs for the following arguments. 1. (Ex 6.18) 1 A B 2 A B 1 A B 2 A 3 A B Elim: 2 4 B 5 B 6 Intro: 4,5 7 B Intro:

More information

Hypothesis Testing Chap 10p460

Hypothesis Testing Chap 10p460 Hypothesis Testing Chap 1p46 Elements of a statistical test p462 - Null hypothesis - Alternative hypothesis - Test Statistic - Rejection region Rejection Region p462 The rejection region (RR) specifies

More information

Adaptive Designs: Why, How and When?

Adaptive Designs: Why, How and When? Adaptive Designs: Why, How and When? Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj ISBS Conference Shanghai, July 2008 1 Adaptive designs:

More information

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12 NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major

More information

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1 Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate

More information

Hypothesis Testing One Sample Tests

Hypothesis Testing One Sample Tests STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal

More information

Inference for Distributions Inference for the Mean of a Population

Inference for Distributions Inference for the Mean of a Population Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The

More information

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population

More information

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Application of Variance Homogeneity Tests Under Violation of Normality Assumption Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption Summary: the confidence interval for the mean (σ known) with gaussian assumption on X Let X be a Gaussian r.v. with mean µ and variance σ. If X 1, X,..., X n is a random sample drawn from X then the confidence

More information

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

The Popperian image of an ideal scientific claim is that of a clear-cut experimental falsification but, in practice, such black and white outcomes

The Popperian image of an ideal scientific claim is that of a clear-cut experimental falsification but, in practice, such black and white outcomes The Popperian image of an ideal scientific claim is that of a clear-cut experimental falsification but, in practice, such black and white outcomes rarely occur. Standard techniques for gauging at least

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing

More information

Optimal testing of multiple hypotheses with common effect direction

Optimal testing of multiple hypotheses with common effect direction Biometrika Advance Access published April 21, 2009 Biometrika (2009), pp. 1 12 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asp006 Optimal testing of multiple hypotheses with common

More information

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials Research Article Received 29 January 2010, Accepted 26 May 2010 Published online 18 April 2011 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.4008 Mixtures of multiple testing procedures

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

Chapter 7: Hypothesis Testing

Chapter 7: Hypothesis Testing Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

PHILOSOPHY OF MODELING

PHILOSOPHY OF MODELING PHILOSOPHY OF MODELING 1B-1 Overview Modeling Scientific method Karl Popper Scientific method Problems in modeling Stages in model development Model complexity Model complexity Conclusions "After all,

More information

Hypothesis testing (cont d)

Hypothesis testing (cont d) Hypothesis testing (cont d) Ulrich Heintz Brown University 4/12/2016 Ulrich Heintz - PHYS 1560 Lecture 11 1 Hypothesis testing Is our hypothesis about the fundamental physics correct? We will not be able

More information

David Merritt, Cosmology and Convention, Studies in History and Philosophy of Modern Physics 57 (2017) 41 52

David Merritt, Cosmology and Convention, Studies in History and Philosophy of Modern Physics 57 (2017) 41 52 David Merritt, Cosmology and Convention, Studies in History and Philosophy of Modern Physics 57 (2017) 41 52 Main claims: 1. Dark matter and dark energy arise through the use of Popper s deprecated conventionalist

More information

NATURE OF SCIENCE & LIFE. Professor Andrea Garrison Biology 11

NATURE OF SCIENCE & LIFE. Professor Andrea Garrison Biology 11 NATURE OF SCIENCE & LIFE Professor Andrea Garrison Biology 11 Nature Science Process of asking questions 2 Nature Science Process of asking questions Questions that involve logical reasoning 3 Nature Science

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information