How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty?

Similar documents
Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

QUESTION ONE Let 7C = Total Cost MC = Marginal Cost AC = Average Cost

Hypothesis testing. Data to decisions

Chapter Six: Two Independent Samples Methods 1/51

HYPOTHESIS TESTING. Hypothesis Testing

Statistical Inference

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Hypothesis for Means and Proportions

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Chapter 3. Comparing two populations

MATH4427 Notebook 4 Fall Semester 2017/2018

Analysis of Categorical Data Three-Way Contingency Table

Two Sample Problems. Two sample problems

Power and sample size calculations

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

James H. Steiger. Department of Psychology and Human Development Vanderbilt University. Introduction Factors Influencing Power

Non-parametric methods

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Hypotheses Testing. 1-Single Mean

Statistics for IT Managers

Introductory Econometrics. Review of statistics (Part II: Inference)

Statistical Inference. Hypothesis Testing

Political Science 236 Hypothesis Testing: Review and Bootstrapping

ECON Introductory Econometrics. Lecture 2: Review of Statistics

Calculate the volume of the sphere. Give your answer correct to two decimal places. (3)

Computer Exercise 3 Answers Hypothesis Testing

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Chapter 4. Probability-The Study of Randomness

E509A: Principle of Biostatistics. GY Zou

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.

T test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016

MTMS Mathematical Statistics

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima

Statistical Testing I. De gustibus non est disputandum

Tests about a population mean

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Frequentist Statistics and Hypothesis Testing Spring

Chapter 7 Comparison of two independent samples

Statistics 135: Fall 2004 Final Exam

CBA4 is live in practice mode this week exam mode from Saturday!

Review of basic probability and statistics

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham

Inferences About Two Proportions

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Lecture 5: ANOVA and Correlation

Introduction to Statistical Data Analysis III

Review. December 4 th, Review

Inferences About Two Population Proportions

Introductory Econometrics

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

hypothesis a claim about the value of some parameter (like p)

Lecture 28 Chi-Square Analysis

VTU Edusat Programme 16

Superiority by a Margin Tests for One Proportion

CHAPTER EIGHT TESTS OF HYPOTHESES

Development of a water calorimeter for medium energy x-rays

Many natural processes can be fit to a Poisson distribution

Analysis of Variance: Part 1

6.4 Type I and Type II Errors

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

For tests of hypotheses made about population, we consider. only tests having a hypothesis of. Note that under the assumption of

Guidance document on How to perform a shredder campaign Background information

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

STAC51: Categorical data Analysis

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

Chapter 7: Hypothesis Testing

Acknowledge error Smaller samples, less spread

Detection theory 101 ELEC-E5410 Signal Processing for Communications

Statistics Revision Questions Nov 2016 [175 marks]

Relax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

ANOVA - analysis of variance - used to compare the means of several populations.

The Purpose of Hypothesis Testing

Statistical Process Control (contd... )

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Topic 10: Hypothesis Testing

9-7: THE POWER OF A TEST

Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University

Revision, DP16, May 2017

Beyond p values and significance. "Accepting the null hypothesis" Power Utility of a result. Cohen Empirical Methods CS650

Sign test. Josemari Sarasola - Gizapedia. Statistics for Business. Josemari Sarasola - Gizapedia Sign test 1 / 13

Mathematical statistics

Sequential Analysis & Testing Multiple Hypotheses,

Sample Size Determination

APPENDIX 1 NEYMAN PEARSON CRITERIA

LECTURE 5 HYPOTHESIS TESTING

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Psych 230. Psychological Measurement and Statistics

ISQS 5349 Final Exam, Spring 2017.

The Difference in Proportions Test

Transcription:

How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty? by Eduard Hofer Workshop on Uncertainties in Radiation Dosimetry and their Impact on Risk Analysis, Washington, DC, May 2009

The Ideal Case: Hypothesis Test Observed disease vector: (population of n individuals) Given dose vector: y = (y 1, y 2,..., y n ) x = (x 1,x 2,...,x n ) Calculate the value of the test statistic and reject the null hypothesis if this value exceeds the critical value 2

Calculation of Statistical Power k disease vectors are generated by MC simulation under the alternative hypothesis using the given given dose vector dose vector y 1 = (y 1,1, y 1,2,...,y 1,n ) x = (x 1, x 2,..., x n ) y 2 = (y 2,1, y 2,2,...,y 2,n ) y k = (y k,1, y k,2,...,y k,n ) Calculate the test statistic for each disease vector and compare to critical value Estimate of the statistical power: Fraction of disease vectors with value of the test statistic above critical value 3

The Realistic Case Hypothesis Test Observed disease vector (population of n individuals) m possibly true dose vectors y = (y 1, y 2,..., y n ) x 1 = (x 1,1,x 1,2,...,x 1,n ) x 2 = (x 2,1,x 2,2,...,x 2,n )... x m = (x m,1,x m,2,...,x m,n ) Calculate the value of the test statistic for each dose vector (= m test results) and compare it to the corresponding critical value 4

What is obtained from the m test results? An estimate of the subjective probability for rejection of the null hypothesis This is the fraction of the m dose vectors with value of the test statistic beyond the critical value. The decision of whether to reject or not to reject the null hypothesis A large number of disease vectors is generated under the null hypothesis. The test is performed with each using each of the m dose vectors. This gives an empirical distribution of subjective probabilities for rejection of the null hypothesis although it is true. The null hypothesis is rejected for the observed disease vector y if the subjective probability exceeds the (1-a)* 100% quantile of the empirical distribution mentioned above, where a is the chosen false positive probability. The value of this quantile is denoted by d. 5

Calculation of Statistical Power 1. We do not know which of the m dose vectors is true, i.e.the disease generating one. 2. We generate a sample of k disease vectors under the alternative hypothesis using each of the m dose vectors separately as only one of them can be true. 3. As the hypothesis test with the observed disease vector will use all m dose vectors, the calculation of the power of the test will also have to use all m dose vectors with each of the generated samples of k disease vectors. 6

Assumed disease generating dose vector x j : x j,1 x j,2... x j,n generates (under the alternative hypothesis) the disease vectors y 1,j = (y 1,j,1 y 1,j,2... y 1,j,n ) y 2,j = (y 2,j,1 y 2,j,2... y 2,j ) y k,j = (y k,j,1 y k,j,2... y k,j,n ) Perform the hypothesis test with all m dose vectors for each of these k disease vectors and decide for rejection of the null hypothesis if the subjective probability for rejection is above d., The fraction of the k disease vectors with subjective probability for rejection of the null hypothesis above d, is an estimate of the statistical power (assuming Xj is true, i.e. disease generating). 7

This gives m possibly true values of the statistical power (one for each of the m possibly true dose vectors) It follows: 1. An empirical subjective probability distribution of the statistical power is obtained. 2. From this distribution we read an estimate of the subjective probability for the power to be at least as large as desired. This is all that can be said about the statistical power in the realistic case where dose values are uncertain! 8

Practical Examples We consider: A population of n = 3000 individuals (males and females in about equal proportions); Four random sets of m = 100 possibly true dose vectors each: Set Description 1 Distribution information on k95 factors and on median dose values as in HTDS; Shared uncertainties resulting in a sample correlation of 0.6 between the 100 dose values of any two individuals; 2 Like set 1 but no uncertainties shared; 3 Distribution information on k95 factors as in HTDS but on median dose values changed (m*1.3,s/8); Shared uncertainties resulting in a sample correlation of 0.6; 4 Like set 3 but no uncertainties shared; A linear dose response relationship, assuming a coefficient of 0.0169 Gy -1 and background probabilities of 0.003 for men and 0.007 for women; A false positive probability of a = 0.05. 9

Dose vector set 1: Shared uncertainties resulting in sample correlation of 0.6; Distribution of k95 factors and of median dose values as in HTDS; The false positive fraction is 0.0506 for both types of test. 1 0.8 0.6 0.4 0.2 0 0.694 1 8 15 22 29 36 43 50 57 64 71 78 85 92 99 Cumulative Subjective Probability [% hypothesis test with median dose values only hypothesis test with the m dose values 10

Dose vector set 2: No uncertainties shared;distribution of k95 factors and of median dose values as in HTDS; The false positive fraction is 0.0479 for both types of test. 1 0.8 0.6 0.4 0.2 0 0.798 1 8 15 22 29 36 43 50 57 64 71 78 85 92 99 Cumulative Subjective Probability [%] hypothesis test with median dose values only hypothesis test with the m dose values 11

Density Function Comparison of four distributions of Median Values (95% quantile of medians as in HTDS) 0.008 0.007 (6.1292;0.1211) 0.006 (4.7349;0.9687) 0.005 (5.93;0.2422) 0.004 (5.5316;0.4844) 0.003 0.002 0.001 0 0 200 400 600 800 1000 1200 1400 Median Dose Values [mgy] 12

Dose vector set 3: Shared uncertainties resulting in sample correlation of 0.6; Distribution of k95 factors as in HTDS but for median dose values changed (m*1.3,s/8) ; The false positive fraction is 0.0453 for both types of test. 1 0.8 0.6 0.4 0.2 0 0.328 1 8 15 22 29 36 43 50 57 64 71 78 85 92 99 Cumulative Subjective Probability [%] hypothesis test with median dose values only hypothesis test with the m dose values 13

Statistical Power (using100 dose vectors in the hypothesis test) Comparison of Statistical Power for Vector Set 3 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 14 Statistical Power (using only the vector of median values in the hypothesis test)

Power (test using vectors) minus Power (test using medians) 0.6 0.5 0.4 0.3 0.2 0.1 0 0 500 1000 1500 2000 2500 3000 Standard Deviation of the Disease Generating Dose Vector [mgy] 15

Statistical Power (using m dose vectors in the test) 1.2 1 0.8 0.6 0.4 0.2 0 0 500 1000 1500 2000 2500 3000 Standard Deviation of the Disease Generating Dose Vector [mgy] 16

Dose vector set 4: No uncertainties shared; Distribution of k95 factors as in HTDS but for median dose values changed (m*1.3,s/8) ; The false positive fraction is 0.057617 for both types of test. 1 0.8 0.6 0.4 0.2 0 0.396 1 12 23 34 45 56 67 78 89 100 Cumulative Subjective Probability [% ] hypothesis test with median dose values only hypothesis test with the m dose values 17

Subjective Probability for the Power to be at least 0.8 Set No only medians used dose vectors used "false positive" rate δ 1 0.35 0.37 0.051 0.41 2 0.52 0.74 0.048 0.28 3 0.02 0.37 0.045 0.14 4 0 1 0.058 0.13 18

Conclusions Two ways of arriving at a subjective probability distribution for the statistical power are compared: 1) All m dose vectors from the uncertainty analysis of the dose reconstruction are used in the hypothesis test. 2) The hypothesis test uses only the vector of median dose values. The comparison shows that the first way of testing can be expected to be of superior statistical power. 19