Confidence Intervals. First ICFA Instrumentation School/Workshop. Harrison B. Prosper Florida State University

Similar documents
Confidence Intervals Lecture 2

Statistics of Small Signals

E. Santovetti lesson 4 Maximum likelihood Interval estimation

Journeys of an Accidental Statistician

Statistical Methods in Particle Physics

Statistical Methods for Particle Physics Lecture 4: discovery, exclusion limits

Physics 403. Segev BenZvi. Credible Intervals, Confidence Intervals, and Limits. Department of Physics and Astronomy University of Rochester

Frequentist Confidence Limits and Intervals. Roger Barlow SLUO Lectures on Statistics August 2006

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009

Hypothesis testing (cont d)

Statistical Methods for Discovery and Limits in HEP Experiments Day 3: Exclusion Limits

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Statistical Methods in Particle Physics Lecture 2: Limits and Discovery

Statistics for the LHC Lecture 2: Discovery

Harrison B. Prosper. CMS Statistics Committee

Confidence Limits and Intervals 3: Various other topics. Roger Barlow SLUO Lectures on Statistics August 2006

Practical Statistics for Particle Physicists

Unified approach to the classical statistical analysis of small signals

Foundations of Statistical Inference

Some Statistical Tools for Particle Physics

Statistics for the LHC Lecture 1: Introduction

Confidence intervals and the Feldman-Cousins construction. Edoardo Milotti Advanced Statistics for Data Analysis A.Y

Confidence intervals fundamental issues

Solution: chapter 2, problem 5, part a:

Second Workshop, Third Summary

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Physics 1140 Fall 2013 Introduction to Experimental Physics

2. A Basic Statistical Toolbox

Part 3 Robust Bayesian statistics & applications in reliability networks

Topics in Statistical Data Analysis for HEP Lecture 1: Bayesian Methods CERN-JINR European School of High Energy Physics Bautzen, June 2009

Use of the likelihood principle in physics. Statistics II

Statistics and Data Analysis

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Primer on statistics:

Signal detection theory

Discovery and Significance. M. Witherell 5/10/12

Detection ASTR ASTR509 Jasper Wall Fall term. William Sealey Gosset

(Dated: July 1, 2004) Abstract. In Run I, the DØ Collaboration adopted a Bayesian method for calculating confidence limits that

Statistical techniques for data analysis in Cosmology

CREDIBILITY OF CONFIDENCE INTERVALS

Chapter 3: Interval Estimation

Stat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

Constructing Ensembles of Pseudo-Experiments

Lecture 24. Introduction to Error Analysis. Experimental Nuclear Physics PHYS 741

Statistical Data Analysis Stat 3: p-values, parameter estimation

Method of Feldman and Cousins for the construction of classical confidence belts

Statistical Methods for Particle Physics

Advanced Statistics Course Part I

Statistical Methods for Particle Physics (I)

Bayes Theorem (Recap) Adrian Bevan.

A Calculator for Confidence Intervals

The Jeffreys-Lindley Paradox and Discovery Criteria in High Energy Physics

Search and Discovery Statistics in HEP

Bayesian Inference in Astronomy & Astrophysics A Short Course

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

Parameter estimation! and! forecasting! Cristiano Porciani! AIfA, Uni-Bonn!

CPSC 340: Machine Learning and Data Mining

Data Analysis and Monte Carlo Methods

Introduction to Bayesian Data Analysis

Confidence Distribution

Some Topics in Statistical Data Analysis

FYST17 Lecture 8 Statistics and hypothesis testing. Thanks to T. Petersen, S. Maschiocci, G. Cowan, L. Lyons

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

P Values and Nuisance Parameters

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Harrison B. Prosper. CMS Statistics Committee

Computing Likelihood Functions for High-Energy Physics Experiments when Distributions are Defined by Simulators with Nuisance Parameters

32. STATISTICS. 32. Statistics 1

Harrison B. Prosper. Bari Lectures

Lecture 1: Probability Fundamentals

Practical Statistics part II Composite hypothesis, Nuisance Parameters

Asymptotic formulae for likelihood-based tests of new physics

Discussion of Dempster by Shafer. Dempster-Shafer is fiducial and so are you.

Definition and Treatment of Systematic Uncertainties in High Energy Physics and Astrophysics

Error analysis for efficiency

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Principles of Statistical Inference

Statistical Methods in Particle Physics Day 4: Discovery and limits

Statistics: Learning models from data

Lies, damned lies, and statistics in Astronomy

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

arxiv: v1 [hep-ex] 9 Jul 2013

Counting Statistics and Error Propagation!

arxiv:physics/ v2 [physics.data-an] 16 Dec 1999

Statistics Challenges in High Energy Physics Search Experiments

Statistical Methods for Particle Physics Lecture 3: systematic uncertainties / further topics

32. STATISTICS. 32. Statistics 1

Bayesian Inference. Anders Gorm Pedersen. Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU)

Physics 736. Experimental Methods in Nuclear-, Particle-, and Astrophysics. - Statistics and Error Analysis -

OVERVIEW OF PRINCIPLES OF STATISTICS

Introduction to Statistical Methods for High Energy Physics

STAT 425: Introduction to Bayesian Analysis

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Statistical Tools in Collider Experiments. Multivariate analysis in high energy physics

Reasoning Under Uncertainty: Introduction to Probability

Bayesian Model Specification: Toward a Theory of Applied Statistics

Machine Learning Lecture 2

Statistical Methods for Astronomy

Statistical Methods for Particle Physics Lecture 3: Systematics, nuisance parameters

Transcription:

Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia,, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

Outline Lecture 1 Introduction Confidence Intervals - Frequency Interpretation Poisson Example Summary Lecture 2 Deductive and Inductive Reasoning Confidence Intervals - Bayesian Interpretation Poisson Example Summary November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 2

Introduction We physicists often talk about calculating errors, but what we really mean, of course, is quantifying our uncertainty A measurement is not uncertain, but it has an error about which we are uncertain! ε = mˆ m mˆ = measurement m = true value November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 3

Introduction - i One way to quantify uncertainty is the standard deviation or, even better, the root mean square deviation of the distribution of measurements. ε 2 = ( mˆ = + mˆ 2 m) 2 mˆ ( mˆ m) 2 2 In 1937 Jerzy Neyman invented another measure of uncertainty called a confidence interval. rms = std. dev. = bias = ε mˆ 2 2 mˆ m mˆ 2 November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 4

Introduction - ii Consider the following questions What is the mass of the top quark? What is the mass of the tau neutrino? What is the mass of the Higgs boson? Here are possible answers Here are possible answers m t = 174.3 ± 5.1 GeV m < 18.2 MeV m H > 114.3 GeV November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 5

Introduction iii These answers are unsatisfactory because they do not specify how much confidence we should place in them. Here are better answers m t = 174.3 ± 5.1 GeV, with CL = 0.683 m < 18.2 MeV, with CL = 0.950 m H > 114.3 GeV, with CL = 0.950 CL = Confidence Level November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 6

Introduction - iv Note that the statements m t = 174.3 ± 5.1 GeV, CL = 0.683 m < 18.2 MeV, CL = 0.950 m H > 114.3 GeV, CL = 0.950 are just an asymmetric way of writing m t lies in [169.2, 179.4] GeV, CL = 0.683 m lies in [0, 18.2] MeV, CL = 0.950 m H lies in [114.3, ) GeV, CL = 0.950 November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 7

Introduction - v The goal of these lectures is to explain the precise meaning of statements of the form lies in [L, U], with CL = L = lower limit U = upper limit For example m t lies in [169.2, 179.4] GeV, with CL = 0.683 November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 8

What is a Confidence Level? A confidence level is a probability that quantifies in some way the reliability of a given statement But, what exactly is probability? Bayesian: The degree of belief in, or plausibility of, a statement (Bayes, Laplace, Jeffreys, Jaynes) Frequentist: The relative frequency with which something happens (Boole, Venn, Fisher, Neyman) November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 9

Probability: An Example Consider the statement S = It will rain in Morelia on Monday And the probability assignment Pr{S} = 0.01 Bayesian interpretation The plausibility of the statement S is 0.01 Frequentist interpretation The relative frequency with which it rains on Mondays in Morelia is 0.01 November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 10

Confidence Level: Interpretation Since probability can be interpreted in (at least) two different ways, the interpretation of statements such as m t lies in [169.2, 179.4] GeV, with CL = 0.683 depends on which interpretation of probability is being used. A great deal of confusion arises in our field because of our tendency to forget this fact November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 11

Confidence Intervals Frequency Interpretation

The basic idea Confidence Intervals Imagine a set of ensembles of experiments, each member of which is associated with a fixed value of the parameter to be measured (for example, the top quark mass). Each experiment E, within an ensemble, yields an interval [l(e), u(e)] )], which either contains or does not contain. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 13

Coverage Probability For a given ensemble, the fraction of experiments with intervals containing the value associated with that ensemble is called the coverage probability of the ensemble. In general, the coverage probability will vary from one ensemble to another. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 14

Example Ensemble with = 1 with Pr = 0.4 E 4 E 1 E 3 E 4 E 2 E 1 E 3 E 5 E 2 E 5 Ensemble with = 2 with Pr = 0.8 E 5 E 4 E 2 E 3 E 1 Ensemble with = 3 with Pr = 0.6 November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 15

Confidence Level Frequency Interpretation If our experiment is selected at random from the ensemble to which it belongs (presumably the one associated with the true value of ) then the probability that its interval [l(e), u(e)] contains is equal to the coverage probability of that ensemble. The crucial point is this: We try to construct the set of ensembles so that the coverage probability over the set is never less than some pre-specified value, called the confidence level. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 16

Points to Note Confidence Level - ii In the frequency interpretation, the confidence level is a property of the set of ensembles; In fact, it is the minimum coverage probability over the set. Consequently, if the set of ensembles is unspecified or unknown the confidence level is undefined. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 17

Confidence Intervals Formal Definition Any set of intervals [ l ( E ), u ( E )] E Experiment l(e) Lower limit u(e) Upper limit with a minimum coverage probability equal to is a set of confidence intervals at 100 % confidence level (CL). (Neyman,1937) Confidence intervals are defined not by how they are constructed, but by their frequency properties. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 18

Confidence Intervals: An Example Experiment: To measure the mean rate of UHECRs above 10 20 ev per unit solid angle. Assume the probability of N events to be a given by a Poisson distribution θ n e θ P( n θ) =, θ = < n >, n! std.dev. = θ November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 19

Confidence Intervals Example - ii Goal: Compute a set of intervals [ l( n), u( n)] for N = 0, 1, 2, with CL = 0.683 for a set of ensembles, each member of which is characterized by a different mean event count. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 20

Why 68.3%? It is just a useful convention! It comes from the fact that for a Gaussian distribution the confidence intervals given by [x-, x+ ] are associated with a set of ensembles whose confidence level is 0.683. (x = measurement, = std. dev.) The main reason for this convention is the Central Limit Theorem Most sensible distributions become more and more Gaussian as the data increase. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 21

Parameter space Confidence Interval General Algorithm θ α L For each value find an interval in N with probability content of at least β =1 α L α R θ =u(n) α R θ=l(n) Count N November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 22

Example: Interval in N for = 10 0.15 Poisson Distribution (Mean = 10) 0.1 Probability 0.05 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Count November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 25

Confidence Intervals Specific Algorithms Neyman Region: fixed probabilities on either side Feldman Cousins Region: containing largest likelihood ratios P(n )/ P(n n) Mode Centered Region: containing largest probabilities P(n ) November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 26

Neyman Construction Define C ( N θ) = P( z θ) L z N Left cumulative distribution function N C ( N θ ) = P( z θ ) R z N Right cumulative distribution function N Valid for both continuous and discrete distributions. November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 27

Neyman Construction - ii Solve C L ( N u) = α L α L N C R ( N where β = 1 α α L R l) = α R α R N Remember: Left is UP and Right is LOW! November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 28

Central Confidence Intervals Choose α L = α = ( 1 0. 683) / 2 R and solve α L = n i= 0 P( i u) α R = i= n P( i l) = 1 for the interval [ l( n), u( n)] n 1 i= 0 P( i l) November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 29

P( n Central Confidence Intervals - ii Poisson Distribution θ ) = e θ n! θ n Parameter θ 20 15 10 Central Intervals - Poisson u(n) l(n) 5 0 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Upper Limits Lower Limits Count n November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 30

Comparison of Confidence Intervals 20 Intervals - Poisson Distribution Central 15 Feldman-Cousins Mode-Centered Parameter 10 N± N 5 0 0 2 4 6 8 10 12 14 16 18 20 Count November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 31

Comparison of Confidence Interval Widths 10 Width of Intervals Central Feldman-Cousins Mode-Centered Interval Width 8 6 4 2 N± N 0 0 2 4 6 8 10 12 14 16 18 20 Count Central Feldman-Cousins Mode Root(N) November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 32

Comparison of Coverage Probabilities 1 Coverage Probability Central Feldman-Cousins Mode-Centered Probability 0.8 0.6 0.4 0.2 0 N± N 0 5 10 15 20 Poisson Parameter Central Feldman-Cousins Mode Root(N) 68.3% November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 33

Summary The interpretation of confidence intervals and confidence levels depends on which interpretation of probability one is using The coverage probability of an ensemble of experiments is the fraction of experiments that produce intervals containing the value of the parameter associated with that ensemble The confidence level is the minimum coverage probability over a set of ensembles. The confidence level is undefined if the set of ensembles is unspecified or unknown November 21, 2002 First ICFA Instrumentation School/Workshop Harrison B. Prosper 34