Confidence Distribution

Similar documents
STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling

Bios 6649: Clinical Trials - Statistical Design and Monitoring

the unification of statistics its uses in practice and its role in Objective Bayesian Analysis:

Confidence distributions in statistical inference

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review

On Generalized Fiducial Inference

BFF Four: Are we Converging?

Introduction into Bayesian statistics

New Bayesian methods for model comparison

Fiducial Inference and Generalizations

A BAYESIAN MATHEMATICAL STATISTICS PRIMER. José M. Bernardo Universitat de València, Spain

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

STAT 425: Introduction to Bayesian Analysis

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Two examples of the use of fuzzy set theory in statistics. Glen Meeden University of Minnesota.

P Values and Nuisance Parameters

Bayesian parameter estimation with weak data and when combining evidence: the case of climate sensitivity

Data Fusion with Confidence Curves: The II-CC-FF Paradigm

Statistical Methods in Particle Physics

Answers and expectations

Bayesian Econometrics

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio

Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs

Frequentist Statistics and Hypothesis Testing Spring

Bayesian Inference in Astronomy & Astrophysics A Short Course

Political Science 236 Hypothesis Testing: Review and Bootstrapping

CD posterior combining prior and data through confidence distributions

Noninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions

Harvard University. Harvard University Biostatistics Working Paper Series

Long-Run Covariability

Bayesian Inference. Chapter 1. Introduction and basic concepts

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Combining diverse information sources with the II-CC-FF paradigm

Bayesian inference: what it means and why we care

Applied Bayesian Statistics STAT 388/488

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

Physics 403. Segev BenZvi. Credible Intervals, Confidence Intervals, and Limits. Department of Physics and Astronomy University of Rochester

Stat 5101 Lecture Notes

Can we do statistical inference in a non-asymptotic way? 1

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

My talk concerns estimating a fixed but unknown, continuously valued parameter, linked to data by a statistical model. I focus on contrasting

Statistical Methods for Discovery and Limits in HEP Experiments Day 3: Exclusion Limits

Statistical Methods for Particle Physics Lecture 4: discovery, exclusion limits

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

(1) Introduction to Bayesian statistics

Contents. Part I: Fundamentals of Bayesian Inference 1

Bayesian Inference and the Parametric Bootstrap. Bradley Efron Stanford University

(4) One-parameter models - Beta/binomial. ST440/550: Applied Bayesian Statistics

Statistics: Learning models from data

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:

Bayesian Inference: Posterior Intervals

Treatment and analysis of data Applied statistics Lecture 4: Estimation

A Very Brief Summary of Statistical Inference, and Examples

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

2. A Basic Statistical Toolbox

Propagation of Uncertainties in Measurements: Generalized/ Fiducial Inference

Statistical Methods for Astronomy

Previous lecture. Single variant association. Use genome-wide SNPs to account for confounding (population substructure)

Hypothesis Testing - Frequentist

Chapter 4 HOMEWORK ASSIGNMENTS. 4.1 Homework #1

Deciding, Estimating, Computing, Checking

Deciding, Estimating, Computing, Checking. How are Bayesian posteriors used, computed and validated?

Foundations of Statistical Inference

Bootstrap and Parametric Inference: Successes and Challenges

Group Sequential Tests for Delayed Responses

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

Principles of Statistical Inference

Principles of Statistical Inference

18.05 Practice Final Exam

Inference for a Population Proportion

Principles of Bayesian Inference

Principles of Bayesian Inference

BEGINNING BAYES IN R. Bayes with discrete models

Poisson CI s. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA

Statistical Methods for Astronomy

Journeys of an Accidental Statistician

Bayesian Models in Machine Learning

Principles of Bayesian Inference

Bayesian Inference. Chapter 2: Conjugate models

Statistical methods for decision making in mine action

Bayesian Statistics from Subjective Quantum Probabilities to Objective Data Analysis

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33

Statistical Inference

Bayesian inference. Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark. April 10, 2017

Statistics of Small Signals

A Very Brief Summary of Bayesian Inference, and Examples

Principles of Bayesian Inference

Plausible Values for Latent Variables Using Mplus

A Bayesian solution for a statistical auditing problem

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Imperfect Data in an Uncertain World

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Probability and Estimation. Alan Moses

Monte Carlo in Bayesian Statistics

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

Using prior knowledge in frequentist tests

Transcription:

Confidence Distribution Xie and Singh (2013): Confidence distribution, the frequentist distribution estimator of a parameter: A Review Céline Cunen, 15/09/2014

Outline of Article Introduction The concept of Confidence Distribution (CD) A classical Definition and the History of the CD Concept A modern definition and interpretation Illustrative examples Basic parametric examples Significant (p-value) functions Bootstrap distributions Likelihood functions Asymptotically third-order accurate confidence distributions CD, Bootstrap, Fiducial and Bayesian approaches CD-random variable, Bootstrap estimator and fiducial-less interpretation CD, fiducial distribution and Belief function CD and Bayesian inference Inferences using a CD Confidence Interval Point estimation Hypothesis testing Optimality (comparison) of CDs Combining CDs from independent sources Combination of CDs and a unified framework for Meta-Analysis Incorporation of Expert opinions in clinical trials CD-based new methodologies, examples and applications CD-based likelihood caluculations Confidence curve CD-based simulation methods Additional examples and applications of CD-developments Summary

CD: a sample-dependent distribution that can represent confidence intervals of all levels for a parameter of interest

CD: a broad concept = covers all approaches that can build confidence intervals at all levels

Interpretation A distribution on the parameter space A Distribution estimator = contains information for many types of inference An estimator FOR the parameter of interest, instead of an inherent distribution OF the parameter Purely frequentist: The parameter is a fixed, non-random quantity 95% CI: the true parameter value will be covered by the CIs 95% of the time

Fiducial Inference Fisher's biggest blunder? Fiducial Faith A fiducial distribution: describes the level of faith attached to different values of the unknown parameter Sometimes normalized likelihood function are interpreted as distributions of the parameter ~ the parameters are both fixed and random quantities Interpretation of 95% CI: there is a 95% probability that the parameter lies inside the CI.

2.2 Modern definition A function Hn () is called a confidence distribution for a parameter θ if: R1: H n () is a cummulative distribution function on the parameter space R2: at the true parameter value θ = θ 0 H n (θ 0 ) as a function of the sample x follows the uniform distribution U[0,1] R2 is important!

2.3.1: Basic parametric examples A sample: With σ² known The function satisfies the requirements in the CD-definition R1: it clearly is a cdf R2:

2.3.2 p-value functions One-sided test H0 : θ b vs H 1 : θ > b P-value function: Usually CDs or (acds): Because cdf And because when b=θ 0, H 0 is true and p-values are uniformly distributed when H 0 is true.

2.3.3 Bootstrap distributions True (unknown) parameter Original sample, with estimator Bootstrap sample: a sample of equal size as the original sample, sampled with replacement from the original sample Compute the estimator on each bootstrap samples (get many ) --> the Bootstrap Distribution The Bootstrap Distribution is an acd!

2.3.4 Likelihood functions Under some mild conditions: normalized likelihood functions are density functions of asymptotic normal CDs Method for obtaining CDs from likelihoods CD-based inference likelihood inference

3.1 CD-random Variable,... The CD-random variable: The CD is not a distribution of θ! CD-random variable = Bootstrap estimator, and this is useful because it: Help understanding CD-inference and develop new methods Clarifies the interpretation of CDs: CDs are not distributions of θ, so therefore it is not a problem that a transformation g(θ) of θ does not generally lead to a CD for g(θ)

3.2 CD, Fiducial Distribution... Methods from fiducial reasoning (and from CDs) are supposed to have good statistical performance in the frequentist sense CIs should have the exact coverage property For tests: the actual rate of type I error is equal to the specified level of the test Many fiducial distributions are CDs Fiducial reasoning: provides a procedure for finding CDs

3.3 CD and Bayesian Inference Bayesian credible intervals to not possess the exact coverage property But asymptotically they can obtain it Then: posterior distributions are acds Bayes methods can produce CDs! Benefits with CBs compared to bayesian methods: Nuisance parameters

4 Inferences using a CD CDs contain information for any type of frequentist inference

4.1 Confidence Intervals CDs allows us to construct CIs for all levels of α CI constructed from CDs have the exact coverage property:

4.2 Point Estimation Median: Mean: Mode: Under some conditions, these are consistent estimators (= converging in probability to the true value)

4.3 Hypothesis Testing One-sided test: The support on C: Reject H 0 if the support on C is less than α The rejection region corresponds to a level α test Support = p-value (often) Two-sided test Same story, but with a more complicated rejection region

5 Optimality of CDs Can have multiple CDs for the same parameter A better CD = a CD more concentrated around the true parameter value

6.1 Combination of CDs... k independent studies, estimate the same parameter of interest θ Study i with sample xi and CD H i (.) Propose a general recipe for combining k independent CDs:

Combining k p-values - Fisher's method k p-values from k independent studies Under H0 : Remember that So that the teststatistic Will be distributed under H0

Example 2 studies aim to estimate the μ parameter from a normal model with known sigmas Study 1: n1 =30, Study 2: n2 =40, With CDs: We choose And then

6.2 Incorporation of Expert Opinions How can one incorporate existing knowledge in an analysis? Bayesian approaches: prior = existing knowledge In a frequentist setting: CD! CD-approach: CD e : summarizes the existing information/opinions CD d : from the data Combines these two by the methods in the last section Advantages: Easy to implement/ computationally cheap No need for priors on nuisance parameter! Avoids the discrepant posterior phenomenon

7 New methodologies, Examples... Different CD-related methods: Obtaining likelihood functions from CDs Presenting CDs: the confidence curve CD-based simulation methods

8 Summary CDs is a broad concept which contains many well-known notions and results Most types of frequentist inference can be derived from CDs Advantages of the CD-approach: Handles nuisance parameters well Easy to combine information Problems (need of further study): Multivariate CDs Cases where it difficult to obtain CDs Model uncertainty/ diagnosis/ selection