BFF Four: Are we Converging?

Similar documents
Principles of Statistical Inference

Principles of Statistical Inference

Default priors and model parametrization

A union of Bayesian, frequentist and fiducial inferences by confidence distribution and artificial data sampling

Confidence Distribution

Confidence distributions in statistical inference

My talk concerns estimating a fixed but unknown, continuously valued parameter, linked to data by a statistical model. I focus on contrasting

Noninformative Priors for the Ratio of the Scale Parameters in the Inverted Exponential Distributions

Fiducial Inference and Generalizations

STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

Conditional Inference by Estimation of a Marginal Distribution

Combining diverse information sources with the II-CC-FF paradigm

Bayesian and frequentist inference

THE MINIMAL BELIEF PRINCIPLE: A NEW METHOD FOR PARAMETRIC INFERENCE

Discussion on Fygenson (2007, Statistica Sinica): a DS Perspective

Confidence Intervals Lecture 2

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007

Estimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio

Uncertain Inference and Artificial Intelligence

A simple analysis of the exact probability matching prior in the location-scale model

Likelihood inference in the presence of nuisance parameters

Validity and the foundations of statistical inference

Data Fusion with Confidence Curves: The II-CC-FF Paradigm

STAT 425: Introduction to Bayesian Analysis

Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review

Bayesian Model Specification: Toward a Theory of Applied Statistics

Review: Statistical Model

Remarks on Improper Ignorance Priors

Inferential models: A framework for prior-free posterior probabilistic inference

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

Propagation of Uncertainties in Measurements: Generalized/ Fiducial Inference

ICES REPORT Model Misspecification and Plausibility

Approximating models. Nancy Reid, University of Toronto. Oxford, February 6.

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Conditional confidence interval procedures for the location and scale parameters of the Cauchy and logistic distributions

Probabilistic Inference for Multiple Testing

The New Palgrave Dictionary of Economics Online

Constructing Ensembles of Pseudo-Experiments

CONVERTING OBSERVED LIKELIHOOD FUNCTIONS TO TAIL PROBABILITIES. D.A.S. Fraser Mathematics Department York University North York, Ontario M3J 1P3

Bayesian Inference and the Parametric Bootstrap. Bradley Efron Stanford University

Default priors for Bayesian and frequentist inference

Introduction to Bayesian Statistics

The Fundamental Principle of Data Science

ASSESSING A VECTOR PARAMETER

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

The binomial model. Assume a uniform prior distribution on p(θ). Write the pdf for this distribution.

Statistical Approaches to Learning and Discovery. Week 4: Decision Theory and Risk Minimization. February 3, 2003

Bayesian model selection: methodology, computation and applications

PARAMETER CURVATURE REVISITED AND THE BAYES-FREQUENTIST DIVERGENCE.

Overall Objective Priors

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units

Measuring nuisance parameter effects in Bayesian inference

Generalized Fiducial Inference

Bayesian analysis in finite-population models

MAXIMUM LIKELIHOOD, SET ESTIMATION, MODEL CRITICISM

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Generalized Fiducial Inference: A Review

Module 22: Bayesian Methods Lecture 9 A: Default prior selection

CS 540: Machine Learning Lecture 2: Review of Probability & Statistics

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

Introduction to Bayesian Methods

Confidence is epistemic probability

Statistical Methods in Particle Physics

Chapter 4 HOMEWORK ASSIGNMENTS. 4.1 Homework #1

g-priors for Linear Regression

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Harvard University. Harvard University Biostatistics Working Paper Series

Australian & New Zealand Journal of Statistics

Bayesian tests of hypotheses

Discussion of Dempster by Shafer. Dempster-Shafer is fiducial and so are you.

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

arxiv: v1 [math.st] 10 Aug 2007

Better Bootstrap Confidence Intervals

Stat 5101 Lecture Notes

Marginal inferential models: prior-free probabilistic inference on interest parameters

BAYESIAN ANALYSIS OF BIVARIATE COMPETING RISKS MODELS

A Very Brief Summary of Statistical Inference, and Examples

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Likelihood and p-value functions in the composite likelihood context

Valid Prior-Free Probabilistic Inference and its Applications in Medical Statistics

Second order ancillary: A differential view from continuity

Chapter 9: Interval Estimation and Confidence Sets Lecture 16: Confidence sets and credible sets

CD posterior combining prior and data through confidence distributions

Likelihood Inference in the Presence of Nuisance Parameters

Likelihood Inference in the Presence of Nuisance Parameters

Bayesian Models in Machine Learning

The Surprising Conditional Adventures of the Bootstrap

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

Some History of Optimality

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009

Fiducial and Confidence Distributions for Real Exponential Families

DEFNITIVE TESTING OF AN INTEREST PARAMETER: USING PARAMETER CONTINUITY

Objective Bayesian Statistical Inference

Miscellany : Long Run Behavior of Bayesian Methods; Bayesian Experimental Design (Lecture 4)

A Very Brief Summary of Bayesian Inference, and Examples

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

BFF4: Fourth Bayesian, Fiducial, and Frequentist Workshop Program

P Values and Nuisance Parameters

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Noninformative Priors Do Not Exist: A Discussion with José M. Bernardo

Transcription:

BFF Four: Are we Converging? Nancy Reid May 2, 2017

Classical Approaches: A Look Way Back Nature of Probability BFF one to three: a look back Comparisons Are we getting there? BFF Four Harvard, May 2017 2/34

Posterior Distribution Bayes 1763 Stigler 2013 BFF Four Harvard, May 2017 3/34

Posterior Distribution Bayes 1763 π(θ y 0 ) = f (y 0 ; θ)π(θ)/m(y 0 ) probability distribution for θ y 0 is fixed probability comes from π(θ) BFF Four Harvard, May 2017 4/34

Fiducial Probability Fisher 1930 df = F(T, θ)dθ θ fiducial probability density for θ, given statistic T probability comes from (dist n of) T It is not to be lightly supposed that men of the mental calibre of Laplace and Gauss... could fall into error on a question of prime theoretical importance, without an uncommonly good reason BFF Four Harvard, May 2017 5/34

Confidence Distribution Cox 1958; Efron 1993 Much controversy has centred on the distinction between fiducial and confidence estimation... The fiducial approach leads to a distribution for the unknown parameter... the method of confidence intervals, as usually formulated, gives only one interval at some preselected level of probability... in... simple cases... there seems no reason why we should not work with confidence distributions for the unknown parameter These can either be defined directly, or... introduced in terms of the set of all confidence intervals BFF Four Harvard, May 2017 6/34

Confidence Distribution Cox 1958; Efron 1993 assigns probability 0.05 to θ lying between the upper endpoints of the 0.90 and 0.95 confidence intervals, etc. Of course this is logically incorrect, but it has powerful intuitive appeal... no nuisance parameters [this] is exactly Fisher s fiducial distribution Seidenfeld 1992; Zabell 1992 BFF Four Harvard, May 2017 7/34

Structural Probability Fraser 1966 a re-formulation of fiducial probability for transformation models This transformation re-formulation leads to a frequency interpretation a change in the parameter value can be offset by a change in the sample y y + a; θ θ a a local location version leads to: Likelihood z } { f (y 0, θ) dy df = F (y, θ)dθ = F (y, θ) = f (y 0, θ) θ θ f (y 0, θ) dθ BFF Four Harvard, May 2017 y0 8/34

Significance Function Fraser 1991 Log likelihood 8 6 4 2 0 Pivot 3 2 1 0 1 2 3 0 2 4 6 8 ψ 0 2 4 6 8 ψ from likelihood to significance significance records probability left of the observed data point likelihood records probability at the observed data point the significance function is a plot of this probability as a function of θ the full spectrum of confidence intervals is obtained... suggesting the alternate name confidence distribution function BFF Four Harvard, May 2017 9/34

Nature of Probability Cox 2006; R & Cox 2015; Zabell, 1992 probability to describe physical haphazard variability aleatory probabilities represent features of the real world in somewhat idealized form subject to empirical test and improvement conclusions of statistical analysis expressed in terms of interpretable parameters enhanced understanding of the data generating process probability to describe the uncertainty of knowledge epistemic measures rational, supposedly impersonal, degree of belief, given relevant information Jeffreys measures a particular person s degree of belief, subject typically to some constraints of self-consistency Ramsey, de Finetti, Savage often linked with personal decision making necessarily? BFF Four Harvard, May 2017 10/34

... nature of probability Bayes posterior describes uncertainty of knowledge probability comes from the prior confidence intervals or p-values refer to empirical probabilities in what sense are confidence distribution functions, significance functions, structural or fiducial probabilities to be interpreted? empirically? degree of belief? literature is not very clear we may avoid the need for a different version of probability by appeal to a notion of calibration imho BFF Four Harvard, May 2017 11/34

BFF 1 4 posterior distribution fiducial probability confidence distribution structural probability significance function belief functions objective Bayes generalized fiducial inference confidence distributions and confidence curves approximate significance functions inferential models BFF Four Harvard, May 2017 12/34

What has changed? computation BFF Four Harvard, May 2017 13/34

Objective Bayes Berger, BFF4, e.g. noninformative, default, matching, reference,... priors we may avoid the need for a different version of probability by appeal to a notion of calibration Cox 2006, R & Cox 2015 as with other measuring devices within this scheme of repetition, probability is defined as a hypothetical frequency it is unacceptable if a procedure yielding high-probability regions in some non-frequency sense are poorly calibrated such procedures, used repeatedly, give misleading conclusions Bayesian Analysis, V1(3) 2006 BFF Four Harvard, May 2017 14/34

... objective Bayes pragmatic solution as a starting point some versions may not be correctly calibrated requires checking in each example calibrated versions must be targetted on the parameter of interest only in very special cases can calibration be achieved for more than one parameter in the model, from the same prior the simplicity of a fully Bayesian approach to inference is lost Gelman 2008; PPM LW BFF Four Harvard, May 2017 15/34

Confidence Distribution Xie & Singh; Hjort & Schweder any function H : Y Θ (0, 1) which is a cumulative distribution function of θ for any y Y has correct coverage: H(Y, θ) U(0, 1) Y f ( ; θ) CDs, or approximate CDs, are readily obtained from pivotal quantitites pivotal quantity: g(y, θ) with sampling distribution known n(ȳ µ)/s sufficiently general to encompass bootstrap distribution and many standard likelihood quantities recent examples include robust meta-analysis and identification of change-points Xie et al. 2011; Cunen et al. 2017; Hannig & Xie 2012 BFF Four Harvard, May 2017 16/34

... confidence distribution confidence 0.0 0.2 0.4 0.6 0.8 1.0 cc 0.5 0.6 0.7 0.8 0.9 1.0-1 0 1 2 3 4 µ -1 0 1 2 3 4 µ significance 0.0 0.2 0.4 0.6 0.8 1.0 pivot -4-2 0 2 4-1 0 1 2 3 4-1 0 1 2 3 4 µ BFF Four Harvard, May 2017 17/34 µ

Generalized Fiducial Hannig et al 2016 Fisher: g(y, θ) has a known distribution; invert this to create distribution for θ when y 0 is obtained Fraser: use data-generating equation to make the inversion more direct, e.g. Y i = µ + e i, e i = y 0 i µ Hannig et al. Y = G(U, θ), U has known distribution suppose we can invert this for any y 0 : θ = Q y 0(U) fiducial distribution of θ is Q y 0(U ) U independent copy of U inverse only exists if θ and Y have same dimension might get this by reduction of a sample to sufficient statistics more generally, fiducial density takes the form r(θ; y 0 ) f (y 0, θ)j(y 0, θ) BFF Four Harvard, May 2017 18/34

Significance Function Fraser & R,... focus on parameter of interest y R n, θ R p, ψ R focus on dimension reduction many arguments point to the need for this n p using an approximation location model Likelihood df = θ F (y, θ)dθ = θ F(y, θ) f (y 0 {}}{, θ) f (y 0, θ) = f (y 0, θ) dy dθ p 1 using a tangent exponential model combination of structural model and likelihood asymptotics leads for example to construction of default priors y 0 Fraser et al 2010 BFF Four Harvard, May 2017 19/34

Inferential Models Martin & Liu Hannig et al. Y = G(U, θ), U has known distribution suppose we can invert this for any y 0 : θ = Q y 0(U) fiducial distribution of θ is Q y 0(U ) U independent copy of U use a random set S to predict U this random set is converted to a belief function about θ need to ensure the belief function is valid and efficient valid = calibrated? BFF Four Harvard, May 2017 20/34

Comparisons: conditioning objective Bayes generalized fiducial inference confidence distributions and confidence curves approximate significance functions inferential models yes yes and no JASA 16 needs to be built in ahead of time yes; via approximate location model needs to be built in ahead of time BFF Four Harvard, May 2017 21/34

Comparisons: Eliminating Nuisance Parameters objective Bayes generalized fiducial inference confidence distributions and confidence curves approximate significance functions inferential models marginalization rarely works... depends on the problem use profile log-likelihood, or similar marginalization? focus parameter via Laplace approximation marginalization invoked ahead of time BFF Four Harvard, May 2017 22/34

Comparisons: Calibration objective Bayes generalized fiducial inference confidence distributions and confidence curves approximate significance functions inferential models often yes typically approximate typically approximate yes BFF Four Harvard, May 2017 23/34

Comparisons: Nature of Probability Bayes / objective Bayes epistemic / empirical generalized fiducial inference empirical confidence distributions and confidence curves empirical but not prescriptive approximate significance functions empirical inferential models?epistemic? BFF Four Harvard, May 2017 24/34

What s the end goal? Applications something that works gives sensible answers not too sensitive to model assumptions computable in reasonable time provides interpretable parameters Foundations peeling back the layers what does works mean? what probability do we mean Goldilocks conditioning Meng & Liu, 2016 how does this impact applied work? BFF Four Harvard, May 2017 25/34

Role of Foundations Cox & R, 2015 avoid apparent discoveries based on spurious patterns to shed light on the structure of the problem calibrated inferences about interpretable parameters realistic assessment of precision understanding when/why methods work/fail BFF Four Harvard, May 2017 26/34

Well, are we? objective Bayes confidence distributions generalized fiducial Larry W: the perpetual motion machine of Bayesian inference Min-ge, Regina: everything fits Nils: CDs are the gold standard Jan: bring it on... I ll figure it out inferential models significance functions Ryan: it s the only solution Chuanhai: it might take 100 years Don: it s the best solution... you can t solve everything at once BFF Four Harvard, May 2017 27/34

Some warning signs BFF Four Harvard, May 2017 28/34

Some warning signs BFF Four Harvard, May 2017 29/34

Some warning signs BFF Four Harvard, May 2017 30/34

Some warning signs deep neural networks easily fit random labels these observations rule out... possible explanations for the generalization performance of state-of-the-art neural networks. BFF Four Harvard, May 2017 31/34

Some warning signs BFF Four Harvard, May 2017 32/34

Summary Bayes, fiducial, structural, confidence, belief BFF 1-4: Develop links to bridge gaps among different statistical paradigms targetting parameters limit distributions calibration in repeated sampling relevant repetitions for the data at hand NR: Why is conditional inference so hard? DRC: I expect we re all missing something, but I don t know what it is StatSci Interview 1996 BFF Four Harvard, May 2017 33/34

References Cox, D.R. (1958). Ann. Math. Statist. Cox, D.R. (2006). Principles of Statistical Inference. Cunen et al. (2017). J. Statist. Plann. Infer. Efron, B. (1993). Biometrika Fisher, R.A. (1930). Proc. Cam. Phil. Soc. Fraser, D.A.S. (1966). Biometrika Fraser, D.A.S. (1991). J. Amer. Statist. Assoc. Fraser, D.A.S. et al. (2010). J. R. Statist. Soc. B Fraser, D.A.S. and Reid, N. (1993). Statist. Sinica Gelman, A. (2008). Ann. Appl. Statist. Hannig, J. and Xie, M. (2012). Elect. J. Statist. Hannig, J. et al. (2016). J. Amer. Statist. Assoc. Hjort, N. and Schweder, T. Confidence, Likelihood, Probability: Inference with Confidence Distributions Meng, X.-L. and Liu, K. (2016). Ann. Rev. Stat. and its Applic. Norman, S. et al. (2016). Ann. Rev. Stat. and its Applic. Reid, N. and Cox, D.R. (2015). Intern. Statist. Rev. Stein, C. (1959). Ann. Math. Statist. Stigler, S. (2013). Statistical Science Xie, M. and Singh, K. (2013) Internat. Statist. Rev. Xie, M. et al. (2011). J. Amer. Statist. Assoc. Zabell, X. (1992). Statistical Science BFF Four Harvard, May 2017 34/34