On Extended Admissible Procedures and their Nonstandard Bayes Risk

Similar documents
ST5215: Advanced Statistical Theory

Tame definable topological dynamics

Some Finitely Additive (Statistical) Decision Theory or How Bruno de Finetti might have channeled Abraham Wald

LECTURE NOTES 57. Lecture 9

HAPPY BIRTHDAY CHARLES

STAT 830 Decision Theory and Bayesian Methods

STA 732: Inference. Notes 10. Parameter Estimation from a Decision Theoretic Angle. Other resources

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

More Model Theory Notes

Evaluating the Performance of Estimators (Section 7.3)

Sequential Decisions

Week 5 Lectures 13-15

6c Lecture 14: May 14, 2014

Finite-Dimensional Cones 1

3 Hausdorff and Connected Spaces

MATH 4200 HW: PROBLEM SET FOUR: METRIC SPACES

Ends of Finitely Generated Groups from a Nonstandard Perspective

Lecture 2: Basic Concepts of Statistical Decision Theory

Bayesian Persuasion Online Appendix

Monotone equilibria in nonatomic supermodular games. A comment

Statistical Approaches to Learning and Discovery. Week 4: Decision Theory and Risk Minimization. February 3, 2003

Lecture notes on statistical decision theory Econ 2110, fall 2013

Definably amenable groups in NIP

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

NOTES ON VECTOR-VALUED INTEGRATION MATH 581, SPRING 2017

Exact Minimax Strategies for Predictive Density Estimation, Data Compression, and Model Selection

LECTURE 15: COMPLETENESS AND CONVEXITY

Quantum Minimax Theorem

Math General Topology Fall 2012 Homework 8 Solutions

Section 2: Classes of Sets

Perfect Conditional -Equilibria of Multi-Stage Games with Infinite Sets of Signals and Actions (Preliminary and Incomplete)

Reflections of planar convex bodies

Rearrangements and polar factorisation of countably degenerate functions G.R. Burton, School of Mathematical Sciences, University of Bath, Claverton D

LECTURE 3: SMOOTH FUNCTIONS

An Introduction to Non-Standard Analysis and its Applications

FUNDAMENTALS OF REAL ANALYSIS

The small ball property in Banach spaces (quantitative results)

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Sequential Decisions

18.S097 Introduction to Proofs IAP 2015 Lecture Notes 1 (1/5/2015)

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

Games on Manifolds. Noah Stein

VAUGHT S THEOREM: THE FINITE SPECTRUM OF COMPLETE THEORIES IN ℵ 0. Contents

arxiv: v1 [math.fa] 14 Jul 2018

Abstract Measure Theory

Mircea Balaj. Comment.Math.Univ.Carolinae 42,4 (2001)

Introduction. Itaï Ben-Yaacov C. Ward Henson. September American Institute of Mathematics Workshop. Continuous logic Continuous model theory

Chapter 5 Random vectors, Joint distributions. Lectures 18-23

Axioms of separation

Topology Homework Assignment 1 Solutions

Convexity in R N Supplemental Notes 1

Existence of Optima. 1

Introduction to Topology

Lecture 4 Lebesgue spaces and inequalities

1 Lesson 1: Brunn Minkowski Inequality

Model Theory MARIA MANZANO. University of Salamanca, Spain. Translated by RUY J. G. B. DE QUEIROZ

Announcements. Proposals graded

Chapter ML:IV. IV. Statistical Learning. Probability Basics Bayes Classification Maximum a-posteriori Hypotheses

Short Introduction to Admissible Recursion Theory

Convergence of a Generalized Midpoint Iteration

SMALL SUBSETS OF THE REALS AND TREE FORCING NOTIONS

5 Set Operations, Functions, and Counting

NONSTANDARD ANALYSIS AND AN APPLICATION TO THE SYMMETRIC GROUP ON NATURAL NUMBERS

R N Completeness and Compactness 1

MASTERS EXAMINATION IN MATHEMATICS SOLUTIONS

Testing Statistical Hypotheses

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

Foundations. September 4, Foundations. A Model Theoretic Perspective. John T. Baldwin. sociology. Thesis I. Thesis II. A Concept Analysis

DEFINITION. A theory is a set of sentences. A type is a set of formulas x in some fixed variable x. A structure A is a model of a theory if all the

Nonstandard methods in Lie theory

STAT 7032 Probability Spring Wlodek Bryc

Stat 643 Problems. a) Show that V is a sub 5-algebra of T. b) Let \ be an integrable random variable. Find EÐ\lVÑÞ

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

Preference and Utility

MAT3500/ Mandatory assignment 2013 Solutions

Exploring the infinite : an introduction to proof and analysis / Jennifer Brooks. Boca Raton [etc.], cop Spis treści

Connectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ).

Topology, Math 581, Fall 2017 last updated: November 24, Topology 1, Math 581, Fall 2017: Notes and homework Krzysztof Chris Ciesielski

Elementary Point-Set Topology

Bayesian statistics: Inference and decision theory

LECTURE 5 NOTES. n t. t Γ(a)Γ(b) pt+a 1 (1 p) n t+b 1. The marginal density of t is. Γ(t + a)γ(n t + b) Γ(n + a + b)

Finite dimensional topological vector spaces

Hyperreal-Valued Probability Measures Approximating a Real-Valued Measure

4 Invariant Statistical Decision Problems

METRIC SPACES. Contents

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Finite dimensional topological vector spaces

Game Theory: Lecture 2

i=1 β i,i.e. = β 1 x β x β 1 1 xβ d

Handbook of Logic and Proof Techniques for Computer Science

An inverse of Sanov s theorem

A Very Brief Summary of Bayesian Inference, and Examples

Economics 574 Appendix to 13 Ways

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers

A Discrete Duality Between Nonmonotonic Consequence Relations and Convex Geometries

Lecture 4. f X T, (x t, ) = f X,T (x, t ) f T (t )

Parametric Inference Maximum Likelihood Inference Exponential Families Expectation Maximization (EM) Bayesian Inference Statistical Decison Theory

EXTENSION OF LYAPUNOV S CONVEXITY THEOREM TO SUBRANGES

Economics 204 Fall 2012 Problem Set 3 Suggested Solutions

Transcription:

On Extended Admissible Procedures and their Nonstandard Bayes Risk Haosui "Kevin" Duanmu and Daniel M. Roy University of Toronto + Fourth Bayesian, Fiducial, and Frequentist Conference Harvard University

Statistical Decision Theory (Wald) Defn (risk). Defn (convex extension). Defn. minimax iff Defn. Bayes iff Defn. admissible iff and

Thm. (Wald) If the parameter space admissible implies Bayes. Proof. Let. Then is finite, then Define the risk set of : Let be admissible and define lower quantant. Claim. and intersect at one point,.

Generalizing admissibility and Bayes Defn. extended admissible iff Defn. extended Bayes iff Thm (BG54). Extended Bayes implies extended admissible. Proof by picture. Via contrapositive: not extended admissible implies not extended Bayes.

Set-theoretic relationships : minimax : admissible : extended admissible : Bayes : extended Bayes Thm. 5.5.3 (BG54). Assuming only that is finite, the diagram (right) cannot be simplified. Thm. 5.5.1 (BG54). Assuming only bounded risk, the diagram (left) cannot be simplified. Thm. 5.5.3 (BG54). Assuming only bounded risk and minimax=maximin for every derived game, the diagram (right) cannot be simplified.

Results beyond bounded risk / finite Thm. (Wald, LeCam, Brown) Assume admits strictly positive densities with respect to a -finite measure. Assume the action space is a closed convex subset of Euclidean space. Assume the loss is lower semicontinuous and strictly convex in for every, and satisfies Then, for every admissible decision procedure, there exists a sequence priors with support on a finite set, such that of where is a Bayes procedure with respect to. Defn. Suppose admits densities with respect to a -finite measure. An estimator is generalized Bayes with respect to a -finite measure on if it minimizes the unnormalized posterior risk for - a.e.. Thm. (Berger Srinivasan) Assume is a multi-dimensional exponential family, and that the loss is jointly continuous, strictly convex in for every, and satisfies Then every admissible estimator is generalized Bayes.

First-order logic Example. (bounded quantifier formulas) Example. (normal-location model) Define. Then is Bayes w.r.t. prior, for all. In logic, Nonstandard analysis Three mechanisms: extension, transfer, -saturation. Defn (internal). An element is internal if for some standard set. Defn ( -saturation). Let be a collection of internal sets, where has cardinality less than. Then is nonempty if is nonempty for every finite.

Some elementary applications Example (transfer). What do elements of look like? They satisfy exactly the first order properties satisfied by the standard reals,. How about the relation? By transfer, we know it's a total order. Example. (normal-location model) By transfer, holds, i.e., Key point: Without saturation, we get nothing new. Example (saturation). Assume -saturation. Let. Then and. In contrast,. An element is an infinite positive real and so. is an infinitesimal, i.e., for all. Note that, hence, by transfer. Defn. For, write if is infinitesimal. Example (normal-location model). Recall the MLE estimator, infinite,.. For

Recap: Bayes optimality Defn. Let and. 1. A prior is a probability measure. 2. The average risk of with respect to a prior is 3. is Bayes (optimal) among if, for some prior, and for all. Thm. Bayes among extended admissible among. Nonstandard Bayes optimality Defn. Let and. 1. A nonstandard prior is a probability measure. 2. The internal average risk of with respect to a nonstandard prior is 3. is nonstandard Bayes among if, for some nonstandard prior, and for all. Theorem (Haosui Roy). nonstandard Bayes among extended admissible among.

Example (normal-location model). Choose infinite. By transfer, is Bayes w.r.t.. By transfer,. By transfer,. But implies is nonstandard Bayes among. Hence it is extended admissible among.

*finite (aka hyperfinite) sets Defn. A set is hyperfinite if there exists an internal bijection between and for some. This is unique and is called the internal cardinality of. Lemma. There is a hypefinite set such that. Proof. For every finite set, let be the sentence By saturation, there is a hyperfinite set containing Main result Theorem (Haosui Roy). extended admissible among nonstandard Bayes among. as a subset. Proof. By saturation, exists hyperfinite containing. Let. Define the hyperdiscretized risk set induced by : Note is not convex over. where ranges over finite subsets of. Note: and is convex over and

For and, define Claim. If is extended admissible among, then and do not intersect.

A standard result via nonstandard theory Defn Let. If there exists such that then is called the standard part of, denoted. is a partial function from to, called the standard part map. Example. Consider a set :, in general, is not an internal set. Internal probability measures standard probability measures Theorem (Cutland Neves Oliveira Sousa-Pinto). Let be a compact Hausdorff Borel measurable space. Let be an internal probability measure on. Define by Then is a standard Borel probability measure. is called the push down of. Example. Let. Let be an internal probability measure concentrating on. Then is the degenerate measure on. Theorem (Haosui Roy). Suppose is compact Hausdorff and risk functions are continuous. Let be an internal probability measure on and let be its push-down. Let be a standard decision procedure. Then. Theorem (Haosui Roy). Suppose is compact Hausdorff and risk functions are continuous. Then is extended admissible among if and only if is Bayes among.

A nonstandard Blyth's method Defn. For, write when for all. Defn. Let be a metric space, and let. An internal probability measure on is -regular if, for every and non-infinitesimal,. Theorem. Suppose is metric, risk functions are continuous, and let and. If there exists such that is within of the optimal Bayes risk among with respect to some -regular nonstandard prior, then is admissible among. Example (normal-location problem). Choose Recall that is "flat" on : infinite. Bayes risk of is. Bayes risk of is. Thus, excess risk is. For, is -regular. Thus is admissible for, as is well known. For, is not -regular. Thus theorem is silent. Thm (Stein). is admissible only for.

Summary of main results Theorem (Haosui Roy). extended admissible among nonstandard Bayes among. Lemma (Haosui Roy). extended Bayes among nonstandard Bayes among. Lemma (Haosui Roy). generalized Bayes among nonstandard Bayes among. Conclusion By working in a saturated models of the reals, a notion of Bayes optimality aligns perfectly with extended admissibility. Our results come without conditions other than saturation, and so they can be used to study infinite dimensional nondominated models with unbounded risk beyond the remit of existing results The nonstandard Blyth method points the way towards necessary conditions for admissibility. There's hope that more of frequentist and Bayesian theory can be aligned using similar techniques.

Another example Example. Let and. Define by for and. Let, for. Consider the loss function. Every nonrandomized decision procedure with a pair. Note: Loss is merely lower semicontinous. Model also not continuous. Thm. is an admissible non-bayes estimator. corresponds Thm. is nonstandard Bayes with respect to any prior concentrating on some infinitesimal. Lem. is a generalized Bayesian estimator with respect to the improper prior.