LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]

Size: px
Start display at page:

Download "LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]"

Transcription

1 LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for any δ > 0, P( x n x > δ) 0 2. Convergence in distribution: x n d x if E [g(x n )] E [g(x)] for all bounded, continuous functions g : X R. If x n and x are real d valued, x n x is equivalent to F n (t) F (t) at the continuity oints of F, where F n is the CDF of x n and F is the CDF of x. Convergence in robability is the stronger notion of convergence. In articular, x n x imlies xn d x. There is a artial converse: if xn d c, where c is a constant, then x n c. For convenience, we say o P and O P for convergence and boundedness in robability. Recall for a non-negative sequence of scalars {a n }, 1. a n o(1) means a n 0 as n grows. 2. a n o(b n ) for a non-negative sequence {b n } means an b n Further, o(1). 1. a n O(1) means {a n } is bounded; i.e. there is some (large) M > 0 such that a n 2. a n O(b n ) (for a non-negative sequence {b n }) means an b n O(1). There are robabilistic analogues of the receding statements. Let x n be a sequence of non-negative real-valued random variables. 1. x n o P (1) means x n 0 as n grows. 2. x n o P (b n ) for a non-negative sequence {b n } means xn b n o P (1). 3. x n o P (y n ) for a sequence of non-negative random variables {y n } imlies xn y n o P (1). 1

2 2 STAT 201B Further, 1. x n O P (1) means {x n } is bounded in robability; i.e. for any ɛ > 0, there is M > 0 such that su n P(x n > M) ɛ. 2. x n O P (b n ) means xn b n O P (1). 3. x n O P (y n ) means xn y n O P (1). The aforementioned convergence roerties are reserved under transformations. For any continuous fu- Theorem 1.1 (Continuous maing theorem). nction g, 1. x n x imlies g(xn ) g(x). d 2. x n x imlies g(xn ) d g(x). Theorem 1.2 (Slutsky s theorem). Let x n d x and xn d c. We have 1. x n + y n d x + c, 2. x n y n d cx. More generally, g(x n, y n ) d g(x, c), for any continuous function g. Proof sketch. The key ste in the roof of Slutsky s theorem shows that x n d x and yn d c imlies (xn, y n ) d (x, c). We aeal to the continuous maing theorem to obtain the stated conclusion. d d We remark that x n x and yn y (convergence of the marginal distributions) generally does not imly x n + y n x + y; convergence of the joint d distribution is crucial. Lemma 1.3 (delta method). grows. For any differentiable g, Let a n (x n c) d x, where a n as n a n (g(x n ) g(c)) d g(c) T x. Proof. By Taylor s theorem, g(x n ) g(c) = g(c) T (x n c) + o( x n c 2 ) for any sequence {x n } c. Since a n (x n c) d x, by Slutsky s theorem, we deduce x n c d 0, which imlies x n c 0. Thus g(x n ) g(c) = g(c) T (x n c) + o P ( x n c 2 ),

3 LECTURE 7 NOTES 3 which imlies a n (g(x n ) g(c)) = a n g(c) T (x n c) + o P (a n x n c 2 ) = a n g(c) T (x n c) + o P (1). Rearranging, a n (g(x n ) g(c)) a n g(c) T (x n c) 0. By the continuous maing theorem, a n g(c) T (x n c) d g(c) T x, which allows us to conclude a n (g(x n ) g(c)) d g(c) T x. Corollary 1.4. Let n(x n θ) d N (0, Σ). We have n(g(xn ) g(θ)) d N ( 0, g(θ) T Σ g(θ) ). Corollary 1.4 is by far the most common alication of the delta method to the extent that some sources go as far as to call it the delta method. 2. Consistency. Most asymtotic statements concern a sequence of estimators {ˆθ n } indexed by samle size n. In the usual asymtotic framework, we observe i.i.d. random variables {x i } and consider the sequence of estimators ˆθ n = ˆθ ( {x} i [n] ) that consists of the same estimation rocedure erformed on growing samles. For examle, in the coin tossing investigation, the sequence of MLE s of consists of the means of growing samles: ˆ n = 1 x i. n i [n] In statistical arlance, statements about asymtotic roerties of an estimator (instead of a sequence of estimators) are common. Such statements are actually about the underlying estimation rocedure, and should be interreted as such. Definition 2.1. A sequence of oint estimators {ˆθ n } of θ Θ is consistent if ˆθ converges in robability to θ.

4 4 STAT 201B We remark that since convergence in mean squared imlies convergence in robability, a sufficient condition for consistency is ] 2 ] [ ] bias θ [ˆθn + varθ [ˆθn = Eθ ˆθ n θ Consistency is the basic notion of correctness for a oint estimator. It ensures an estimator converges to its target as the samle size grows. For simle estimators, such as those that are smooth functions of the observations {x i } i [n], consistency is often a direct consequence of the law of large numbers (LLN). In the coin tossing investigation, the (weak) LLN imlies ˆ n. Further, by the continuous maing theorem, the estimator g(ˆθ n ) is a consistent estimator of g(θ ) as long as ˆθ n is a consistent estimator of θ. i.i.d. Examle 2.2. Let x i Ber( ). Recall the MLE of is x n. By the law of large numbers, x n, which shows that the MLE is consistent. i.i.d. Examle 2.3. Let x i unif(0, θ ). Recall the MLE of θ is x (n). It is ossible to show the MLE is consistent by direct calculation. Indeed, ( P θ θ x (n) > ɛ ) ( = P θ maxi [n] x i < θ ɛ ) = ( P θ xi < θ ɛ ) which converges to zero as n grows. i [n] = i [n] θ ɛ θ, We turn our attention to establishing the consistency of the MLE. The MLE is an examle of an extremum estimator: it is the maximizer of the log-likelihood l x (θ). Since the log-likelihood deends on the observations x, it is a random function (of θ). The basic idea is to show that 1. the log-likelihood converges uniformly to a oulation objective, 2. the true arameter θ is the maximizer of the oulation objective. The first condition is known as a uniform law of large numbers. Definition 2.4. A set of (real-valued) functions F satisfies a uniform law of large numbers (ULLN) if 1 su f F n i [n] f(x i) E [ f(x 1 ) ] 0.

5 LECTURE 7 NOTES 5 We remark that as long as E [ f ] is finite, the (weak) LLN ensures 1 n i [n] f(x i) E [ f(x 1 ) ] 0 for each f F. A ULLN is a stronger statement that says the convergence is uniform over F. Why is the stronger statement necessary? Consider the MLE ˆθ n of θ in a arametric model. The LLN ensures 1 n i [n] l x i (θ) [ E θ lx1 (θ) ] for any θ Θ. Unfortunately, ointwise convergence of the log-likelihood is not even enough to ensure the convergence of the maximum, much less the maximizer, which is required to conclude the consistency of the MLE. Theorem 2.5. Let {x i } be i.i.d. random variables, and f θ (x) be their density for some θ Θ. Assume 1. f θ (x) a.e. = f θ (x) if and only if θ = θ, 1 2. su θ Θ (θ) lx1 (θ) ] 0, then θ is the unique maximizer of lx1 (θ) ] on Θ and lx1 (θ ) ] lx1 (ˆθ n ) ] 0. Proof. The roof consists of two arts: 1. show that θ is the unique maximizer of lx1 (θ) ] on Θ. 2. show that the maximum of 1 (θ)) converges to the maximum of lx1 (θ) ]. We begin by showing lx1 (θ ) ] lx1 (θ) ] > 0 for any θ θ. By the linearity of the exectation, lx1 (θ ) ] [ E θ lx1 (θ) ] = log fθ (x 1 ) ] [ E θ log fθ (x 1 ) ] [ [ ]] fθ (x 1 ) = E θ log, f θ (x 1 )

6 6 STAT 201B which is the Kullback-Leibler divergence between F θ and F θ. The KL divergence is always non-negative. Indeed, by Jensen s inequality, [ [ ]] [ [ ]] fθ (x 1 ) fθ (x 1 ) E θ log = E θ log f θ (x 1 ) f θ (x 1 ) ( ) log f θ (x 1 )dx 1 = log 1. X Further, since log x is strictly concave, the inequality is strict unless [ ] f θ (x 1 ) a.s. fθ (x 1 ) = E θ, f θ (x 1 ) f θ (x 1 ) which, by the first assumtion, is not ossible at any θ θ. Thus θ is the unique maximizer of lx1 (θ) ]. To comlete the roof, we show that (2.1) lx1 (θ ) ] lx1 (ˆθ n ) ] 0. We exand lx1 (ˆθ n ) ] lx1 (θ ) ] : lx1 (θ ) ] [ E θ lx1 (ˆθ n ) ] = lx1 (θ ) ] 1 (θ )) + 1 (θ ) 1 (ˆθ n ) }{{} + 1 n i [n] l [ x i (ˆθ n ) E θ lx1 (ˆθ n ) ] lx1 (θ ) ] 1 (θ )) + 0 since ˆθ n maximizes i [n] lx i (θ) 1 n i [n] l x i (ˆθ n ) lx1 (ˆθ n ) ], The first term is o P (1) by the LLN, and the second term is also o P (1) by the uniform convergence assumtion, which establishes (2.1). Theorem 2.5 catures the essence of most results on the consistency of the MLE. The key ingredients are identifiability and uniform convergence. The first assumtion of Theorem 2.5 is an identifiability assumtion. It ensures the maximizer of E θ [ lx1 (θ) ] ] is unique by disallowing two arameters θ 1, θ 2 to corresond to the same density. The second assumtion is a ULLN for the set of log-likelihood functions {log f θ (x) : θ Θ}. Here is an examle of such a ULLN. Lemma 2.6. Assume 1. l x (θ) is a continuous function (of θ) for any x X

7 LECTURE 7 NOTES 7 2. there is an enveloe function b such that E [ b(x) ] < and l x (θ) b(x) for any θ Θ. If Θ is comact, then 1 su θ Θ (θ) E [ l x1 (θ) ] 0. We remark that Theorem 2.5 establishes the excess risk of the MLE lx1 (θ ) ] [ E θ lx1 (ˆθ n ) ] vanishes. However, sans other assumtions, we cannot conclude ˆθ n θ. Most results on the consistency of the MLE state a laundry list of assumtions, most of which are required to establish uniform convergence and to conclude ˆθ n θ from (2.1). For comleteness, we state such a result. Corollary 2.7. Let {x i } be i.i.d. random variables, and f θ (x) be their density for some θ in a comact arameter sace Θ. If 1. f θ (x) = f θ (x) if and only if θ = θ, 2. l x (θ) is continuous for any x X 3. there is a function b : X R such that l x (θ) b(x) for any θ Θ and b(x1 ) ] <. then the MLE is consistent; i.e. arg max θ Θ i [n] l x i (θ) θ. Proof. First, we show that lx1 (θ) ] is continuous under the stated assumtions. For any sequence {θ n } converging to θ, {l x (θ n )} l x (θ) by the continuity of l x. Since l x is dominated by b and b(x1 ) ] is finite, lx1 (θ n ) ] [ E θ lx1 (θ) ] by the dominated convergence theorem. To comlete the roof, we show that ˆθ n θ. For any δ > 0, consider su θ Θ: θ θ 2 δ lx1 (θ) ]. Since lx1 (θ) ] is continuous and {θ Θ : θ θ 2 δ} is comact, the su is attained at some θ {θ Θ : θ θ 2 δ}. By Theorem 2.5, θ is the unique maximizer of lx1 (θ) ] on Θ. Thus ɛ := lx1 (θ ) ] lx1 ( θ) ] > 0.

8 8 STAT 201B 1 When su θ Θ n i [n] g [ x i (θ) E θ lx1 (θ) ] < ɛ su θ Θ 1 n i [n] l x i (θ) 1 n 2, i [n] l x i (θ ) > lx1 (θ ) ] ɛ 2 = lx1 ( θ) ] + ɛ 2 su θ Θ: θ θ 2 δ 1 n i [n] l x i (θ) which imlies ˆθn θ 2 δ. By Lemma 2.6, ( 1 P θ su θ Θ n i [n] g [ x i (θ) E θ lx1 (θ) ] ) < ɛ 2 We conclude P θ ( ˆθn θ 2 δ ) 1. 1, The takeaways from the lengthy receding section on the consistency of the MLE are 1. as long as 1 (θ) converges to E [ l x1 (θ) ] uniformly on θ Θ, the risk of the MLE converges to the risk of θ. Establishing uniform convergence is non-trivial, and we do not dwell on the toic. 2. to show consistency ˆθ n θ requires additional technical conditions on the arametric model. There are many aroaches to establishing consistency: we followed Wald s original aroach. To wra u, we consider the consequences of an incorrect model; i.e. the arametric model F := {f θ (x) : X R : θ Θ} does not include the generative distribution of the observations. We know that the MLE converges to the maximizer of E F [ log fθ (x) ] or equivalently, the minimizer of E F [ log f(x) ] EF [ log fθ (x) ], where F is the generative distribution and f(x) is its density. We recognize the difference as the KL divergence between F and F θ : [ ]] f(x) = E F [log. f θ (x) Thus the MLE generally converges to a θ Θ that is the best aroximation of F in the arametric model. That is F θ has the smallest KL divergence to F among the distributions in the arametric model. Yuekai Sun Berkeley, California December 4, 2015

Maximum Likelihood Asymptotic Theory. Eduardo Rossi University of Pavia

Maximum Likelihood Asymptotic Theory. Eduardo Rossi University of Pavia Maximum Likelihood Asymtotic Theory Eduardo Rossi University of Pavia Slutsky s Theorem, Cramer s Theorem Slutsky s Theorem Let {X N } be a random sequence converging in robability to a constant a, and

More information

1 Extremum Estimators

1 Extremum Estimators FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Elementary Analysis in Q p

Elementary Analysis in Q p Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some

More information

Lecture 2: Consistency of M-estimators

Lecture 2: Consistency of M-estimators Lecture 2: Instructor: Deartment of Economics Stanford University Preared by Wenbo Zhou, Renmin University References Takeshi Amemiya, 1985, Advanced Econometrics, Harvard University Press Newey and McFadden,

More information

Lecture 3 January 16

Lecture 3 January 16 Stats 3b: Theory of Statistics Winter 28 Lecture 3 January 6 Lecturer: Yu Bai/John Duchi Scribe: Shuangning Li, Theodor Misiakiewicz Warning: these notes may contain factual errors Reading: VDV Chater

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

On Wald-Type Optimal Stopping for Brownian Motion

On Wald-Type Optimal Stopping for Brownian Motion J Al Probab Vol 34, No 1, 1997, (66-73) Prerint Ser No 1, 1994, Math Inst Aarhus On Wald-Tye Otimal Stoing for Brownian Motion S RAVRSN and PSKIR The solution is resented to all otimal stoing roblems of

More information

Econometrics I. September, Part I. Department of Economics Stanford University

Econometrics I. September, Part I. Department of Economics Stanford University Econometrics I Deartment of Economics Stanfor University Setember, 2008 Part I Samling an Data Poulation an Samle. ineenent an ientical samling. (i.i..) Samling with relacement. aroximates samling without

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Lecture 3 Consistency of Extremum Estimators 1

Lecture 3 Consistency of Extremum Estimators 1 Lecture 3 Consistency of Extremum Estimators 1 This lecture shows how one can obtain consistency of extremum estimators. It also shows how one can find the robability limit of extremum estimators in cases

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification

More information

Lecture 23 Maximum Likelihood Estimation and Bayesian Inference

Lecture 23 Maximum Likelihood Estimation and Bayesian Inference Lecture 23 Maximum Likelihood Estimation and Bayesian Inference Thais Paiva STA 111 - Summer 2013 Term II August 7, 2013 1 / 31 Thais Paiva STA 111 - Summer 2013 Term II Lecture 23, 08/07/2013 Lecture

More information

Elementary theory of L p spaces

Elementary theory of L p spaces CHAPTER 3 Elementary theory of L saces 3.1 Convexity. Jensen, Hölder, Minkowski inequality. We begin with two definitions. A set A R d is said to be convex if, for any x 0, x 1 2 A x = x 0 + (x 1 x 0 )

More information

Introduction to Banach Spaces

Introduction to Banach Spaces CHAPTER 8 Introduction to Banach Saces 1. Uniform and Absolute Convergence As a rearation we begin by reviewing some familiar roerties of Cauchy sequences and uniform limits in the setting of metric saces.

More information

Chapter 7: Special Distributions

Chapter 7: Special Distributions This chater first resents some imortant distributions, and then develos the largesamle distribution theory which is crucial in estimation and statistical inference Discrete distributions The Bernoulli

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

Statics and dynamics: some elementary concepts

Statics and dynamics: some elementary concepts 1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and

More information

LECTURE 5 NOTES. n t. t Γ(a)Γ(b) pt+a 1 (1 p) n t+b 1. The marginal density of t is. Γ(t + a)γ(n t + b) Γ(n + a + b)

LECTURE 5 NOTES. n t. t Γ(a)Γ(b) pt+a 1 (1 p) n t+b 1. The marginal density of t is. Γ(t + a)γ(n t + b) Γ(n + a + b) LECTURE 5 NOTES 1. Bayesian point estimators. In the conventional (frequentist) approach to statistical inference, the parameter θ Θ is considered a fixed quantity. In the Bayesian approach, it is considered

More information

B8.1 Martingales Through Measure Theory. Concept of independence

B8.1 Martingales Through Measure Theory. Concept of independence B8.1 Martingales Through Measure Theory Concet of indeendence Motivated by the notion of indeendent events in relims robability, we have generalized the concet of indeendence to families of σ-algebras.

More information

Sums of independent random variables

Sums of independent random variables 3 Sums of indeendent random variables This lecture collects a number of estimates for sums of indeendent random variables with values in a Banach sace E. We concentrate on sums of the form N γ nx n, where

More information

Analysis of some entrance probabilities for killed birth-death processes

Analysis of some entrance probabilities for killed birth-death processes Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction

More information

Introduction to Probability and Statistics

Introduction to Probability and Statistics Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based

More information

Lecture: Condorcet s Theorem

Lecture: Condorcet s Theorem Social Networs and Social Choice Lecture Date: August 3, 00 Lecture: Condorcet s Theorem Lecturer: Elchanan Mossel Scribes: J. Neeman, N. Truong, and S. Troxler Condorcet s theorem, the most basic jury

More information

ECON 4130 Supplementary Exercises 1-4

ECON 4130 Supplementary Exercises 1-4 HG Set. 0 ECON 430 Sulementary Exercises - 4 Exercise Quantiles (ercentiles). Let X be a continuous random variable (rv.) with df f( x ) and cdf F( x ). For 0< < we define -th quantile (or 00-th ercentile),

More information

Convex Analysis and Economic Theory Winter 2018

Convex Analysis and Economic Theory Winter 2018 Division of the Humanities and Social Sciences Ec 181 KC Border Conve Analysis and Economic Theory Winter 2018 Toic 16: Fenchel conjugates 16.1 Conjugate functions Recall from Proosition 14.1.1 that is

More information

Convergence of random variables, and the Borel-Cantelli lemmas

Convergence of random variables, and the Borel-Cantelli lemmas Stat 205A Setember, 12, 2002 Convergence of ranom variables, an the Borel-Cantelli lemmas Lecturer: James W. Pitman Scribes: Jin Kim (jin@eecs) 1 Convergence of ranom variables Recall that, given a sequence

More information

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES

IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES IMPROVED BOUNDS IN THE SCALED ENFLO TYPE INEQUALITY FOR BANACH SPACES OHAD GILADI AND ASSAF NAOR Abstract. It is shown that if (, ) is a Banach sace with Rademacher tye 1 then for every n N there exists

More information

ON THE NORM OF AN IDEMPOTENT SCHUR MULTIPLIER ON THE SCHATTEN CLASS

ON THE NORM OF AN IDEMPOTENT SCHUR MULTIPLIER ON THE SCHATTEN CLASS PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 00, Number 0, Pages 000 000 S 000-9939XX)0000-0 ON THE NORM OF AN IDEMPOTENT SCHUR MULTIPLIER ON THE SCHATTEN CLASS WILLIAM D. BANKS AND ASMA HARCHARRAS

More information

Problem Set 2 Solution

Problem Set 2 Solution Problem Set 2 Solution Aril 22nd, 29 by Yang. More Generalized Slutsky heorem [Simle and Abstract] su L n γ, β n Lγ, β = su L n γ, β n Lγ, β n + Lγ, β n Lγ, β su L n γ, β n Lγ, β n + su Lγ, β n Lγ, β su

More information

Mollifiers and its applications in L p (Ω) space

Mollifiers and its applications in L p (Ω) space Mollifiers and its alications in L () sace MA Shiqi Deartment of Mathematics, Hong Kong Batist University November 19, 2016 Abstract This note gives definition of mollifier and mollification. We illustrate

More information

Real Analysis 1 Fall Homework 3. a n.

Real Analysis 1 Fall Homework 3. a n. eal Analysis Fall 06 Homework 3. Let and consider the measure sace N, P, µ, where µ is counting measure. That is, if N, then µ equals the number of elements in if is finite; µ = otherwise. One usually

More information

GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS

GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS GENERICITY OF INFINITE-ORDER ELEMENTS IN HYPERBOLIC GROUPS PALLAVI DANI 1. Introduction Let Γ be a finitely generated grou and let S be a finite set of generators for Γ. This determines a word metric on

More information

Lecture 4: Law of Large Number and Central Limit Theorem

Lecture 4: Law of Large Number and Central Limit Theorem ECE 645: Estimation Theory Sring 2015 Instructor: Prof. Stanley H. Chan Lecture 4: Law of Large Number and Central Limit Theorem (LaTeX reared by Jing Li) March 31, 2015 This lecture note is based on ECE

More information

MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression

MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression 1/9 MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression Dominique Guillot Deartments of Mathematical Sciences University of Delaware February 15, 2016 Distribution of regression

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

Location of solutions for quasi-linear elliptic equations with general gradient dependence

Location of solutions for quasi-linear elliptic equations with general gradient dependence Electronic Journal of Qualitative Theory of Differential Equations 217, No. 87, 1 1; htts://doi.org/1.14232/ejqtde.217.1.87 www.math.u-szeged.hu/ejqtde/ Location of solutions for quasi-linear ellitic equations

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating

More information

1 Martingales. Martingales. (Ω, B, P ) is a probability space.

1 Martingales. Martingales. (Ω, B, P ) is a probability space. Martingales January 8, 206 Debdee Pati Martingales (Ω, B, P ) is a robability sace. Definition. (Filtration) filtration F = {F n } n 0 is a collection of increasing sub-σfields such that for m n, we have

More information

On Doob s Maximal Inequality for Brownian Motion

On Doob s Maximal Inequality for Brownian Motion Stochastic Process. Al. Vol. 69, No., 997, (-5) Research Reort No. 337, 995, Det. Theoret. Statist. Aarhus On Doob s Maximal Inequality for Brownian Motion S. E. GRAVERSEN and G. PESKIR If B = (B t ) t

More information

A Note on Massless Quantum Free Scalar Fields. with Negative Energy Density

A Note on Massless Quantum Free Scalar Fields. with Negative Energy Density Adv. Studies Theor. Phys., Vol. 7, 13, no. 1, 549 554 HIKARI Ltd, www.m-hikari.com A Note on Massless Quantum Free Scalar Fields with Negative Energy Density M. A. Grado-Caffaro and M. Grado-Caffaro Scientific

More information

Lecture 2: August 31

Lecture 2: August 31 0-704: Information Processing and Learning Fall 206 Lecturer: Aarti Singh Lecture 2: August 3 Note: These notes are based on scribed notes from Spring5 offering of this course. LaTeX template courtesy

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law

On Isoperimetric Functions of Probability Measures Having Log-Concave Densities with Respect to the Standard Normal Law On Isoerimetric Functions of Probability Measures Having Log-Concave Densities with Resect to the Standard Normal Law Sergey G. Bobkov Abstract Isoerimetric inequalities are discussed for one-dimensional

More information

MATH 2710: NOTES FOR ANALYSIS

MATH 2710: NOTES FOR ANALYSIS MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite

More information

Econometrics I, Estimation

Econometrics I, Estimation Econometrics I, Estimation Department of Economics Stanford University September, 2008 Part I Parameter, Estimator, Estimate A parametric is a feature of the population. An estimator is a function of the

More information

simple if it completely specifies the density of x

simple if it completely specifies the density of x 3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely

More information

The following document is intended for online publication only (authors webpage).

The following document is intended for online publication only (authors webpage). The following document is intended for online ublication only (authors webage). Sulement to Identi cation and stimation of Distributional Imacts of Interventions Using Changes in Inequality Measures, Part

More information

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications.

is a Borel subset of S Θ for each c R (Bertsekas and Shreve, 1978, Proposition 7.36) This always holds in practical applications. Stat 811 Lecture Notes The Wald Consistency Theorem Charles J. Geyer April 9, 01 1 Analyticity Assumptions Let { f θ : θ Θ } be a family of subprobability densities 1 with respect to a measure µ on a measurable

More information

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes Infinite Series 6.2 Introduction We extend the concet of a finite series, met in Section 6., to the situation in which the number of terms increase without bound. We define what is meant by an infinite

More information

A construction of bent functions from plateaued functions

A construction of bent functions from plateaued functions A construction of bent functions from lateaued functions Ayça Çeşmelioğlu, Wilfried Meidl Sabancı University, MDBF, Orhanlı, 34956 Tuzla, İstanbul, Turkey. Abstract In this resentation, a technique for

More information

ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM

ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM ANALYTIC NUMBER THEORY AND DIRICHLET S THEOREM JOHN BINDER Abstract. In this aer, we rove Dirichlet s theorem that, given any air h, k with h, k) =, there are infinitely many rime numbers congruent to

More information

Topic 7: Using identity types

Topic 7: Using identity types Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules

More information

Sharp gradient estimate and spectral rigidity for p-laplacian

Sharp gradient estimate and spectral rigidity for p-laplacian Shar gradient estimate and sectral rigidity for -Lalacian Chiung-Jue Anna Sung and Jiaing Wang To aear in ath. Research Letters. Abstract We derive a shar gradient estimate for ositive eigenfunctions of

More information

A Note on the Positive Nonoscillatory Solutions of the Difference Equation

A Note on the Positive Nonoscillatory Solutions of the Difference Equation Int. Journal of Math. Analysis, Vol. 4, 1, no. 36, 1787-1798 A Note on the Positive Nonoscillatory Solutions of the Difference Equation x n+1 = α c ix n i + x n k c ix n i ) Vu Van Khuong 1 and Mai Nam

More information

Best approximation by linear combinations of characteristic functions of half-spaces

Best approximation by linear combinations of characteristic functions of half-spaces Best aroximation by linear combinations of characteristic functions of half-saces Paul C. Kainen Deartment of Mathematics Georgetown University Washington, D.C. 20057-1233, USA Věra Kůrková Institute of

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

PArtially observable Markov decision processes

PArtially observable Markov decision processes Solving Continuous-State POMDPs via Density Projection Enlu Zhou, Member, IEEE, Michael C. Fu, Fellow, IEEE, and Steven I. Marcus, Fellow, IEEE Abstract Research on numerical solution methods for artially

More information

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels

Quantitative estimates of propagation of chaos for stochastic systems with W 1, kernels oname manuscrit o. will be inserted by the editor) Quantitative estimates of roagation of chaos for stochastic systems with W, kernels Pierre-Emmanuel Jabin Zhenfu Wang Received: date / Acceted: date Abstract

More information

Convex Optimization methods for Computing Channel Capacity

Convex Optimization methods for Computing Channel Capacity Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem

More information

Applications of the course to Number Theory

Applications of the course to Number Theory Alications of the course to Number Theory Rational Aroximations Theorem (Dirichlet) If ξ is real and irrational then there are infinitely many distinct rational numbers /q such that ξ q < q. () 2 Proof

More information

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain 0.1. INTRODUCTION 1 0.1 Introduction R. A. Fisher, a pioneer in the development of mathematical statistics, introduced a measure of the amount of information contained in an observaton from f(x θ). Fisher

More information

A note on the random greedy triangle-packing algorithm

A note on the random greedy triangle-packing algorithm A note on the random greedy triangle-acking algorithm Tom Bohman Alan Frieze Eyal Lubetzky Abstract The random greedy algorithm for constructing a large artial Steiner-Trile-System is defined as follows.

More information

t 0 Xt sup X t p c p inf t 0

t 0 Xt sup X t p c p inf t 0 SHARP MAXIMAL L -ESTIMATES FOR MARTINGALES RODRIGO BAÑUELOS AND ADAM OSȨKOWSKI ABSTRACT. Let X be a suermartingale starting from 0 which has only nonnegative jums. For each 0 < < we determine the best

More information

Inference for Empirical Wasserstein Distances on Finite Spaces: Supplementary Material

Inference for Empirical Wasserstein Distances on Finite Spaces: Supplementary Material Inference for Emirical Wasserstein Distances on Finite Saces: Sulementary Material Max Sommerfeld Axel Munk Keywords: otimal transort, Wasserstein distance, central limit theorem, directional Hadamard

More information

Lecture 6 : Dimensionality Reduction

Lecture 6 : Dimensionality Reduction CPS290: Algorithmic Founations of Data Science February 3, 207 Lecture 6 : Dimensionality Reuction Lecturer: Kamesh Munagala Scribe: Kamesh Munagala In this lecture, we will consier the roblem of maing

More information

PETER J. GRABNER AND ARNOLD KNOPFMACHER

PETER J. GRABNER AND ARNOLD KNOPFMACHER ARITHMETIC AND METRIC PROPERTIES OF -ADIC ENGEL SERIES EXPANSIONS PETER J. GRABNER AND ARNOLD KNOPFMACHER Abstract. We derive a characterization of rational numbers in terms of their unique -adic Engel

More information

arxiv: v3 [physics.data-an] 23 May 2011

arxiv: v3 [physics.data-an] 23 May 2011 Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland

More information

ON UNIFORM BOUNDEDNESS OF DYADIC AVERAGING OPERATORS IN SPACES OF HARDY-SOBOLEV TYPE. 1. Introduction

ON UNIFORM BOUNDEDNESS OF DYADIC AVERAGING OPERATORS IN SPACES OF HARDY-SOBOLEV TYPE. 1. Introduction ON UNIFORM BOUNDEDNESS OF DYADIC AVERAGING OPERATORS IN SPACES OF HARDY-SOBOLEV TYPE GUSTAVO GARRIGÓS ANDREAS SEEGER TINO ULLRICH Abstract We give an alternative roof and a wavelet analog of recent results

More information

HAUSDORFF MEASURE OF p-cantor SETS

HAUSDORFF MEASURE OF p-cantor SETS Real Analysis Exchange Vol. 302), 2004/2005,. 20 C. Cabrelli, U. Molter, Deartamento de Matemática, Facultad de Cs. Exactas y Naturales, Universidad de Buenos Aires and CONICET, Pabellón I - Ciudad Universitaria,

More information

Commutators on l. D. Dosev and W. B. Johnson

Commutators on l. D. Dosev and W. B. Johnson Submitted exclusively to the London Mathematical Society doi:10.1112/0000/000000 Commutators on l D. Dosev and W. B. Johnson Abstract The oerators on l which are commutators are those not of the form λi

More information

ON THE ADVERSARIAL ROBUSTNESS OF ROBUST ESTIMATORS

ON THE ADVERSARIAL ROBUSTNESS OF ROBUST ESTIMATORS O THE ADVERSARIAL ROBUSTESS OF ROBUST ESTIMATORS BY ERHA BAYAKTAR,, LIFEG LAI, University of Michigan and University of California, Davis Motivated by recent data analytics alications, we study the adversarial

More information

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015)

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015) On-Line Aendix Matching on the Estimated Proensity Score Abadie and Imbens, 205 Alberto Abadie and Guido W. Imbens Current version: August 0, 205 The first art of this aendix contains additional roofs.

More information

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI ** Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R

More information

Measure and Integration: Solutions of CW2

Measure and Integration: Solutions of CW2 Measure and Integration: s of CW2 Fall 206 [G. Holzegel] December 9, 206 Problem of Sheet 5 a) Left (f n ) and (g n ) be sequences of integrable functions with f n (x) f (x) and g n (x) g (x) for almost

More information

MATH 250: THE DISTRIBUTION OF PRIMES. ζ(s) = n s,

MATH 250: THE DISTRIBUTION OF PRIMES. ζ(s) = n s, MATH 50: THE DISTRIBUTION OF PRIMES ROBERT J. LEMKE OLIVER For s R, define the function ζs) by. Euler s work on rimes ζs) = which converges if s > and diverges if s. In fact, though we will not exloit

More information

Consistency and asymptotic normality

Consistency and asymptotic normality Consistency an asymtotic normality Class notes for Econ 842 Robert e Jong March 2006 1 Stochastic convergence The asymtotic theory of minimization estimators relies on various theorems from mathematical

More information

EXISTENCE AND UNIQUENESS OF SOLUTIONS FOR NONLOCAL p-laplacian PROBLEMS

EXISTENCE AND UNIQUENESS OF SOLUTIONS FOR NONLOCAL p-laplacian PROBLEMS Electronic Journal of ifferential Equations, Vol. 2016 (2016), No. 274,. 1 9. ISSN: 1072-6691. URL: htt://ejde.math.txstate.edu or htt://ejde.math.unt.edu EXISTENCE AN UNIQUENESS OF SOLUTIONS FOR NONLOCAL

More information

COMMUNICATION BETWEEN SHAREHOLDERS 1

COMMUNICATION BETWEEN SHAREHOLDERS 1 COMMUNICATION BTWN SHARHOLDRS 1 A B. O A : A D Lemma B.1. U to µ Z r 2 σ2 Z + σ2 X 2r ω 2 an additive constant that does not deend on a or θ, the agents ayoffs can be written as: 2r rθa ω2 + θ µ Y rcov

More information

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,

More information

Estimating Time-Series Models

Estimating Time-Series Models Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary

More information

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Parameter Estimation December 14, 2015 Overview 1 Motivation 2 3 4 What did we have so far? 1 Representations: how do we model the problem? (directed/undirected). 2 Inference: given a model and partially

More information

STK4900/ Lecture 7. Program

STK4900/ Lecture 7. Program STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment

More information

δ -method and M-estimation

δ -method and M-estimation Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size

More information

Estimating function analysis for a class of Tweedie regression models

Estimating function analysis for a class of Tweedie regression models Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal

More information

WALD S EQUATION AND ASYMPTOTIC BIAS OF RANDOMLY STOPPED U-STATISTICS

WALD S EQUATION AND ASYMPTOTIC BIAS OF RANDOMLY STOPPED U-STATISTICS PROCEEDINGS OF HE AMERICAN MAHEMAICAL SOCIEY Volume 125, Number 3, March 1997, Pages 917 925 S 0002-9939(97)03574-0 WALD S EQUAION AND ASYMPOIC BIAS OF RANDOMLY SOPPED U-SAISICS VICOR H. DE LA PEÑA AND

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm

NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS. The Goldstein-Levitin-Polyak algorithm - (23) NLP - NONLINEAR OPTIMIZATION WITH CONVEX CONSTRAINTS The Goldstein-Levitin-Polya algorithm We consider an algorithm for solving the otimization roblem under convex constraints. Although the convexity

More information

Universal Finite Memory Coding of Binary Sequences

Universal Finite Memory Coding of Binary Sequences Deartment of Electrical Engineering Systems Universal Finite Memory Coding of Binary Sequences Thesis submitted towards the degree of Master of Science in Electrical and Electronic Engineering in Tel-Aviv

More information

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression

Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout

More information

On Z p -norms of random vectors

On Z p -norms of random vectors On Z -norms of random vectors Rafa l Lata la Abstract To any n-dimensional random vector X we may associate its L -centroid body Z X and the corresonding norm. We formulate a conjecture concerning the

More information

Lecture 4 September 15

Lecture 4 September 15 IFT 6269: Probabilistic Graphical Models Fall 2017 Lecture 4 September 15 Lecturer: Simon Lacoste-Julien Scribe: Philippe Brouillard & Tristan Deleu 4.1 Maximum Likelihood principle Given a parametric

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation Statistics 62: L p spaces, metrics on spaces of probabilites, and connections to estimation Moulinath Banerjee December 6, 2006 L p spaces and Hilbert spaces We first formally define L p spaces. Consider

More information