LECTURE 8: ASYMPTOTICS I

Similar documents
This section is optional.

Lecture 19: Convergence

Distribution of Random Samples & Limit theorems

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Lecture 20: Multivariate convergence and the Central Limit Theorem

Lecture 8: Convergence of transformations and law of large numbers

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

2.2. Central limit theorem.

Large Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution

SDS 321: Introduction to Probability and Statistics

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

MIT Spring 2016

An Introduction to Asymptotic Theory

1 Convergence in Probability and the Weak Law of Large Numbers

EE 4TM4: Digital Communications II Probability Theory

Probability and Statistics

Parameter, Statistic and Random Samples

Introduction to Probability. Ariel Yadin

Advanced Stochastic Processes.

Mathematics 170B Selected HW Solutions.

MA Advanced Econometrics: Properties of Least Squares Estimators

Sequences and Series of Functions

Notes 19 : Martingale CLT

1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

32 estimating the cumulative distribution function

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Asymptotic distribution of products of sums of independent random variables

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Kernel density estimator

ST5215: Advanced Statistical Theory

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction

Central limit theorem and almost sure central limit theorem for the product of some partial sums

ECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002

Elements of Statistical Methods Lots of Data or Large Samples (Ch 8)

1 Introduction to reducing variance in Monte Carlo simulations

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

4. Partial Sums and the Central Limit Theorem

Math 525: Lecture 5. January 18, 2018

Lecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables

Lecture 7: Properties of Random Samples

Asymptotic Results for the Linear Regression Model

Economics 326 Methods of Empirical Research in Economics. Lecture 18: The asymptotic variance of OLS and heteroskedasticity

1+x 1 + α+x. x = 2(α x2 ) 1+x

STA Object Data Analysis - A List of Projects. January 18, 2018

Central Limit Theorem using Characteristic functions

Lecture Chapter 6: Convergence of Random Sequences

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

Chapter 2 The Monte Carlo Method

Lecture 15: Density estimation

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

TAMS24: Notations and Formulas

Expectation and Variance of a random variable

5. Limit Theorems, Part II: Central Limit Theorem. ECE 302 Fall 2009 TR 3 4:15pm Purdue University, School of ECE Prof.

Exercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).

AMS570 Lecture Notes #2

STAT Homework 1 - Solutions

Limit Theorems. Convergence in Probability. Let X be the number of heads observed in n tosses. Then, E[X] = np and Var[X] = np(1-p).

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

MATH301 Real Analysis (2008 Fall) Tutorial Note #7. k=1 f k (x) converges pointwise to S(x) on E if and

Topic 9: Sampling Distributions of Estimators

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

Topic 9: Sampling Distributions of Estimators

Notes 27 : Brownian motion: path properties

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

Measure and Measurable Functions

1.3 Convergence Theorems of Fourier Series. k k k k. N N k 1. With this in mind, we state (without proof) the convergence of Fourier series.

Solution to Chapter 2 Analytical Exercises

lim za n n = z lim a n n.

2.4.2 A Theorem About Absolutely Convergent Series

Lecture 27: Optimal Estimators and Functional Delta Method

MAT1026 Calculus II Basic Convergence Tests for Series

1.010 Uncertainty in Engineering Fall 2008

Lecture 4. We also define the set of possible values for the random walk as the set of all x R d such that P(S n = x) > 0 for some n.

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

Stat 421-SP2012 Interval Estimation Section

Binomial Distribution

Regression with an Evaporating Logarithmic Trend

EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1

Parameter, Statistic and Random Samples

2 Banach spaces and Hilbert spaces

Fall 2013 MTH431/531 Real analysis Section Notes

Economics 620, Lecture 8: Asymptotics I

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

HOMEWORK I: PREREQUISITES FROM MATH 727

Exponential Families and Bayesian Inference

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

Lecture 33: Bootstrap

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Mathematical Statistics - MS

The standard deviation of the mean

Transcription:

LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1

Defiitio: (Weak covergece) A sequece of radom variables {X, 1}is said to coverge weakly to a costat c if lim P( X c > ε) 0 P for every give ε > 0. This is writte plim X c or X P C ad is also called covergece i probability. N. M. Kiefer, Corell Uiversity, Ecoomics 60

Defiitio: (Strog covergece) A sequece of radom variables is said to coverge strogly to a costat c if or P(lim X c) 1. lim P(sup x c > ε) N > N 0 Strog covergece is also called almost sure covergece or covergece with probability oe ad is writte X C w.p.1 or X C. N. M. Kiefer, Corell Uiversity, Ecoomics 60 3

LAWS OF LARGE NUMBERS: Let {X, 1}be observatios ad suppose we look at the sequece X i 1 X i /. whe does ξ where ξ is some parameter? Weak Law of Large Numbers: (WLLN) Let E(X i ) µ, V(X i ) σ, cov(x i X j ) 0. The - µ 0 i probability N. M. Kiefer, Corell Uiversity, Ecoomics 60 4

P Proof: Recall Chebyshev's iequality: ( X E(X) µ k ) σ / k µ ad V(X) where σ Proof of Chebyshev s iequality σ + ( x µ ) df µ λσ µ + λσ ( x µ ) ( x µ ) df df + µ + λσ µ λσ ( x µ ) df Put i the smallest value of x i the first ad last itegral, ad drop the middle to get:. σ λ σ P( x µ λσ ) N. M. Kiefer, Corell Uiversity, Ecoomics 60 5

Sice we are iterested i X N, ote that E(X N ) µ ad V (X N ) σ /. Cosequetly, lim P ( X µ > ε ) lim σ / ε 0. N. M. Kiefer, Corell Uiversity, Ecoomics 60 6

Notes: 1. E(Xi) µi is O.K. Cosider -1 X - µ with µ µ i.. V(Xi) σ i is O.K.. As log as lim σ i / 0, our proof applies. 3. Existece of σ ca be dropped if we assume idepedet ad idetically distributed observatios. (I this case, the proof is differet.) Strog Law of Large Numbers: (SLLN) Xi are idepedet with E(Xi) µ, V(X ) σ ad i i i σ / i <. The X µ 0 i almost surely (a.s.). N. M. Kiefer, Corell Uiversity, Ecoomics 60 7

Note: Agai, we ca drop assumptio o the existece of σ i if we assume idepedet ad idetically distributed observatios. Some properties of plim: 1. plim XY plim X plim Y. plim (X+Y) plim X + plim Y 3. Slutsky's theorem: If the fuctio g is cotiuous at plim X, the plim g(x) g(plim X). N. M. Kiefer, Corell Uiversity, Ecoomics 60 8

CENTRAL LIMIT THEOREM: (Asymptotic Normality) Defiitio: The momet geeratig fuctio is defied as mt () E(e tx ) tx e f( xdx ). The ame comes from the fact that r dm tx r r x efxdx () E(X ) whe r - dt evaluated at t 0. N. M. Kiefer, Corell Uiversity, Ecoomics 60 9

Note the followig series expasio: tx 1 mt ( ) E(e ) E(1 + Xt + ( Xt) +...)! 1 + α t + 1 α 1 t +... where α r EX r (For example: α µ α µ +, σ ). 1 1 Property 1: The momet geeratig fuctio of i 1 X i whe Xi are idepedet is the product of the momet geeratig fuctios of Xi. (Exercise: Prove this.) Property : Let X ad Y be radom variables with cotiuous desities f(x) ad g(y). If the momet geeratig fuctio of X is equal to that of Y i a iterval -h < t < h, the f g. N. M. Kiefer, Corell Uiversity, Ecoomics 60 10

Example: The momet geeratig fuctio for X ~ q(0,1) 1 1 x tx tx mt () E(e ) e e dx π 1 t e ( ) / 1 x t e dx? π Uses of Asymptotic Distributios: Suppose X - µ 0 i probability. (What ca be said about the distributio of X - µ?) I order to get distributio theory, we eed to orm the radom variable; we usually look at 1/ ( X - µ). N. M. Kiefer, Corell Uiversity, Ecoomics 60 11

Note that the radom variable sequece {1/(X - µ), 1}does ot coverge i probability. (why ot?) We might be able to make probability statemets like 1 / lim P ( ( X µ ) < z) F(z) for some distributio F. The we could use F as a approximate distributio for 1/ (X - µ). This, of course, implies a approximate distributio for X It is a little easier to work with Y 1/ (X - µ)/σ. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1

Cetral Limit Theorem: (CLT) (Lidberg- Levy) The distributio of Y (as defied above) as is Proof: Let m z Φ(z) 1 x e π (stadard ormal) X i µ / dx () t be the momet geeratig fuctio of (Xi - µ). That is, m X i µ σ t () t 1 + + o(t ) where o(t ) is the remaider term such that o(t )/ t 0 as t 0. N. M. Kiefer, Corell Uiversity, Ecoomics 60 13

N. M. Kiefer, Corell Uiversity, Ecoomics 60 14 We kow that Y X X i i1 ( ) ( ). µ σ µ σ The momet geeratig fuctio of Y is m t t Y () m 1 + t + o t X i µ σ l m l 1 + t + o t l 1 + t Y () t

As, m Y (t) e t / which is the momet geeratig fuctio of a stadard ormal radom variable. Poit of the Cetral Limit Theorem: The distributio fuctio of X for large ca be approximated by that of a ormal with mea µ ad variace σ /. Qualificatio: We really should use characteristic fuctio C(t) Ee itx i the proof. N. M. Kiefer, Corell Uiversity, Ecoomics 60 15

Notes: 1. Idetical meas ad variaces ca be dropped straightforwardly. We eed some restrictios o the variace sequece though. I this case, we work with ( X µ ) i1 i i Y. 1/ ( ) i σ 1 i. Versios of the Cetral Limit Theorem with radom vectors are also available. 3. The basic requiremets is that each term i the sum should make a egligible cotributio. N. M. Kiefer, Corell Uiversity, Ecoomics 60 16

Examples: 1. Estimatio of mea µ from a sample of ormal radom variables: I this case, we estimate µ by X, ad the asymptotic approximatio for the distributio of X or (X µ) is exact.. Cosider 1/ ( β $ β) where β$ is the LS esimator. 1 / 1 ( $ 1/ β β) ( X X) X ε -1 1 / [X X / ] [ X ε / ] Where [ XX / ] is the sample secod momet matrix of the regressors. N. M. Kiefer, Corell Uiversity, Ecoomics 60 17

Uder the assumptio that regressors are well-behaved (i.e. cotributio of ay particular observatio to [X ε/] is egligible), we ca apply a Cetral Limit Theorem ad coclude that 1 / ( β $ β) [X X/] -1 1 / [ X ε/ ] D q (, 0 σ Q 1 ). N. M. Kiefer, Corell Uiversity, Ecoomics 60 18

Defiitio: (Covergece i distributio) : A sequece of radom variables {Z, 1}with distributio fuctios {F(z) P(Z z), 1} is said to coverge i distributio to a radom variable Z with distributio fuctio F(z) if ad oly if lim F(z) F(z) at all poits of cotiuity of F(z). D Notatio: Z Z. N. M. Kiefer, Corell Uiversity, Ecoomics 60 19

Some properties of covergece i probability (plim) ad covergece i distributio: 1. X ad Y are radom variable sequeces. If plim (X - Y) 0 ad Y D Y, the X D Y as well.. If Y D Y ad X c i probability (i.e. plim X c), the a. X + Y c + Y b. XY D D cy c. Y / X Y / c, c 0. D 3. If X D X ad g is a cotiuous fuctio, the g(x) D g(x). N. M. Kiefer, Corell Uiversity, Ecoomics 60 0