STA Object Data Analysis - A List of Projects. January 18, 2018

Similar documents
Lecture 20: Multivariate convergence and the Central Limit Theorem

Lecture 19: Convergence

Large Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution

7.1 Convergence of sequences of random variables

Convergence of random variables. (telegram style notes) P.J.C. Spreij

7.1 Convergence of sequences of random variables

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Notes 19 : Martingale CLT

An Introduction to Asymptotic Theory

1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1

Distribution of Random Samples & Limit theorems

1 Convergence in Probability and the Weak Law of Large Numbers

Notes 5 : More on the a.s. convergence of sums

STAT Homework 1 - Solutions

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

LECTURE 8: ASYMPTOTICS I

Lecture 8: Convergence of transformations and law of large numbers

The Central Limit Theorem

Probability and Statistics

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

This section is optional.

HOMEWORK I: PREREQUISITES FROM MATH 727

STATISTICAL METHODS FOR BUSINESS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

Notes 27 : Brownian motion: path properties

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

Parameter, Statistic and Random Samples

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

2.2. Central limit theorem.

Mathematical Statistics - MS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

Mathematics 170B Selected HW Solutions.

2.1. Convergence in distribution and characteristic functions.

Probability and Random Processes

Random Variables, Sampling and Estimation

ECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002

4. Partial Sums and the Central Limit Theorem

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Introduction to Probability. Ariel Yadin

Lecture 3 : Random variables and their distributions

32 estimating the cumulative distribution function

Sieve Estimators: Consistency and Rates of Convergence

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Complete Convergence for Asymptotically Almost Negatively Associated Random Variables

Bull. Korean Math. Soc. 36 (1999), No. 3, pp. 451{457 THE STRONG CONSISTENCY OF NONLINEAR REGRESSION QUANTILES ESTIMATORS Seung Hoe Choi and Hae Kyung

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Solutions of Homework 2.

Lecture Chapter 6: Convergence of Random Sequences

Entropy Rates and Asymptotic Equipartition

Law of the sum of Bernoulli random variables

Topic 9: Sampling Distributions of Estimators

Lecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables

Lecture 15: Density estimation

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction

Kernel density estimator

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector

EE 4TM4: Digital Communications II Probability Theory

Asymptotic Results for the Linear Regression Model

Singular Continuous Measures by Michael Pejic 5/14/10

Lecture 3 The Lebesgue Integral

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Stochastic Processes.

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

Unbiased Estimation. February 7-12, 2008

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

MA Advanced Econometrics: Properties of Least Squares Estimators

An alternative proof of a theorem of Aldous. concerning convergence in distribution for martingales.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables

Lecture 33: Bootstrap

Solution to Chapter 2 Analytical Exercises

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Basics of Probability Theory (for Theory of Computation courses)

Estimation of the essential supremum of a regression function

Appendix to: Hypothesis Testing for Multiple Mean and Correlation Curves with Functional Data

REAL ANALYSIS II: PROBLEM SET 1 - SOLUTIONS

Precise Rates in Complete Moment Convergence for Negatively Associated Sequences

Entropy and Ergodic Theory Lecture 5: Joint typicality and conditional AEP

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

Central Limit Theorem using Characteristic functions

Probability for mathematicians INDEPENDENCE TAU

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

STAT Homework 2 - Solutions

MATH 413 FINAL EXAM. f(x) f(y) M x y. x + 1 n

Detailed proofs of Propositions 3.1 and 3.2

Exercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).

ki, X(n) lj (n) = (ϱ (n) ij ) 1 i,j d.

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }

lim za n n = z lim a n n.

Web-based Supplementary Materials for A Modified Partial Likelihood Score Method for Cox Regression with Covariate Error Under the Internal

MAXIMAL INEQUALITIES AND STRONG LAW OF LARGE NUMBERS FOR AANA SEQUENCES

Chapter 5. Inequalities. 5.1 The Markov and Chebyshev inequalities

Lecture 15: Learning Theory: Concentration Inequalities

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

ST5215: Advanced Statistical Theory

Transcription:

STA 6557 Jauary 8, 208 Object Data Aalysis - A List of Projects. Schoeberg Mea glaucomatous shape chages of the Optic Nerve Head regio i aimal models 2. Aalysis of VW- Kedall ati-mea shapes with a applicatio to medical imagig - Yufa Wag 3. Distributios of Trajectories of Objects i the Solar System. See Ch. data ad (https://e.wikipedia.org/wiki/list of tras-neptuia objects) 4. VW regressio for projective shapes of shells of Acrosterigma magum mollusk i the N Florida coastal waters. 5. 3D Projective Shape Aalysis i Face Recogitio from Digital Camera Images - Seug-Hee Choi. 6. Topological Data Aalysis 7. Procipal Compoets o Object Spaces

2 REVIEW ( BUILDING THE NECESSARY MATH STAT LANGUAGE ) We begi by reviewig a list of simple defiitios ad otatios for otios that are at the basis of statistical iferece, such as sample space, evets, σ-algebra, probability measure probability space radom variable radom vector mea vector, covariace matrix characteristic fuctio 2. The Multivariate Normal Distributio REMARK 2.. The multivariate ormal distributios play a cetral role i Large Sample Theory, the key to Noparametric Data Aalysis (see Bhattacharya ad Rao (976) or Tsybakov(2009)).. A r.vec. X : Ω R p has a multivariate ormal distributio if a R p the r.v. a T X has a ormal distributio. Assume X is as i ad E(X) = µ, Cov(X) = Σ, ( otatio X N p (µ, Σ) ). The characteristic fuctio of X is φ X (t) = e itt µ 2 tt Σt. If X N p (µ, Σ), where Σ is positive defiite the the joit p.d.f. of X is give ϕ X (x) = (2π) p/2 Σ /2 exp( 2 (x µ)t Σ (x µ)) () 2

If Z = (Z,..., Z p ) N p (0, I p ), the the margials Z,..., Z p are idepedet. X N p (µ, Σ), where Σ is positive defiite the (X µ) T Σ (X µ) has a χ 2 p distributio. 2.2 Covergece i Distributio ad The Cetral Limit Theorem. The mai referece for this sectio is Ferguso (996). 2.3 Covergece i distributio ( weak covergece ) DEFINITION 2.. A sequece of radom vectors Y, N, coverges i distributio (or i law) to a radom vector Y, if the sequece of joit c.d.f. s F Y (y) coverges to F Y (y) at all vectors y where F Y is cotiuous. Notatio Y d Y, or Y L Y. The followig is a combiatio of the Helly-Bray theorem with the cotiuity theorem, ad is take straight from Ferguso (996), icludig the proof. THEOREM 2.. Assume X, N, is a sequece of radom vectors i R p. The followig are equivalet: a. X L X. b. Ef(X ) Ef(X) = R p fdp X as for all real-valued cotiuous fuctios f o R p, with compact support. c. X coverges weakly to X, that is Ef(X ) Ef(X) as (2) for all bouded real-valued cotiuous fuctios f o R p. d. Ef(X ) Ef(X) as for all measurable fuctios f o R p, such that X takes values o the set of poits of cotiuity of f with probability. e. The sequece of characteristic fuctios of X coverges to the characteristic fuctio of X. 3

2.4 The Cetral Limit Theorem This sectio is from Bhattacharya et al.(206)(see STA5334). The most importat covergece theorem i law is the followig theorem, abbreviated as CLT. THEOREM 2.2 (Cetral Limit Theorem). If Y is a sequece of i.i.d. radom vectors i R p with commo mea vector zero ad a fiite commo covariace matrix Σ, the 2 (Y + + Y ) L N p (0, Σ). (3) that For a proof of the CLT, we cosider the sequece X = Ȳ = 2 (Y + + Y ). Note φ X (t) = E(exp(i 2 t T X j )) = (E(exp(i 2 t T X ))) = (φ X ( 2 t)) j= (4) Let σ 2 (t) = E((t T X ) 2 ). From equatio (4) it follows that lim φ X (t) = exp( σ2 (t) ). (5) 2 The right had side of equatio (5) is φ U (), where U N (0, σ 2 (t)). Fially ote that if X N d (0, Σ), the t T X N(0, σ 2 (t)), ad from Theorem?? it follows that X L X, doe. We will sometimes use the alterative otatio Φ 0,Σ for N p (0, Σ), ad deote the correspodig distributio fuctio by Φ 0,Σ (x), ad, wheever Σ is positive defiite, the joit desity fuctio by φ 0,Σ (x) = (2π)p Σ e 2 xt Σ x. The CLT says that if Σ is positive defiite, the P( 2 (Y + + Y ) x) Φ 0,Σ (x) = (2π)p Σ... e 2 yt Σ y dy (6) y x 4

for all x. A old result of Polya implies that actually the covergece i (6) is uiform over all x: sup P( 2 (Y + + Y ) x) Φ 0,Σ (x) 0 as. (7) x A immediate applicatio of the CLT is that the r th sample momet (X r + + X r )/ of a uivariate distributio is asymptotically ormal if EX 2r <. (Here r is a positive iteger). The italicized statemet meas: ( ) Xi r EX r L N(0, σ 2 ), (8) i= where σ 2 = var(x r ) = EX 2r (EX r ) 2. REMARK 2.2. The LLN shows i particular that for a sequece of i.i.d. radom variables Z, i= Z i µ (= EZ ), i.e., the differece betwee the two sides i goes to zero as. Uder the additioal assumptio EZ 2 <, Chebyshev s iequality stregthes this approximatio by showig that the differece is of the order of 2 i probability: i= This meas that, give ε < 0 there exists A > 0 such that P( Z i µ = O p ( ). (9) Z i µ > A ) < ε (0) i= for all sufficietly large. The CLT is of course a more precise statemet tha (9). I particular, it provides a computatio of the left side of (0) as ( 2Φ 0, A ) + δ = 2 A/σ e y2 /2 dy + δ, () σ 2π where δ 0 as (uiformly w.r.t. A).] A useful extesio of the CLT i the uivariate case is 5

THEOREM 2.3. (Lideberg s CLT). Let X, X 2,... be idepedet radom variables with distributio fuctios F, F 2,..., respectively, such that EX = µ ad Var X = σ 2 <, with at least oe σ > 0. Let S = X + + X ad s = Var(S ) = σ 2 + + σ. 2 The the ormalized partial sums S ES s coverge i distributio to a radom variable with ormal distributio N(0, ) (i.e. the ormal covergece holds,) if the followig Lideberg coditio is satisfied: ε > 0, lim (x µ s 2 k ) 2 df k (x) = 0. x µ k >εs k= PROBLEM 2.. Assume we have k sequeces U (i) P a i ( i k) ad a fuctio g of k vector variables cotiuous at (a, a 2,..., a k ). Show that g(u (),..., U (k) ) coverges i probability to (a, a 2,..., a k ). PROBLEM 2.2. Prove that the sample covariace matrix is a cosistet estimator of the covariace matrix. PROBLEM 2.3. Let X ( ) be a sequece of i.i.d. radom variables, ad assume that the ifimum ad supremum of values of X are m ad M, respectively. That is, P(m X M) =, P(X < a) > 0 a > m, P(X > b) > 0 b < M (Here X real-valued, but m ad/or M may be ifiite). Prove that max{x,..., X } P M ad mi{x,..., X } P m. PROBLEM 2.4. Prove that if Y coverges i distributio to a costat c (i.e., the distributio of Y coverges to the Dirac measure δ {c} ), the Y P c. PROBLEM 2.5. Let P ( ), P be probability measures o (R, B ) such that P coverges weakly to P. (a) Give a example to show that P (B) eed ot coverge to P(B) for all Borel sets B. (b) Give a example to show that the distributio fuctio F of P may ot coverge to the distributio fuctio F of P at every poit x. 6