Week 6. Intuition: Let p (x, y) be the joint density of (X, Y ) and p (x) be the marginal density of X. Then. h dy =
|
|
- Bethanie Harrell
- 5 years ago
- Views:
Transcription
1 Week 6 Lecture Kerel regressio ad miimax rates Model: Observe Y i = f (X i ) + ξ i were ξ i, i =,,,, iid wit Eξ i = 0 We ofte assume X i are iid or X i = i/ Nadaraya Watso estimator Let w i (x) = K ( X i x) Te Nadaraya-Watso estimator is ˆf NW (x) = { P i= Yiwi P i= wi, i= w i 0 0, oterwise Ituitio: Let p (x, y) be te joit desity of (X, Y ) ad p (x) be te margial desity of X Te yp (x, y) dy yˆp (x, y) dy f (x) = E (Y X = x) = p (x) ˆp (x) = i= K ( ) X i x ( ) yk Yi y dy i= K ( X i x) i= K ( X i x) [ ( ) Yi K Yi y dy ( ) ] Y i y K Yi y dy = i= K ( ) X i x i= = Y ik ( ) X i x i= K ( X i x) were te last idetity follows from te fact tat K (y) = ad yk (y) dy = 0 Remark: Te Nadaraya-Watso estimator is a solutio of te followig miimizatio problem ˆf NW (x) = arg mi θ (Y i θ) w i (x) Startig from tis equatio we may itroduce te local polyomial metod i= Rate of covergece Assume tat F = { f, sup x f (m) (x) } M, 0 < ɛ p (x) M ad Eξi < We may sow ( if sup E f (x) f (x)) C m/(m+) bf f F
2 Upper boud: But to simplify our calculatios we may assume X i are iid U (0, ) Let ˆf NW (x) = ( ) Xi x Y i K Bias: i= ( ) NW E ˆf (x) f (x) = t x f (t) K f (x) C m Variace: ( ) V ar ˆf NW (x) = ( ( )) V ar Y Xi x ik [ ( )] E Y Xi x ik [ E ( f (X i ) + ξ i ) K ( Xi x )] Te sup f F E ( ˆf NW Questios: ) (x) f (x) C m/(m+) Wat about lower boud? Similarly to desity estimatio case Adaptive estimatio? Estimatio uder sup-orm? Estimatio at a poit? Adaptive estimatio? Very similar results to desity estimatio! Asymptotic equivalece Geeral Teory: Le Cam (986) Too ard to read!? Desity estimatio Poisso E : Y, Y,, Y iid wit desity f (y), y [0, ] F : Y, Y,, Y N iid wit desity f (y), N P oisso (), y [0, ] Gaussia wite oise G : dy (t) = f (t) dt + ɛdw (t), ɛ = /
3 Gaussia regressio Spectral desity estimatio H : y i = f (i/) + z i, z i N (0, ) I : Y, Y,, Y, a statioary cetered Gaussia sequece wit spectral desity f ad more models: expoetial family regressio, geeral locatio models Brow ad Low (996): G ad H are asymptotically equivalet uder te assumptio f Hölder(α, M) wit α > / Nussbaum (996), Brow, Carter, Low ad Zag (004): E ad H (replacig f by f, ad ɛ by ɛ) are asymptotically equivalet uder te assumptio f Hölder(α, M) wit α > / ad f is bouded away from 0 3 Low ad Zou (005): E ad F are asymptotically equivalet uder te assumptio f a compact subset of C([0, ] d ) 4 Golubev, Nussbaum ad Zou (009): I ad H (replacig f by log f) are asymptotically equivalet uder te assumptio f Hölder(α, M) wit α > / ad f is bouded away from 0 5 Grama ad Nussbaum (998, 00): more geeral models 3
4 Lecture Gaussioizatio Desity estimatio ad Gaussia regressio Variace stabilizig trasformatios See Hoyle (973) for a review of te extesive literature See also Efro (98) ad Bar-Lev ad Eis (990) For Poisso distributios Bartlett (936) was te first to propose te root trasform X i a omoscedastic liear model were X Poisso(λ) Ascombe (948) proposed improvig te variace stabilizig properties by istead usig X Te costat 3 8 is cose to optimally stabilize te variace usig te Taylor expasio Ascombe s variace stabilizig trasformatio as also bee briefly discussed i Dooo (993) for desity estimatio I te cotext of oparametric desity estimatio, i compariso to variace stabilizatio, mea matcig is more importat A mea-matcig root trasform is eeded for miimizig te bias as well as stabilizig te variace Te goal of mea matcig is to coose a costat c so tat te mea of X + c is closest to λ Te followig lemma gives te expasios of te mea ad variace of root trasform of te form X + c were c is a costat It ca be see easily tat c = 4 is te optimal coice for miimizig te bias E( X + c) λ i te first order Lemma Let X Poisso(λ) wit λ > 0 ad let c 0 be a costat Te E( X + c) = λ + 4c 8 Var( X + c) = c 3 λ 6c 4c λ 3 + O(λ 5 ) λ + 3c 5c + 7 λ + O(λ 3 ) 8 I particular, for c = 4 ( ) E X + 4 ( ) Var X + 4 = λ 64 λ 3 + O(λ 5 ) = λ λ + O(λ 3 ) Desity estimatio troug regressio Desity estimatio E : X, X,, X iid wit desity f (x), x [0, ] Poisso (very close to desity estimatio) F : X, X,, X N iid wit desity f (x), N P oisso (), x [0, ] 4
5 Te first step of te procedure is biig Let T be some positive iteger (Te coice of T will be discussed later) Divide {X i } ito T equal legt subitervals betwee 0 ad Let Q,, Q T be te umber of observatios i eac of te subitervals Te Q i s joitly ave a multiomial distributio Note tat if te sample size is Poissoized, tat is, it is ot fixed but a Poisso radom variable wit mea ad idepedet of te X i s, te te couts {Q i : i =,, T } are idepedet Poisso radom variables wit Q i Poisso(mp i ) were p i = T i T i T f(x)dx We te apply te mea-matcig root trasform discussed i Sectio?? Set Y i = Q i + 4, were Q i = Card({k : X k I i }), i =,, T, ad treat Y = (Y, Y,, Y T ) as te ew equi-spaced sample for a oparametric regressio problem Troug biig ad te root trasform te desity estimatio problem as ow bee trasferred to a equi-spaced, early costat variace oparametric regressio problem Ay good oparametric regressio procedure, suc as a kerel, splie or wavelet procedure, ca be applied to yield a estimator f of f Te fial desity estimator ca be obtaied by ormalizig te square of f Algoritmically, te root-uroot desity estimatio procedure ca be summarized as follows Biig: Divide {X i } ito T equal legt itervals betwee 0 ad Let Q, Q,, Q T be te umber of observatios i eac of te itervals Root Trasform: Let Y i = Q i + 4, i =,, T, ad treat Y = (Y, Y,, Y T ) as te ew equi-spaced sample for a oparametric regressio problem 3 Noparametric Regressio: Apply your favorite oparametric regressio procedure to te bied ad root trasformed data Y to obtai a estimate f of f 4 Uroot: Te desity fuctio f is estimated by f = ( f) 5 Normalizatio: Te estimator ˆf give i Step 4 may ot itegrate to Set f(t) = f(t)/ f(t)dt 0 ad use f as te fial estimator Teoretical property 5
6 We sall use te quatile couplig iequality of Komlós, Major ad Tusády (975) to approximate te bied ad root trasformed data by idepedet ormal variables Te followig lemma is a direct cosequece of te results give i Komlós, Major ad Tusády (975) ad Zou (006) Lemma Let λ > 0 ad let X Poisso(λ) Tere exists a stadard ormal radom variable Z N(0, ) ad costats c, c, c 3 > 0 ot depedig o λ suc tat weever te evet A = { X λ c λ} occurs, X λ λz < c Z + c 3 Propositio 3 Let m = /T We ca write Y i as Y i = mp i + ɛ i + Z i + ξ i, i =,,, T, were Z i iid N(0, ), ɛ i are costats satisfyig ɛ i 64 (mp i) 3 (+o()) ad cosequetly for some costat C > 0 T ɛ i C m 4, i= ad ξ i are idepedet ad stocastically small radom variables satisfyig E ξ i l C l (mp i ) l ad P ( ξ i > a) C l (a mp i ) l were l > 0, a > 0 ad C l > 0 is a costat depedig o l oly 3 Gaussia regressio ad Expoetial family regressio Mea-matcig variace stabilizig trasformatio Let X, X,, X m be a radom sample from a distributio i a atural expoetial families wit te probability desity/mass fuctio q(x η) = e ηx ψ(η) (x) Here η is called te atural parameter Te mea ad variace are respectively µ(η) = ψ (η), ad σ (η) = ψ (η) A special subclass of iterest is te oe wit a quadratic variace fuctio (QVF), σ V (µ) = a 0 + a µ + a µ I tis case we sall write X i NQ(µ) Te NEF-QVF families cosist of six distributios, tree cotiuous: ormal, gamma, ad NEF-GHS distributios ad tree discrete: biomial, egative biomial, ad Poisso See, eg, Morris (98) ad Brow (986) 6
7 Set X = m i= X i Accordig to te Cetral Limit Teorem, L m(x/m µ(η)) N(0, V (µ(η))), as m A variace stabilizig trasformatio (VST) is a fuctio G : R R suc tat G (µ) = V (µ) Te stadard delta metod te yields L m{g(x/m) G(µ(η))} N(0, ) It is kow tat te variace stabilizig properties ca ofte be furter improved by usig a trasformatio of te form H m (X) = G( X + a m + b ) wit suitable coice of costats a ad b See, eg, Ascombe (948) I tis paper we sall use te VST as a tool for oparametric regressio i expoetial families For tis purpose, it is more importat to optimally matc te meas ta to optimally stabilize te variace Tat is, we wis to coose te costats a ad b suc tat E{H m (X)} optimally matces G(µ(η)) To derive te optimal coice of a ad b, we eed te followig expasios for te mea ad variace of te trasformed variable H m (X) Lemma 4 Let Θ 0 be a compact subset of te atural parameter space Θ Assume tat η Θ 0 ad te variace σ (η) is positive o Θ 0 Te for costats a ad b ad E{H m (X)} G(µ(η)) = σ(η) (a bµ(η) µ (η) 4µ (η) ) m + O(m ) V ar{h m (X)} = m + O(m ) Moreover, tere exist costats a ad b suc tat E{G( X + a m + b )} G(µ(η)) = O(m ) if ad oly if te expoetial family as a quadratic variace fuctio Note tat if ad oly if a bµ(η) µ (η) 4µ (η) = 0 σ (η) = µ (η) = a 0 + 4aµ(η) bµ (η) for some costat a 0 Hece te solutio of te differetial equatio is exactly te subclass of atural expoetial family wit a quadratic variace fuctio (QVF) Te followig are te specific expressios of te mea-matcig VST H m for te five distributios (oter ta ormal) i te NEF-QVF families 7
8 Poisso: a = /4, b = 0, ad H m (X) = (X + 4 )/m Biomial(r, p): a = /4, b = r, ad H m(x) = ( ) r arcsi X+/4 rm+/ Negative Biomial(r, p): a = /4, b = r, ad ( H m (X) = ) X + /4 r l mr / + + X + /4 mr / Gamma(r, λ) (wit r kow): a = 0, b = r, ad H m(x) = r l( NEF-GHS(r, λ) (wit r kow): a = 0, b = r, ad H m (X) = ( ) X r l rm / + X + (mr /) X rm / ) Expoetial family regressio troug Gaussia regressio Suppose we observe Y i id NQ(f(t i )), i =,,, t i = i ad wis to estimate te mea fuctio f(t) Similar to desity estimatio we ave Biig: Divide {Y i } ito T equal legt itervals betwee 0 ad Let Q, Q,, Q T be te sum of te observatios i eac of te itervals VST: Let Y j = mh m (Q j ), j =,, T, ad treat Y = (Y, Y,, Y T ) as te ew equi-spaced sample for a oparametric Gaussia regressio problem 3 Gaussia Regressio: Apply your favorite oparametric regressio procedure to te bied ad trasformed data Y to obtai a estimate Ĝ (f) of G (f) 4 Iverse VST: Estimate te mea fuctio f by f = G (Ĝ (f) ) We defie G (a) = 0 we a < 0 i te case of Negative Biomial ad NEF-GHS distributios 3 Gaussia regressio ad Robust estimatio Cosider te oparametric regressio model Y i = f( i ) + ξ i, i =,, 8
9 were te errors ξ i are idepedet ad idetically distributed wit some desity Quatile couplig for media Let X,, X be iid radom variables wit desity fuctio Deote te sample media by X med Assumptio (A): 0 (x) =, (0) > 0, ad (x) is Lipscitz at x = 0 Here te Lipscitz coditio at 0 meas tat tere is a costat C > 0 suc tat (x) (0) C x i a ope eigborood of 0 Teorem 5 Let Z be a stadard ormal radom variable ad let X,, X be iid wit desity fuctio were = k + for some iteger k Let Assumptio ((A) old ) Te for every tere is a mappig X med (Z) : R R suc tat L Xmed (Z) = L (X med ) ad 4 (0) X med Z C + C 4 (0) Xmed, we Xmed ε were C, ε > 0 deped o but ot o Corollary 6 Uder te assumptio of Teorem 5, te mappig X med (Z) i Teorem 5 satisfies 4 (0) X med Z C ( + Z ), we Z ε were C, ε > 0 do ot deped o Remark 7 We = k is eve, te sample media X med is usually take to be ( ) X (k) + X (k+) / Similar quatile couplig iequalities ca be obtaied For eac i, let X i,med be te media of te origial sample wit X i removed Te X med = i= X i,med Let G be te distributio of te media of iid observatios wit desity ad defie (Z i ) i L ( Φ G (X i,med ), i ) Let X ( ) i,med = G Φ(Z i) Te L X i,med, i = L (X i,med, i ) Now a direct applicatio of Teorem 5 gives X med Z C ( + ( ) ) 4 (0) X(k) + X(k+) we X (k) + X(k+) ε, ad Z = i= Z i Te couplig result give i Teorem 5 i fact olds uiformly over a ric collectio of distributios For 0 < ɛ < ad ɛ > 0 defie { 0 H ɛ,ɛ = : (x) =, ɛ (0), (x) (0) x } for all x < ɛ ɛ ɛ It ca be sow tat Teorem 5 olds uiformly for te wole family of H ɛ,ɛ 9
10 Teorem 8 Let X,, X be iid wit desity H ɛ,ɛ For every = k ( + wit ) iteger k, tere is a mappig X med (Z) : R R suc tat L Xmed (Z) = L (X med ) ad for two costats C ɛ,ɛ, ε ɛ,ɛ > 0 depedig oly o ɛ ad ɛ 4 (0) X med Z C ɛ,ɛ + C ɛ,ɛ 4 (0) Xmed, we Xmed ε ɛ,ɛ uiformly over all H ɛ,ɛ Remark 9 Te quatile couplig iequalities i Corollary 6 ad Remark 7 also old uiformly over H ɛ,ɛ by replacig C ad ε tere wit two costats depedig ɛ ad ɛ 0
LECTURE 2 LEAST SQUARES CROSS-VALIDATION FOR KERNEL DENSITY ESTIMATION
Jauary 3 07 LECTURE LEAST SQUARES CROSS-VALIDATION FOR ERNEL DENSITY ESTIMATION Noparametric kerel estimatio is extremely sesitive to te coice of badwidt as larger values of result i averagig over more
More informationNonparametric regression: minimax upper and lower bounds
Capter 4 Noparametric regressio: miimax upper ad lower bouds 4. Itroductio We cosider oe of te two te most classical o-parametric problems i tis example: estimatig a regressio fuctio o a subset of te real
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationKernel density estimator
Jauary, 07 NONPARAMETRIC ERNEL DENSITY ESTIMATION I this lecture, we discuss kerel estimatio of probability desity fuctios PDF Noparametric desity estimatio is oe of the cetral problems i statistics I
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationUnbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More information17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15
17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig
More informationIt is often useful to approximate complicated functions using simpler ones. We consider the task of approximating a function by a polynomial.
Taylor Polyomials ad Taylor Series It is ofte useful to approximate complicated fuctios usig simpler oes We cosider the task of approximatig a fuctio by a polyomial If f is at least -times differetiable
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationLecture 13: Maximum Likelihood Estimation
ECE90 Sprig 007 Statistical Learig Theory Istructor: R. Nowak Lecture 3: Maximum Likelihood Estimatio Summary of Lecture I the last lecture we derived a risk (MSE) boud for regressio problems; i.e., select
More informationGeneralized Semi- Markov Processes (GSMP)
Geeralized Semi- Markov Processes (GSMP) Summary Some Defiitios Markov ad Semi-Markov Processes The Poisso Process Properties of the Poisso Process Iterarrival times Memoryless property ad the residual
More informationLecture 9: Regression: Regressogram and Kernel Regression
STAT 425: Itroductio to Noparametric Statistics Witer 208 Lecture 9: Regressio: Regressogram ad erel Regressio Istructor: Ye-Ci Ce Referece: Capter 5 of All of oparametric statistics 9 Itroductio Let X,
More informationExponential Families and Bayesian Inference
Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationON LOCAL LINEAR ESTIMATION IN NONPARAMETRIC ERRORS-IN-VARIABLES MODELS 1
Teory of Stocastic Processes Vol2 28, o3-4, 2006, pp*-* SILVELYN ZWANZIG ON LOCAL LINEAR ESTIMATION IN NONPARAMETRIC ERRORS-IN-VARIABLES MODELS Local liear metods are applied to a oparametric regressio
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationLaw of the sum of Bernoulli random variables
Law of the sum of Beroulli radom variables Nicolas Chevallier Uiversité de Haute Alsace, 4, rue des frères Lumière 68093 Mulhouse icolas.chevallier@uha.fr December 006 Abstract Let be the set of all possible
More informationFall 2013 MTH431/531 Real analysis Section Notes
Fall 013 MTH431/531 Real aalysis Sectio 8.1-8. Notes Yi Su 013.11.1 1. Defiitio of uiform covergece. We look at a sequece of fuctios f (x) ad study the coverget property. Notice we have two parameters
More informationWeek 10. f2 j=2 2 j k ; j; k 2 Zg is an orthonormal basis for L 2 (R). This function is called mother wavelet, which can be often constructed
Wee 0 A Itroductio to Wavelet regressio. De itio: Wavelet is a fuctio such that f j= j ; j; Zg is a orthoormal basis for L (R). This fuctio is called mother wavelet, which ca be ofte costructed from father
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More informationThe Central Limit Theorem
Chapter The Cetral Limit Theorem Deote by Z the stadard ormal radom variable with desity 2π e x2 /2. Lemma.. Ee itz = e t2 /2 Proof. We use the same calculatio as for the momet geeratig fuctio: exp(itx
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More information1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1
8. The cetral limit theorems 8.1. The cetral limit theorem for i.i.d. sequeces. ecall that C ( is N -separatig. Theorem 8.1. Let X 1, X,... be i.i.d. radom variables with EX 1 = ad EX 1 = σ (,. Suppose
More information32 estimating the cumulative distribution function
32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio
More informationDefinition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.
4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad
More informationLecture Stat Maximum Likelihood Estimation
Lecture Stat 461-561 Maximum Likelihood Estimatio A.D. Jauary 2008 A.D. () Jauary 2008 1 / 63 Maximum Likelihood Estimatio Ivariace Cosistecy E ciecy Nuisace Parameters A.D. () Jauary 2008 2 / 63 Parametric
More informationEFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS
EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog
More informationMa 4121: Introduction to Lebesgue Integration Solutions to Homework Assignment 5
Ma 42: Itroductio to Lebesgue Itegratio Solutios to Homework Assigmet 5 Prof. Wickerhauser Due Thursday, April th, 23 Please retur your solutios to the istructor by the ed of class o the due date. You
More informationBerry-Esseen bounds for self-normalized martingales
Berry-Essee bouds for self-ormalized martigales Xiequa Fa a, Qi-Ma Shao b a Ceter for Applied Mathematics, Tiaji Uiversity, Tiaji 30007, Chia b Departmet of Statistics, The Chiese Uiversity of Hog Kog,
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationAn Introduction to Asymptotic Theory
A Itroductio to Asymptotic Theory Pig Yu School of Ecoomics ad Fiace The Uiversity of Hog Kog Pig Yu (HKU) Asymptotic Theory 1 / 20 Five Weapos i Asymptotic Theory Five Weapos i Asymptotic Theory Pig Yu
More informationSince X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain
Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationLECTURE 8: ASYMPTOTICS I
LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationPAPER : IIT-JAM 2010
MATHEMATICS-MA (CODE A) Q.-Q.5: Oly oe optio is correct for each questio. Each questio carries (+6) marks for correct aswer ad ( ) marks for icorrect aswer.. Which of the followig coditios does NOT esure
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More informationEconomics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator
Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationAdvanced Stochastic Processes.
Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.
More informationLecture 15: Density estimation
Lecture 15: Desity estimatio Why do we estimate a desity? Suppose that X 1,...,X are i.i.d. radom variables from F ad that F is ukow but has a Lebesgue p.d.f. f. Estimatio of F ca be doe by estimatig f.
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013 Fuctioal Law of Large Numbers. Costructio of the Wieer Measure Cotet. 1. Additioal techical results o weak covergece
More informationLecture 20: Multivariate convergence and the Central Limit Theorem
Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationMath 341 Lecture #31 6.5: Power Series
Math 341 Lecture #31 6.5: Power Series We ow tur our attetio to a particular kid of series of fuctios, amely, power series, f(x = a x = a 0 + a 1 x + a 2 x 2 + where a R for all N. I terms of a series
More informationNotes On Nonparametric Density Estimation. James L. Powell Department of Economics University of California, Berkeley
Notes O Noparametric Desity Estimatio James L Powell Departmet of Ecoomics Uiversity of Califoria, Berkeley Uivariate Desity Estimatio via Numerical Derivatives Cosider te problem of estimatig te desity
More informationApplication to Random Graphs
A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let
More informationIIT JAM Mathematical Statistics (MS) 2006 SECTION A
IIT JAM Mathematical Statistics (MS) 6 SECTION A. If a > for ad lim a / L >, the which of the followig series is ot coverget? (a) (b) (c) (d) (d) = = a = a = a a + / a lim a a / + = lim a / a / + = lim
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationUniformly Consistency of the Cauchy-Transformation Kernel Density Estimation Underlying Strong Mixing
Appl. Mat. If. Sci. 7, No. L, 5-9 (203) 5 Applied Matematics & Iformatio Scieces A Iteratioal Joural c 203 NSP Natural Scieces Publisig Cor. Uiformly Cosistecy of te Caucy-Trasformatio Kerel Desity Estimatio
More informationThe Poisson Process *
OpeStax-CNX module: m11255 1 The Poisso Process * Do Johso This work is produced by OpeStax-CNX ad licesed uder the Creative Commos Attributio Licese 1.0 Some sigals have o waveform. Cosider the measuremet
More informationExercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).
Assigmet 7 Exercise 4.3 Use the Cotiuity Theorem to prove the Cramér-Wold Theorem, Theorem 4.12. Hit: a X d a X implies that φ a X (1) φ a X(1). Sketch of solutio: As we poited out i class, the oly tricky
More informationHOMEWORK I: PREREQUISITES FROM MATH 727
HOMEWORK I: PREREQUISITES FROM MATH 727 Questio. Let X, X 2,... be idepedet expoetial radom variables with mea µ. (a) Show that for Z +, we have EX µ!. (b) Show that almost surely, X + + X (c) Fid the
More informationLecture 2: Poisson Sta*s*cs Probability Density Func*ons Expecta*on and Variance Es*mators
Lecture 2: Poisso Sta*s*cs Probability Desity Fuc*os Expecta*o ad Variace Es*mators Biomial Distribu*o: P (k successes i attempts) =! k!( k)! p k s( p s ) k prob of each success Poisso Distributio Note
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationLecture 7 Testing Nonlinear Inequality Restrictions 1
Eco 75 Lecture 7 Testig Noliear Iequality Restrictios I Lecture 6, we discussed te testig problems were te ull ypotesis is de ed by oliear equality restrictios: H : ( ) = versus H : ( ) 6= : () We sowed
More informationECE 901 Lecture 13: Maximum Likelihood Estimation
ECE 90 Lecture 3: Maximum Likelihood Estimatio R. Nowak 5/7/009 The focus of this lecture is to cosider aother approach to learig based o maximum likelihood estimatio. Ulike earlier approaches cosidered
More informationElements of Statistical Methods Lots of Data or Large Samples (Ch 8)
Elemets of Statistical Methods Lots of Data or Large Samples (Ch 8) Fritz Scholz Sprig Quarter 2010 February 26, 2010 x ad X We itroduced the sample mea x as the average of the observed sample values x
More informationPartial match queries: a limit process
Partial match queries: a limit process Nicolas Brouti Ralph Neiiger Heig Sulzbach Partial match queries: a limit process 1 / 17 Searchig geometric data ad quadtrees 1 Partial match queries: a limit process
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationEstimation of the essential supremum of a regression function
Estimatio of the essetial supremum of a regressio fuctio Michael ohler, Adam rzyżak 2, ad Harro Walk 3 Fachbereich Mathematik, Techische Uiversität Darmstadt, Schlossgartestr. 7, 64289 Darmstadt, Germay,
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationA survey on penalized empirical risk minimization Sara A. van de Geer
A survey o pealized empirical risk miimizatio Sara A. va de Geer We address the questio how to choose the pealty i empirical risk miimizatio. Roughly speakig, this pealty should be a good boud for the
More informationProbability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].
Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x
More informationLimit Theorems. Convergence in Probability. Let X be the number of heads observed in n tosses. Then, E[X] = np and Var[X] = np(1-p).
Limit Theorems Covergece i Probability Let X be the umber of heads observed i tosses. The, E[X] = p ad Var[X] = p(-p). L O This P x p NM QP P x p should be close to uity for large if our ituitio is correct.
More informationMAS111 Convergence and Continuity
MAS Covergece ad Cotiuity Key Objectives At the ed of the course, studets should kow the followig topics ad be able to apply the basic priciples ad theorems therei to solvig various problems cocerig covergece
More information4 Conditional Distribution Estimation
4 Coditioal Distributio Estimatio 4. Estimators Te coditioal distributio (CDF) of y i give X i = x is F (y j x) = P (y i y j X i = x) = E ( (y i y) j X i = x) : Tis is te coditioal mea of te radom variable
More informationEntropy and Ergodic Theory Lecture 5: Joint typicality and conditional AEP
Etropy ad Ergodic Theory Lecture 5: Joit typicality ad coditioal AEP 1 Notatio: from RVs back to distributios Let (Ω, F, P) be a probability space, ad let X ad Y be A- ad B-valued discrete RVs, respectively.
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 6 9/23/203 Browia motio. Itroductio Cotet.. A heuristic costructio of a Browia motio from a radom walk. 2. Defiitio ad basic properties
More informationGamma Distribution and Gamma Approximation
Gamma Distributio ad Gamma Approimatio Xiaomig Zeg a Fuhua (Frak Cheg b a Xiame Uiversity, Xiame 365, Chia mzeg@jigia.mu.edu.c b Uiversity of Ketucky, Leigto, Ketucky 456-46, USA cheg@cs.uky.edu Abstract
More informationEstimation of the Mean and the ACVF
Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationLecture 7: Density Estimation: k-nearest Neighbor and Basis Approach
STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationWe are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n
Review of Power Series, Power Series Solutios A power series i x - a is a ifiite series of the form c (x a) =c +c (x a)+(x a) +... We also call this a power series cetered at a. Ex. (x+) is cetered at
More informationFirst Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise
First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >
More information1 Approximating Integrals using Taylor Polynomials
Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................
More informationMA Advanced Econometrics: Properties of Least Squares Estimators
MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample
More informationJournal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula
Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials
More informationPrecise Rates in Complete Moment Convergence for Negatively Associated Sequences
Commuicatios of the Korea Statistical Society 29, Vol. 16, No. 5, 841 849 Precise Rates i Complete Momet Covergece for Negatively Associated Sequeces Dae-Hee Ryu 1,a a Departmet of Computer Sciece, ChugWoo
More informationECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization
ECE 90 Lecture 4: Maximum Likelihood Estimatio ad Complexity Regularizatio R Nowak 5/7/009 Review : Maximum Likelihood Estimatio We have iid observatios draw from a ukow distributio Y i iid p θ, i,, where
More informationLecture 11 and 12: Basic estimation theory
Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis
More informationAsymptotic distribution of products of sums of independent random variables
Proc. Idia Acad. Sci. Math. Sci. Vol. 3, No., May 03, pp. 83 9. c Idia Academy of Scieces Asymptotic distributio of products of sums of idepedet radom variables YANLING WANG, SUXIA YAO ad HONGXIA DU ollege
More informationApproximations and more PMFs and PDFs
Approximatios ad more PMFs ad PDFs Saad Meimeh 1 Approximatio of biomial with Poisso Cosider the biomial distributio ( b(k,,p = p k (1 p k, k λ: k Assume that is large, ad p is small, but p λ at the limit.
More informationLecture 27: Optimal Estimators and Functional Delta Method
Stat210B: Theoretical Statistics Lecture Date: April 19, 2007 Lecture 27: Optimal Estimators ad Fuctioal Delta Method Lecturer: Michael I. Jorda Scribe: Guilherme V. Rocha 1 Achievig Optimal Estimators
More informationAppendix to: Hypothesis Testing for Multiple Mean and Correlation Curves with Functional Data
Appedix to: Hypothesis Testig for Multiple Mea ad Correlatio Curves with Fuctioal Data Ao Yua 1, Hog-Bi Fag 1, Haiou Li 1, Coli O. Wu, Mig T. Ta 1, 1 Departmet of Biostatistics, Bioiformatics ad Biomathematics,
More informationLecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables
CSCI-B609: A Theorist s Toolkit, Fall 06 Aug 3 Lecture 0: the Cetral Limit Theorem Lecturer: Yua Zhou Scribe: Yua Xie & Yua Zhou Cetral Limit Theorem for iid radom variables Let us say that we wat to aalyze
More information