10/ Statistical Machine Learning Homework #1 Solutions
|
|
- Ashley French
- 6 years ago
- Views:
Transcription
1 Caregie Mello Uiversity Departet of Statistics & Data Sciece 0/36-70 Statistical Macie Learig Hoework # Solutios Proble [40 pts.] DUE: February, 08 Let X,..., X P were X i [0, ] ad P as desity p. Let p be te istogra estiator usig bis. Let /. Recall tat te L error is ( p(x) p(x)) p (x)dx p(x)p(x)dx + p (x)dx. As usual, we ay igore te last ter so we defie te loss to be L() p (x)dx p(x)p(x)dx. (a) Suppose we used te direct estiator of te loss, aely, we replace te itegral wit te average to get L() p (x)dx p(x i ). i Sow tat tis fails i te sese tat it is iiized by takig 0. (b) Recall tat te leave-oe-out estiator of te risk is L() p (x)dx i p (i) (X i ), Sow tat L() were Z j is te uber of observatios i bi j. (c) Sow tat L() L() P 0. ( ) + ( ) j Z j Solutio. Defie for j,...,. θ j i (X i B j ) ad Z j θ j
2 0/36-70 Statistical Macie Learig: Hoework (a) (5 pts.) L() p (x)dx i p(x i ) θ j (x B j) dx k i θ j θk (x B j B k ) dx θ j (x B j ) dx θ j (x B j )dx θ j θ j θ j Zj θ j θ j θ j (X i B j ) θ j i (X i B j ) Z j is bouded below by, so L() as 0. Terefore, tis loss is iiized by takig 0. (b) (0 pts.) Fro part (a) we ave p (x)dx θ j. () Ad te secod ter i te leave-oe-out loss is i p ( i) (X i ) ( ) ( ) ( ) i (X i B j ) (X k B j ) k i (X i B j )( θ j (X i B j )) i ( θ j θ j ). ()
3 0/36-70 Statistical Macie Learig: Hoework Takig te differece of () ad (), we get L() θ j ( ) ( ) θ j + ( ) + ( ) θ j ( ) + ( ) ( θ j θ j ) ( ) θ j Z j. (c) (5 pts.) It suffices to sow i p ( i) (X i ) p(x)p(x)dx P 0. (3) Caveat: It is ot a valid lie of reasoig to directly say (3) olds by te weak law of large ubers sice te p ( i) are fuctios of te saple size. p(x)p(x)dx Bj θ j (x B j)p(x)dx θ j p(x)dx θ j p(x)dx B j θ j P (X B j) θ j θ j θ j by Fubii/Toelli teore Ad followig fro () we ave i p ( i) (X i ) ( ) ( ) ( θ j θ j ) θ j ( ) θ j θ j + ( ) θ j + O( ). θ j ( ) 3
4 0/36-70 Statistical Macie Learig: Hoework Terefore, i p ( i) (X i ) p(x)p(x)dx θ j P 0 θ j θ j + O( ) θ j ( θ j θ j ) + O( ) sice θ j by te weak law of large ubers. i (X i B j ) P P (X B j ) θ j 4
5 0/36-70 Statistical Macie Learig: Hoework Proble Let p be te kerel desity estiator (i oe diesio) wit badwidt. Let s (x) Var( p (x)). (a) Sow tat were p (x) E[ p (x)]. p (x) p(x) s (x) N(0, ) Hit: Recall tat te Lyapuov cetral liit teore says te followig: Suppose tat Y, Y,... are idepedet. Let µ i E[Y i ] ad σ i Var(Y i). Let s i σ i. If li s +δ i for soe δ > 0. Te s i (Y i µ u ) N(0, ). E[ Y i µ i +δ ] 0 (b) Assue tat te sootess is β. Suppose tat te badwidt is cose optially. Sow tat p (x) p(x) N(b(x), ) s (x) for soe costat b(x) wic is, i geeral, ot 0. Solutio. (a) Caveat: Te classical Cetral Liit Teore caot be applied ere, as is a fuctio of ad tus te K( x X i ) are ot idetically distributed. However, as te it suggests, te Lyapuov CLT still olds for o-idetically distributed rado variables. Clai. Let p >. Te p E K x X i p (x) Θ p. Proof. See appedix Now E K x X i p (x) +δ +δ +δ E K x X i p (x) Θ +δ +δ, 5
6 0/36-70 Statistical Macie Learig: Hoework ad Terefore, s +δ i E K s i E K E K Θ. x X i p (x) x X i p (x) x X i p (x) +δ Θ(() + δ ) Θ +δ +δ Θ(() δ ) 0, as ad, for ay δ > 0. So, by te Lyapuov CLT, p (x) p (x) s (x) N(0, ). (b) First ote p (x) p(x) s (x) p (x) p (x) s (x) p (x) p (x) s (x) + p (x) p(x) s (x) + Bias(p (x)) Var( p (x)). Fro Teore 5, te optial badwidt is Θ( /5 ). Now fro part (a), we ave ad fro Lea 3, Terefore, p (x) p(x) s (x) Var( p (x)) Θ Bias(p (x)) O( ). p (x) p (x) s (x) p (x) p (x) s (x) p (x) p (x) s (x) + Bias(p (x)) Var( p (x)) O( ) + Θ( () / ) + O( /5 ) Θ( /5 ) p (x) p (x) + O() s (x) N(0,) N(b(x), ). 6
7 0/36-70 Statistical Macie Learig: Hoework Proble 3 Let X,..., X P were X i [0, ]. Assue tat P as desity p wic as bouded cotiuous derivative. Let p (x) be te kerel desity estiator. Sow tat E[ p (0)] p(0) + O(). Solutio. E[ p(0)] E E K 0 0 i K X i X i 0 K(u )p(u)du / / p(0) p(0) K(t)p(t)dt K(t) p(0) + t +p(0) + t 0 / + O(), K(t)dt + O() 0 / +p(0) + o( ) dt / tk(t)dt + O( ) t K(t)dt 0 σk /< let t u were we assued K( ) is supported o [, ],, ad / 0 tk(t)dt is bouded. 7
8 0/36-70 Statistical Macie Learig: Hoework Proble 4 Let p be a desity o te real lie. Assue tat p is -ties cotiuously differetiable ad tat p () <. Let K be a iger order kerel. Tis eas tat K(y)dy, y j K(y)dy 0 for j, y K(y)dy < ad K (y)dy <. Sow tat te kerel estiator wit badwidt satisfies E ( p(x) p(x)) dx C + for soe C > 0. Wat is te optial badwidt ad wat is te correspodig rate of covergece (usig tis badwidt)? Solutio. See Capter 6 of []. 8
9 0/36-70 Statistical Macie Learig: Hoework Proble 5 [0 pts.] Let X,..., X P were X i [0, ] ad P as desity p. Let φ, φ,... be a ortooral basis for L [0, ]. Hece 0 φ j (x)dx for all j ad 0 φ j(x)φ k (x)dx 0 for j k. Assue tat te basis is uiforly bouded, i.e. sup j sup 0 x φ j (x) C <. We ay expad p as p(x) β jφ j (x) were β j φ j (x)p(x)dx. Defie k p(x) β j φ j (x) were β j (/) i φ j(x i ). (a) Sow tat te risk is bouded by for soe costat c > 0. ck + jk+ β j (b) Defie te Sobolev ellipsoid E(, L) of order as te set of desities of te for p(x) β jφ j (x) were β j j < L. Sow tat te risk for ay desity i E(, L) is bouded by c[(k/)+(/k) ]. Usig tis boud, fid te optial value of k ad fid te correspodig risk. Solutio. (a) (0 pts.) First ote, E[ β j ] E φ j (X i ) i E[φ j (x)] p(x)φ j(x)dx 0 β j. 9
10 0/36-70 Statistical Macie Learig: Hoework So β j is ubiased. Now, R( p(x)) E ( p(x) p(x)) dx E k β j φ j (x) β j φ j (x) dx E k ( β j β j )φ j (x) β j φ j (x) dx jk+ k E ( β j β j ) + β j jk+ k Var( β j ) + k Var(φ j(x i )) + C k + βj. jk+ βj jk+ βj jk+ sice φ i φ j δ ij (b) (0 pts.) sup R( p(x)) C k p E(,L) + C k C k C k βj jk+ + k jk+ β j k + jk+ β j j k + L k ax{c, L } k + k fro part (a) 0
11 0/36-70 Statistical Macie Learig: Hoework Proble 6 [40 pts.] Recall tat te total variatio distace betwee two distributios P ad Q is TV(P, Q) sup A P (A) Q(A). I soe sese, tis would be te ideal loss fuctio to use for desity estiatio. We oly use L because it is easier to deal wit. Here you will explore soe properties of TV. (a) Suppose tat P ad Q ave desities p ad q. Sow tat TV(P, Q) (/) p(x) q(x) dx. (b) Let T be ay appig. Let X ad Y be rado variables. Te sup A P (T (X) A) P (T (Y ) A) sup P (X A) P (Y A). A (c) Let K be a kerel. Recall tat te covolutio of a desity p wit K is (p K)(x) p(z)k(x z)dz. Sow tat Hece, sootig reduces L distace. p K q K K p q. Solutio. (a) (5 pts.) For ay easurable B R, p q p(x) q(x) dx (p(x) q(x))dx + B (q(x) p(x))dx R/B B p(x)dx B q(x)dx + R/B q(x)dx R/B p(x)dx B p(x)dx B q(x)dx + ( B q(x)dx) ( B p(x)dx) ( B p(x)dx B q(x)dx) P (B) Q(B) p q P (B) Q(B) for ay easurable B R. By otig, parallel reasoig sows p q q p, So togeter we ave, p q Q(B) P (B) for ay easurable B R. p q P (B) Q(B)
12 0/36-70 Statistical Macie Learig: Hoework ad tus for ay easurable B R. Now cosider te set B is easurable ad p q p(x) q(x) dx p q sup P (B) Q(B), (4) B R B {x R p(x) > q(x)}. B (p(x) q(x))dx + p(x))dx R/B (q(x) B p(x)dx B q(x)dx + R/B q(x)dx R/B p(x)dx B p(x)dx B q(x)dx + ( B q(x)dx) ( B p(x)dx) ( B p(x)dx B q(x)dx) P (B ) Q(B ). P (B ) Q(B ). We ave foud a set B R suc tat terefore, Cobiig (4) ad (5), we ave p q P (B ) Q(B ), p q sup P (B) Q(B). (5) B R T V (P, Q) p q. (b) (0 pts.) Let F be te σ-field geerated by te sets A o te saple space Ω, ad C T (F) {T (A) A F}. Defie T (C) {ω Ω T (ω) C}, i.e. te pre-iage appig. By defiitio, Te, T (C) {T (C) C C} F. sup P (T (X) C) P (T (Y ) C) sup P (X A) P (Y A) C C A T (C) sup P (X A) P (Y A). A F
13 0/36-70 Statistical Macie Learig: Hoework (c) (5 pts.) p K q K p(z)k(x z)dz q(z)k(x z)dz dx (p(z) q(z))k(x z)dz dx p(z) q(z) K(x z) dzdx p(z) q(z) K(x z) dxdz Fubii s teore ( p(z) q(z) K(x z) dx)dz ( p(z) q(z) K(x) dx)dz ivariat to traslatio K(x) dx p(z) q(z) dz K p q 3
14 0/36-70 Statistical Macie Learig: Hoework Refereces [] L. Wassera, All of Noparaetric Statistics, Spriger Texts i Statistics, 006. Appedix Proof of Clai. Fro p a p b p a b p p a p + p b p, we ave Te, p E[ Z i p ] p (x) p E[ Z i p (x) p ] p E[ Z i p ] + p p (x) p. E[ Z i p ] p K p x u p(u)du p K p ( v )p(x + v)dv. So as 0, coose ay [a, b] suc tat K p ( v ) > 0 for soe v [a, b], te K p ( v )p(x+v)dv b a K p ( v )p(x + v)dv b a K p ( v )p(x)dv > 0 by te Bouded Covergece Teore. Also, K p ( v )p(x + v)dv K p ( v ) sup x p(x)dv <, ece K p ( v )p(x + v)dv Θ(), ad accordigly, E[ Z i p ] Θ p. Te Hece p (x) E[Z i ] E[ Z i ] O(). Θ p p E[ Z i p ] p (x) p E[ Z i p (x) p ] p E[ Z i p ] + p p (x) p Θ p wic iplies E[ Z i p (x) p ] Θ p. 4
ECE 901 Lecture 4: Estimation of Lipschitz smooth functions
ECE 9 Lecture 4: Estiatio of Lipschitz sooth fuctios R. Nowak 5/7/29 Cosider the followig settig. Let Y f (X) + W, where X is a rado variable (r.v.) o X [, ], W is a r.v. o Y R, idepedet of X ad satisfyig
More informationLECTURE 2 LEAST SQUARES CROSS-VALIDATION FOR KERNEL DENSITY ESTIMATION
Jauary 3 07 LECTURE LEAST SQUARES CROSS-VALIDATION FOR ERNEL DENSITY ESTIMATION Noparametric kerel estimatio is extremely sesitive to te coice of badwidt as larger values of result i averagig over more
More informationLecture 7: Density Estimation: k-nearest Neighbor and Basis Approach
STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.
More informationOptimal Estimator for a Sample Set with Response Error. Ed Stanek
Optial Estiator for a Saple Set wit Respose Error Ed Staek Itroductio We develop a optial estiator siilar to te FP estiator wit respose error tat was cosidered i c08ed63doc Te first 6 pages of tis docuet
More informationLecture 15: Density estimation
Lecture 15: Desity estimatio Why do we estimate a desity? Suppose that X 1,...,X are i.i.d. radom variables from F ad that F is ukow but has a Lebesgue p.d.f. f. Estimatio of F ca be doe by estimatig f.
More informationLecture 10: Bounded Linear Operators and Orthogonality in Hilbert Spaces
Lecture : Bouded Liear Operators ad Orthogoality i Hilbert Spaces 34 Bouded Liear Operator Let ( X, ), ( Y, ) i i be ored liear vector spaces ad { } X Y The, T is said to be bouded if a real uber c such
More informationProbability Theory. Exercise Sheet 4. ETH Zurich HS 2017
ETH Zurich HS 2017 D-MATH, D-PHYS Prof. A.-S. Szita Coordiator Yili Wag Probability Theory Exercise Sheet 4 Exercise 4.1 Let X ) N be a sequece of i.i.d. rado variables i a probability space Ω, A, P ).
More informationA PROBABILITY PROBLEM
A PROBABILITY PROBLEM A big superarket chai has the followig policy: For every Euros you sped per buy, you ear oe poit (suppose, e.g., that = 3; i this case, if you sped 8.45 Euros, you get two poits,
More informationNonparametric regression: minimax upper and lower bounds
Capter 4 Noparametric regressio: miimax upper ad lower bouds 4. Itroductio We cosider oe of te two te most classical o-parametric problems i tis example: estimatig a regressio fuctio o a subset of te real
More informationStanford Statistics 311/Electrical Engineering 377
I. Uiversal predictio ad codig a. Gae: sequecex ofdata, adwattopredict(orcode)aswellasifwekew distributio of data b. Two versios: probabilistic ad adversarial. I either case, let p ad q be desities or
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationLIMITS AND DERIVATIVES
Capter LIMITS AND DERIVATIVES. Overview.. Limits of a fuctio Let f be a fuctio defied i a domai wic we take to be a iterval, say, I. We sall study te cocept of it of f at a poit a i I. We say f ( ) is
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationUnbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
More information1 Approximating Integrals using Taylor Polynomials
Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationHOMEWORK #10 SOLUTIONS
Math 33 - Aalysis I Sprig 29 HOMEWORK # SOLUTIONS () Prove that the fuctio f(x) = x 3 is (Riema) itegrable o [, ] ad show that x 3 dx = 4. (Without usig formulae for itegratio that you leart i previous
More informationLearning Theory for Conditional Risk Minimization: Supplementary Material
Learig Theory for Coditioal Risk Miiizatio: Suppleetary Material Alexader Zii IST Austria azii@istacat Christoph H Lapter IST Austria chl@istacat Proofs Proof of Theore After the applicatio of (6) ad (8)
More informationLIMITS AND DERIVATIVES NCERT
. Overview.. Limits of a fuctio Let f be a fuctio defied i a domai wic we take to be a iterval, say, I. We sall study te cocept of it of f at a poit a i I. We say f ( ) is te epected value of f at a give
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationMATH 10550, EXAM 3 SOLUTIONS
MATH 155, EXAM 3 SOLUTIONS 1. I fidig a approximate solutio to the equatio x 3 +x 4 = usig Newto s method with iitial approximatio x 1 = 1, what is x? Solutio. Recall that x +1 = x f(x ) f (x ). Hece,
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationStatistics for Applications Fall Problem Set 7
18.650. Statistics for Applicatios Fall 016. Proble Set 7 Due Friday, Oct. 8 at 1 oo Proble 1 QQ-plots Recall that the Laplace distributio with paraeter λ > 0 is the cotiuous probaλ bility easure with
More informationAN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION
Joural of Statistics: Advaces i Theory ad Applicatios Volue 3, Nuber, 00, Pages 6-78 AN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION Departet of Matheatics Brock Uiversity St. Catharies, Otario
More informationf(1), and so, if f is continuous, f(x) = f(1)x.
2.2.35: Let f be a additive fuctio. i Clearly fx = fx ad therefore f x = fx for all Z+ ad x R. Hece, for ay, Z +, f = f, ad so, if f is cotiuous, fx = fx. ii Suppose that f is bouded o soe o-epty ope set.
More informationLECTURE 8: ASYMPTOTICS I
LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece
More informationIntegrals of Functions of Several Variables
Itegrals of Fuctios of Several Variables We ofte resort to itegratios i order to deterie the exact value I of soe quatity which we are uable to evaluate by perforig a fiite uber of additio or ultiplicatio
More informationMath Solutions to homework 6
Math 175 - Solutios to homework 6 Cédric De Groote November 16, 2017 Problem 1 (8.11 i the book): Let K be a compact Hermitia operator o a Hilbert space H ad let the kerel of K be {0}. Show that there
More informationSolutions to HW Assignment 1
Solutios to HW: 1 Course: Theory of Probability II Page: 1 of 6 Uiversity of Texas at Austi Solutios to HW Assigmet 1 Problem 1.1. Let Ω, F, {F } 0, P) be a filtered probability space ad T a stoppig time.
More informationNotes 27 : Brownian motion: path properties
Notes 27 : Browia motio: path properties Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces:[Dur10, Sectio 8.1], [MP10, Sectio 1.1, 1.2, 1.3]. Recall: DEF 27.1 (Covariace) Let X = (X
More information8.1 Introduction. 8. Nonparametric Inference Using Orthogonal Functions
8. Noparametric Iferece Usig Orthogoal Fuctios 1. Itroductio. Noparametric Regressio 3. Irregular Desigs 4. Desity Estimatio 5. Compariso of Methods 8.1 Itroductio Use a orthogoal basis to covert oparametric
More informationChapter 7 Isoperimetric problem
Chapter 7 Isoperimetric problem Recall that the isoperimetric problem (see the itroductio its coectio with ido s proble) is oe of the most classical problem of a shape optimizatio. It ca be formulated
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationSolution: APPM 1360 Final Spring 2013
APPM 36 Fial Sprig 3. For this proble let the regio R be the regio eclosed by the curve y l( ) ad the lies, y, ad y. (a) (6 pts) Fid the area of the regio R. (b) (6 pts) Suppose the regio R is revolved
More informationLecture 19. Curve fitting I. 1 Introduction. 2 Fitting a constant to measured data
Lecture 9 Curve fittig I Itroductio Suppose we are preseted with eight poits of easured data (x i, y j ). As show i Fig. o the left, we could represet the uderlyig fuctio of which these data are saples
More informationA New Type of q-szász-mirakjan Operators
Filoat 3:8 07, 567 568 https://doi.org/0.98/fil7867c Published by Faculty of Scieces ad Matheatics, Uiversity of Niš, Serbia Available at: http://www.pf.i.ac.rs/filoat A New Type of -Szász-Miraka Operators
More informationLecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables
CSCI-B609: A Theorist s Toolkit, Fall 06 Aug 3 Lecture 0: the Cetral Limit Theorem Lecturer: Yua Zhou Scribe: Yua Xie & Yua Zhou Cetral Limit Theorem for iid radom variables Let us say that we wat to aalyze
More informationLecture 20: Multivariate convergence and the Central Limit Theorem
Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece
More informationContents Two Sample t Tests Two Sample t Tests
Cotets 3.5.3 Two Saple t Tests................................... 3.5.3 Two Saple t Tests Setup: Two Saples We ow focus o a sceario where we have two idepedet saples fro possibly differet populatios. Our
More informationNotes 19 : Martingale CLT
Notes 9 : Martigale CLT Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: [Bil95, Chapter 35], [Roc, Chapter 3]. Sice we have ot ecoutered weak covergece i some time, we first recall
More informationMath 210A Homework 1
Math 0A Homework Edward Burkard Exercise. a) State the defiitio of a aalytic fuctio. b) What are the relatioships betwee aalytic fuctios ad the Cauchy-Riema equatios? Solutio. a) A fuctio f : G C is called
More informationBertrand s postulate Chapter 2
Bertrad s postulate Chapter We have see that the sequece of prie ubers, 3, 5, 7,... is ifiite. To see that the size of its gaps is ot bouded, let N := 3 5 p deote the product of all prie ubers that are
More information1 = δ2 (0, ), Y Y n nδ. , T n = Y Y n n. ( U n,k + X ) ( f U n,k + Y ) n 2n f U n,k + θ Y ) 2 E X1 2 X1
8. The cetral limit theorems 8.1. The cetral limit theorem for i.i.d. sequeces. ecall that C ( is N -separatig. Theorem 8.1. Let X 1, X,... be i.i.d. radom variables with EX 1 = ad EX 1 = σ (,. Suppose
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More information1 Convergence in Probability and the Weak Law of Large Numbers
36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec
More informationNotes 5 : More on the a.s. convergence of sums
Notes 5 : More o the a.s. covergece of sums Math 733-734: Theory of Probability Lecturer: Sebastie Roch Refereces: Dur0, Sectios.5; Wil9, Sectio 4.7, Shi96, Sectio IV.4, Dur0, Sectio.. Radom series. Three-series
More informationCONCENTRATION INEQUALITIES
CONCENTRATION INEQUALITIES MAXIM RAGINSKY I te previous lecture, te followig result was stated witout proof. If X 1,..., X are idepedet Beroulliθ radom variables represetig te outcomes of a sequece of
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationIntroduction to Probability. Ariel Yadin
Itroductio to robability Ariel Yadi Lecture 2 *** Ja. 7 ***. Covergece of Radom Variables As i the case of sequeces of umbers, we would like to talk about covergece of radom variables. There are may ways
More informationSTAT Homework 1 - Solutions
STAT-36700 Homework 1 - Solutios Fall 018 September 11, 018 This cotais solutios for Homework 1. Please ote that we have icluded several additioal commets ad approaches to the problems to give you better
More informationDefine a Markov chain on {1,..., 6} with transition probability matrix P =
Pla Group Work 0. The title says it all Next Tie: MCMC ad Geeral-state Markov Chais Midter Exa: Tuesday 8 March i class Hoework 4 due Thursday Uless otherwise oted, let X be a irreducible, aperiodic Markov
More informationMATH301 Real Analysis (2008 Fall) Tutorial Note #7. k=1 f k (x) converges pointwise to S(x) on E if and
MATH01 Real Aalysis (2008 Fall) Tutorial Note #7 Sequece ad Series of fuctio 1: Poitwise Covergece ad Uiform Covergece Part I: Poitwise Covergece Defiitio of poitwise covergece: A sequece of fuctios f
More informationST5215: Advanced Statistical Theory
ST525: Advaced Statistical Theory Departmet of Statistics & Applied Probability Tuesday, September 7, 2 ST525: Advaced Statistical Theory Lecture : The law of large umbers The Law of Large Numbers The
More informationStatistical Theory MT 2009 Problems 1: Solution sketches
Statistical Theory MT 009 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. (a) Let 0 < θ < ad put f(x, θ) = ( θ)θ x ; x = 0,,,... (b) (c) where
More informationSTAT Homework 2 - Solutions
STAT-36700 Homework - Solutios Fall 08 September 4, 08 This cotais solutios for Homework. Please ote that we have icluded several additioal commets ad approaches to the problems to give you better isight.
More informationSupplementary Material
Suppleetary Material Wezhuo Ya a0096049@us.edu.s Departet of Mechaical Eieeri, Natioal Uiversity of Siapore, Siapore 117576 Hua Xu pexuh@us.edu.s Departet of Mechaical Eieeri, Natioal Uiversity of Siapore,
More information1 The Haar functions and the Brownian motion
1 The Haar fuctios ad the Browia motio 1.1 The Haar fuctios ad their completeess The Haar fuctios The basic Haar fuctio is 1 if x < 1/2, ψx) = 1 if 1/2 x < 1, otherwise. 1.1) It has mea zero 1 ψx)dx =,
More informationMathematics 170B Selected HW Solutions.
Mathematics 17B Selected HW Solutios. F 4. Suppose X is B(,p). (a)fidthemometgeeratigfuctiom (s)of(x p)/ p(1 p). Write q = 1 p. The MGF of X is (pe s + q), sice X ca be writte as the sum of idepedet Beroulli
More informationCONVERGENCE TO FISHER INFORMATION AND THE CENTRAL LIMIT THEOREM
Techical Scieces ad Applied Matheatics CONVGNC TO FISH INFOMATION AND TH CNTAL LIMIT THOM Bogda Gheorghe MUNTANU, Doru LUCULSCU Heri Coadă Air Forces Acadey, Braşov Abstract: I this paper, we give coditios
More informationx x x Using a second Taylor polynomial with remainder, find the best constant C so that for x 0,
Math Activity 9( Due with Fial Eam) Usig first ad secod Taylor polyomials with remaider, show that for, 8 Usig a secod Taylor polyomial with remaider, fid the best costat C so that for, C 9 The th Derivative
More informationLecture 11 and 12: Basic estimation theory
Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis
More informationMa 530 Infinite Series I
Ma 50 Ifiite Series I Please ote that i additio to the material below this lecture icorporated material from the Visual Calculus web site. The material o sequeces is at Visual Sequeces. (To use this li
More informationLøsningsførslag i 4M
Norges tekisk aturviteskapelige uiversitet Istitutt for matematiske fag Side 1 av 6 Løsigsførslag i 4M Oppgave 1 a) A sketch of the graph of the give f o the iterval [ 3, 3) is as follows: The Fourier
More informationSupplementary Material for Fast Stochastic AUC Maximization with O(1/n)-Convergence Rate
Supplemetary Material for Fast Stochastic AUC Maximizatio with O/-Covergece Rate Migrui Liu Xiaoxua Zhag Zaiyi Che Xiaoyu Wag 3 iabao Yag echical Lemmas ized versio of Hoeffdig s iequality, ote that We
More informationLecture 7 Testing Nonlinear Inequality Restrictions 1
Eco 75 Lecture 7 Testig Noliear Iequality Restrictios I Lecture 6, we discussed te testig problems were te ull ypotesis is de ed by oliear equality restrictios: H : ( ) = versus H : ( ) 6= : () We sowed
More informationLecture Outline. 2 Separating Hyperplanes. 3 Banach Mazur Distance An Algorithmist s Toolkit October 22, 2009
18.409 A Algorithist s Toolkit October, 009 Lecture 1 Lecturer: Joatha Keler Scribes: Alex Levi (009) 1 Outlie Today we ll go over soe of the details fro last class ad ake precise ay details that were
More informationReliability Equivalence Analysis of a Parallel-Series System Subject to Degradation Facility
Sciece Joural of Applied Mateatics ad Statistics 5; 3(3): 6-64 Publised olie Jue 6 5 (ttp://www.sciecepublisiggroup.co/j/sjas) doi:.648/j.sjas.533.9 ISSN: 376-949 (Prit); ISSN: 376-953 (Olie) Reliability
More informationDifferentiation Techniques 1: Power, Constant Multiple, Sum and Difference Rules
Differetiatio Teciques : Power, Costat Multiple, Sum ad Differece Rules 97 Differetiatio Teciques : Power, Costat Multiple, Sum ad Differece Rules Model : Fidig te Equatio of f '() from a Grap of f ()
More informationSolutions: Homework 3
Solutios: Homework 3 Suppose that the radom variables Y,...,Y satisfy Y i = x i + " i : i =,..., IID where x,...,x R are fixed values ad ",...," Normal(0, )with R + kow. Fid ˆ = MLE( ). IND Solutio: Observe
More informationMDIV. Multiple divisor functions
MDIV. Multiple divisor fuctios The fuctios τ k For k, defie τ k ( to be the umber of (ordered factorisatios of ito k factors, i other words, the umber of ordered k-tuples (j, j 2,..., j k with j j 2...
More informations = and t = with C ij = A i B j F. (i) Note that cs = M and so ca i µ(a i ) I E (cs) = = c a i µ(a i ) = ci E (s). (ii) Note that s + t = M and so
3 From the otes we see that the parts of Theorem 4. that cocer us are: Let s ad t be two simple o-egative F-measurable fuctios o X, F, µ ad E, F F. The i I E cs ci E s for all c R, ii I E s + t I E s +
More informationECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002
ECE 330:541, Stochastic Sigals ad Systems Lecture Notes o Limit Theorems from robability Fall 00 I practice, there are two ways we ca costruct a ew sequece of radom variables from a old sequece of radom
More informationChapter 2 The Monte Carlo Method
Chapter 2 The Mote Carlo Method The Mote Carlo Method stads for a broad class of computatioal algorithms that rely o radom sampligs. It is ofte used i physical ad mathematical problems ad is most useful
More informationMath 128A: Homework 1 Solutions
Math 8A: Homework Solutios Due: Jue. Determie the limits of the followig sequeces as. a) a = +. lim a + = lim =. b) a = + ). c) a = si4 +6) +. lim a = lim = lim + ) [ + ) ] = [ e ] = e 6. Observe that
More informationAn Orthogonality Property of Legendre Polynomials
A Orthogoality Property of Legedre Polyomials L. Bos, A. Naraya, N. Leveberg 3 ad F. Piazzo 4 April 7, 05 Abstract We give a remarable secod othogoality property of the classical Legedre polyomials o the
More informationStability analysis of numerical methods for stochastic systems with additive noise
Stability aalysis of umerical metods for stoctic systems wit additive oise Yosiiro SAITO Abstract Stoctic differetial equatios (SDEs) represet pysical peomea domiated by stoctic processes As for determiistic
More informationSDS 321: Introduction to Probability and Statistics
SDS 321: Itroductio to Probability ad Statistics Lecture 23: Cotiuous radom variables- Iequalities, CLT Puramrita Sarkar Departmet of Statistics ad Data Sciece The Uiversity of Texas at Austi www.cs.cmu.edu/
More informationStatistical Theory MT 2008 Problems 1: Solution sketches
Statistical Theory MT 008 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. a) Let 0 < θ < ad put fx, θ) = θ)θ x ; x = 0,,,... b) c) where α
More informationLecture 8: Convergence of transformations and law of large numbers
Lecture 8: Covergece of trasformatios ad law of large umbers Trasformatio ad covergece Trasformatio is a importat tool i statistics. If X coverges to X i some sese, we ofte eed to check whether g(x ) coverges
More information4 Conditional Distribution Estimation
4 Coditioal Distributio Estimatio 4. Estimators Te coditioal distributio (CDF) of y i give X i = x is F (y j x) = P (y i y j X i = x) = E ( (y i y) j X i = x) : Tis is te coditioal mea of te radom variable
More information42 Dependence and Bases
42 Depedece ad Bases The spa s(a) of a subset A i vector space V is a subspace of V. This spa ay be the whole vector space V (we say the A spas V). I this paragraph we study subsets A of V which spa V
More informationWe are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n
Review of Power Series, Power Series Solutios A power series i x - a is a ifiite series of the form c (x a) =c +c (x a)+(x a) +... We also call this a power series cetered at a. Ex. (x+) is cetered at
More informationREGRESSION WITH QUADRATIC LOSS
REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More informationAPPLIED MULTIVARIATE ANALYSIS
ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix
More informationBinomial transform of products
Jauary 02 207 Bioial trasfor of products Khristo N Boyadzhiev Departet of Matheatics ad Statistics Ohio Norther Uiversity Ada OH 4580 USA -boyadzhiev@ouedu Abstract Give the bioial trasfors { b } ad {
More informationWe have also learned that, thanks to the Central Limit Theorem and the Law of Large Numbers,
Cofidece Itervals III What we kow so far: We have see how to set cofidece itervals for the ea, or expected value, of a oral probability distributio, both whe the variace is kow (usig the stadard oral,
More informationMA541 : Real Analysis. Tutorial and Practice Problems - 1 Hints and Solutions
MA54 : Real Aalysis Tutorial ad Practice Problems - Hits ad Solutios. Suppose that S is a oempty subset of real umbers that is bouded (i.e. bouded above as well as below). Prove that if S sup S. What ca
More informationLecture 9: Regression: Regressogram and Kernel Regression
STAT 425: Itroductio to Noparametric Statistics Witer 208 Lecture 9: Regressio: Regressogram ad erel Regressio Istructor: Ye-Ci Ce Referece: Capter 5 of All of oparametric statistics 9 Itroductio Let X,
More informationChapter 10: Power Series
Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because
More information17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15
17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig
More informationOn Some Properties of Tensor Product of Operators
Global Joural of Pure ad Applied Matheatics. ISSN 0973-1768 Volue 12, Nuber 6 (2016), pp. 5139-5147 Research Idia Publicatios http://www.ripublicatio.co/gjpa.ht O Soe Properties of Tesor Product of Operators
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationMTH 142 Exam 3 Spr 2011 Practice Problem Solutions 1
MTH 42 Exam 3 Spr 20 Practice Problem Solutios No calculators will be permitted at the exam. 3. A pig-pog ball is lauched straight up, rises to a height of 5 feet, the falls back to the lauch poit ad bouces
More informationMath 113, Calculus II Winter 2007 Final Exam Solutions
Math, Calculus II Witer 7 Fial Exam Solutios (5 poits) Use the limit defiitio of the defiite itegral ad the sum formulas to compute x x + dx The check your aswer usig the Evaluatio Theorem Solutio: I this
More informationThe Hypergeometric Coupon Collection Problem and its Dual
Joural of Idustrial ad Systes Egieerig Vol., o., pp -7 Sprig 7 The Hypergeoetric Coupo Collectio Proble ad its Dual Sheldo M. Ross Epstei Departet of Idustrial ad Systes Egieerig, Uiversity of Souther
More informationMaximum Likelihood Estimation
ECE 645: Estimatio Theory Sprig 2015 Istructor: Prof. Staley H. Cha Maximum Likelihood Estimatio (LaTeX prepared by Shaobo Fag) April 14, 2015 This lecture ote is based o ECE 645(Sprig 2015) by Prof. Staley
More information